WorldWideScience

Sample records for web usage mining

  1. World Wide Web Usage Mining Systems and Technologies

    Directory of Open Access Journals (Sweden)

    Wen-Chen Hu

    2003-08-01

    Full Text Available Web usage mining is used to discover interesting user navigation patterns and can be applied to many real-world problems, such as improving Web sites/pages, making additional topic or product recommendations, user/customer behavior studies, etc. This article provides a survey and analysis of current Web usage mining systems and technologies. A Web usage mining system performs five major tasks: i data gathering, ii data preparation, iii navigation pattern discovery, iv pattern analysis and visualization, and v pattern applications. Each task is explained in detail and its related technologies are introduced. A list of major research systems and projects concerning Web usage mining is also presented, and a summary of Web usage mining is given in the last section.

  2. Fuzzy Clustering: An Approachfor Mining Usage Profilesfrom Web

    OpenAIRE

    Ms.Archana N. Boob; Prof. D. M. Dakhane

    2012-01-01

    Web usage mining is an application of data mining technology to mining the data of the web server log file. It can discover the browsing patterns of user and some kind of correlations between the web pages. Web usage mining provides the support for the web site design, providing personalization server and other business making decision, etc. Web mining applies the data mining, the artificial intelligence and the chart technology and so on to the web data and traces users' visiting characteris...

  3. Study on online community user motif using web usage mining

    Science.gov (United States)

    Alphy, Meera; Sharma, Ajay

    2016-04-01

    The Web usage mining is the application of data mining, which is used to extract useful information from the online community. The World Wide Web contains at least 4.73 billion pages according to Indexed Web and it contains at least 228.52 million pages according Dutch Indexed web on 6th august 2015, Thursday. It’s difficult to get needed data from these billions of web pages in World Wide Web. Here is the importance of web usage mining. Personalizing the search engine helps the web user to identify the most used data in an easy way. It reduces the time consumption; automatic site search and automatic restore the useful sites. This study represents the old techniques to latest techniques used in pattern discovery and analysis in web usage mining from 1996 to 2015. Analyzing user motif helps in the improvement of business, e-commerce, personalisation and improvement of websites.

  4. Discovering More Accurate Frequent Web Usage Patterns

    OpenAIRE

    Bayir, Murat Ali; Toroslu, Ismail Hakki; Cosar, Ahmet; Fidan, Guven

    2008-01-01

    Web usage mining is a type of web mining, which exploits data mining techniques to discover valuable information from navigation behavior of World Wide Web users. As in classical data mining, data preparation and pattern discovery are the main issues in web usage mining. The first phase of web usage mining is the data processing phase, which includes the session reconstruction operation from server logs. Session reconstruction success directly affects the quality of the frequent patterns disc...

  5. Association and Sequence Mining in Web Usage

    Directory of Open Access Journals (Sweden)

    Claudia Elena DINUCA

    2011-06-01

    Full Text Available Web servers worldwide generate a vast amount of information on web users’ browsing activities. Several researchers have studied these so-called clickstream or web access log data to better understand and characterize web users. Clickstream data can be enriched with information about the content of visited pages and the origin (e.g., geographic, organizational of the requests. The goal of this project is to analyse user behaviour by mining enriched web access log data. With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, the volumes of click stream and user data collected by Web-based organizations in their daily operations has reached astronomical proportions. This information can be exploited in various ways, such as enhancing the effectiveness of websites or developing directed web marketing campaigns. The discovered patterns are usually represented as collections of pages, objects, or re-sources that are frequently accessed by groups of users with common needs or interests. The focus of this paper is to provide an overview how to use frequent pattern techniques for discovering different types of patterns in a Web log database. In this paper we will focus on finding association as a data mining technique to extract potentially useful knowledge from web usage data. I implemented in Java, using NetBeans IDE, a program for identification of pages’ association from sessions. For exemplification, we used the log files from a commercial web site.

  6. Web Mining

    Science.gov (United States)

    Fürnkranz, Johannes

    The World-Wide Web provides every internet citizen with access to an abundance of information, but it becomes increasingly difficult to identify the relevant pieces of information. Research in web mining tries to address this problem by applying techniques from data mining and machine learning to Web data and documents. This chapter provides a brief overview of web mining techniques and research areas, most notably hypertext classification, wrapper induction, recommender systems and web usage mining.

  7. Constructing a web recommender system using web usage mining and user’s profiles

    Directory of Open Access Journals (Sweden)

    T. Mombeini

    2014-12-01

    Full Text Available The World Wide Web is a great source of information, which is nowadays being widely used due to the availability of useful information changing, dynamically. However, the large number of webpages often confuses many users and it is hard for them to find information on their interests. Therefore, it is necessary to provide a system capable of guiding users towards their desired choices and services. Recommender systems search among a large collection of user interests and recommend those, which are likely to be favored the most by the user. Web usage mining was designed to function on web server records, which are included in user search results. Therefore, recommender servers use the web usage mining technique to predict users’ browsing patterns and recommend those patterns in the form of a suggestion list. In this article, a recommender system based on web usage mining phases (online and offline was proposed. In the offline phase, the first step is to analyze user access records to identify user sessions. Next, user profiles are built using data from server records based on the frequency of access to pages, the time spent by the user on each page and the date of page view. Date is of importance since it is more possible for users to request new pages more than old ones and old pages are less probable to be viewed, as users mostly look for new information. Following the creation of user profiles, users are categorized in clusters using the Fuzzy C-means clustering algorithm and S(c criterion based on their similarities. In the online phase, a neural network is offered to identify the suggested model while online suggestions are generated using the suggestion module for the active user. Search engines analyze suggestion lists based on rate of user interest in pages and page rank and finally suggest appropriate pages to the active user. Experiments show that the proposed method of predicting user recent requested pages has more accuracy and

  8. Data Preparation for Web Mining – A survey

    OpenAIRE

    Amog Rajenderan

    2012-01-01

    An accepted trend is to categorize web mining intothree main areas: web content mining, webstructure mining and web usage mining. Webcontent mining involves extractingdetails/information from the contents of webpagesand performing things like knowledge synthesis.Web structure mining involves the usage of graphtheory to understand website structure/hierarchy.Web usage mining involves the mining of usefulinformation from things like server logs, tounderstand what the user does while on the inte...

  9. Web Usage Mining, Pattern Discovery dan Log File

    OpenAIRE

    Tri Suratno; Toni Prahasto; Adian Fatchur Rochim

    2014-01-01

    Analysis  of  data  to  access  the  server  can  provide  significant  and  useful  information  for  performance  improvement,  restructuring  andimproving the effectiveness of a web site. Data mining is one of the most effective way to detect a series of patterns of information from large amounts of data. Application of  data mining  on  Internet use  called web  mining  is a set of  data mining  techniques  are  used  for the web. Web mining technologies and data mining is a combination o...

  10. An Application for Data Preprocessing and Models Extractions in Web Usage Mining

    Directory of Open Access Journals (Sweden)

    Claudia Elena DINUCA

    2011-11-01

    Full Text Available Web servers worldwide generate a vast amount of information on web users’ browsing activities. Several researchers have studied these so-called clickstream or web access log data to better understand and characterize web users. The goal of this application is to analyze user behaviour by mining enriched web access log data. With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, the volumes of click stream and user data collected by Web-based organizations in their daily operations has reached astronomical proportions. This information can be exploited in various ways, such as enhancing the effectiveness of websites or developing directed web marketing campaigns. The discovered patterns are usually represented as collections of pages, objects, or re-sources that are frequently accessed by groups of users with common needs or interests. In this paper we will focus on displaying the way how it was implemented the application for data preprocessing and extracting different data models from web logs data, finding association as a data mining technique to extract potentially useful knowledge from web usage data. We find different data models navigation patterns by analysing the log files of the web-site. I implemented the application in Java using NetBeans IDE. For exemplification, I used the log files data from a commercial web site www.nice-layouts.com.

  11. SEMANTIC WEB MINING: ISSUES AND CHALLENGES

    OpenAIRE

    Karan Singh*, Anil kumar, Arun Kumar Yadav

    2016-01-01

    The combination of the two fast evolving scientific research areas “Semantic Web” and “Web Mining” are well-known as “Semantic Web Mining” in computer science. These two areas cover way for the mining of related and meaningful information from the web, by this means giving growth to the term “Semantic Web Mining”. The “Semantic Web” makes mining easy and “Web Mining” can construct new structure of Web. Web Mining applies Data Mining technique on web content, Structure and Usage. This paper gi...

  12. Applying Web usage mining for personalizing hyperlinks in Web-based adaptive educational systems

    NARCIS (Netherlands)

    Romero, C.; Ventura, S.; Zafra, A.; Bra, de P.M.E.

    2009-01-01

    Nowadays, the application of Web mining techniques in e-learning and Web-based adaptive educational systems is increasing exponentially. In this paper, we propose an advanced architecture for a personalization system to facilitate Web mining. A specific Web mining tool is developed and a recommender

  13. Applying Web Usage Mining for Personalizing Hyperlinks in Web-Based Adaptive Educational Systems

    Science.gov (United States)

    Romero, Cristobal; Ventura, Sebastian; Zafra, Amelia; de Bra, Paul

    2009-01-01

    Nowadays, the application of Web mining techniques in e-learning and Web-based adaptive educational systems is increasing exponentially. In this paper, we propose an advanced architecture for a personalization system to facilitate Web mining. A specific Web mining tool is developed and a recommender engine is integrated into the AHA! system in…

  14. A Dynamic Recommender System for Improved Web Usage Mining and CRM Using Swarm Intelligence.

    Science.gov (United States)

    Alphy, Anna; Prabakaran, S

    2015-01-01

    In modern days, to enrich e-business, the websites are personalized for each user by understanding their interests and behavior. The main challenges of online usage data are information overload and their dynamic nature. In this paper, to address these issues, a WebBluegillRecom-annealing dynamic recommender system that uses web usage mining techniques in tandem with software agents developed for providing dynamic recommendations to users that can be used for customizing a website is proposed. The proposed WebBluegillRecom-annealing dynamic recommender uses swarm intelligence from the foraging behavior of a bluegill fish. It overcomes the information overload by handling dynamic behaviors of users. Our dynamic recommender system was compared against traditional collaborative filtering systems. The results show that the proposed system has higher precision, coverage, F1 measure, and scalability than the traditional collaborative filtering systems. Moreover, the recommendations given by our system overcome the overspecialization problem by including variety in recommendations.

  15. USING WEB MINING IN E-COMMERCE APPLICATIONS

    Directory of Open Access Journals (Sweden)

    Claudia Elena Dinucă

    2011-09-01

    Full Text Available Nowadays, the web is an important part of our daily life. The web is now the best medium of doing business. Large companies rethink their business strategy using the web to improve business. Business carried on the Web offers the opportunity to potential customers or partners where their products and specific business can be found. Business presence through a company web site has several advantages as it breaks the barrier of time and space compared with the existence of a physical office. To differentiate through the Internet economy, winning companies have realized that e-commerce transactions is more than just buying / selling, appropriate strategies are key to improve competitive power. One effective technique used for this purpose is data mining. Data mining is the process of extracting interesting knowledge from data. Web mining is the use of data mining techniques to extract information from web data. This article presents the three components of web mining: web usage mining, web structure mining and web content mining.

  16. WEB STRUCTURE MINING USING PAGERANK, IMPROVED PAGERANK – AN OVERVIEW

    Directory of Open Access Journals (Sweden)

    V. Lakshmi Praba

    2011-03-01

    Full Text Available Web Mining is the extraction of interesting and potentially useful patterns and information from Web. It includes Web documents, hyperlinks between documents, and usage logs of web sites. The significant task for web mining can be listed out as Information Retrieval, Information Selection / Extraction, Generalization and Analysis. Web information retrieval tools consider only the text on pages and ignore information in the links. The goal of Web structure mining is to explore structural summary about web. Web structure mining focusing on link information is an important aspect of web data. This paper presents an overview of the PageRank, Improved Page Rank and its working functionality in web structure mining.

  17. WEB STRUCTURE MINING

    Directory of Open Access Journals (Sweden)

    CLAUDIA ELENA DINUCĂ

    2011-01-01

    Full Text Available The World Wide Web became one of the most valuable resources for information retrievals and knowledge discoveries due to the permanent increasing of the amount of data available online. Taking into consideration the web dimension, the users get easily lost in the web’s rich hyper structure. Application of data mining methods is the right solution for knowledge discovery on the Web. The knowledge extracted from the Web can be used to raise the performances for Web information retrievals, question answering and Web based data warehousing. In this paper, I provide an introduction of Web mining categories and I focus on one of these categories: the Web structure mining. Web structure mining, one of three categories of web mining for data, is a tool used to identify the relationship between Web pages linked by information or direct link connection. It offers information about how different pages are linked together to form this huge web. Web Structure Mining finds hidden basic structures and uses hyperlinks for more web applications such as web search.

  18. Automated web usage data mining and recommendation system using K-Nearest Neighbor (KNN classification method

    Directory of Open Access Journals (Sweden)

    D.A. Adeniyi

    2016-01-01

    Full Text Available The major problem of many on-line web sites is the presentation of many choices to the client at a time; this usually results to strenuous and time consuming task in finding the right product or information on the site. In this work, we present a study of automatic web usage data mining and recommendation system based on current user behavior through his/her click stream data on the newly developed Really Simple Syndication (RSS reader website, in order to provide relevant information to the individual without explicitly asking for it. The K-Nearest-Neighbor (KNN classification method has been trained to be used on-line and in Real-Time to identify clients/visitors click stream data, matching it to a particular user group and recommend a tailored browsing option that meet the need of the specific user at a particular time. To achieve this, web users RSS address file was extracted, cleansed, formatted and grouped into meaningful session and data mart was developed. Our result shows that the K-Nearest Neighbor classifier is transparent, consistent, straightforward, simple to understand, high tendency to possess desirable qualities and easy to implement than most other machine learning techniques specifically when there is little or no prior knowledge about data distribution.

  19. Using Open Web APIs in Teaching Web Mining

    Science.gov (United States)

    Chen, Hsinchun; Li, Xin; Chau, M.; Ho, Yi-Jen; Tseng, Chunju

    2009-01-01

    With the advent of the World Wide Web, many business applications that utilize data mining and text mining techniques to extract useful business information on the Web have evolved from Web searching to Web mining. It is important for students to acquire knowledge and hands-on experience in Web mining during their education in information systems…

  20. Web Page Recommendation Using Web Mining

    OpenAIRE

    Modraj Bhavsar; Mrs. P. M. Chavan

    2014-01-01

    On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each...

  1. Is Toscana A Formal Concept Analysis Based Solution In Web Usage Mining?

    Directory of Open Access Journals (Sweden)

    Dan-Andrei SITAR-TĂUT

    2012-01-01

    Full Text Available Analyzing large amount of data come from web logs represents a complex, but challenging nowadays problem with implication in various fields, thing that lets open a way for theoretically infinite approaches an implementations. The main goal of our paper represents the possibility of applying the formal concept analysis as viable solution of sustaining the web mining process, based on a technological open-source solution called TOSCANA.

  2. Web Mining and Social Networking

    DEFF Research Database (Denmark)

    Xu, Guandong; Zhang, Yanchun; Li, Lin

    This book examines the techniques and applications involved in the Web Mining, Web Personalization and Recommendation and Web Community Analysis domains, including a detailed presentation of the principles, developed algorithms, and systems of the research in these areas. The applications of web ...... sense of individuals or communities. The volume will benefit both academic and industry communities interested in the techniques and applications of web search, web data management, web mining and web knowledge discovery, as well as web community and social network analysis.......This book examines the techniques and applications involved in the Web Mining, Web Personalization and Recommendation and Web Community Analysis domains, including a detailed presentation of the principles, developed algorithms, and systems of the research in these areas. The applications of web...... mining, and the issue of how to incorporate web mining into web personalization and recommendation systems are also reviewed. Additionally, the volume explores web community mining and analysis to find the structural, organizational and temporal developments of web communities and reveal the societal...

  3. Web Mining and Social Networking

    CERN Document Server

    Xu, Guandong; Li, Lin

    2011-01-01

    This book examines the techniques and applications involved in the Web Mining, Web Personalization and Recommendation and Web Community Analysis domains, including a detailed presentation of the principles, developed algorithms, and systems of the research in these areas. The applications of web mining, and the issue of how to incorporate web mining into web personalization and recommendation systems are also reviewed. Additionally, the volume explores web community mining and analysis to find the structural, organizational and temporal developments of web communities and reveal the societal s

  4. URL Mining Using Agglomerative Clustering Algorithm

    Directory of Open Access Journals (Sweden)

    Chinmay R. Deshmukh

    2015-02-01

    Full Text Available Abstract The tremendous growth of the web world incorporates application of data mining techniques to the web logs. Data Mining and World Wide Web encompasses an important and active area of research. Web log mining is analysis of web log files with web pages sequences. Web mining is broadly classified as web content mining web usage mining and web structure mining. Web usage mining is a technique to discover usage patterns from Web data in order to understand and better serve the needs of Web-based applications. URL mining refers to a subclass of Web mining that helps us to investigate the details of a Uniform Resource Locator. URL mining can be advantageous in the fields of security and protection. The paper introduces a technique for mining a collection of user transactions with an Internet search engine to discover clusters of similar queries and similar URLs. The information we exploit is a clickthrough data each record consist of a users query to a search engine along with the URL which the user selected from among the candidates offered by search engine. By viewing this dataset as a bipartite graph with the vertices on one side corresponding to queries and on the other side to URLs one can apply an agglomerative clustering algorithm to the graphs vertices to identify related queries and URLs.

  5. Integration of Web mining and web crawler: Relevance and State of Art

    OpenAIRE

    Subhendu kumar pani; Deepak Mohapatra,; Bikram Keshari Ratha

    2010-01-01

    This study presents the role of web crawler in web mining environment. As the growth of the World Wide Web exceeded all expectations,the research on Web mining is growing more and more.web mining research topic which combines two of the activated research areas: Data Mining and World Wide Web .So, the World Wide Web is a very advanced area for data mining research. Search engines that are based on web crawling framework also used in web mining to find theinteracted web pages. This paper discu...

  6. Data mining approach to web application intrusions detection

    Science.gov (United States)

    Kalicki, Arkadiusz

    2011-10-01

    Web applications became most popular medium in the Internet. Popularity, easiness of web application script languages and frameworks together with careless development results in high number of web application vulnerabilities and high number of attacks performed. There are several types of attacks possible because of improper input validation: SQL injection Cross-site scripting, Cross-Site Request Forgery (CSRF), web spam in blogs and others. In order to secure web applications intrusion detection (IDS) and intrusion prevention systems (IPS) are being used. Intrusion detection systems are divided in two groups: misuse detection (traditional IDS) and anomaly detection. This paper presents data mining based algorithm for anomaly detection. The principle of this method is the comparison of the incoming HTTP traffic with a previously built profile that contains a representation of the "normal" or expected web application usage sequence patterns. The frequent sequence patterns are found with GSP algorithm. Previously presented detection method was rewritten and improved. Some tests show that the software catches malicious requests, especially long attack sequences, results quite good with medium length sequences, for short length sequences must be complemented with other methods.

  7. Web Mining of Hotel Customer Survey Data

    Directory of Open Access Journals (Sweden)

    Richard S. Segall

    2008-12-01

    Full Text Available This paper provides an extensive literature review and list of references on the background of web mining as applied specifically to hotel customer survey data. This research applies the techniques of web mining to actual text of written comments for hotel customers using Megaputer PolyAnalyst®. Web mining functionalities utilized include those such as clustering, link analysis, key word and phrase extraction, taxonomy, and dimension matrices. This paper provides screen shots of the web mining applications using Megaputer PolyAnalyst®. Conclusions and future directions of the research are presented.

  8. A COMPARATIVE ANALYSIS OF WEB INFORMATION EXTRACTION TECHNIQUES DEEP LEARNING vs. NAÏVE BAYES vs. BACK PROPAGATION NEURAL NETWORKS IN WEB DOCUMENT EXTRACTION

    OpenAIRE

    J. Sharmila; A. Subramani

    2016-01-01

    Web mining related exploration is getting the chance to be more essential these days in view of the reason that a lot of information is overseen through the web. Web utilization is expanding in an uncontrolled way. A particular framework is required for controlling such extensive measure of information in the web space. Web mining is ordered into three noteworthy divisions: Web content mining, web usage mining and web structure mining. Tak-Lam Wong has proposed a web content mining methodolog...

  9. The design and implementation of web mining in web sites security

    Science.gov (United States)

    Li, Jian; Zhang, Guo-Yin; Gu, Guo-Chang; Li, Jian-Li

    2003-06-01

    The backdoor or information leak of Web servers can be detected by using Web Mining techniques on some abnormal Web log and Web application log data. The security of Web servers can be enhanced and the damage of illegal access can be avoided. Firstly, the system for discovering the patterns of information leakages in CGI scripts from Web log data was proposed. Secondly, those patterns for system administrators to modify their codes and enhance their Web site security were provided. The following aspects were described: one is to combine web application log with web log to extract more information, so web data mining could be used to mine web log for discovering the information that firewall and Information Detection System cannot find. Another approach is to propose an operation module of web site to enhance Web site security. In cluster server session, Density-Based Clustering technique is used to reduce resource cost and obtain better efficiency.

  10. ANALYSIS OF WEB MINING APPLICATIONS AND BENEFICIAL AREAS

    Directory of Open Access Journals (Sweden)

    Khaleel Ahmad

    2011-10-01

    Full Text Available The main purpose of this paper is to study the process of Web mining techniques, features, application ( e-commerce and e-business and its beneficial areas. Web mining has become more popular and its widely used in varies application areas (such as business intelligent system, e-commerce and e-business. The e-commerce or e-business results are bettered by the application of the mining techniques such as data mining and text mining, among all the mining techniques web mining is better.

  11. AN EFFICIENT WEB PERSONALIZATION APPROACH TO DISCOVER USER INTERESTED DIRECTORIES

    Directory of Open Access Journals (Sweden)

    M. Robinson Joel

    2014-04-01

    Full Text Available Web Usage Mining is the application of data mining technique used to retrieve the web usage from web proxy log file. Web Usage Mining consists of three major stages: preprocessing, clustering and pattern analysis. This paper explains each of these stages in detail. In this proposed approach, the web directories are discovered based on the user’s interestingness. The web proxy log file undergoes a preprocessing phase to improve the quality of data. Fuzzy Clustering Algorithm is used to cluster the user and session into disjoint clusters. In this paper, an effective approach is presented for Web personalization based on an Advanced Apriori algorithm. It is used to select the user interested web directories. The proposed method is compared with the existing web personalization methods like Objective Probabilistic Directory Miner (OPDM, Objective Community Directory Miner (OCDM and Objective Clustering and Probabilistic Directory Miner (OCPDM. The result shows that the proposed approach provides better results than the aforementioned existing approaches. At last, an application is developed with the user interested directories and web usage details.

  12. Semantic Web Requirements through Web Mining Techniques

    OpenAIRE

    Hassanzadeh, Hamed; Keyvanpour, Mohammad Reza

    2012-01-01

    In recent years, Semantic web has become a topic of active research in several fields of computer science and has applied in a wide range of domains such as bioinformatics, life sciences, and knowledge management. The two fast-developing research areas semantic web and web mining can complement each other and their different techniques can be used jointly or separately to solve the issues in both areas. In addition, since shifting from current web to semantic web mainly depends on the enhance...

  13. A Two-Tiered Model for Analyzing Library Web Site Usage Statistics, Part 1: Web Server Logs.

    Science.gov (United States)

    Cohen, Laura B.

    2003-01-01

    Proposes a two-tiered model for analyzing web site usage statistics for academic libraries: one tier for library administrators that analyzes measures indicating library use, and a second tier for web site managers that analyzes measures aiding in server maintenance and site design. Discusses the technology of web site usage statistics, and…

  14. Earth Science Mining Web Services

    Science.gov (United States)

    Pham, Long; Lynnes, Christopher; Hegde, Mahabaleshwa; Graves, Sara; Ramachandran, Rahul; Maskey, Manil; Keiser, Ken

    2008-01-01

    To allow scientists further capabilities in the area of data mining and web services, the Goddard Earth Sciences Data and Information Services Center (GES DISC) and researchers at the University of Alabama in Huntsville (UAH) have developed a system to mine data at the source without the need of network transfers. The system has been constructed by linking together several pre-existing technologies: the Simple Scalable Script-based Science Processor for Measurements (S4PM), a processing engine at he GES DISC; the Algorithm Development and Mining (ADaM) system, a data mining toolkit from UAH that can be configured in a variety of ways to create customized mining processes; ActiveBPEL, a workflow execution engine based on BPEL (Business Process Execution Language); XBaya, a graphical workflow composer; and the EOS Clearinghouse (ECHO). XBaya is used to construct an analysis workflow at UAH using ADam components, which are also installed remotely at the GES DISC, wrapped as Web Services. The S4PM processing engine searches ECHO for data using space-time criteria, staging them to cache, allowing the ActiveBPEL engine to remotely orchestras the processing workflow within S4PM. As mining is completed, the output is placed in an FTP holding area for the end user. The goals are to give users control over the data they want to process, while mining data at the data source using the server's resources rather than transferring the full volume over the internet. These diverse technologies have been infused into a functioning, distributed system with only minor changes to the underlying technologies. The key to the infusion is the loosely coupled, Web-Services based architecture: All of the participating components are accessible (one way or another) through (Simple Object Access Protocol) SOAP-based Web Services.

  15. Analysis of Usage Patterns in Large Multimedia Websites

    Science.gov (United States)

    Singh, Rahul; Bhattarai, Bibek

    User behavior in a website is a critical indicator of the web site's usability and success. Therefore an understanding of usage patterns is essential to website design optimization. In this context, large multimedia websites pose a significant challenge for comprehension of the complex and diverse user behaviors they sustain. This is due to the complexity of analyzing and understanding user-data interactions in media-rich contexts. In this chapter we present a novel multi-perspective approach for usability analysis of large media rich websites. Our research combines multimedia web content analysis with elements of web-log analysis and visualization/visual mining of web usage metadata. Multimedia content analysis allows direct estimation of the information-cues presented to a user by the web content. Analysis of web logs and usage-metadata, such as location, type, and frequency of interactions provides a complimentary perspective on the site's usage. The entire set of information is leveraged through powerful visualization and interactive querying techniques to provide analysis of usage patterns, measure of design quality, as well as the ability to rapidly identify problems in the web-site design. Experiments on media rich sites including the SkyServer - a large multimedia web-based astronomy information repository demonstrate the efficacy and promise of the proposed approach.

  16. Text mining of web-based medical content

    CERN Document Server

    Neustein, Amy

    2014-01-01

    Text Mining of Web-Based Medical Content examines web mining for extracting useful information that can be used for treating and monitoring the healthcare of patients. This work provides methodological approaches to designing mapping tools that exploit data found in social media postings. Specific linguistic features of medical postings are analyzed vis-a-vis available data extraction tools for culling useful information.

  17. Web-based pathology practice examination usage

    Directory of Open Access Journals (Sweden)

    Edward C Klatt

    2014-01-01

    Full Text Available Context: General and subject specific practice examinations for students in health sciences studying pathology were placed onto a free public internet web site entitled web path and were accessed four clicks from the home web site menu. Subjects and Methods: Multiple choice questions were coded into. html files with JavaScript functions for web browser viewing in a timed format. A Perl programming language script with common gateway interface for web page forms scored examinations and placed results into a log file on an internet computer server. The four general review examinations of 30 questions each could be completed in up to 30 min. The 17 subject specific examinations of 10 questions each with accompanying images could be completed in up to 15 min each. The results of scores and user educational field of study from log files were compiled from June 2006 to January 2014. Results: The four general review examinations had 31,639 accesses with completion of all questions, for a completion rate of 54% and average score of 75%. A score of 100% was achieved by 7% of users, ≥90% by 21%, and ≥50% score by 95% of users. In top to bottom web page menu order, review examination usage was 44%, 24%, 17%, and 15% of all accessions. The 17 subject specific examinations had 103,028 completions, with completion rate 73% and average score 74%. Scoring at 100% was 20% overall, ≥90% by 37%, and ≥50% score by 90% of users. The first three menu items on the web page accounted for 12.6%, 10.0%, and 8.2% of all completions, and the bottom three accounted for no more than 2.2% each. Conclusions: Completion rates were higher for shorter 10 questions subject examinations. Users identifying themselves as MD/DO scored higher than other users, averaging 75%. Usage was higher for examinations at the top of the web page menu. Scores achieved suggest that a cohort of serious users fully completing the examinations had sufficient preparation to use them to support

  18. Usage reporting on recorded lectures using educational data mining

    NARCIS (Netherlands)

    Gorissen, Pierre; Van Bruggen, Jan; Jochems, Wim

    2012-01-01

    Gorissen, P., Van Bruggen, J., & Jochems, W. M. G. (2012). Usage reporting on recorded lectures using educational data mining. International Journal of Learning Technology, 7, 23-40. doi:10.1504/IJLT.2012.046864

  19. Experimental economics for web mining

    OpenAIRE

    Tagiew, Rustam; Ignatov, Dmitry I.; Amroush, Fadi

    2014-01-01

    This paper offers a step towards research infrastructure, which makes data from experimental economics efficiently usable for analysis of web data. We believe that regularities of human behavior found in experimental data also emerge in real world web data. A format for data from experiments is suggested, which enables its publication as open data. Once standardized datasets of experiments are available on-line, web mining can take advantages from this data. Further, the questions about the o...

  20. OntoGene web services for biomedical text mining.

    Science.gov (United States)

    Rinaldi, Fabio; Clematide, Simon; Marques, Hernani; Ellendorff, Tilia; Romacker, Martin; Rodriguez-Esteban, Raul

    2014-01-01

    Text mining services are rapidly becoming a crucial component of various knowledge management pipelines, for example in the process of database curation, or for exploration and enrichment of biomedical data within the pharmaceutical industry. Traditional architectures, based on monolithic applications, do not offer sufficient flexibility for a wide range of use case scenarios, and therefore open architectures, as provided by web services, are attracting increased interest. We present an approach towards providing advanced text mining capabilities through web services, using a recently proposed standard for textual data interchange (BioC). The web services leverage a state-of-the-art platform for text mining (OntoGene) which has been tested in several community-organized evaluation challenges,with top ranked results in several of them.

  1. Análisis de sesiones de la web del Cindoc: una aproximación a la minería de uso web

    OpenAIRE

    Ortega-Priego, José-Luis

    2005-01-01

    This paper try an usability and navigability study of the Cindoc web site through web log files of the main server for october 2003. For this, web mining are used, concretly, web usage mining techniques to the detection of sessions with the aim of determine navigation patterns and design faults. Several design problems are detected in the navigation menu, in the layouth of the contents and in the web structure. Different navigation identificated patterns are discussed and many advices are ...

  2. Graph Mining Meets the Semantic Web

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Sangkeun (Matt) [ORNL; Sukumar, Sreenivas R [ORNL; Lim, Seung-Hwan [ORNL

    2015-01-01

    The Resource Description Framework (RDF) and SPARQL Protocol and RDF Query Language (SPARQL) were introduced about a decade ago to enable flexible schema-free data interchange on the Semantic Web. Today, data scientists use the framework as a scalable graph representation for integrating, querying, exploring and analyzing data sets hosted at different sources. With increasing adoption, the need for graph mining capabilities for the Semantic Web has emerged. We address that need through implementation of three popular iterative Graph Mining algorithms (Triangle count, Connected component analysis, and PageRank). We implement these algorithms as SPARQL queries, wrapped within Python scripts. We evaluate the performance of our implementation on 6 real world data sets and show graph mining algorithms (that have a linear-algebra formulation) can indeed be unleashed on data represented as RDF graphs using the SPARQL query interface.

  3. Data Mining Web Services for Science Data Repositories

    Science.gov (United States)

    Graves, S.; Ramachandran, R.; Keiser, K.; Maskey, M.; Lynnes, C.; Pham, L.

    2006-12-01

    The maturation of web services standards and technologies sets the stage for a distributed "Service-Oriented Architecture" (SOA) for NASA's next generation science data processing. This architecture will allow members of the scientific community to create and combine persistent distributed data processing services and make them available to other users over the Internet. NASA has initiated a project to create a suite of specialized data mining web services designed specifically for science data. The project leverages the Algorithm Development and Mining (ADaM) toolkit as its basis. The ADaM toolkit is a robust, mature and freely available science data mining toolkit that is being used by several research organizations and educational institutions worldwide. These mining services will give the scientific community a powerful and versatile data mining capability that can be used to create higher order products such as thematic maps from current and future NASA satellite data records with methods that are not currently available. The package of mining and related services are being developed using Web Services standards so that community-based measurement processing systems can access and interoperate with them. These standards-based services allow users different options for utilizing them, from direct remote invocation by a client application to deployment of a Business Process Execution Language (BPEL) solutions package where a complex data mining workflow is exposed to others as a single service. The ability to deploy and operate these services at a data archive allows the data mining algorithms to be run where the data are stored, a more efficient scenario than moving large amounts of data over the network. This will be demonstrated in a scenario in which a user uses a remote Web-Service-enabled clustering algorithm to create cloud masks from satellite imagery at the Goddard Earth Sciences Data and Information Services Center (GES DISC).

  4. Mining for Social Media: Usage Patterns of Small Businesses

    OpenAIRE

    Balan, Shilpa; Rege, Janhavi

    2017-01-01

    Background: Information can now be rapidly exchanged due to social media. Due to its openness, Twitter has generated massive amounts of data. In this paper, we apply data mining and analytics to extract the usage patterns of social media by small businesses. Objectives: The aim of this paper is to describe with an example how data mining can be applied to social media. This paper further examines the impact of social media on small businesses. The Twitter posts related to small businesses are...

  5. Web-based Media at European Universities: Systems, Usage, and Motivation

    DEFF Research Database (Denmark)

    Godsk, Mikkel

    2009-01-01

    This paper presents the results of two surveys analyzing the usage of and the systems available for web-based media at European universities, and how the teachers can be motivated to increase their usage of such materials in their teaching practice. The surveys were carried out April-May 2009 among...... obvious. The surveys also show that many teachers are already using web-based media in their teaching practice and by addressing some of their teaching circumstances it would be possible to increase the usage even further. Based on these results the paper presents five initiatives to motivate the teachers...

  6. Web-video-mining-supported workflow modeling for laparoscopic surgeries.

    Science.gov (United States)

    Liu, Rui; Zhang, Xiaoli; Zhang, Hao

    2016-11-01

    As quality assurance is of strong concern in advanced surgeries, intelligent surgical systems are expected to have knowledge such as the knowledge of the surgical workflow model (SWM) to support their intuitive cooperation with surgeons. For generating a robust and reliable SWM, a large amount of training data is required. However, training data collected by physically recording surgery operations is often limited and data collection is time-consuming and labor-intensive, severely influencing knowledge scalability of the surgical systems. The objective of this research is to solve the knowledge scalability problem in surgical workflow modeling with a low cost and labor efficient way. A novel web-video-mining-supported surgical workflow modeling (webSWM) method is developed. A novel video quality analysis method based on topic analysis and sentiment analysis techniques is developed to select high-quality videos from abundant and noisy web videos. A statistical learning method is then used to build the workflow model based on the selected videos. To test the effectiveness of the webSWM method, 250 web videos were mined to generate a surgical workflow for the robotic cholecystectomy surgery. The generated workflow was evaluated by 4 web-retrieved videos and 4 operation-room-recorded videos, respectively. The evaluation results (video selection consistency n-index ≥0.60; surgical workflow matching degree ≥0.84) proved the effectiveness of the webSWM method in generating robust and reliable SWM knowledge by mining web videos. With the webSWM method, abundant web videos were selected and a reliable SWM was modeled in a short time with low labor cost. Satisfied performances in mining web videos and learning surgery-related knowledge show that the webSWM method is promising in scaling knowledge for intelligent surgical systems. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. Web mining in soft computing framework: relevance, state of the art and future directions.

    Science.gov (United States)

    Pal, S K; Talwar, V; Mitra, P

    2002-01-01

    The paper summarizes the different characteristics of Web data, the basic components of Web mining and its different types, and the current state of the art. The reason for considering Web mining, a separate field from data mining, is explained. The limitations of some of the existing Web mining methods and tools are enunciated, and the significance of soft computing (comprising fuzzy logic (FL), artificial neural networks (ANNs), genetic algorithms (GAs), and rough sets (RSs) are highlighted. A survey of the existing literature on "soft Web mining" is provided along with the commercially available systems. The prospective areas of Web mining where the application of soft computing needs immediate attention are outlined with justification. Scope for future research in developing "soft Web mining" systems is explained. An extensive bibliography is also provided.

  8. Preprocessing and Content/Navigational Pages Identification as Premises for an Extended Web Usage Mining Model Development

    Directory of Open Access Journals (Sweden)

    Daniel MICAN

    2009-01-01

    Full Text Available From its appearance until nowadays, the internet saw a spectacular growth not only in terms of websites number and information volume, but also in terms of the number of visitors. Therefore, the need of an overall analysis regarding both the web sites and the content provided by them was required. Thus, a new branch of research was developed, namely web mining, that aims to discover useful information and knowledge, based not only on the analysis of websites and content, but also on the way in which the users interact with them. The aim of the present paper is to design a database that captures only the relevant data from logs in a way that will allow to store and manage large sets of temporal data with common tools in real time. In our work, we rely on different web sites or website sections with known architecture and we test several hypotheses from the literature in order to extend the framework to sites with unknown or chaotic structure, which are non-transparent in determining the type of visited pages. In doing this, we will start from non-proprietary, preexisting raw server logs.

  9. Comparing usage of a web and app stress management intervention: An observational study

    Directory of Open Access Journals (Sweden)

    Leanne G. Morrison

    2018-06-01

    Full Text Available Choices in the design and delivery of digital health behaviour interventions may have a direct influence on subsequent usage and engagement. Few studies have been able to make direct, detailed comparisons of differences in usage between interventions that are delivered via web or app. This study compared the usage of two versions of a digital stress management intervention, one delivered via a website (Healthy Paths and the other delivered via an app (Healthy Mind. Design modifications were introduced within Healthy Mind to take account of reported differences in how individuals engage with websites compared to apps and mobile phones. Data were collected as part of an observational study nested within a broader exploratory trial of Healthy Mind. Objective usage of Healthy Paths and Healthy Mind were automatically recorded, including frequency and duration of logins, access to specific components within the intervention and order of page/screen visits. Usage was compared for a two week period following initial registration. In total, 381 participants completed the registration process for Healthy Paths (web and 162 participants completed the registration process for Healthy Mind (app. App users logged in twice as often (Mdn = 2.00 as web users (Mdn = 1.00, U = 13,059.50, p ≤ 0.001, but spent half as much time (Mdn = 5.23 min on the intervention compared to web users (Mdn = 10.52 min, U = 19,740.00, p ≤ 0.001. Visual exploration of usage patterns over time revealed that a significantly higher proportion of app users (n = 126, 82.35% accessed both types of support available within the intervention (i.e. awareness and change-focused tools compared to web users (n = 92, 40.17%, χ2(1, n = 382 = 66.60, p < 0.001. This study suggests that the digital platform used to deliver an intervention (i.e. web versus app and specific design choices (e.g. navigation, length and volume of content may be

  10. Usage Of Asp.Net Ajax for Binus School Serpong Web Applications

    Directory of Open Access Journals (Sweden)

    Karto Iskandar

    2016-03-01

    Full Text Available Today web applications have become a necessity and many companies use them as a communication tool to keep in touch with their customers. The usage of Web Application in current time increases as the numberof internet users has been rised. For reason of Rich Internet Application, the desktop application developer wasmoved to web application developer with AJAX technology. BINUS School Serpong is a Cambridge Curriculum base International School that uses web application for access every information about the school. By usingAJAX, performance of web application should be improved and the bandwidth usage is decreased. Problems thatoccur at BINUS School Serpong is not all part of the web application that uses AJAX. This paper introducesusage of AJAX in ASP.NET with C# programming language in web application BINUS School Serpong. It is expected by using ASP.NET AJAX, BINUS School Serpong website performance will be faster because of reducing web page reload. The methodology used in this paper is literature study. Results from this study are to prove that the ASP.NET AJAX can be used easily and improve BINUS School Serpong website performance. Conclusion of this paper is the implementation of ASP.NET AJAX improves performance of web application in BINUS School Serpong.

  11. AN INNOVATIVE WEB MINING APPLICATION ON BLOGS - A LAYOUT

    Directory of Open Access Journals (Sweden)

    S. Prakash

    2012-01-01

    Full Text Available Blogs and Web services agree to express user’s opinions and interests, in the form of small text messages which gives abbreviated and highly personalized remarks in real-time. Recognizing emotion is really significant for a text-based communication tool such as blogs. Nowadays, user opinions in the structure of comments, reviews in blogs have been utilized by researchers for various purposes. Among them the application of sentiment analysis techniques to these opinions is an interesting one. This paper deals with a proposal of a software structural design for constructing Web mining applications in the blog world. The design includes blog crawling and data mining algorithms, to offer a full-fledged and flexible key for constructing general-purpose Web mining applications. The structural design allocates some significant customizations, such as the construction of adapters for reading text from different blogs, and the utilization of different pre-processing methods and data mining procedures. The core of this paper is on explaining the innovative software structural design of the general framework offering thorough information about the data mining sub-framework.

  12. Antecedents of Continued Usage Intentions of Web-Based Learning Management System in Tanzania

    Science.gov (United States)

    Lwoga, Edda Tandi; Komba, Mercy

    2015-01-01

    Purpose: The purpose of this paper is to examine factors that predict students' continued usage intention of web-based learning management systems (LMS) in Tanzania, with a specific focus on the School of Business of Mzumbe University. Specifically, the study investigated major predictors of actual usage and continued usage intentions of…

  13. Effect of Temporal Relationships in Associative Rule Mining for Web Log Data

    Science.gov (United States)

    Mohd Khairudin, Nazli; Mustapha, Aida

    2014-01-01

    The advent of web-based applications and services has created such diverse and voluminous web log data stored in web servers, proxy servers, client machines, or organizational databases. This paper attempts to investigate the effect of temporal attribute in relational rule mining for web log data. We incorporated the characteristics of time in the rule mining process and analysed the effect of various temporal parameters. The rules generated from temporal relational rule mining are then compared against the rules generated from the classical rule mining approach such as the Apriori and FP-Growth algorithms. The results showed that by incorporating the temporal attribute via time, the number of rules generated is subsequently smaller but is comparable in terms of quality. PMID:24587757

  14. Mining for Social Media: Usage Patterns of Small Businesses

    Directory of Open Access Journals (Sweden)

    Balan Shilpa

    2017-03-01

    Full Text Available Background: Information can now be rapidly exchanged due to social media. Due to its openness, Twitter has generated massive amounts of data. In this paper, we apply data mining and analytics to extract the usage patterns of social media by small businesses. Objectives: The aim of this paper is to describe with an example how data mining can be applied to social media. This paper further examines the impact of social media on small businesses. The Twitter posts related to small businesses are analyzed in detail. Methods/Approach: The patterns of social media usage by small businesses are observed using IBM Watson Analytics. In this paper, we particularly analyze tweets on Twitter for the hashtag #smallbusiness. Results: It is found that the number of females posting topics related to small business on Twitter is greater than the number of males. It is also found that the number of negative posts in Twitter is relatively low. Conclusions: Small firms are beginning to understand the importance of social media to realize their business goals. For future research, further analysis can be performed on the date and time the tweets were posted.

  15. Using Clustering Techniques To Detect Usage Patterns in a Web-based Information System.

    Science.gov (United States)

    Chen, Hui-Min; Cooper, Michael D.

    2001-01-01

    This study developed an analytical approach to detecting groups with homogenous usage patterns in a Web-based information system. Principal component analysis was used for data reduction, cluster analysis for categorizing usage into groups. The methodology was demonstrated and tested using two independent samples of user sessions from the…

  16. The Usage of Web 2.0 as a Media Promotion in Indonesia University Libraries

    Directory of Open Access Journals (Sweden)

    Nove E. Variant Anna

    2015-04-01

    Full Text Available The usage of web 2.0 has become popular among young people in Indonesia. One of the purpose of using web 2.0 is for promotion in some university libraries. The emerging of the web 2.0 as promotional media is corelating with the development of digital library. The paper aims are (1 to describe the usage of web 2.0 for academic libraries promotion. (2 to describe the information / content of those web 2.0. (3 to describe the promotion activity through web 2.0. This research population is all university libraries in Indonesia, but only 40 university libraries that conduct promotion through web 2.0. The website observation is done between May-July 2013. The research results are (1 the university libraries in Indonesia are use facebook, twitter, and flicker to promote library programs and interaction with users. The web 2.0 consist of information about new book release, user education, general information about library services, and information literacy. (3 some of univerity libraries taking seriously and actively promote their library services, but some of them are don’t use the web 2.0.

  17. The Usage of Web 2.0 as a Media Promotion in Indonesia University Libraries

    Directory of Open Access Journals (Sweden)

    Nove E. Variant Anna

    2018-01-01

    Full Text Available The usage of web 2.0 has become popular among young people in Indonesia. One of the purpose of using web 2.0 is for promotion in some university libraries. The emerging of the web 2.0 as promotional media is corelating with the development of digital library. The paper aims are (1 to describe the usage of web 2.0 for academic libraries promotion. (2 to describe the information / content of those web 2.0. (3 to describe the promotion activity through web 2.0. This research population is all university libraries in Indonesia, but only 40 university librraries that conduct promotion through web 2.0. The website observation is done between May-July 2013. The research results are (1 the university libraries in Indonesia are use facebook, twitter, and flikr to promote library programs and interaction with users. The web 2.0 consist of information about new book release, user education, general information about library services, and information literacy. (3 some of univerity libraries taking seriously and actively promote their library services, but some of them are don’t use the web 2.0.

  18. GROUPING WEB ACCESS SEQUENCES uSING SEQUENCE ALIGNMENT METHOD

    OpenAIRE

    BHUPENDRA S CHORDIA; KRISHNAKANT P ADHIYA

    2011-01-01

    In web usage mining grouping of web access sequences can be used to determine the behavior or intent of a set of users. Grouping websessions is how to measure the similarity between web sessions. There are many shortcomings in traditional measurement methods. The taskof grouping web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-groupsimilarity is done using sequence alignment method. This paper introduces a new method to group we...

  19. Introduction to the JASIST Special Topic Issue on Web Retrieval and Mining: A Machine Learning Perspective.

    Science.gov (United States)

    Chen, Hsinchun

    2003-01-01

    Discusses information retrieval techniques used on the World Wide Web. Topics include machine learning in information extraction; relevance feedback; information filtering and recommendation; text classification and text clustering; Web mining, based on data mining techniques; hyperlink structure; and Web size. (LRW)

  20. The Role of Virtual Reference in Library Web Site Design: A Qualitative Source for Usage Data

    Science.gov (United States)

    Powers, Amanda Clay; Shedd, Julie; Hill, Clay

    2011-01-01

    Gathering qualitative information about usage behavior of library Web sites is a time-consuming process requiring the active participation of patron communities. Libraries that collect virtual reference transcripts, however, hold valuable data regarding how the library Web site is used that could benefit Web designers. An analysis of virtual…

  1. On-Board Mining in the Sensor Web

    Science.gov (United States)

    Tanner, S.; Conover, H.; Graves, S.; Ramachandran, R.; Rushing, J.

    2004-12-01

    On-board data mining can contribute to many research and engineering applications, including natural hazard detection and prediction, intelligent sensor control, and the generation of customized data products for direct distribution to users. The ability to mine sensor data in real time can also be a critical component of autonomous operations, supporting deep space missions, unmanned aerial and ground-based vehicles (UAVs, UGVs), and a wide range of sensor meshes, webs and grids. On-board processing is expected to play a significant role in the next generation of NASA, Homeland Security, Department of Defense and civilian programs, providing for greater flexibility and versatility in measurements of physical systems. In addition, the use of UAV and UGV systems is increasing in military, emergency response and industrial applications. As research into the autonomy of these vehicles progresses, especially in fleet or web configurations, the applicability of on-board data mining is expected to increase significantly. Data mining in real time on board sensor platforms presents unique challenges. Most notably, the data to be mined is a continuous stream, rather than a fixed store such as a database. This means that the data mining algorithms must be modified to make only a single pass through the data. In addition, the on-board environment requires real time processing with limited computing resources, thus the algorithms must use fixed and relatively small amounts of processing time and memory. The University of Alabama in Huntsville is developing an innovative processing framework for the on-board data and information environment. The Environment for On-Board Processing (EVE) and the Adaptive On-board Data Processing (AODP) projects serve as proofs-of-concept of advanced information systems for remote sensing platforms. The EVE real-time processing infrastructure will upload, schedule and control the execution of processing plans on board remote sensors. These plans

  2. Using ant-behavior-based simulation model AntWeb to improve website organization

    Science.gov (United States)

    Li, Weigang; Pinheiro Dib, Marcos V.; Teles, Wesley M.; Morais de Andrade, Vlaudemir; Alves de Melo, Alba C. M.; Cariolano, Judas T.

    2002-03-01

    Some web usage mining algorithms showed the potential application to find the difference among the organizations expected by visitors to the website. However, there are still no efficient method and criterion for a web administrator to measure the performance of the modification. In this paper, we developed an AntWeb, a model inspired by ants' behavior to simulate the sequence of visiting the website, in order to measure the efficient of the web structure. We implemented a web usage mining algorithm using backtrack to the intranet website of the Politec Informatic Ltd., Brazil. We defined throughput (the number of visitors to reach their target pages per time unit relates to the total number of visitors) as an index to measure the website's performance. We also used the link in a web page to represent the effect of visitors' pheromone trails. For every modification in the website organization, for example, putting a link from the expected location to the target object, the simulation reported the value of throughput as a quick answer about this modification. The experiment showed the stability of our simulation model, and a positive modification to the intranet website of the Politec.

  3. Beyond accuracy: creating interoperable and scalable text-mining web services.

    Science.gov (United States)

    Wei, Chih-Hsuan; Leaman, Robert; Lu, Zhiyong

    2016-06-15

    The biomedical literature is a knowledge-rich resource and an important foundation for future research. With over 24 million articles in PubMed and an increasing growth rate, research in automated text processing is becoming increasingly important. We report here our recently developed web-based text mining services for biomedical concept recognition and normalization. Unlike most text-mining software tools, our web services integrate several state-of-the-art entity tagging systems (DNorm, GNormPlus, SR4GN, tmChem and tmVar) and offer a batch-processing mode able to process arbitrary text input (e.g. scholarly publications, patents and medical records) in multiple formats (e.g. BioC). We support multiple standards to make our service interoperable and allow simpler integration with other text-processing pipelines. To maximize scalability, we have preprocessed all PubMed articles, and use a computer cluster for processing large requests of arbitrary text. Our text-mining web service is freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/tmTools/#curl : Zhiyong.Lu@nih.gov. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.

  4. Web based parallel/distributed medical data mining using software agents

    Energy Technology Data Exchange (ETDEWEB)

    Kargupta, H.; Stafford, B.; Hamzaoglu, I.

    1997-12-31

    This paper describes an experimental parallel/distributed data mining system PADMA (PArallel Data Mining Agents) that uses software agents for local data accessing and analysis and a web based interface for interactive data visualization. It also presents the results of applying PADMA for detecting patterns in unstructured texts of postmortem reports and laboratory test data for Hepatitis C patients.

  5. Web Usage Mining: Application to an Online Educational Digital Library Service

    Science.gov (United States)

    Palmer, Bart C.

    2012-01-01

    This dissertation was situated in the crossroads of educational data mining (EDM), educational digital libraries (such as the National Science Digital Library; http://nsdl.org), and examination of teacher behaviors while creating online learning resources in an end-user authoring system, the Instructional Architect (IA; http://ia.usu.edu). The…

  6. A Visualization Tool to Analyse Usage of Web-Based Interventions: The Example of Positive Online Weight Reduction (POWeR)

    Science.gov (United States)

    Smith, Emily; Bradbury, Katherine; Morrison, Leanne; Dennison, Laura; Michaelides, Danius; Yardley, Lucy

    2015-01-01

    Background Attrition is a significant problem in Web-based interventions. Consequently, this research aims to identify the relation between Web usage and benefit from such interventions. A visualization tool has been developed that enables researchers to more easily examine large datasets on intervention usage that can be difficult to make sense of using traditional descriptive or statistical techniques alone. Objective This paper demonstrates how the visualization tool was used to explore patterns in participants’ use of a Web-based weight management intervention, termed "positive online weight reduction (POWeR)." We also demonstrate how the visualization tool can be used to perform subsequent statistical analyses of the association between usage patterns, participant characteristics, and intervention outcome. Methods The visualization tool was used to analyze data from 132 participants who had accessed at least one session of the POWeR intervention. Results There was a drop in usage of optional sessions after participants had accessed the initial, core POWeR sessions, but many users nevertheless continued to complete goal and weight reviews. The POWeR tools relating to the food diary and steps diary were reused most often. Differences in participant characteristics and usage of other intervention components were identified between participants who did and did not choose to access optional POWeR sessions (in addition to the initial core sessions) or reuse the food and steps diaries. Reuse of the steps diary and the getting support tools was associated with greater weight loss. Conclusions The visualization tool provided a quick and efficient method for exploring patterns of Web usage, which enabled further analyses of whether different usage patterns were associated with participant characteristics or differences in intervention outcome. Further usage of visualization techniques is recommended to (1) make sense of large datasets more quickly and efficiently; (2

  7. Web Approach for Ontology-Based Classification, Integration, and Interdisciplinary Usage of Geoscience Metadata

    Directory of Open Access Journals (Sweden)

    B Ritschel

    2012-10-01

    Full Text Available The Semantic Web is a W3C approach that integrates the different sources of semantics within documents and services using ontology-based techniques. The main objective of this approach in the geoscience domain is the improvement of understanding, integration, and usage of Earth and space science related web content in terms of data, information, and knowledge for machines and people. The modeling and representation of semantic attributes and relations within and among documents can be realized by human readable concept maps and machine readable OWL documents. The objectives for the usage of the Semantic Web approach in the GFZ data center ISDC project are the design of an extended classification of metadata documents for product types related to instruments, platforms, and projects as well as the integration of different types of metadata related to data product providers, users, and data centers. Sources of content and semantics for the description of Earth and space science product types and related classes are standardized metadata documents (e.g., DIF documents, publications, grey literature, and Web pages. Other sources are information provided by users, such as tagging data and social navigation information. The integration of controlled vocabularies as well as folksonomies plays an important role in the design of well formed ontologies.

  8. Mining the inner structure of the Web graph

    International Nuclear Information System (INIS)

    Donato, Debora; Leonardi, Stefano; Millozzi, Stefano; Tsaparas, Panayiotis

    2008-01-01

    Despite being the sum of decentralized and uncoordinated efforts by heterogeneous groups and individuals, the World Wide Web exhibits a well-defined structure, characterized by several interesting properties. This structure was clearly revealed by Broder et al (2000 Graph structure in the web Comput. Netw. 33 309) who presented the evocative bow-tie picture of the Web. Although, the bow-tie structure is a relatively clear abstraction of the macroscopic picture of the Web, it is quite uninformative with respect to the finer details of the Web graph. In this paper, we mine the inner structure of the Web graph. We present a series of measurements on the Web, which offer a better understanding of the individual components of the bow-tie. In the process, we develop algorithmic techniques for performing these measurements. We discover that the scale-free properties permeate all the components of the bow-tie which exhibit the same macroscopic properties as the Web graph itself. However, close inspection reveals that their inner structure is quite distinct. We show that the Web graph does not exhibit self similarity within its components, and we propose a possible alternative picture for the Web graph, as it emerges from our experiments

  9. A Hybrid Data Mining Approach for Credit Card Usage Behavior Analysis

    Science.gov (United States)

    Tsai, Chieh-Yuan

    Credit card is one of the most popular e-payment approaches in current online e-commerce. To consolidate valuable customers, card issuers invest a lot of money to maintain good relationship with their customers. Although several efforts have been done in studying card usage motivation, few researches emphasize on credit card usage behavior analysis when time periods change from t to t+1. To address this issue, an integrated data mining approach is proposed in this paper. First, the customer profile and their transaction data at time period t are retrieved from databases. Second, a LabelSOM neural network groups customers into segments and identify critical characteristics for each group. Third, a fuzzy decision tree algorithm is used to construct usage behavior rules of interesting customer groups. Finally, these rules are used to analysis the behavior changes between time periods t and t+1. An implementation case using a practical credit card database provided by a commercial bank in Taiwan is illustrated to show the benefits of the proposed framework.

  10. Mining Web-based Educational Systems to Predict Student Learning Achievements

    Directory of Open Access Journals (Sweden)

    José del Campo-Ávila

    2015-03-01

    Full Text Available Educational Data Mining (EDM is getting great importance as a new interdisciplinary research field related to some other areas. It is directly connected with Web-based Educational Systems (WBES and Data Mining (DM, a fundamental part of Knowledge Discovery in Databases. The former defines the context: WBES store and manage huge amounts of data. Such data are increasingly growing and they contain hidden knowledge that could be very useful to the users (both teachers and students. It is desirable to identify such knowledge in the form of models, patterns or any other representation schema that allows a better exploitation of the system. The latter reveals itself as the tool to achieve such discovering. Data mining must afford very complex and different situations to reach quality solutions. Therefore, data mining is a research field where many advances are being done to accommodate and solve emerging problems. For this purpose, many techniques are usually considered. In this paper we study how data mining can be used to induce student models from the data acquired by a specific Web-based tool for adaptive testing, called SIETTE. Concretely we have used top down induction decision trees algorithms to extract the patterns because these models, decision trees, are easily understandable. In addition, the conducted validation processes have assured high quality models.

  11. A Watercolor NPR System with Web-Mining 3D Color Charts

    Science.gov (United States)

    Chen, Lieu-Hen; Ho, Yi-Hsin; Liu, Ting-Yu; Hsieh, Wen-Chieh

    In this paper, we propose a watercolor image synthesizing system which integrates the user-personalized color charts based on web-mining technologies with the 3D Watercolor NPR system. Through our system, users can personalize their own color palette by using keywords such as the name of the artist or by choosing color sets on an emotional map. The related images are searched from web by adopting web mining technology, and the appropriate colors are extracted to construct the color chart by analyzing these images. Then, the color chart is rendered in a 3D visualization system which allows users to view and manage the distribution of colors interactively. Then, users can use these colors on our watercolor NPR system with a sketch-based GUI which allows users to manipulate watercolor attributes of object intuitively and directly.

  12. A COMPARATIVE ANALYSIS OF WEB INFORMATION EXTRACTION TECHNIQUES DEEP LEARNING vs. NAÏVE BAYES vs. BACK PROPAGATION NEURAL NETWORKS IN WEB DOCUMENT EXTRACTION

    Directory of Open Access Journals (Sweden)

    J. Sharmila

    2016-01-01

    Full Text Available Web mining related exploration is getting the chance to be more essential these days in view of the reason that a lot of information is overseen through the web. Web utilization is expanding in an uncontrolled way. A particular framework is required for controlling such extensive measure of information in the web space. Web mining is ordered into three noteworthy divisions: Web content mining, web usage mining and web structure mining. Tak-Lam Wong has proposed a web content mining methodology in the exploration with the aid of Bayesian Networks (BN. In their methodology, they were learning on separating the web data and characteristic revelation in view of the Bayesian approach. Roused from their investigation, we mean to propose a web content mining methodology, in view of a Deep Learning Algorithm. The Deep Learning Algorithm gives the interest over BN on the basis that BN is not considered in any learning architecture planning like to propose system. The main objective of this investigation is web document extraction utilizing different grouping algorithm and investigation. This work extricates the data from the web URL. This work shows three classification algorithms, Deep Learning Algorithm, Bayesian Algorithm and BPNN Algorithm. Deep Learning is a capable arrangement of strategies for learning in neural system which is connected like computer vision, speech recognition, and natural language processing and biometrics framework. Deep Learning is one of the simple classification technique and which is utilized for subset of extensive field furthermore Deep Learning has less time for classification. Naive Bayes classifiers are a group of basic probabilistic classifiers in view of applying Bayes hypothesis with concrete independence assumptions between the features. At that point the BPNN algorithm is utilized for classification. Initially training and testing dataset contains more URL. We extract the content presently from the dataset. The

  13. Mining social media and web searches for disease detection.

    Science.gov (United States)

    Yang, Y Tony; Horneffer, Michael; DiLisio, Nicole

    2013-04-28

    Web-based social media is increasingly being used across different settings in the health care industry. The increased frequency in the use of the Internet via computer or mobile devices provides an opportunity for social media to be the medium through which people can be provided with valuable health information quickly and directly. While traditional methods of detection relied predominately on hierarchical or bureaucratic lines of communication, these often failed to yield timely and accurate epidemiological intelligence. New web-based platforms promise increased opportunities for a more timely and accurate spreading of information and analysis. This article aims to provide an overview and discussion of the availability of timely and accurate information. It is especially useful for the rapid identification of an outbreak of an infectious disease that is necessary to promptly and effectively develop public health responses. These web-based platforms include search queries, data mining of web and social media, process and analysis of blogs containing epidemic key words, text mining, and geographical information system data analyses. These new sources of analysis and information are intended to complement traditional sources of epidemic intelligence. Despite the attractiveness of these new approaches, further study is needed to determine the accuracy of blogger statements, as increases in public participation may not necessarily mean the information provided is more accurate.

  14. Mining social media and web searches for disease detection

    Directory of Open Access Journals (Sweden)

    Y. Tony Yang

    2013-05-01

    Full Text Available Web-based social media is increasingly being used across different settings in the health care industry. The increased frequency in the use of the Internet via computer or mobile devices provides an opportunity for social media to be the medium through which people can be provided with valuable health information quickly and directly. While traditional methods of detection relied predominately on hierarchical or bureaucratic lines of communication, these often failed to yield timely and accurate epidemiological intelligence. New web-based platforms promise increased opportunities for a more timely and accurate spreading of information and analysis. This article aims to provide an overview and discussion of the availability of timely and accurate information. It is especially useful for the rapid identification of an outbreak of an infectious disease that is necessary to promptly and effectively develop public health responses. These web-based platforms include search queries, data mining of web and social media, process and analysis of blogs containing epidemic key words, text mining, and geographical information system data analyses. These new sources of analysis and information are intended to complement traditional sources of epidemic intelligence. Despite the attractiveness of these new approaches, further study is needed to determine the accuracy of blogger statements, as increases in public participation may not necessarily mean the information provided is more accurate.

  15. Usage of a generic web-based self-management intervention for breast cancer survivors: substudy analysis of the BREATH trial.

    Science.gov (United States)

    van den Berg, Sanne W; Peters, Esmee J; Kraaijeveld, J Frank; Gielissen, Marieke F M; Prins, Judith B

    2013-08-19

    Generic fully automated Web-based self-management interventions are upcoming, for example, for the growing number of breast cancer survivors. It is hypothesized that the use of these interventions is more individualized and that users apply a large amount of self-tailoring. However, technical usage evaluations of these types of interventions are scarce and practical guidelines are lacking. To gain insight into meaningful usage parameters to evaluate the use of generic fully automated Web-based interventions by assessing how breast cancer survivors use a generic self-management website. Final aim is to propose practical recommendations for researchers and information and communication technology (ICT) professionals who aim to design and evaluate the use of similar Web-based interventions. The BREAst cancer ehealTH (BREATH) intervention is a generic unguided fully automated website with stepwise weekly access and a fixed 4-month structure containing 104 intervention ingredients (ie, texts, tasks, tests, videos). By monitoring https-server requests, technical usage statistics were recorded for the intervention group of the randomized controlled trial. Observed usage was analyzed by measures of frequency, duration, and activity. Intervention adherence was defined as continuous usage, or the proportion of participants who started using the intervention and continued to log in during all four phases. By comparing observed to minimal intended usage (frequency and activity), different user groups were defined. Usage statistics for 4 months were collected from 70 breast cancer survivors (mean age 50.9 years). Frequency of logins/person ranged from 0 to 45, total duration/person from 0 to 2324 minutes (38.7 hours), and activity from opening none to all intervention ingredients. 31 participants continued logging in to all four phases resulting in an intervention adherence rate of 44.3% (95% CI 33.2-55.9). Nine nonusers (13%), 30 low users (43%), and 31 high users (44%) were

  16. A WebGIS Decision Support System for Management of Abandoned Mines

    Directory of Open Access Journals (Sweden)

    Ranka Stanković

    2016-07-01

    Full Text Available This paper presents the development of a WebGIS application aimed at providing safe and reliable data needed for reclamation of abandoned mines in national parks and other protected areas in Vojvodina in compliance with existing legal regulations. The geodatabase model for this application has been developed using UML and the CASE tool Microsoft Visio featuring an interface with ArcGIS. The WebGIS application was developed using GeoServer, an open source tool in the Java programming language, with integrated PostgreSQL DB and the possibility of generating and publishing WMS, WFS and KML services. The WebGIS application is publicly available, based on an appropriate central database, which for the first time encompasses all available data on abandoned mines in Vojvodina, and as such may serve as a model for similar databases on the territory of the Republic of Serbia.

  17. A Generic Framework for Extraction of Knowledge from Social Web Sources (Social Networking Websites) for an Online Recommendation System

    Science.gov (United States)

    Sathick, Javubar; Venkat, Jaya

    2015-01-01

    Mining social web data is a challenging task and finding user interest for personalized and non-personalized recommendation systems is another important task. Knowledge sharing among web users has become crucial in determining usage of web data and personalizing content in various social websites as per the user's wish. This paper aims to design a…

  18. Web 2.0 usage among New Zealand learners: Findings on gender difference

    Directory of Open Access Journals (Sweden)

    Ning Wei

    Full Text Available In this paper, gender differences in Web 2.0 usage by postgraduate students in New Zealand are presented. 84 postgraduate students drawn from two different convenience samples were surveyed to discover the extent to which they used and were familiar with Web 2.0 applications. According to Cuadrado-García, Ruiz-Molina and Montoro-Pons (2010, p. 367, \\"men and women differ in their interaction with technology\\". In this study, gender differences in the use of different Web 2.0 applications and technologies have been considered. Whilst findings from this study are limited by the way in which the populations were sampled, the sample size and having a majority of international students with English as a second language, it is interesting to note that there were only minor differences between the ways in which male and female postgraduate students use Web 2.0 applications.

  19. New energy opinion leaders' lifestyles and media usage - applying data mining decision tree analysis for UNIDO - ICHET web site users

    International Nuclear Information System (INIS)

    Tsai, M.; Veziroglu, A.; Warren, S.; Que, Y.

    2007-01-01

    According to the innovation diffusion research, the innovators, opinion leaders, and diffusion agents play vital roles in promoting the acceptance of innovation. The innovators and opinion leaders must be able to cope with the high degree of uncertainty about an innovation and usually they have higher innovation-related media usage than the majority. Based on consumer behavior studies, lifestyle analysis could help researchers divide consumers into different lifestyle groups to understand and predict consumer behaviors. Lifestyle allows researchers to investigate consumers via their activities, interests and opinions instead of using demographic variables. The purpose of this research is to investigate how new energy innovators and opinion leaders' different lifestyles affect their new energy product adoption, and their media usage regarding new energy reports or promotion. In order to achieve the purposes listed above, the researchers need to locate and contact the potential innovators and opinion leaders in this field. Thus the researchers cooperate with UNIDO-ICHET to launch this survey. This cross-discipline online survey was formally launched from Aug 2005 to Oct 2006. The result of this survey successfully collected 2040 new energy innovators and opinion leaders' information. The researchers analyzed the data using SPSS statistics software and Data Mining decision tree analysis. Then the researchers divided new energy innovators into four groups: social-oriented, young modern, conservative, and show-off-oriented. They also analyzed which lifestyle groups are better targets for innovation agencies to launch innovation-related promotions or campaigns

  20. A fuzzy method for improving the functionality of search engines based on user's web interactions

    Directory of Open Access Journals (Sweden)

    Farzaneh Kabirbeyk

    2015-04-01

    Full Text Available Web mining has been widely used to discover knowledge from various sources in the web. One of the important tools in web mining is mining of web user’s behavior that is considered as a way to discover the potential knowledge of web user’s interaction. Nowadays, Website personalization is regarded as a popular phenomenon among web users and it plays an important role in facilitating user access and provides information of users’ requirements based on their own interests. Extracting important features about web user behavior plays a significant role in web usage mining. Such features are page visit frequency in each session, visit duration, and dates of visiting a certain pages. This paper presents a method to predict user’s interest and to propose a list of pages based on their interests by identifying user’s behavior based on fuzzy techniques called fuzzy clustering method. Due to the user’s different interests and use of one or more interest at a time, user’s interest may belong to several clusters and fuzzy clustering provide a possible overlap. Using the resulted cluster helps extract fuzzy rules. This helps detecting user’s movement pattern and using neural network a list of suggested pages to the users is provided.

  1. A Survey of Bioinformatics Database and Software Usage through Mining the Literature.

    Directory of Open Access Journals (Sweden)

    Geraint Duck

    Full Text Available Computer-based resources are central to much, if not most, biological and medical research. However, while there is an ever expanding choice of bioinformatics resources to use, described within the biomedical literature, little work to date has provided an evaluation of the full range of availability or levels of usage of database and software resources. Here we use text mining to process the PubMed Central full-text corpus, identifying mentions of databases or software within the scientific literature. We provide an audit of the resources contained within the biomedical literature, and a comparison of their relative usage, both over time and between the sub-disciplines of bioinformatics, biology and medicine. We find that trends in resource usage differs between these domains. The bioinformatics literature emphasises novel resource development, while database and software usage within biology and medicine is more stable and conservative. Many resources are only mentioned in the bioinformatics literature, with a relatively small number making it out into general biology, and fewer still into the medical literature. In addition, many resources are seeing a steady decline in their usage (e.g., BLAST, SWISS-PROT, though some are instead seeing rapid growth (e.g., the GO, R. We find a striking imbalance in resource usage with the top 5% of resource names (133 names accounting for 47% of total usage, and over 70% of resources extracted being only mentioned once each. While these results highlight the dynamic and creative nature of bioinformatics research they raise questions about software reuse, choice and the sharing of bioinformatics practice. Is it acceptable that so many resources are apparently never reused? Finally, our work is a step towards automated extraction of scientific method from text. We make the dataset generated by our study available under the CC0 license here: http://dx.doi.org/10.6084/m9.figshare.1281371.

  2. Engineers and the Web: An analysis of real life gaps in information usage

    NARCIS (Netherlands)

    Kraaijenbrink, Jeroen

    2007-01-01

    Engineers face a wide range of gaps when trying to identify, acquire, and utilize information from the Web. To be able to avoid creating such gaps, it is essential to understand them in detail. This paper reports the results of a study of the real life gaps in information usage processes of 17

  3. Usage of Web Service in Mobile Application for Parents and Students in Binus School Serpong

    Directory of Open Access Journals (Sweden)

    Karto Iskandar

    2016-09-01

    Full Text Available A web service is a service offered by a device electronically to communicate with other electronic device using the World wide web. Smartphone is an electronic device that almost everyone has, especially student and parent for getting information about the school. In BINUS School Serpong mobile application, web services used for getting data from web server like student and menu data. Problem faced by BINUS School Serpong today is the time-consuming application update when using the native application while the application updates are very frequent. To resolve this problem, BINUS School Serpong mobile application will use the web service. This article showed the usage of web services with XML for retrieving data of student. The result from this study is that by using web service, smartphone can retrieve data consistently between multiple platforms. 

  4. Data pre-processing for web log mining: Case study of commercial bank website usage analysis

    Directory of Open Access Journals (Sweden)

    Jozef Kapusta

    2013-01-01

    Full Text Available We use data cleaning, integration, reduction and data conversion methods in the pre-processing level of data analysis. Data processing techniques improve the overall quality of the patterns mined. The paper describes using of standard pre-processing methods for preparing data of the commercial bank website in the form of the log file obtained from the web server. Data cleaning, as the simplest step of data pre-processing, is non–trivial as the analysed content is highly specific. We had to deal with the problem of frequent changes of the content and even frequent changes of the structure. Regular changes in the structure make use of the sitemap impossible. We presented approaches how to deal with this problem. We were able to create the sitemap dynamically just based on the content of the log file. In this case study, we also examined just the one part of the website over the standard analysis of an entire website, as we did not have access to all log files for the security reason. As the result, the traditional practices had to be adapted for this special case. Analysing just the small fraction of the website resulted in the short session time of regular visitors. We were not able to use recommended methods to determine the optimal value of session time. Therefore, we proposed new methods based on outliers identification for raising the accuracy of the session length in this paper.

  5. Social big data mining

    CERN Document Server

    Ishikawa, Hiroshi

    2015-01-01

    Social Media. Big Data and Social Data. Hypotheses in the Era of Big Data. Social Big Data Applications. Basic Concepts in Data Mining. Association Rule Mining. Clustering. Classification. Prediction. Web Structure Mining. Web Content Mining. Web Access Log Mining, Information Extraction and Deep Web Mining. Media Mining. Scalability and Outlier Detection.

  6. Using an improved association rules mining optimization algorithm in web-based mobile-learning system

    Science.gov (United States)

    Huang, Yin; Chen, Jianhua; Xiong, Shaojun

    2009-07-01

    Mobile-Learning (M-learning) makes many learners get the advantages of both traditional learning and E-learning. Currently, Web-based Mobile-Learning Systems have created many new ways and defined new relationships between educators and learners. Association rule mining is one of the most important fields in data mining and knowledge discovery in databases. Rules explosion is a serious problem which causes great concerns, as conventional mining algorithms often produce too many rules for decision makers to digest. Since Web-based Mobile-Learning System collects vast amounts of student profile data, data mining and knowledge discovery techniques can be applied to find interesting relationships between attributes of learners, assessments, the solution strategies adopted by learners and so on. Therefore ,this paper focus on a new data-mining algorithm, combined with the advantages of genetic algorithm and simulated annealing algorithm , called ARGSA(Association rules based on an improved Genetic Simulated Annealing Algorithm), to mine the association rules. This paper first takes advantage of the Parallel Genetic Algorithm and Simulated Algorithm designed specifically for discovering association rules. Moreover, the analysis and experiment are also made to show the proposed method is superior to the Apriori algorithm in this Mobile-Learning system.

  7. Surfing for thinness: a pilot study of pro-eating disorder Web site usage in adolescents with eating disorders.

    Science.gov (United States)

    Wilson, Jenny L; Peebles, Rebecka; Hardy, Kristina K; Litt, Iris F

    2006-12-01

    Pro-eating disorder Web sites are communities of individuals who engage in disordered eating and use the Internet to discuss their activities. Pro-recovery sites, which are less numerous, express a recovery-oriented perspective. This pilot study investigated the awareness and usage of pro-eating disorder Web sites among adolescents with eating disorders and their parents and explored associations with health and quality of life. This was a cross-sectional study of 698 families of patients (aged 10-22 years) diagnosed with an eating disorder at Stanford between 1997 and 2004. Anonymous surveys were mailed and offered in clinic. Survey content included questions about disease severity, health outcomes, Web site usage, and parental knowledge of eating disorder Web site usage. Surveys were returned by 182 individuals: 76 patients and 106 parents. Parents frequently (52.8%) were aware of pro-eating disorder sites, but an equal number did not know whether their child visited these sites, and only 27.6% had discussed them with their child. Most (62.5%) parents, however, did not know about pro-recovery sites. Forty-one percent of patients visited pro-recovery sites, 35.5% visited pro-eating disorder sites, 25.0% visited both, and 48.7% visited neither. While visiting pro-eating disorder sites, 96.0% reported learning new weight loss or purging techniques. However, 46.4% of pro-recovery site visitors also learned new techniques. Pro-eating disorder site users did not differ from nonusers in health outcomes but reported spending less time on school or schoolwork and had a longer duration of illness. Users of both pro-eating disorder and pro-recovery sites were hospitalized more than users of neither site. Pro-eating disorder site usage was prevalent among adolescents with eating disorders, yet parents had little knowledge of this. Although use of these sites was not associated with other health outcomes, usage may have a negative impact on quality of life and result in

  8. AN EFFECTIVE RECOMMENDATIONS BY DIFFUSION ALGORITHM FOR WEB GRAPH MINING

    Directory of Open Access Journals (Sweden)

    S. Vasukipriya

    2013-04-01

    Full Text Available The information on the World Wide Web grows in an explosive rate. Societies are relying more on the Web for their miscellaneous needs of information. Recommendation systems are active information filtering systems that attempt to present the information items like movies, music, images, books recommendations, tags recommendations, query suggestions, etc., to the users. Various kinds of data bases are used for the recommendations; fundamentally these data bases can be molded in the form of many types of graphs. Aiming at provided that a general framework on effective DR (Recommendations by Diffusion algorithm for web graphs mining. First introduce a novel graph diffusion model based on heat diffusion. This method can be applied to both undirected graphs and directed graphs. Then it shows how to convert different Web data sources into correct graphs in our models.

  9. Combining Data Warehouse and Data Mining Techniques for Web Log Analysis

    DEFF Research Database (Denmark)

    Pedersen, Torben Bach; Jespersen, Søren; Thorhauge, Jesper

    2008-01-01

    a number of approaches thatcombine data warehousing and data mining techniques in order to analyze Web logs.After introducing the well-known click and session data warehouse (DW) schemas,the chapter presents the subsession schema, which allows fast queries on sequences...

  10. Effective Filtering of Query Results on Updated User Behavioral Profiles in Web Mining

    Directory of Open Access Journals (Sweden)

    S. Sadesh

    2015-01-01

    Full Text Available Web with tremendous volume of information retrieves result for user related queries. With the rapid growth of web page recommendation, results retrieved based on data mining techniques did not offer higher performance filtering rate because relationships between user profile and queries were not analyzed in an extensive manner. At the same time, existing user profile based prediction in web data mining is not exhaustive in producing personalized result rate. To improve the query result rate on dynamics of user behavior over time, Hamilton Filtered Regime Switching User Query Probability (HFRS-UQP framework is proposed. HFRS-UQP framework is split into two processes, where filtering and switching are carried out. The data mining based filtering in our research work uses the Hamilton Filtering framework to filter user result based on personalized information on automatic updated profiles through search engine. Maximized result is fetched, that is, filtered out with respect to user behavior profiles. The switching performs accurate filtering updated profiles using regime switching. The updating in profile change (i.e., switches regime in HFRS-UQP framework identifies the second- and higher-order association of query result on the updated profiles. Experiment is conducted on factors such as personalized information search retrieval rate, filtering efficiency, and precision ratio.

  11. Archival classification: new usage scenarios among semantic web and traditio of digital samples

    Directory of Open Access Journals (Sweden)

    Alessandro Alfier

    2017-05-01

    Full Text Available Starting from the acknowledgement of the basic purpose assigned by tradition to classification within documents management, the article faces the issues related to new needs and usage, related to the digital scenarios, that would allow classification to consolidate its tradition of effectiveness in a new digital environment. The key point of the article is represented by the in-depth analysis of the possible synergies between classification-related activities and the International Standard for Describing Functions (ISDF, developed by ICA in 2007. The article highlights how an approach to classification elaborated from the ISDF perspective allows classification itself to enrich from purposes and semantic web related usage, and with the traditio of digital documents.

  12. Wireless sensing of gas in mining with web service in real time

    Directory of Open Access Journals (Sweden)

    Juan Mauricio Salamanca

    2014-12-01

    hierarchically in order to transmit the data to the entrance of the mine. Finally, the network configuration is done until the system enters in mode sleep (idle when it is not receiving information, in this way the consuming power decreased, increasing the autonomy of the batteries. This paper describes the design, implementation and operation of a gas monitoring system in mining with web service inreal-time based on a network of Zigbee sensors.

  13. Mining and Utilizing Dataset Relevancy from Oceanographic Dataset Metadata, Usage Metrics, and User Feedback to Improve Data Discovery and Access

    Data.gov (United States)

    National Aeronautics and Space Administration — We propose to mine and utilize the combination of Earth Science dataset, metadata with usage metrics and user feedback to objectively extract relevance for improved...

  14. Lecture Attendance and Web Based Lecture Technologies: A Comparison of Student Perceptions and Usage Patterns

    Science.gov (United States)

    von Konsky, Brian R.; Ivins, Jim; Gribble, Susan J.

    2009-01-01

    This paper investigates the impact of web based lecture recordings on learning and attendance at lectures. Student opinions regarding the perceived value of the recordings were evaluated in the context of usage patterns and final marks, and compared with attendance data and student perceptions regarding the usefulness of lectures. The availability…

  15. Optimizing the Information Presentation on Mining Potential by using Web Services Technology with Restful Protocol

    Science.gov (United States)

    Abdillah, T.; Dai, R.; Setiawan, E.

    2018-02-01

    This study aims to develop the application of Web Services technology with RestFul Protocol to optimize the information presentation on mining potential. This study used User Interface Design approach for the information accuracy and relevance as well as the Web Service for the reliability in presenting the information. The results show that: the information accuracy and relevance regarding mining potential can be seen from the achievement of User Interface implementation in the application that is based on the following rules: The consideration of the appropriate colours and objects, the easiness of using the navigation, and users’ interaction with the applications that employs symbols and languages understood by the users; the information accuracy and relevance related to mining potential can be observed by the information presented by using charts and Tool Tip Text to help the users understand the provided chart/figure; the reliability of the information presentation is evident by the results of Web Services testing in Figure 4.5.6. This study finds out that User Interface Design and Web Services approaches (for the access of different Platform apps) are able to optimize the presentation. The results of this study can be used as a reference for software developers and Provincial Government of Gorontalo.

  16. The spread of scientific information: insights from the web usage statistics in PLoS article-level metrics.

    Directory of Open Access Journals (Sweden)

    Koon-Kiu Yan

    Full Text Available The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientists and study, in particular, how the awareness of a scientific paper is spread. Our work is based on the web usage statistics obtained from the PLoS Article Level Metrics dataset compiled by PLoS. The cumulative number of HTML views was found to follow a long tail distribution which is reasonably well-fitted by a lognormal one. We modeled the diffusion of information by a random multiplicative process, and thus extracted the rates of information spread at different stages after the publication of a paper. We found that the spread of information displays two distinct decay regimes: a rapid downfall in the first month after publication, and a gradual power law decay afterwards. We identified these two regimes with two distinct driving processes: a short-term behavior driven by the fame of a paper, and a long-term behavior consistent with citation statistics. The patterns of information spread were found to be remarkably similar in data from different journals, but there are intrinsic differences for different types of web usage (HTML views and PDF downloads versus XML. These similarities and differences shed light on the theoretical understanding of different complex systems, as well as a better design of the corresponding web applications that is of high potential marketing impact.

  17. The spread of scientific information: insights from the web usage statistics in PLoS article-level metrics.

    Science.gov (United States)

    Yan, Koon-Kiu; Gerstein, Mark

    2011-01-01

    The presence of web-based communities is a distinctive signature of Web 2.0. The web-based feature means that information propagation within each community is highly facilitated, promoting complex collective dynamics in view of information exchange. In this work, we focus on a community of scientists and study, in particular, how the awareness of a scientific paper is spread. Our work is based on the web usage statistics obtained from the PLoS Article Level Metrics dataset compiled by PLoS. The cumulative number of HTML views was found to follow a long tail distribution which is reasonably well-fitted by a lognormal one. We modeled the diffusion of information by a random multiplicative process, and thus extracted the rates of information spread at different stages after the publication of a paper. We found that the spread of information displays two distinct decay regimes: a rapid downfall in the first month after publication, and a gradual power law decay afterwards. We identified these two regimes with two distinct driving processes: a short-term behavior driven by the fame of a paper, and a long-term behavior consistent with citation statistics. The patterns of information spread were found to be remarkably similar in data from different journals, but there are intrinsic differences for different types of web usage (HTML views and PDF downloads versus XML). These similarities and differences shed light on the theoretical understanding of different complex systems, as well as a better design of the corresponding web applications that is of high potential marketing impact.

  18. Geovisualization of Local and Regional Migration Using Web-mined Demographics

    Science.gov (United States)

    Schuermann, R. T.; Chow, T. E.

    2014-11-01

    The intent of this research was to augment and facilitate analyses, which gauges the feasibility of web-mined demographics to study spatio-temporal dynamics of migration. As a case study, we explored the spatio-temporal dynamics of Vietnamese Americans (VA) in Texas through geovisualization of mined demographic microdata from the World Wide Web. Based on string matching across all demographic attributes, including full name, address, date of birth, age and phone number, multiple records of the same entity (i.e. person) over time were resolved and reconciled into a database. Migration trajectories were geovisualized through animated sprites by connecting the different addresses associated with the same person and segmenting the trajectory into small fragments. Intra-metropolitan migration patterns appeared at the local scale within many metropolitan areas. At the scale of metropolitan area, varying degrees of immigration and emigration manifest different types of migration clusters. This paper presents a methodology incorporating GIS methods and cartographic design to produce geovisualization animation, enabling the cognitive identification of migration patterns at multiple scales. Identification of spatio-temporal patterns often stimulates further research to better understand the phenomenon and enhance subsequent modeling.

  19. Research on the optimization strategy of web search engine based on data mining

    Science.gov (United States)

    Chen, Ronghua

    2018-04-01

    With the wide application of search engines, web site information has become an important way for people to obtain information. People have found that they are growing in an increasingly explosive manner. Web site information is verydifficult to find the information they need, and now the search engine can not meet the need, so there is an urgent need for the network to provide website personalized information service, data mining technology for this new challenge is to find a breakthrough. In order to improve people's accuracy of finding information from websites, a website search engine optimization strategy based on data mining is proposed, and verified by website search engine optimization experiment. The results show that the proposed strategy improves the accuracy of the people to find information, and reduces the time for people to find information. It has an important practical value.

  20. A construction scheme of web page comment information extraction system based on frequent subtree mining

    Science.gov (United States)

    Zhang, Xiaowen; Chen, Bingfeng

    2017-08-01

    Based on the frequent sub-tree mining algorithm, this paper proposes a construction scheme of web page comment information extraction system based on frequent subtree mining, referred to as FSM system. The entire system architecture and the various modules to do a brief introduction, and then the core of the system to do a detailed description, and finally give the system prototype.

  1. Web services-based text-mining demonstrates broad impacts for interoperability and process simplification.

    Science.gov (United States)

    Wiegers, Thomas C; Davis, Allan Peter; Mattingly, Carolyn J

    2014-01-01

    The Critical Assessment of Information Extraction systems in Biology (BioCreAtIvE) challenge evaluation tasks collectively represent a community-wide effort to evaluate a variety of text-mining and information extraction systems applied to the biological domain. The BioCreative IV Workshop included five independent subject areas, including Track 3, which focused on named-entity recognition (NER) for the Comparative Toxicogenomics Database (CTD; http://ctdbase.org). Previously, CTD had organized document ranking and NER-related tasks for the BioCreative Workshop 2012; a key finding of that effort was that interoperability and integration complexity were major impediments to the direct application of the systems to CTD's text-mining pipeline. This underscored a prevailing problem with software integration efforts. Major interoperability-related issues included lack of process modularity, operating system incompatibility, tool configuration complexity and lack of standardization of high-level inter-process communications. One approach to potentially mitigate interoperability and general integration issues is the use of Web services to abstract implementation details; rather than integrating NER tools directly, HTTP-based calls from CTD's asynchronous, batch-oriented text-mining pipeline could be made to remote NER Web services for recognition of specific biological terms using BioC (an emerging family of XML formats) for inter-process communications. To test this concept, participating groups developed Representational State Transfer /BioC-compliant Web services tailored to CTD's NER requirements. Participants were provided with a comprehensive set of training materials. CTD evaluated results obtained from the remote Web service-based URLs against a test data set of 510 manually curated scientific articles. Twelve groups participated in the challenge. Recall, precision, balanced F-scores and response times were calculated. Top balanced F-scores for gene, chemical and

  2. The Influence of Perceived Organizational Injustice towards Workplace Personal Web Usage and Work Productivity in Indonesia

    Directory of Open Access Journals (Sweden)

    Nur Fathonah

    2014-11-01

    Full Text Available Workplace personal web usage (WPWU is an employee’s activity in using internet for non-related task during working hours. It is considered a counterproductive behavior when done excessively because it can interrupt employee’s productivity, but it can increase creativity and eliminate boredom when used in a rational amount. The objective of this study was to prove whether perceived organizational injustice had influence on WPWU which affected work productivity. A total of 222 respondents working in various industries were gathered through web-survey. By using multinomial logistic regression analysis, this study found that high level use of internet for unrelated jobs between 2 to 4 hours a day was influenced by respondents’ perception of not getting fair treatment and incentive for being good performer, which then caused them to perform very low completion of tasks. There were two contrasting views regarding this result; organizations considered it as deviant behavior because it reduced employees’ performance whereas employees regarded it as just short breaks to get rid of stress. Hence, this finding suggested that companies should redesign its internet policies to accommodate “Work-Life Blend”; blending work and personal lives, as a consequence of cultural shift in the era of globalization and new technologies. Keywords: Organizational Justice, Workplace Personal Web Usage, Work Productivity, Work-Life Blend, Indonesia.

  3. adaptation et selection de varietes performantes de riz pluvi

    African Journals Online (AJOL)

    PROJET SOJA

    Mots-clés : « Web usage mining », extraction de connaissances, fichier Log, méthode. « A Priori », algorithme, théorie de décision. Abstract. Log file treatment and exploration of the web site, for extracting and mining the knowledges : Web usage mining. The aim of this work is to design and produce one tool and implement, ...

  4. Mining

    Directory of Open Access Journals (Sweden)

    Khairullah Khan

    2014-09-01

    Full Text Available Opinion mining is an interesting area of research because of its applications in various fields. Collecting opinions of people about products and about social and political events and problems through the Web is becoming increasingly popular every day. The opinions of users are helpful for the public and for stakeholders when making certain decisions. Opinion mining is a way to retrieve information through search engines, Web blogs and social networks. Because of the huge number of reviews in the form of unstructured text, it is impossible to summarize the information manually. Accordingly, efficient computational methods are needed for mining and summarizing the reviews from corpuses and Web documents. This study presents a systematic literature survey regarding the computational techniques, models and algorithms for mining opinion components from unstructured reviews.

  5. QuadBase2: web server for multiplexed guanine quadruplex mining and visualization

    Science.gov (United States)

    Dhapola, Parashar; Chowdhury, Shantanu

    2016-01-01

    DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890

  6. Concept and Establishment of the Mine Information System within the CROMAC GIP Project

    Directory of Open Access Journals (Sweden)

    Zvonko Biljecki

    2006-12-01

    Full Text Available In order to solve mine problems in the Republic of Croatia, a unique project CROMAC GIP (Croatian Mine Action Centre Geoinformation Project has been initiated significantly increasing the functional quality of the existing Mine Information System (MIS. Since mine problems are closely related to space, geodata are a crucial part of MIS intended for monitoring and planning of demining. Since the moment the Croatian Mine Action Centre was funded till today, the process of demining has progressed. The implementation of a topographic database in accordance with the CROTIS data model and the usage of orthophoto data produced according to the official product specifications can be pointed out in that progress. Usage of such geodata requires a sophisticated information system that enables a simultaneous usage of geodata and other data connected with solving mine problems. In order to reach all goals in demining and to use all advantages of geodata, it was indispensable to upgrade the existing Mine Information System by merging geodata and HCR data and to collect new data according to the standardized procedures, but controlling at the same time the quality and automated procedures of uploading into the system. Apart from being constructed in accordance with the Standard Operative Procedures (SOP, the modernised MIS is also based on generally accepted standards in the field of geoinformation and it is implemented on advanced technology. The core of the system is the Oracle database, and GeoMedia is a WebMap Professional tool on the basis of which the distribution and the work with spatial data is possible on intranet/Internet. In order to achieve full efficiency of the system, it is necessary to provide high quality and updated geodata. In this respect, photogrammetric data are the most efficient solution.

  7. Web of Things-Based Remote Monitoring System for Coal Mine Safety Using Wireless Sensor Network

    OpenAIRE

    Bo, Cheng; Xin, Cheng; Zhongyi, Zhai; Chengwen, Zhang; Junliang, Chen

    2014-01-01

    Frequent accidents have occurred in coal mine enterprises; therefore, raising the technological level of coal mine safety monitoring systems is an urgent problem. Wireless sensor networks (WSN), as a new field of research, have broad application prospects. This paper proposes a Web of Things- (WoT-) based remote monitoring system that takes full advantage of wireless sensor networks in combination with the CAN bus communication technique that abstracts the underground sensor data and capabili...

  8. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins.

    Science.gov (United States)

    van Heel, Auke J; de Jong, Anne; Song, Chunxu; Viel, Jakob H; Kok, Jan; Kuipers, Oscar P

    2018-05-21

    Interest in secondary metabolites such as RiPPs (ribosomally synthesized and posttranslationally modified peptides) is increasing worldwide. To facilitate the research in this field we have updated our mining web server. BAGEL4 is faster than its predecessor and is now fully independent from ORF-calling. Gene clusters of interest are discovered using the core-peptide database and/or through HMM motifs that are present in associated context genes. The databases used for mining have been updated and extended with literature references and links to UniProt and NCBI. Additionally, we have included automated promoter and terminator prediction and the option to upload RNA expression data, which can be displayed along with the identified clusters. Further improvements include the annotation of the context genes, which is now based on a fast blast against the prokaryote part of the UniRef90 database, and the improved web-BLAST feature that dynamically loads structural data such as internal cross-linking from UniProt. Overall BAGEL4 provides the user with more information through a user-friendly web-interface which simplifies data evaluation. BAGEL4 is freely accessible at http://bagel4.molgenrug.nl.

  9. Placing Music Artists and Songs in Time Using Editorial Metadata and Web Mining Techniques

    NARCIS (Netherlands)

    Bountouridis, D.; Veltkamp, R.C.; Balen, J.M.H. van

    2013-01-01

    This paper investigates the novel task of situating music artists and songs in time, thereby adding contextual information that typically correlates with an artist’s similarities, collaborations and influences. The proposed method makes use of editorial metadata in conjunction with web mining

  10. Working with Data: Discovering Knowledge through Mining and Analysis; Systematic Knowledge Management and Knowledge Discovery; Text Mining; Methodological Approach in Discovering User Search Patterns through Web Log Analysis; Knowledge Discovery in Databases Using Formal Concept Analysis; Knowledge Discovery with a Little Perspective.

    Science.gov (United States)

    Qin, Jian; Jurisica, Igor; Liddy, Elizabeth D.; Jansen, Bernard J; Spink, Amanda; Priss, Uta; Norton, Melanie J.

    2000-01-01

    These six articles discuss knowledge discovery in databases (KDD). Topics include data mining; knowledge management systems; applications of knowledge discovery; text and Web mining; text mining and information retrieval; user search patterns through Web log analysis; concept analysis; data collection; and data structure inconsistency. (LRW)

  11. UKRVO Astronomical WEB Services

    Directory of Open Access Journals (Sweden)

    Mazhaev, O.E.

    2017-01-01

    Full Text Available Ukraine Virtual Observatory (UkrVO has been a member of the International Virtual Observatory Alliance (IVOA since 2011. The virtual observatory (VO is not a magic solution to all problems of data storing and processing, but it provides certain standards for building infrastructure of astronomical data center. The astronomical databases help data mining and offer to users an easy access to observation metadata, images within celestial sphere and results of image processing. The astronomical web services (AWS of UkrVO give to users handy tools for data selection from large astronomical catalogues for a relatively small region of interest in the sky. Examples of the AWS usage are showed.

  12. A node linkage approach for sequential pattern mining.

    Directory of Open Access Journals (Sweden)

    Osvaldo Navarro

    Full Text Available Sequential Pattern Mining is a widely addressed problem in data mining, with applications such as analyzing Web usage, examining purchase behavior, and text mining, among others. Nevertheless, with the dramatic increase in data volume, the current approaches prove inefficient when dealing with large input datasets, a large number of different symbols and low minimum supports. In this paper, we propose a new sequential pattern mining algorithm, which follows a pattern-growth scheme to discover sequential patterns. Unlike most pattern growth algorithms, our approach does not build a data structure to represent the input dataset, but instead accesses the required sequences through pseudo-projection databases, achieving better runtime and reducing memory requirements. Our algorithm traverses the search space in a depth-first fashion and only preserves in memory a pattern node linkage and the pseudo-projections required for the branch being explored at the time. Experimental results show that our new approach, the Node Linkage Depth-First Traversal algorithm (NLDFT, has better performance and scalability in comparison with state of the art algorithms.

  13. The Impact of Media Richness on the Usage of Web 2.0 Services for Knowledge Transfer

    DEFF Research Database (Denmark)

    Gyamfi, Albert

    2016-01-01

    The study investigates the impact of the use of web 2.0 applications on knowledge transfer in the Cocoa Sector in Ghana. Transferring knowledge via social media websites has received widespread attention by organizations. However, in most developing countries like Ghana, knowledge transfer still...... proposed that the usage of web 2.0 applications for the different modes of knowledge transfer can be affected by their media richness. And the use of web 2.0 applications for the knowledge transfer modes can influence knowledge transfer success. The study was conducted using a mixed method approach...... remains a major challenge, especially in the Cocoa Sector. The selection of media for a given task depends on the richness of the media and the characteristics of the task. The four modes of knowledge transfer theorized by Nonaka, require the use of media with varying degrees of richness. The study...

  14. Technologies for Decreasing Mining Losses

    Science.gov (United States)

    Valgma, Ingo; Väizene, Vivika; Kolats, Margit; Saarnak, Martin

    2013-12-01

    In case of stratified deposits like oil shale deposit in Estonia, mining losses depend on mining technologies. Current research focuses on extraction and separation possibilities of mineral resources. Selective mining, selective crushing and separation tests have been performed, showing possibilities of decreasing mining losses. Rock crushing and screening process simulations were used for optimizing rock fractions. In addition mine backfilling, fine separation, and optimized drilling and blasting have been analyzed. All tested methods show potential and depend on mineral usage. Usage in addition depends on the utilization technology. The questions like stability of the material flow and influences of the quality fluctuations to the final yield are raised.

  15. Usage Analysis of Web 2.0 and Library 2.0 Tools by Librarians in Kwara State Academic Libraries

    Science.gov (United States)

    Tella, Adeyinka; Soluoku, Taofeeqat

    2016-01-01

    This study analysed the usage of Web 2.0 and Library 2.0 tools by librarians in Kwara State academic libraries. A sample of 40 librarians was surveyed through total enumeration sampling technique from four different tertiary education institutions libraries in Kwara State, Nigeria. Questionnaire was used for the collection of data. The collected…

  16. An Educational Data Mining Approach to Concept Map Construction for Web based Learning

    Directory of Open Access Journals (Sweden)

    Anal ACHARYA

    2017-01-01

    Full Text Available This aim of this article is to study the use of Educational Data Mining (EDM techniques in constructing concept maps for organizing knowledge in web based learning systems whereby studying their synergistic effects in enhancing learning. This article first provides a tutorial based introduction to EDM. The applicability of web based learning systems in enhancing the efficiency of EDM techniques in real time environment is investigated. Web based learning systems often use a tool for organizing knowledge. This article explores the use of one such tool called concept map for this purpose. The pioneering works by various researchers who proposed web based learning systems in personalized and collaborative environment in this arena are next presented. A set of parameters are proposed based on which personalized and collaborative learning applications may be generalized and their performances compared. It is found that personalized learning environment uses EDM techniques more exhaustively compared to collaborative learning for concept map construction in web based environment. This article can be used as a starting point for freshers who would like to use EDM techniques for concept map construction for web based learning purposes.

  17. Energy efficient technologies for the mining industry

    Energy Technology Data Exchange (ETDEWEB)

    Klein, B.; Bamber, A.; Weatherwax, T.; Dozdiak, J.; Nadolski, S.; Roufail, R.; Parry, J.; Roufail, R.; Tong, L.; Hall, R. [British Columbia Univ., Vancouver, BC (Canada). Centre for Environmental Research in Minerals, Metals and Materials, Norman B. Keevil Inst. of Mining Engineering

    2010-07-01

    Mining in British Columbia is the second largest industrial electricity consumer. This presentation highlighted methods to help the mining industry reduce their energy requirements by limiting waste and improving efficiency. The measures are aimed at optimizing energy-use and efficiency in mining and processing and identifying opportunities and methods of improving this efficiency. Energy conservation in comminution and beneficiation is a primary focus of research activities at the University of British Columbia (UBC). The objective is to reduce energy usage in metal mines by 20 per cent overall. Open pit copper, gold and molybdenum mines are being targeted. Projects underway at UBC were outlined, with particular reference to energy usage, recovery and alternative energy sources; preconcentration; reducing energy usage from comminution in sorting, high pressure grinding rolls and high speed stirred mills; Hydromet; other energy efficient technologies such as control and flotation; and carbon dioxide sequestration. Studies were conducted at various mining facilities, including mines in Sudbury, Ontario. tabs., figs.

  18. Rare disease diagnosis: A review of web search, social media and large-scale data-mining approaches.

    Science.gov (United States)

    Svenstrup, Dan; Jørgensen, Henrik L; Winther, Ole

    2015-01-01

    Physicians and the general public are increasingly using web-based tools to find answers to medical questions. The field of rare diseases is especially challenging and important as shown by the long delay and many mistakes associated with diagnoses. In this paper we review recent initiatives on the use of web search, social media and data mining in data repositories for medical diagnosis. We compare the retrieval accuracy on 56 rare disease cases with known diagnosis for the web search tools google.com, pubmed.gov, omim.org and our own search tool findzebra.com. We give a detailed description of IBM's Watson system and make a rough comparison between findzebra.com and Watson on subsets of the Doctor's dilemma dataset. The recall@10 and recall@20 (fraction of cases where the correct result appears in top 10 and top 20) for the 56 cases are found to be be 29%, 16%, 27% and 59% and 32%, 18%, 34% and 64%, respectively. Thus, FindZebra has a significantly (p mining tools and social media are some of the areas that hold promise.

  19. Mining web-based data to assess public response to environmental events

    International Nuclear Information System (INIS)

    Cha, YoonKyung; Stow, Craig A.

    2015-01-01

    We explore how the analysis of web-based data, such as Twitter and Google Trends, can be used to assess the social relevance of an environmental accident. The concept and methods are applied in the shutdown of drinking water supply at the city of Toledo, Ohio, USA. Toledo's notice, which persisted from August 1 to 4, 2014, is a high-profile event that directly influenced approximately half a million people and received wide recognition. The notice was given when excessive levels of microcystin, a byproduct of cyanobacteria blooms, were discovered at the drinking water treatment plant on Lake Erie. Twitter mining results illustrated an instant response to the Toledo incident, the associated collective knowledge, and public perception. The results from Google Trends, on the other hand, revealed how the Toledo event raised public attention on the associated environmental issue, harmful algal blooms, in a long-term context. Thus, when jointly applied, Twitter and Google Trend analysis results offer complementary perspectives. Web content aggregated through mining approaches provides a social standpoint, such as public perception and interest, and offers context for establishing and evaluating environmental management policies. - The joint application of Twitter and Google Trend analysis to an environmental event offered both short and long-term patterns of public perception and interest on the event

  20. Cluo: Web-Scale Text Mining System For Open Source Intelligence Purposes

    Directory of Open Access Journals (Sweden)

    Przemyslaw Maciolek

    2013-01-01

    Full Text Available The amount of textual information published on the Internet is considered tobe in billions of web pages, blog posts, comments, social media updates andothers. Analyzing such quantities of data requires high level of distribution –both data and computing. This is especially true in case of complex algorithms,often used in text mining tasks.The paper presents a prototype implementation of CLUO – an Open SourceIntelligence (OSINT system, which extracts and analyzes significant quantitiesof openly available information.

  1. ArrayMining: a modular web-application for microarray analysis combining ensemble and consensus methods with cross-study normalization

    Directory of Open Access Journals (Sweden)

    Krasnogor Natalio

    2009-10-01

    Full Text Available Abstract Background Statistical analysis of DNA microarray data provides a valuable diagnostic tool for the investigation of genetic components of diseases. To take advantage of the multitude of available data sets and analysis methods, it is desirable to combine both different algorithms and data from different studies. Applying ensemble learning, consensus clustering and cross-study normalization methods for this purpose in an almost fully automated process and linking different analysis modules together under a single interface would simplify many microarray analysis tasks. Results We present ArrayMining.net, a web-application for microarray analysis that provides easy access to a wide choice of feature selection, clustering, prediction, gene set analysis and cross-study normalization methods. In contrast to other microarray-related web-tools, multiple algorithms and data sets for an analysis task can be combined using ensemble feature selection, ensemble prediction, consensus clustering and cross-platform data integration. By interlinking different analysis tools in a modular fashion, new exploratory routes become available, e.g. ensemble sample classification using features obtained from a gene set analysis and data from multiple studies. The analysis is further simplified by automatic parameter selection mechanisms and linkage to web tools and databases for functional annotation and literature mining. Conclusion ArrayMining.net is a free web-application for microarray analysis combining a broad choice of algorithms based on ensemble and consensus methods, using automatic parameter selection and integration with annotation databases.

  2. A citation analysis of the research reports of the Central Mining Institute. Mining and Environment using the Web of Science, Scopus, BazTech, and Google Scholar: A case study

    OpenAIRE

    Magdalena Bemke-Switilnik; Aneta Drabek

    2015-01-01

    This paper presents the analysis of a Polish mining sciences journal (Prace Naukowe GIG. Górnictwo i Środowisko; title in English: Research Reports of the Central Mining Institute. Mining and Environment; acronym in English [RRCMIME]). The analysis is based on data from the following sources: the Web of Science (WoS), Scopus, BazTech (a bibliographic database containing citations from Polish Technical Journals), and Google Scholar (GS). The data from the WoS and Scopus were collected manually...

  3. Chemotext: A Publicly Available Web Server for Mining Drug-Target-Disease Relationships in PubMed.

    Science.gov (United States)

    Capuzzi, Stephen J; Thornton, Thomas E; Liu, Kammy; Baker, Nancy; Lam, Wai In; O'Banion, Colin P; Muratov, Eugene N; Pozefsky, Diane; Tropsha, Alexander

    2018-02-26

    Elucidation of the mechanistic relationships between drugs, their targets, and diseases is at the core of modern drug discovery research. Thousands of studies relevant to the drug-target-disease (DTD) triangle have been published and annotated in the Medline/PubMed database. Mining this database affords rapid identification of all published studies that confirm connections between vertices of this triangle or enable new inferences of such connections. To this end, we describe the development of Chemotext, a publicly available Web server that mines the entire compendium of published literature in PubMed annotated by Medline Subject Heading (MeSH) terms. The goal of Chemotext is to identify all known DTD relationships and infer missing links between vertices of the DTD triangle. As a proof-of-concept, we show that Chemotext could be instrumental in generating new drug repurposing hypotheses or annotating clinical outcomes pathways for known drugs. The Chemotext Web server is freely available at http://chemotext.mml.unc.edu .

  4. Verification of the fulfilment of the purposes of Basel II, Pillar 3 through application of the web log mining methods

    Directory of Open Access Journals (Sweden)

    M. Munk

    2012-01-01

    Full Text Available The objective of the paper is the verification of the fulfilment of the purposes of Basel II, Pillar 3 – market discipline during the recent financial crisis. The objective of the paper is to describe the current state of the working out of the project that is focused on the analysis of the market participants’ interest in mandatory disclosure of financial information by a commercial bank by means of advanced methods of web log mining. The output of the realized project will be the verification of the assumptions related to the purposes of Basel III by means of the web mining methods, the recommendations for possible reduction of mandatory disclosure of information under Basel II and III, the proposal of the methodology for data preparation for web log mining in this application domain and the generalised procedure for users’ behaviour modelling dependent on time. The schedule of the project has been divided into three phases. The paper deals with its first phase that is focusing on the data pre-processing, analysis and evaluation of the required information under Basel II, Pillar 3 since 2008 and its disclosure into the web site of a commercial bank. The authors introduce the methodologies for data preparation and known heuristic methods for path completion into web log files with respect to the particularity of investigated application domain. They propose scientific methods for modelling users’ behaviour of the webpages related to Pillar 3 with respect to time.

  5. ESTminer: a Web interface for mining EST contig and cluster databases.

    Science.gov (United States)

    Huang, Yecheng; Pumphrey, Janie; Gingle, Alan R

    2005-03-01

    ESTminer is a Web application and database schema for interactive mining of expressed sequence tag (EST) contig and cluster datasets. The Web interface contains a query frame that allows the selection of contigs/clusters with specific cDNA library makeup or a threshold number of members. The results are displayed as color-coded tree nodes, where the color indicates the fractional size of each cDNA library component. The nodes are expandable, revealing library statistics as well as EST or contig members, with links to sequence data, GenBank records or user configurable links. Also, the interface allows 'queries within queries' where the result set of a query is further filtered by the subsequent query. ESTminer is implemented in Java/JSP and the package, including MySQL and Oracle schema creation scripts, is available from http://cggc.agtec.uga.edu/Data/download.asp agingle@uga.edu.

  6. Soil food web changes during spontaneous succession at post mining sites: a possible ecosystem engineering effect on food web organization?

    Science.gov (United States)

    Frouz, Jan; Thébault, Elisa; Pižl, Václav; Adl, Sina; Cajthaml, Tomáš; Baldrián, Petr; Háněl, Ladislav; Starý, Josef; Tajovský, Karel; Materna, Jan; Nováková, Alena; de Ruiter, Peter C

    2013-01-01

    Parameters characterizing the structure of the decomposer food web, biomass of the soil microflora (bacteria and fungi) and soil micro-, meso- and macrofauna were studied at 14 non-reclaimed 1- 41-year-old post-mining sites near the town of Sokolov (Czech Republic). These observations on the decomposer food webs were compared with knowledge of vegetation and soil microstructure development from previous studies. The amount of carbon entering the food web increased with succession age in a similar way as the total amount of C in food web biomass and the number of functional groups in the food web. Connectance did not show any significant changes with succession age, however. In early stages of the succession, the bacterial channel dominated the food web. Later on, in shrub-dominated stands, the fungal channel took over. Even later, in the forest stage, the bacterial channel prevailed again. The best predictor of fungal bacterial ratio is thickness of fermentation layer. We argue that these changes correspond with changes in topsoil microstructure driven by a combination of plant organic matter input and engineering effects of earthworms. In early stages, soil is alkaline, and a discontinuous litter layer on the soil surface promotes bacterial biomass growth, so the bacterial food web channel can dominate. Litter accumulation on the soil surface supports the development of the fungal channel. In older stages, earthworms arrive, mix litter into the mineral soil and form an organo-mineral topsoil, which is beneficial for bacteria and enhances the bacterial food web channel.

  7. A Study on Information Search and Commitment Strategies on Web Environment and Internet Usage Self-Efficacy Beliefs of University Students'

    Science.gov (United States)

    Geçer, Aynur Kolburan

    2014-01-01

    This study addresses university students' information search and commitment strategies on web environment and internet usage self-efficacy beliefs in terms of such variables as gender, department, grade level and frequency of internet use; and whether there is a significant relation between these beliefs. Descriptive method was used in the study.…

  8. Usage of Data Mining at Financial Decision Making

    Directory of Open Access Journals (Sweden)

    Levent BORAN

    2014-06-01

    Full Text Available The knowledge age requires controlling every kind of information. Recognition of patterns in data may provide previously unknown and useful information that can provide competitive advantages. If related techniques are applied on financial statements, it is possible to acquire valuable information about companies’ financial situations. It is considered that data mining could be an alternative of common financial analysis techniques such as vertical analysis, horizontal analysis, trend analysis and ratio analysis. Against existing financial analysis methods, data mining provides some advantages, which are ability of manipulation of huge data and competence of obtaining previously unknown information. There exist two major constraints of data mining implementation that are lack of experts on both data mining and related domains and cost of computer software and hardware used.

  9. What Is Different about E-Books? A MINES for Libraries® Analysis of Academic and Health Sciences Research Libraries' E-Book Usage

    Science.gov (United States)

    Plum, Terry; Franklin, Brinley

    2015-01-01

    Building on the theoretical proposals of Kevin Guthrie and others concerning the transition from print books to e-books in academic and health sciences libraries, this paper presents data collected using the MINES for Libraries® e-resource survey methodology. Approximately 6,000 e-book uses were analyzed from a sample of e-resource usage at…

  10. An Introduction to Social Semantic Web Mining & Big Data Analytics for Political Attitudes and Mentalities Research

    Directory of Open Access Journals (Sweden)

    Markus Schatten

    2015-01-01

    Full Text Available The social web has become a major repository of social and behavioral data that is of exceptional interest to the social science and humanities research community. Computer science has only recently developed various technologies and techniques that allow for harvesting, organizing and analyzing such data and provide knowledge and insights into the structure and behavior or people on-line. Some of these techniques include social web mining, conceptual and social network analysis and modeling, tag clouds, topic maps, folksonomies, complex network visualizations, modeling of processes on networks, agent based models of social network emergence, speech recognition, computer vision, natural language processing, opinion mining and sentiment analysis, recommender systems, user profiling and semantic wikis. All of these techniques are briefly introduced, example studies are given and ideas as well as possible directions in the field of political attitudes and mentalities are given. In the end challenges for future studies are discussed.

  11. Cementitious backfill in mining

    Energy Technology Data Exchange (ETDEWEB)

    Taute, A; Spice, J; Wingrove, A C [Van Niekerk, Kleyn Edwards (South Africa)

    1993-03-01

    This article describes the need for increased usage of backfill material in mining and presents some of the considerations for use of cemented materials. Laboratory test results obtained using a variety of cementitious binders and mine tailings are presented. 3 figs., 1 tab.

  12. Mining the Social Web Analyzing Data from Facebook, Twitter, LinkedIn, and Other Social Media Sites

    CERN Document Server

    Russell, Matthew

    2011-01-01

    Want to tap the tremendous amount of valuable social data in Facebook, Twitter, LinkedIn, and Google+? This refreshed edition helps you discover who's making connections with social media, what they're talking about, and where they're located. You'll learn how to combine social web data, analysis techniques, and visualization to find what you've been looking for in the social haystack-as well as useful information you didn't know existed. Each standalone chapter introduces techniques for mining data in different areas of the social Web, including blogs and email. All you need to get started

  13. Socio-contextual Network Mining for User Assistance in Web-based Knowledge Gathering Tasks

    Science.gov (United States)

    Rajendran, Balaji; Kombiah, Iyakutti

    Web-based Knowledge Gathering (WKG) is a specialized and complex information seeking task carried out by many users on the web, for their various learning, and decision-making requirements. We construct a contextual semantic structure by observing the actions of the users involved in WKG task, in order to gain an understanding of their task and requirement. We also build a knowledge warehouse in the form of a master Semantic Link Network (SLX) that accommodates and assimilates all the contextual semantic structures. This master SLX, which is a socio-contextual network, is then mined to provide contextual inputs to the current users through their agents. We validated our approach through experiments and analyzed the benefits to the users in terms of resource explorations and the time saved. The results are positive enough to motivate us to implement in a larger scale.

  14. SA-Search: a web tool for protein structure mining based on a Structural Alphabet

    OpenAIRE

    Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre

    2004-01-01

    SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of f...

  15. Informing child welfare policy and practice: using knowledge discovery and data mining technology via a dynamic Web site.

    Science.gov (United States)

    Duncan, Dean F; Kum, Hye-Chung; Weigensberg, Elizabeth Caplick; Flair, Kimberly A; Stewart, C Joy

    2008-11-01

    Proper management and implementation of an effective child welfare agency requires the constant use of information about the experiences and outcomes of children involved in the system, emphasizing the need for comprehensive, timely, and accurate data. In the past 20 years, there have been many advances in technology that can maximize the potential of administrative data to promote better evaluation and management in the field of child welfare. Specifically, this article discusses the use of knowledge discovery and data mining (KDD), which makes it possible to create longitudinal data files from administrative data sources, extract valuable knowledge, and make the information available via a user-friendly public Web site. This article demonstrates a successful project in North Carolina where knowledge discovery and data mining technology was used to develop a comprehensive set of child welfare outcomes available through a public Web site to facilitate information sharing of child welfare data to improve policy and practice.

  16. Usage of Safety Gloves in the Gold Mining Industry

    CSIR Research Space (South Africa)

    Scheepers, JCE

    1978-10-01

    Full Text Available The safety departments of 31 mines were visited, and the data obtained was used to determine to what extent safety gloves were being used in the gold mining industry. The frequency of occurrence of hand injuries amongst black workers of the gold...

  17. EVALUATION OF WEB SEARCHING METHOD USING A NOVEL WPRR ALGORITHM FOR TWO DIFFERENT CASE STUDIES

    Directory of Open Access Journals (Sweden)

    V. Lakshmi Praba

    2012-04-01

    Full Text Available The World-Wide Web provides every internet citizen with access to an abundance of information, but it becomes increasingly difficult to identify the relevant pieces of information. Research in web mining tries to address this problem by applying techniques from data mining and machine learning to web data and documents. Web content mining and web structure mining have important roles in identifying the relevant web page. Relevancy of web page denotes how well a retrieved web page or set of web pages meets the information need of the user. Page Rank, Weighted Page Rank and Hypertext Induced Topic Selection (HITS are existing algorithms which considers only web structure mining. Vector Space Model (VSM, Cover Density Ranking (CDR, Okapi similarity measurement (Okapi and Three-Level Scoring method (TLS are some of existing relevancy score methods which consider only web content mining. In this paper, we propose a new algorithm, Weighted Page with Relevant Rank (WPRR which is blend of both web content mining and web structure mining that demonstrates the relevancy of the page with respect to given query for two different case scenarios. It is shown that WPRR’s performance is better than the existing algorithms.

  18. Ultrabroadband photonic Internet: data mining approach to security aspects

    Science.gov (United States)

    Kalicki, Arkadiusz

    2009-06-01

    Web applications became most popular medium in the Internet. Popularity, easiness of web application frameworks together with careless development results in high number of vulnerabilities and attacks. There are several types of attacks possible because of improper input validation. SQL injection is ability to execute arbitrary SQL queries in a database through an existing application. Cross-site scripting is the vulnerability which allows malicious web users to inject code into the web pages viewed by other users. Cross-Site Request Forgery (CSRF) is an attack that tricks the victim into loading a page that contains malicious request. Web spam in blogs. In order to secure web applications intrusion detection (IDS) and intrusion prevention systems (IPS) are being used. Intrusion detection systems are divided in two groups: misuse detection (traditional IDS) and anomaly detection. Misuse detection systems are signature based, have high accuracy in detecting many kinds of known attacks but cannot detect unknown and emerging attacks. This can be complemented with anomaly based intrusion detection and prevention systems. This paper presents anomaly driven proxy as an IPS and data mining based algorithm which was used to detecting anomalies. The principle of this method is the comparison of the incoming HTTP traffic with a previously built profile that contains a representation of the "normal" or expected web application usage sequence patterns. The frequent sequence patterns are found with GSP algorithm. Some basic tests show that the software catches malicious requests.

  19. Nuclear expert web mining system: monitoring and analysis of nuclear acceptance by information retrieval and opinion extraction on the Internet

    Energy Technology Data Exchange (ETDEWEB)

    Reis, Thiago; Barroso, Antonio C.O.; Imakuma, Kengo, E-mail: thiagoreis@usp.b, E-mail: barroso@ipen.b, E-mail: kimakuma@ipen.b [Instituto de Pesquisas Energeticas e Nucleares (IPEN/CNEN-SP), Sao Paulo, SP (Brazil)

    2011-07-01

    This paper presents a research initiative that aims to collect nuclear related information and to analyze opinionated texts by mining the hypertextual data environment and social networks web sites on the Internet. Different from previous approaches that employed traditional statistical techniques, it is being proposed a novel Web Mining approach, built using the concept of Expert Systems, for massive and autonomous data collection and analysis. The initial step has been accomplished, resulting in a framework design that is able to gradually encompass a set of evolving techniques, methods, and theories in such a way that this work will build a platform upon which new researches can be performed more easily by just substituting modules or plugging in new ones. Upon completion it is expected that this research will contribute to the understanding of the population views on nuclear technology and its acceptance. (author)

  20. Nuclear expert web mining system: monitoring and analysis of nuclear acceptance by information retrieval and opinion extraction on the Internet

    International Nuclear Information System (INIS)

    Reis, Thiago; Barroso, Antonio C.O.; Imakuma, Kengo

    2011-01-01

    This paper presents a research initiative that aims to collect nuclear related information and to analyze opinionated texts by mining the hypertextual data environment and social networks web sites on the Internet. Different from previous approaches that employed traditional statistical techniques, it is being proposed a novel Web Mining approach, built using the concept of Expert Systems, for massive and autonomous data collection and analysis. The initial step has been accomplished, resulting in a framework design that is able to gradually encompass a set of evolving techniques, methods, and theories in such a way that this work will build a platform upon which new researches can be performed more easily by just substituting modules or plugging in new ones. Upon completion it is expected that this research will contribute to the understanding of the population views on nuclear technology and its acceptance. (author)

  1. SalanderMaps: A rapid overview about felt earthquakes through data mining of web-accesses

    Science.gov (United States)

    Kradolfer, Urs

    2013-04-01

    While seismological observatories detect and locate earthquakes based on measurements of the ground motion, they neither know a priori whether an earthquake has been felt by the public nor is it known, where it has been felt. Such information is usually gathered by evaluating feedback reported by the public through on-line forms on the web. However, after a felt earthquake in Switzerland, many people visit the webpages of the Swiss Seismological Service (SED) at the ETH Zurich and each such visit leaves traces in the logfiles on our web-servers. Data mining techniques, applied to these logfiles and mining publicly available data bases on the internet open possibilities to obtain previously unknown information about our virtual visitors. In order to provide precise information to authorities and the media, it would be desirable to rapidly know from which locations these web-accesses origin. The method 'Salander' (Seismic Activitiy Linked to Area codes - Nimble Detection of Earthquake Rumbles) will be introduced and it will be explained, how the IP-addresses (each computer or router directly connected to the internet has a unique IP-address; an example would be 129.132.53.5) of a sufficient amount of our virtual visitors were linked to their geographical area. This allows us to unprecedentedly quickly know whether and where an earthquake was felt in Switzerland. It will also be explained, why the method Salander is superior to commercial so-called geolocation products. The corresponding products of the Salander method, animated SalanderMaps, which are routinely generated after each earthquake with a magnitude of M>2 in Switzerland (http://www.seismo.ethz.ch/prod/salandermaps/, available after March 2013), demonstrate how the wavefield of earthquakes propagates through Switzerland and where it was felt. Often, such information is available within less than 60 seconds after origin time, and we always get a clear picture within already five minutes after origin time

  2. The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews.

    Science.gov (United States)

    Hao, Haijing; Zhang, Kunpeng

    2016-05-10

    Many Web-based health care platforms allow patients to evaluate physicians by posting open-end textual reviews based on their experiences. These reviews are helpful resources for other patients to choose high-quality doctors, especially in countries like China where no doctor referral systems exist. Analyzing such a large amount of user-generated content to understand the voice of health consumers has attracted much attention from health care providers and health care researchers. The aim of this paper is to automatically extract hidden topics from Web-based physician reviews using text-mining techniques to examine what Chinese patients have said about their doctors and whether these topics differ across various specialties. This knowledge will help health care consumers, providers, and researchers better understand this information. We conducted two-fold analyses on the data collected from the "Good Doctor Online" platform, the largest online health community in China. First, we explored all reviews from 2006-2014 using descriptive statistics. Second, we applied the well-known topic extraction algorithm Latent Dirichlet Allocation to more than 500,000 textual reviews from over 75,000 Chinese doctors across four major specialty areas to understand what Chinese health consumers said online about their doctor visits. On the "Good Doctor Online" platform, 112,873 out of 314,624 doctors had been reviewed at least once by April 11, 2014. Among the 772,979 textual reviews, we chose to focus on four major specialty areas that received the most reviews: Internal Medicine, Surgery, Obstetrics/Gynecology and Pediatrics, and Chinese Traditional Medicine. Among the doctors who received reviews from those four medical specialties, two-thirds of them received more than two reviews and in a few extreme cases, some doctors received more than 500 reviews. Across the four major areas, the most popular topics reviewers found were the experience of finding doctors, doctors' technical

  3. HC StratoMineR: A Web-Based Tool for the Rapid Analysis of High-Content Datasets.

    Science.gov (United States)

    Omta, Wienand A; van Heesbeen, Roy G; Pagliero, Romina J; van der Velden, Lieke M; Lelieveld, Daphne; Nellen, Mehdi; Kramer, Maik; Yeong, Marley; Saeidi, Amir M; Medema, Rene H; Spruit, Marco; Brinkkemper, Sjaak; Klumperman, Judith; Egan, David A

    2016-10-01

    High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that these datasets are frequently underutilized. Here, we present HC StratoMineR, a web-based tool for high-content data analysis. It is a decision-supportive platform that guides even non-expert users through a high-content data analysis workflow. HC StratoMineR is built by using My Structured Query Language for storage and querying, PHP: Hypertext Preprocessor as the main programming language, and jQuery for additional user interface functionality. R is used for statistical calculations, logic and data visualizations. Furthermore, C++ and graphical processor unit power is diffusely embedded in R by using the rcpp and rpud libraries for operations that are computationally highly intensive. We show that we can use HC StratoMineR for the analysis of multivariate data from a high-content siRNA knock-down screen and a small-molecule screen. It can be used to rapidly filter out undesirable data; to select relevant data; and to perform quality control, data reduction, data exploration, morphological hit picking, and data clustering. Our results demonstrate that HC StratoMineR can be used to functionally categorize HCS hits and, thus, provide valuable information for hit prioritization.

  4. Provenance-Based Approaches to Semantic Web Service Discovery and Usage

    Science.gov (United States)

    Narock, Thomas William

    2012-01-01

    The World Wide Web Consortium defines a Web Service as "a software system designed to support interoperable machine-to-machine interaction over a network." Web Services have become increasingly important both within and across organizational boundaries. With the recent advent of the Semantic Web, web services have evolved into semantic…

  5. Segmenting The Web 2.0 Market: Behavioural And Usage Patterns Of Social Web Consumers

    NARCIS (Netherlands)

    Lorenzo Romero, Carlota; Constantinides, Efthymios; Alarcon-del-Amo, Maria-del-Carmen

    2010-01-01

    The evolution of the commercial Internet to the current phase, commonly called Web 2.0 (or Social Web) has firmly positioned the web not only as a commercial but also as a social communication platform: an online environment facilitating peer-to-peer interaction, socialization, co-operation and

  6. Recommendations for Benchmarking Web Site Usage among Academic Libraries.

    Science.gov (United States)

    Hightower, Christy; Sih, Julie; Tilghman, Adam

    1998-01-01

    To help library directors and Web developers create a benchmarking program to compare statistics of academic Web sites, the authors analyzed the Web server log files of 14 university science and engineering libraries. Recommends a centralized voluntary reporting structure coordinated by the Association of Research Libraries (ARL) and a method for…

  7. Usage of cell nomenclature in biomedical literature

    KAUST Repository

    Kafkas, Senay; Sarntivijai, Sirarat; Hoehndorf, Robert

    2017-01-01

    large scale for understanding the level of uptake of cell nomenclature in literature by scientists. In this study, we analyse the usage of cell nomenclature, both in Vivo, and in Vitro in biomedical literature by using text mining methods and present our

  8. What explains usage of mobile physician-rating apps? Results from a web-based questionnaire.

    Science.gov (United States)

    Bidmon, Sonja; Terlutter, Ralf; Röttl, Johanna

    2014-06-11

    Consumers are increasingly accessing health-related information via mobile devices. Recently, several apps to rate and locate physicians have been released in the United States and Germany. However, knowledge about what kinds of variables explain usage of mobile physician-rating apps is still lacking. This study analyzes factors influencing the adoption of and willingness to pay for mobile physician-rating apps. A structural equation model was developed based on the Technology Acceptance Model and the literature on health-related information searches and usage of mobile apps. Relationships in the model were analyzed for moderating effects of physician-rating website (PRW) usage. A total of 1006 randomly selected German patients who had visited a general practitioner at least once in the 3 months before the beginning of the survey were randomly selected and surveyed. A total of 958 usable questionnaires were analyzed by partial least squares path modeling and moderator analyses. The suggested model yielded a high model fit. We found that perceived ease of use (PEOU) of the Internet to gain health-related information, the sociodemographic variables age and gender, and the psychographic variables digital literacy, feelings about the Internet and other Web-based applications in general, patients' value of health-related knowledgeability, as well as the information-seeking behavior variables regarding the amount of daily private Internet use for health-related information, frequency of using apps for health-related information in the past, and attitude toward PRWs significantly affected the adoption of mobile physician-rating apps. The sociodemographic variable age, but not gender, and the psychographic variables feelings about the Internet and other Web-based applications in general and patients' value of health-related knowledgeability, but not digital literacy, were significant predictors of willingness to pay. Frequency of using apps for health-related information

  9. Discovering Student Web Usage Profiles Using Markov Chains

    Science.gov (United States)

    Marques, Alice; Belo, Orlando

    2011-01-01

    Nowadays, Web based platforms are quite common in any university, supporting a very diversified set of applications and services. Ranging from personal management to student evaluation processes, Web based platforms are doing a great job providing a very flexible way of working, promote student enrolment, and making access to academic information…

  10. Mining the human phenome using semantic web technologies: a case study for Type 2 Diabetes.

    Science.gov (United States)

    Pathak, Jyotishman; Kiefer, Richard C; Bielinski, Suzette J; Chute, Christopher G

    2012-01-01

    The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. However, historically GWAS have been limited by inadequate sample size due to associated costs for genotyping and phenotyping of study subjects. This has prompted several academic medical centers to form "biobanks" where biospecimens linked to personal health information, typically in electronic health records (EHRs), are collected and stored on large number of subjects. This provides tremendous opportunities to discover novel genotype-phenotype associations and foster hypothesis generation. In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical and genotype data stored at the Mayo Clinic Biobank to mine the phenotype data for genetic associations. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR diagnoses and procedure data, and enable federated querying via standardized Web protocols to identify subjects genotyped with Type 2 Diabetes for discovering gene-disease associations. Our study highlights the potential of Web-scale data federation techniques to execute complex queries.

  11. Models and methods for building web recommendation systems

    OpenAIRE

    Stekh, Yu.; Artsibasov, V.

    2012-01-01

    Modern Word Wide Web contains a large number of Web sites and pages in each Web site. Web recommendation system (recommendation system for web pages) are typically implemented on web servers and use the data obtained from the collection viewed web templates (implicit data) or user registration data (explicit data). In article considering methods and algorithms of web recommendation system based on the technology of data mining (web mining). Сучасна мережа Інтернет містить велику кількість веб...

  12. Web server's reliability improvements using recurrent neural networks

    DEFF Research Database (Denmark)

    Madsen, Henrik; Albu, Rǎzvan-Daniel; Felea, Ioan

    2012-01-01

    In this paper we describe an interesting approach to error prediction illustrated by experimental results. The application consists of monitoring the activity for the web servers in order to collect the specific data. Predicting an error with severe consequences for the performance of a server (t...... usage, network usage and memory usage. We collect different data sets from monitoring the web server's activity and for each one we predict the server's reliability with the proposed recurrent neural network. © 2012 Taylor & Francis Group...

  13. Mobile response in web panels

    NARCIS (Netherlands)

    de Bruijne, M.A.; Wijnant, A.

    2014-01-01

    This article investigates unintended mobile access to surveys in online, probability-based panels. We find that spontaneous tablet usage is drastically increasing in web surveys, while smartphone usage remains low. Further, we analyze the bias of respondent profiles using smartphones and tablets

  14. Do usage and scientific collaboration associate with citation impact

    Energy Technology Data Exchange (ETDEWEB)

    Chi, P.S.; Glänzel, W.

    2016-07-01

    In this study usage counts and times cited from Web of Science Core Collection (WoS) were collected for each article published in 2013 with Belgian, Israeli and Iranian addresses. We investigate the relations among three indicators related to citation impact, usage counts coauthorship, respectively. In addition, we apply the method of Characteristic Scores and Scal (CSS) to analyse the distributions of citations and usage counts. The results show that citations and usage counts in WoS correlate to each other significantly, especially in the social sciences. However, the increase of the number of co-authors does not increase usage counts or citations significantly. Furthermore, the stability of CSS-class distributions proves the availability of CSS in characterising both usage and citation distributions. (Author)

  15. Booster fans : some considerations for their usage in underground coal mines

    Energy Technology Data Exchange (ETDEWEB)

    Gillies, S.; Slaughter, C. [Missouri Univ. of Science and Technology, Rolla, MO (United States); Calizaya, F. [Utah Univ., Salt Lake City, UT (United States); Wu, H.W. [Gillies Wu Mining Technology Pty Ltd., Brisbane, QLD (Australia)

    2010-07-01

    This paper reported on a study that investigated the conditions under which booster fans can be used safely and efficiently in underground coal mines. Booster fans are installed in series with a main surface fan and are used to boost the air pressure of the ventilation air passing through it. Several coal mining countries use booster fans, but in the United States, they are only used in metal/non-metal mines due to concerns of uncontrolled recirculation. This study investigated installations of booster fans in non-US underground coal mines where safe and efficient atmospheric conditions are achieved. The purpose was to collect reliable information on airway resistances and flow requirements typical in large US coal mines. The study showed that safe booster fan installations are found in both high and low gas conditions, and sometimes where workings are located at great depths. The interlocking systems within the booster fan can control the underground fans and avoid recirculation when surface fans are unexpectedly turned off. Another purpose of the study was to determine when booster fans become a more viable solution in coal mines due to increases in air requirements at higher production rates. It was concluded that a new fan selection algorithm to produce recirculation-free ventilation designs will be developed to enable US coal mine operators to develop ventilation designs to extract coal seams from depths greater than 1000 m. 17 refs., 1 fig.

  16. A web server for mining Comparative Genomic Hybridization (CGH) data

    Science.gov (United States)

    Liu, Jun; Ranka, Sanjay; Kahveci, Tamer

    2007-11-01

    Advances in cytogenetics and molecular biology has established that chromosomal alterations are critical in the pathogenesis of human cancer. Recurrent chromosomal alterations provide cytological and molecular markers for the diagnosis and prognosis of disease. They also facilitate the identification of genes that are important in carcinogenesis, which in the future may help in the development of targeted therapy. A large amount of publicly available cancer genetic data is now available and it is growing. There is a need for public domain tools that allow users to analyze their data and visualize the results. This chapter describes a web based software tool that will allow researchers to analyze and visualize Comparative Genomic Hybridization (CGH) datasets. It employs novel data mining methodologies for clustering and classification of CGH datasets as well as algorithms for identifying important markers (small set of genomic intervals with aberrations) that are potentially cancer signatures. The developed software will help in understanding the relationships between genomic aberrations and cancer types.

  17. Measurment of Web Usability: Web Page of Hacettepe University Department of Information Management

    OpenAIRE

    Nazan Özenç Uçak; Tolga Çakmak

    2009-01-01

    Today, information is produced increasingly in electronic form and retrieval of information is provided via web pages. As a result of the rise of the number of web pages, many of them seem to comprise similar contents but different designs. In this respect, presenting information over the web pages according to user expectations and specifications is important in terms of effective usage of information. This study provides an insight about web usability studies that are executed for measuring...

  18. SQUAT: A web tool to mine human, murine and avian SAGE data

    Directory of Open Access Journals (Sweden)

    Besson Jérémy

    2008-09-01

    Full Text Available Abstract Background There is an increasing need in transcriptome research for gene expression data and pattern warehouses. It is of importance to integrate in these warehouses both raw transcriptomic data, as well as some properties encoded in these data, like local patterns. Description We have developed an application called SQUAT (SAGE Querying and Analysis Tools which is available at: http://bsmc.insa-lyon.fr/squat/. This database gives access to both raw SAGE data and patterns mined from these data, for three species (human, mouse and chicken. This database allows to make simple queries like "In which biological situations is my favorite gene expressed?" as well as much more complex queries like: ≪what are the genes that are frequently co-over-expressed with my gene of interest in given biological situations?≫. Connections with external web databases enrich biological interpretations, and enable sophisticated queries. To illustrate the power of SQUAT, we show and analyze the results of three different queries, one of which led to a biological hypothesis that was experimentally validated. Conclusion SQUAT is a user-friendly information retrieval platform, which aims at bringing some of the state-of-the-art mining tools to biologists.

  19. What Explains Usage of Mobile Physician-Rating Apps? Results From a Web-Based Questionnaire

    Science.gov (United States)

    Terlutter, Ralf; Röttl, Johanna

    2014-01-01

    Background Consumers are increasingly accessing health-related information via mobile devices. Recently, several apps to rate and locate physicians have been released in the United States and Germany. However, knowledge about what kinds of variables explain usage of mobile physician-rating apps is still lacking. Objective This study analyzes factors influencing the adoption of and willingness to pay for mobile physician-rating apps. A structural equation model was developed based on the Technology Acceptance Model and the literature on health-related information searches and usage of mobile apps. Relationships in the model were analyzed for moderating effects of physician-rating website (PRW) usage. Methods A total of 1006 randomly selected German patients who had visited a general practitioner at least once in the 3 months before the beginning of the survey were randomly selected and surveyed. A total of 958 usable questionnaires were analyzed by partial least squares path modeling and moderator analyses. Results The suggested model yielded a high model fit. We found that perceived ease of use (PEOU) of the Internet to gain health-related information, the sociodemographic variables age and gender, and the psychographic variables digital literacy, feelings about the Internet and other Web-based applications in general, patients’ value of health-related knowledgeability, as well as the information-seeking behavior variables regarding the amount of daily private Internet use for health-related information, frequency of using apps for health-related information in the past, and attitude toward PRWs significantly affected the adoption of mobile physician-rating apps. The sociodemographic variable age, but not gender, and the psychographic variables feelings about the Internet and other Web-based applications in general and patients’ value of health-related knowledgeability, but not digital literacy, were significant predictors of willingness to pay. Frequency of

  20. Technical Note: On The Usage and Development of the AWAKE Web Server and Web Applications

    CERN Document Server

    Berger, Dillon Tanner

    2017-01-01

    The purpose of this technical note is to give a brief explanation of the AWAKE Web Server, the current web applications it serves, and how to edit, maintain, and update the source code. The majority of this paper is dedicated to the development of the server and its web applications.

  1. The Effects of Web 2.0 Technologies Usage in Programming Languages Lesson on the Academic Success, Interrogative Learning Skills and Attitudes of Students towards Programming Languages

    Science.gov (United States)

    Gençtürk, Abdullah Tarik; Korucu, Agah Tugrul

    2017-01-01

    It is observed that teacher candidates receiving education in the department of Computer and Instructional Technologies Education are not able to gain enough experience and knowledge in "Programming Languages" lesson. The goal of this study is to analyse the effects of web 2.0 technologies usage in programming languages lesson on the…

  2. The Influence of Perceived Organizational Injustice towards Workplace Personal Web Usage and Work Productivity in Indonesia

    Directory of Open Access Journals (Sweden)

    Nur Fathonah

    2014-10-01

    Full Text Available Workplace personal web usage (WPWU is an employee’s activity in using internet for non-related task during working hours. It is considered a counterproductive behavior when done excessively because it can interrupt employee’s productivity, but it can increase creativity and eliminate bore- dom when used in a rational amount. The objective of this study was to prove whether perceived organizational injustice had influence on WPWU which affected work productivity. A total of 222 respondents working in various industries were gathered through web-survey. By using multino- mial logistic regression analysis, this study found that high level use of internet for unrelated jobs between 2 to 4 hours a day was influenced by respondents’ perception of not getting fair treatment and incentive for being good performer, which then caused them to perform very low completion of tasks. There were two contrasting views regarding this result; organizations considered it as deviant behavior because it reduced employees’ performance whereas employees regarded it as just short breaks to get rid of stress. Hence, this finding suggested that companies should redesign its internet policies to accommodate “Work-Life Blend”; blending work and personal lives, as a consequence of cultural shift in the era of globalization and new technologies.

  3. Mining Tasks from the Web Anchor Text Graph: MSR Notebook Paper for the TREC 2015 Tasks Track

    Science.gov (United States)

    2015-11-20

    Mining Tasks from the Web Anchor Text Graph: MSR Notebook Paper for the TREC 2015 Tasks Track Paul N. Bennett Microsoft Research Redmond, USA pauben...anchor text graph has proven useful in the general realm of query reformulation [2], we sought to quantify the value of extracting key phrases from...anchor text in the broader setting of the task understanding track. Given a query, our approach considers a simple method for identifying a relevant

  4. Evaluating The Markov Assumption For Web Usage Mining

    DEFF Research Database (Denmark)

    Jespersen, S.; Pedersen, Torben Bach; Thorhauge, J.

    2003-01-01

    ) model~\\cite{borges99data}. These techniques typically rely on the \\textit{Markov assumption with history depth} $n$, i.e., it is assumed that the next requested page is only dependent on the last $n$ pages visited. This is not always valid, i.e. false browsing patterns may be discovered. However, to our...

  5. Semantic web for integrated network analysis in biomedicine.

    Science.gov (United States)

    Chen, Huajun; Ding, Li; Wu, Zhaohui; Yu, Tong; Dhanapalan, Lavanya; Chen, Jake Y

    2009-03-01

    The Semantic Web technology enables integration of heterogeneous data on the World Wide Web by making the semantics of data explicit through formal ontologies. In this article, we survey the feasibility and state of the art of utilizing the Semantic Web technology to represent, integrate and analyze the knowledge in various biomedical networks. We introduce a new conceptual framework, semantic graph mining, to enable researchers to integrate graph mining with ontology reasoning in network data analysis. Through four case studies, we demonstrate how semantic graph mining can be applied to the analysis of disease-causal genes, Gene Ontology category cross-talks, drug efficacy analysis and herb-drug interactions analysis.

  6. Usage of Cable Bolts for Gateroad Maintenance in Soft Rocks

    Directory of Open Access Journals (Sweden)

    Iurii Khalymendyk

    2014-01-01

    Originality/value: 1. There are no regulations and state standards in regard to cable bolt installation parameters in the mines of Ukraine, consequently the usage of cable bolts for gateroad maintenance required preliminary testing under geological conditions at the Western Donbass mines with soft enclosing rocks. 2. Combining levelling with observations using extensometers allowed for the detection of the rock layers' uniform sagging zone in the roof of the gateroad.

  7. The use of web ontology languages and other semantic web tools in drug discovery.

    Science.gov (United States)

    Chen, Huajun; Xie, Guotong

    2010-05-01

    To optimize drug development processes, pharmaceutical companies require principled approaches to integrate disparate data on a unified infrastructure, such as the web. The semantic web, developed on the web technology, provides a common, open framework capable of harmonizing diversified resources to enable networked and collaborative drug discovery. We survey the state of art of utilizing web ontologies and other semantic web technologies to interlink both data and people to support integrated drug discovery across domains and multiple disciplines. Particularly, the survey covers three major application categories including: i) semantic integration and open data linking; ii) semantic web service and scientific collaboration and iii) semantic data mining and integrative network analysis. The reader will gain: i) basic knowledge of the semantic web technologies; ii) an overview of the web ontology landscape for drug discovery and iii) a basic understanding of the values and benefits of utilizing the web ontologies in drug discovery. i) The semantic web enables a network effect for linking open data for integrated drug discovery; ii) The semantic web service technology can support instant ad hoc collaboration to improve pipeline productivity and iii) The semantic web encourages publishing data in a semantic way such as resource description framework attributes and thus helps move away from a reliance on pure textual content analysis toward more efficient semantic data mining.

  8. Web usage data as a means of evaluating public health messaging and outreach.

    Science.gov (United States)

    Tian, Hao; Brimmer, Dana J; Lin, Jin-Mann S; Tumpey, Abbigail J; Reeves, William C

    2009-12-21

    The Internet is increasingly utilized by researchers, health care providers, and the public to seek medical information. The Internet also provides a powerful tool for public health messaging. Understanding the needs of the intended audience and how they use websites is critical for website developers to provide better services to the intended users. The aim of the study was to examine the utilization of the chronic fatigue syndrome (CFS) website at the Centers for Disease Control and Prevention (CDC). We evaluated (1) CFS website utilization, (2) outcomes of a CDC CFS public awareness campaign, and (3) user behavior related to public awareness campaign materials and CFS continuing medical education courses. To describe and evaluate Web utilization, we collected Web usage data over an 18-month period and extracted page views, visits, referring domains, and geographic locations. We used page views as the primary measure for the CFS awareness outreach effort. We utilized market basket analysis and Markov chain model techniques to describe user behavior related to utilization of campaign materials and continuing medical education courses. The CDC CFS website received 3,647,736 views from more than 50 countries over the 18-month period and was the 33rd most popular CDC website. States with formal CFS programs had higher visiting density, such as Washington, DC; Georgia; and New Jersey. Most visits (71%) were from Web search engines, with 16% from non-search-engine sites and 12% from visitors who had bookmarked the site. The public awareness campaign was associated with a sharp increase and subsequent quick drop in Web traffic. Following the campaign, user interest shifted from information targeting consumer basic knowledge to information for health care professionals. The market basket analysis showed that visitors preferred the 60-second radio clip public service announcement over the 30-second one. Markov chain model results revealed that most visitors took the

  9. Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance.

    Science.gov (United States)

    Kamel Boulos, Maged N; Sanfilippo, Antonio P; Corley, Courtney D; Wheeler, Steve

    2010-10-01

    This paper explores Technosocial Predictive Analytics (TPA) and related methods for Web "data mining" where users' posts and queries are garnered from Social Web ("Web 2.0") tools such as blogs, micro-blogging and social networking sites to form coherent representations of real-time health events. The paper includes a brief introduction to commonly used Social Web tools such as mashups and aggregators, and maps their exponential growth as an open architecture of participation for the masses and an emerging way to gain insight about people's collective health status of whole populations. Several health related tool examples are described and demonstrated as practical means through which health professionals might create clear location specific pictures of epidemiological data such as flu outbreaks. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.

  10. Classification algorithm of Web document in ionization radiation

    International Nuclear Information System (INIS)

    Geng Zengmin; Liu Wanchun

    2005-01-01

    Resources in the Internet is numerous. It is one of research directions of Web mining (WM) how to mine the resource of some calling or trade more efficiently. The paper studies the classification of Web document in ionization radiation (IR) based on the algorithm of Bayes, Rocchio, Widrow-Hoff, and analyses the result of trial effect. (authors)

  11. Two-step web-mining approach to study geology/geophysics-related open-source software projects

    Science.gov (United States)

    Behrends, Knut; Conze, Ronald

    2013-04-01

    Geology/geophysics is a highly interdisciplinary science, overlapping with, for instance, physics, biology and chemistry. In today's software-intensive work environments, geoscientists often encounter new open-source software from scientific fields that are only remotely related to the own field of expertise. We show how web-mining techniques can help to carry out systematic discovery and evaluation of such software. In a first step, we downloaded ~500 abstracts (each consisting of ~1 kb UTF-8 text) from agu-fm12.abstractcentral.com. This web site hosts the abstracts of all publications presented at AGU Fall Meeting 2012, the world's largest annual geology/geophysics conference. All abstracts belonged to the category "Earth and Space Science Informatics", an interdisciplinary label cross-cutting many disciplines such as "deep biosphere", "atmospheric research", and "mineral physics". Each publication was represented by a highly structured record with ~20 short data attributes, the largest authorship-record being the unstructured "abstract" field. We processed texts of the abstracts with the statistics software "R" to calculate a corpus and a term-document matrix. Using R package "tm", we applied text-mining techniques to filter data and develop hypotheses about software-development activities happening in various geology/geophysics fields. Analyzing the term-document matrix with basic techniques (e.g., word frequencies, co-occurences, weighting) as well as more complex methods (clustering, classification) several key pieces of information were extracted. For example, text-mining can be used to identify scientists who are also developers of open-source scientific software, and the names of their programming projects and codes can also be identified. In a second step, based on the intermediate results found by processing the conference-abstracts, any new hypotheses can be tested in another webmining subproject: by merging the dataset with open data from github

  12. Data Mining of Web-Based Documents on Social Networking Sites That Included Suicide-Related Words Among Korean Adolescents.

    Science.gov (United States)

    Song, Juyoung; Song, Tae Min; Seo, Dong-Chul; Jin, Jae Hyun

    2016-12-01

    To investigate online search activity of suicide-related words in South Korean adolescents through data mining of social media Web sites as the suicide rate in South Korea is one of the highest in the world. Out of more than 2.35 billion posts for 2 years from January 1, 2011 to December 31, 2012 on 163 social media Web sites in South Korea, 99,693 suicide-related documents were retrieved by Crawler and analyzed using text mining and opinion mining. These data were further combined with monthly employment rate, monthly rental prices index, monthly youth suicide rate, and monthly number of reported bully victims to fit multilevel models as well as structural equation models. The link from grade pressure to suicide risk showed the largest standardized path coefficient (beta = .357, p < .001) in structural models and a significant random effect (p < .01) in multilevel models. Depression was a partial mediator between suicide risk and grade pressure, low body image, victims of bullying, and concerns about disease. The largest total effect was observed in the grade pressure to depression to suicide risk. The multilevel models indicate about 27% of the variance in the daily suicide-related word search activity is explained by month-to-month variations. A lower employment rate, a higher rental prices index, and more bullying were associated with an increased suicide-related word search activity. Academic pressure appears to be the biggest contributor to Korean adolescents' suicide risk. Real-time suicide-related word search activity monitoring and response system needs to be developed. Copyright © 2016 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.

  13. A STUDY OF TEXT MINING METHODS, APPLICATIONS,AND TECHNIQUES

    OpenAIRE

    R. Rajamani*1 & S. Saranya2

    2017-01-01

    Data mining is used to extract useful information from the large amount of data. It is used to implement and solve different types of research problems. The research related areas in data mining are text mining, web mining, image mining, sequential pattern mining, spatial mining, medical mining, multimedia mining, structure mining and graph mining. Text mining also referred to text of data mining, it is also called knowledge discovery in text (KDT) or knowledge of intelligent text analysis. T...

  14. A privacy-preserving sharing method of electricity usage using self-organizing map

    Directory of Open Access Journals (Sweden)

    Yuichi Nakamura

    2018-03-01

    Full Text Available Smart meters for measuring electricity usage are expected in electricity usage management. Although the relevant power supplier stores the measured data, the data are worth sharing among power suppliers because the entire data of a city will be required to control the regional grid stability or demand–supply balance. Even though many techniques and methods of privacy-preserving data mining have been studied to share data while preserving data privacy, a study on sharing electricity usage data is still lacking. In this paper, we propose a sharing method of electricity usage while preserving data privacy using a self-organizing map. Keywords: Privacy preserving, Data sharing, Self-Organizing map

  15. Web 2.0 (and Beyond)

    NARCIS (Netherlands)

    P.A. Arora (Payal)

    2015-01-01

    textabstractWeb 2.0 is a term coined to mark a new era of Internet usage driven by user interactivity and collaboration in generating content, moving away from the static information dissemination model associated with Web 1.0. It became common in early 2000 with the growth of social network sites,

  16. Big data mining: In-database Oracle data mining over hadoop

    Science.gov (United States)

    Kovacheva, Zlatinka; Naydenova, Ina; Kaloyanova, Kalinka; Markov, Krasimir

    2017-07-01

    Big data challenges different aspects of storing, processing and managing data, as well as analyzing and using data for business purposes. Applying Data Mining methods over Big Data is another challenge because of huge data volumes, variety of information, and the dynamic of the sources. Different applications are made in this area, but their successful usage depends on understanding many specific parameters. In this paper we present several opportunities for using Data Mining techniques provided by the analytical engine of RDBMS Oracle over data stored in Hadoop Distributed File System (HDFS). Some experimental results are given and they are discussed.

  17. Game-Theoretic Models for Usage-based Maintenance Contract

    Science.gov (United States)

    Husniah, H.; Wangsaputra, R.; Cakravastia, A.; Iskandar, B. P.

    2018-03-01

    A usage-based maintenance contracts with coordination and non coordination between two parties is studied in this paper. The contract is applied to a dump truck operated in a mining industry. The situation under study is that an agent offers service contract to the owner of the truck after warranty ends. This contract has only a time limit but no usage limit. If the total usage per period exceeds the maximum usage allowed in the contract, then the owner will be charged an additional cost. In general, the agent (Original Equipment Manufacturer/OEM) provides a full coverage of maintenance, which includes PM and CM under the lease contract. The decision problem for the owner is to select the best option offered that fits to its requirement, and the decision problem for the agent is to find the optimal maintenance efforts for a given price of the service option offered. We first find the optimal decisions using coordination scheme and then with non coordination scheme for both parties.

  18. Design of an Interface for Page Rank Calculation using Web Link Attributes Information

    Directory of Open Access Journals (Sweden)

    Jeyalatha SIVARAMAKRISHNAN

    2010-01-01

    Full Text Available This paper deals with the Web Structure Mining and the different Structure Mining Algorithms like Page Rank, HITS, Trust Rank and Sel-HITS. The functioning of these algorithms are discussed. An incremental algorithm for calculation of PageRank using an interface has been formulated. This algorithm makes use of Web Link Attributes Information as key parameters and has been implemented using Visibility and Position of a Link. The application of Web Structure Mining Algorithm in an Academic Search Application has been discussed. The present work can be a useful input to Web Users, Faculty, Students and Web Administrators in a University Environment.

  19. Experienced ethical issues of personalized data-mined media services

    DEFF Research Database (Denmark)

    Sørensen, Jannick Kirk

    2008-01-01

    This tentative PhD project description concerns the ethnographic examination of users’ experience of privacy issues and usability related to personalized data mined (web-) services for media content.......This tentative PhD project description concerns the ethnographic examination of users’ experience of privacy issues and usability related to personalized data mined (web-) services for media content....

  20. APFEL Web a web-based application for the graphical visualization of parton distribution functions

    CERN Document Server

    Carrazza, Stefano; Palazzo, Daniele; Rojo, Juan

    2015-01-01

    We present APFEL Web, a web-based application designed to provide a flexible user-friendly tool for the graphical visualization of parton distribution functions (PDFs). In this note we describe the technical design of the APFEL Web application, motivating the choices and the framework used for the development of this project. We document the basic usage of APFEL Web and show how it can be used to provide useful input for a variety of collider phenomenological studies. Finally we provide some examples showing the output generated by the application.

  1. APFEL Web: a web-based application for the graphical visualization of parton distribution functions

    International Nuclear Information System (INIS)

    Carrazza, Stefano; Ferrara, Alfio; Palazzo, Daniele; Rojo, Juan

    2015-01-01

    We present APFEL Web, a Web-based application designed to provide a flexible user-friendly tool for the graphical visualization of parton distribution functions. In this note we describe the technical design of the APFEL Web application, motivating the choices and the framework used for the development of this project. We document the basic usage of APFEL Web and show how it can be used to provide useful input for a variety of collider phenomenological studies. Finally we provide some examples showing the output generated by the application. (note)

  2. HEP Outreach, Inreach, and Web 2.0

    International Nuclear Information System (INIS)

    Goldfarb, Steven

    2011-01-01

    I report on current usage of multimedia and social networking 'Web 2.0' tools for Education and Outreach in high-energy physics, and discuss their potential for internal communication within large worldwide collaborations, such as those of the LHC. Following a brief description of the history of Web 2.0 development, I present a survey of the most popular sites and describe their usage in HEP to disseminate information to students and the general public. I then discuss the potential of certain specific tools, such as document and multimedia sharing sites, for boosting the speed and effectiveness of information exchange within the collaborations. I conclude with a brief discussion of the successes and failures of these tools, and make suggestions for improved usage in the future.

  3. HEP Outreach, Inreach, and Web 2.0

    Science.gov (United States)

    Goldfarb, Steven

    2011-12-01

    I report on current usage of multimedia and social networking "Web 2.0" tools for Education and Outreach in high-energy physics, and discuss their potential for internal communication within large worldwide collaborations, such as those of the LHC. Following a brief description of the history of Web 2.0 development, I present a survey of the most popular sites and describe their usage in HEP to disseminate information to students and the general public. I then discuss the potential of certain specific tools, such as document and multimedia sharing sites, for boosting the speed and effectiveness of information exchange within the collaborations. I conclude with a brief discussion of the successes and failures of these tools, and make suggestions for improved usage in the future.

  4. A new measurement of workload in Web application reliability assessment

    Directory of Open Access Journals (Sweden)

    CUI Xia

    2015-02-01

    Full Text Available Web application has been popular in various fields of social life.It becomes more and more important to study the reliability of Web application.In this paper the definition of Web application failure is firstly brought out,and then the definition of Web application reliability.By analyzing data in the IIS server logs and selecting corresponding usage and information delivery failure data,the paper study the feasibility of Web application reliability assessment from the perspective of Web software system based on IIS server logs.Because the usage for a Web site often has certain regularity,a new measurement of workload in Web application reliability assessment is raised.In this method,the unit is removed by weighted average technique;and the weights are assessed by setting objective function and optimization.Finally an experiment was raised for validation.The experiment result shows the assessment of Web application reliability base on the new workload is better.

  5. Uncoolness factor of collaborative Web Mining Tools (WMT

    Directory of Open Access Journals (Sweden)

    Juan Luis Chulilla

    2009-12-01

    Full Text Available The recent development of social mining is a useful and direct analogy to talking about the less visible part of the adoption of successive waves of social software. The striking fact of visibility decrease as each type of social software matures should be taken into account for any comprehensive analysis of the relation between collectives and Internet technologies. One of the main results of this relation is the social data mining of Internet, which both gives sense to virtual communities and produces contents via feedback. We are just at the beginning of the adoption of new ways of social data mining, which will be significant when grow mature and become invisible.

  6. A demanding web-based PACS supported by web services technology

    Science.gov (United States)

    Costa, Carlos M. A.; Silva, Augusto; Oliveira, José L.; Ribeiro, Vasco G.; Ribeiro, José

    2006-03-01

    During the last years, the ubiquity of web interfaces have pushed practically all PACS suppliers to develop client applications in which clinical practitioners can receive and analyze medical images, using conventional personal computers and Web browsers. However, due to security and performance issues, the utilization of these software packages has been restricted to Intranets. Paradigmatically, one of the most important advantages of digital image systems is to simplify the widespread sharing and remote access of medical data between healthcare institutions. This paper analyses the traditional PACS drawbacks that contribute to their reduced usage in the Internet and describes a PACS based on Web Services technology that supports a customized DICOM encoding syntax and a specific compression scheme providing all historical patient data in a unique Web interface.

  7. Sentiment Analysis and Opinion Mining

    CERN Document Server

    Liu, Bing

    2012-01-01

    Sentiment analysis and opinion mining is the field of study that analyzes people's opinions, sentiments, evaluations, attitudes, and emotions from written language. It is one of the most active research areas in natural language processing and is also widely studied in data mining, Web mining, and text mining. In fact, this research has spread outside of computer science to the management sciences and social sciences due to its importance to business and society as a whole. The growing importance of sentiment analysis coincides with the growth of social media such as reviews, forum discussions

  8. A study on PubMed search tag usage pattern: association rule mining of a full-day PubMed query log.

    Science.gov (United States)

    Mosa, Abu Saleh Mohammad; Yoo, Illhoi

    2013-01-09

    The practice of evidence-based medicine requires efficient biomedical literature search such as PubMed/MEDLINE. Retrieval performance relies highly on the efficient use of search field tags. The purpose of this study was to analyze PubMed log data in order to understand the usage pattern of search tags by the end user in PubMed/MEDLINE search. A PubMed query log file was obtained from the National Library of Medicine containing anonymous user identification, timestamp, and query text. Inconsistent records were removed from the dataset and the search tags were extracted from the query texts. A total of 2,917,159 queries were selected for this study issued by a total of 613,061 users. The analysis of frequent co-occurrences and usage patterns of the search tags was conducted using an association mining algorithm. The percentage of search tag usage was low (11.38% of the total queries) and only 2.95% of queries contained two or more tags. Three out of four users used no search tag and about two-third of them issued less than four queries. Among the queries containing at least one tagged search term, the average number of search tags was almost half of the number of total search terms. Navigational search tags are more frequently used than informational search tags. While no strong association was observed between informational and navigational tags, six (out of 19) informational tags and six (out of 29) navigational tags showed strong associations in PubMed searches. The low percentage of search tag usage implies that PubMed/MEDLINE users do not utilize the features of PubMed/MEDLINE widely or they are not aware of such features or solely depend on the high recall focused query translation by the PubMed's Automatic Term Mapping. The users need further education and interactive search application for effective use of the search tags in order to fulfill their biomedical information needs from PubMed/MEDLINE.

  9. Interactive text mining with Pipeline Pilot: a bibliographic web-based tool for PubMed.

    Science.gov (United States)

    Vellay, S G P; Latimer, N E Miller; Paillard, G

    2009-06-01

    Text mining has become an integral part of all research in the medical field. Many text analysis software platforms support particular use cases and only those. We show an example of a bibliographic tool that can be used to support virtually any use case in an agile manner. Here we focus on a Pipeline Pilot web-based application that interactively analyzes and reports on PubMed search results. This will be of interest to any scientist to help identify the most relevant papers in a topical area more quickly and to evaluate the results of query refinement. Links with Entrez databases help both the biologist and the chemist alike. We illustrate this application with Leishmaniasis, a neglected tropical disease, as a case study.

  10. SA-Search: a web tool for protein structure mining based on a Structural Alphabet.

    Science.gov (United States)

    Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre

    2004-07-01

    SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search.

  11. Two Algorithms for Web Applications Assessment

    Directory of Open Access Journals (Sweden)

    Stavros Ioannis Valsamidis

    2011-09-01

    Full Text Available The usage of web applications can be measured with the use of metrics. In a LMS, a typical web application, there are no appropriate metrics which would facilitate their qualitative and quantitative measurement. The purpose of this paper is to propose the use of existing techniques with a different way, in order to analyze the log file of a typical LMS and deduce useful conclusions. Three metrics for course usage measurement are used. It also describes two algorithms for course classification and suggestion actions. The metrics and the algorithms and were in Open eClass LMS tracking data of an academic institution. The results from 39 courses presented interest insights. Although the case study concerns a LMS it can also be applied to other web applications such as e-government, e-commerce, e-banking, blogs e.t.c.

  12. Kernel Methods for Mining Instance Data in Ontologies

    Science.gov (United States)

    Bloehdorn, Stephan; Sure, York

    The amount of ontologies and meta data available on the Web is constantly growing. The successful application of machine learning techniques for learning of ontologies from textual data, i.e. mining for the Semantic Web, contributes to this trend. However, no principal approaches exist so far for mining from the Semantic Web. We investigate how machine learning algorithms can be made amenable for directly taking advantage of the rich knowledge expressed in ontologies and associated instance data. Kernel methods have been successfully employed in various learning tasks and provide a clean framework for interfacing between non-vectorial data and machine learning algorithms. In this spirit, we express the problem of mining instances in ontologies as the problem of defining valid corresponding kernels. We present a principled framework for designing such kernels by means of decomposing the kernel computation into specialized kernels for selected characteristics of an ontology which can be flexibly assembled and tuned. Initial experiments on real world Semantic Web data enjoy promising results and show the usefulness of our approach.

  13. University Students’ Web 2.0 Technologies Usage, Skill Levels and Educational Usage

    OpenAIRE

    Baran, Bahar; Ata, Figen

    2013-01-01

    This study aims to find out university students’ use of Web 2.0 technologies in terms of frequencies, skill levels and educational use and to understand whether or not these variables differ for gender, foreign language levels, computer ownership and the Internet connection duration. Accessible population of this study is the entire Dokuz Eylul University students. In the sample, the researchers collected data from 2776 university students of the university. In the context of the study, blog,...

  14. Legal aspects of search and mining of nuclear ores under Brazilian law

    International Nuclear Information System (INIS)

    Godinho, T.M.

    1980-06-01

    The legal aspects of mining in the Brazilian law its general principles, the basic concepts and rules established in the constitution of Brazil, in the mining code and in special laws are analysed. The rules for mining and usage of nuclear ores and other ores of interest to the nuclear field are emphasized. (A.L.) [pt

  15. Stratification-Based Outlier Detection over the Deep Web.

    Science.gov (United States)

    Xian, Xuefeng; Zhao, Pengpeng; Sheng, Victor S; Fang, Ligang; Gu, Caidong; Yang, Yuanfeng; Cui, Zhiming

    2016-01-01

    For many applications, finding rare instances or outliers can be more interesting than finding common patterns. Existing work in outlier detection never considers the context of deep web. In this paper, we argue that, for many scenarios, it is more meaningful to detect outliers over deep web. In the context of deep web, users must submit queries through a query interface to retrieve corresponding data. Therefore, traditional data mining methods cannot be directly applied. The primary contribution of this paper is to develop a new data mining method for outlier detection over deep web. In our approach, the query space of a deep web data source is stratified based on a pilot sample. Neighborhood sampling and uncertainty sampling are developed in this paper with the goal of improving recall and precision based on stratification. Finally, a careful performance evaluation of our algorithm confirms that our approach can effectively detect outliers in deep web.

  16. Informal Learning through Expertise Mining in the Social Web

    Science.gov (United States)

    Valencia-Garcia, Rafael; Garcia-Sanchez, Francisco; Casado-Lumbreras, Cristina; Castellanos-Nieves, Dagoberto; Fernandez-Breis, Jesualdo Tomas

    2012-01-01

    The advent of Web 2.0, also called the Social Web, has changed the way people interact with the Web. Assisted by the technologies associated with this new trend, users now play a much more active role as content providers. This Web paradigm shift has also changed how companies operate and interact with their employees, partners and customers. The…

  17. LHCb Computing Resource usage in 2017

    CERN Document Server

    Bozzi, Concezio

    2018-01-01

    This document reports the usage of computing resources by the LHCb collaboration during the period January 1st – December 31st 2017. The data in the following sections have been compiled from the EGI Accounting portal: https://accounting.egi.eu. For LHCb specific information, the data is taken from the DIRAC Accounting at the LHCb DIRAC Web portal: http://lhcb-portal-dirac.cern.ch.

  18. Co-clustering Analysis of Weblogs Using Bipartite Spectral Projection Approach

    DEFF Research Database (Denmark)

    Xu, Guandong; Zong, Yu; Dolog, Peter

    2010-01-01

    Web clustering is an approach for aggregating Web objects into various groups according to underlying relationships among them. Finding co-clusters of Web objects is an interesting topic in the context of Web usage mining, which is able to capture the underlying user navigational interest...... and content preference simultaneously. In this paper we will present an algorithm using bipartite spectral clustering to co-cluster Web users and pages. The usage data of users visiting Web sites is modeled as a bipartite graph and the spectral clustering is then applied to the graph representation of usage...... data. The proposed approach is evaluated by experiments performed on real datasets, and the impact of using various clustering algorithms is also investigated. Experimental results have demonstrated the employed method can effectively reveal the subset aggregates of Web users and pages which...

  19. Google Scholar Usage: An Academic Library's Experience

    Science.gov (United States)

    Wang, Ya; Howard, Pamela

    2012-01-01

    Google Scholar is a free service that provides a simple way to broadly search for scholarly works and to connect patrons with the resources libraries provide. The researchers in this study analyzed Google Scholar usage data from 2006 for three library tools at San Francisco State University: SFX link resolver, Web Access Management proxy server,…

  20. Development of a Mine Rescue Drilling System (MRDS)

    Energy Technology Data Exchange (ETDEWEB)

    Raymond, David W. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Gaither, Katherine N. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Polsky, Yarom [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Knudsen, Steven D. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Broome, Scott Thomas [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Su, Jiann-Cherng [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Blankenship, Douglas A. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Costin, Laurence S. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

    2014-06-01

    Sandia National Laboratories (Sandia) has a long history in developing compact, mobile, very high-speed drilling systems and this technology could be applied to increasing the rate at which boreholes are drilled during a mine accident response. The present study reviews current technical approaches, primarily based on technology developed under other programs, analyzes mine rescue specific requirements to develop a conceptual mine rescue drilling approach, and finally, proposes development of a phased mine rescue drilling system (MRDS) that accomplishes (1) development of rapid drilling MRDS equipment; (2) structuring improved web communication through the Mine Safety & Health Administration (MSHA) web site; (3) development of an improved protocol for employment of existing drilling technology in emergencies; (4) deployment of advanced technologies to complement mine rescue drilling operations during emergency events; and (5) preliminary discussion of potential future technology development of specialized MRDS equipment. This phased approach allows for rapid fielding of a basic system for improved rescue drilling, with the ability to improve the system over time at a reasonable cost.

  1. Lightweight monitoring and control system for coal mine safety using REST style.

    Science.gov (United States)

    Cheng, Bo; Cheng, Xin; Chen, Junliang

    2015-01-01

    The complex environment of a coal mine requires the underground environment, devices and miners to be constantly monitored to ensure safe coal production. However, existing coal mines do not meet these coverage requirements because blind spots occur when using a wired network. In this paper, we develop a Web-based, lightweight remote monitoring and control platform using a wireless sensor network (WSN) with the REST style to collect temperature, humidity and methane concentration data in a coal mine using sensor nodes. This platform also collects information on personnel positions inside the mine. We implement a RESTful application programming interface (API) that provides access to underground sensors and instruments through the Web such that underground coal mine physical devices can be easily interfaced to remote monitoring and control applications. We also implement three different scenarios for Web-based, lightweight remote monitoring and control of coal mine safety and measure and analyze the system performance. Finally, we present the conclusions from this study and discuss future work. Copyright © 2014 ISA. Published by Elsevier Ltd. All rights reserved.

  2. Web-Based Analysis for Decision Support Systems

    African Journals Online (AJOL)

    pc

    2018-03-05

    Mar 5, 2018 ... such as web mining, social analytics, and data mining were examined. ... Additionally, the systems possess superb interaction capability which enables .... technologies has a significant impact on DSS design especially ..... Evaluating the Impact of User Characteristics and Different Layouts on an Interactive ...

  3. Project management in mine actions using Multi-Criteria-Analysis-based decision support system

    Directory of Open Access Journals (Sweden)

    Marko Mladineo

    2014-12-01

    Full Text Available In this paper, a Web-based Decision Support System (Web DSS, that supports humanitarian demining operations and restoration of mine-contaminated areas, is presented. The financial shortage usually triggers a need for priority setting in Project Management in Mine actions. As part of the FP7 Project TIRAMISU, a specialized Web DSS has been developed to achieve a fully transparent priority setting process. It allows stakeholders and donors to actively join the decision making process using a user-friendly and intuitive Web application. The main advantage of this Web DSS is its unique way of managing a mine action project using Multi-Criteria Analysis (MCA, namely the PROMETHEE method, in order to select priorities for demining actions. The developed Web DSS allows decision makers to use several predefined scenarios (different criteria weights or to develop their own, so it allows project managers to compare different demining possibilities with ease.

  4. Design research of uranium mine borehole database

    International Nuclear Information System (INIS)

    Xie Huaming; Hu Guangdao; Zhu Xianglin; Chen Dehua; Chen Miaoshun

    2008-01-01

    With short supply of energy sources, exploration of uranium mine have been enhanced, but data storage, analysis and usage of exploration data of uranium mine are not highly computerized currently in China, the data is poor shared and used that it can not adapt the need of production and research. It will be well done, if the data are stored and managed in a database system. The concept structure design, logic structure design and data integrity checks are discussed according to the demand of applications and the analysis of exploration data of uranium mine. An application of the database is illustrated finally. (authors)

  5. Intelligent Information Retrieval and Web Mining Architecture Using SOA

    Science.gov (United States)

    El-Bathy, Naser Ibrahim

    2010-01-01

    The study of this dissertation provides a solution to a very specific problem instance in the area of data mining, data warehousing, and service-oriented architecture in publishing and newspaper industries. The research question focuses on the integration of data mining and data warehousing. The research problem focuses on the development of…

  6. Web pages of Slovenian public libraries

    Directory of Open Access Journals (Sweden)

    Silva Novljan

    2002-01-01

    Full Text Available Libraries should offer their patrons web sites which establish the unmistakeable concept (public of library, the concept that cannot be mistaken for other information brokers and services available on the Internet, but inside this framework of the concept of library, would show a diversity which directs patrons to other (public libraries. This can be achieved by reliability, quality of information and services, and safety of usage.Achieving this, patrons regard library web sites as important reference sources deserving continuous usage for obtaining relevant information. Libraries excuse investment in the development and sustainance of their web sites by the number of visits and by patron satisfaction. The presented research, made on a sample of Slovene public libraries’web sites, determines how the libraries establish their purpose and role, as well as the given professional recommendations in web site design.The results uncover the striving of libraries for the modernisation of their functions,major attention is directed to the presentation of classic libraries and their activities,lesser to the expansion of available contents and electronic sources. Pointing to their diversity is significant since it is not a result of patrons’ needs, but more the consequence of improvisation, too little attention to selection, availability, organisation and formation of different kind of information and services on the web sites. Based on the analysis of a common concept of the public library web site, certain activities for improving the existing state of affairs are presented in the paper.

  7. The world wide web: exploring a new advertising environment.

    Science.gov (United States)

    Johnson, C R; Neath, I

    1999-01-01

    The World Wide Web currently boasts millions of users in the United States alone and is likely to continue to expand both as a marketplace and as an advertising environment. Three experiments explored advertising in the Web environment, in particular memory for ads as they appear in everyday use across the Web. Experiments 1 and 2 examined the effect of advertising repetition on the retention of familiar and less familiar brand names, respectively. Experiment 1 demonstrated that repetition of a banner ad within multiple web pages can improve recall of familiar brand names, and Experiment 2 demonstrated that repetition can improve recognition of less familiar brand names. Experiment 3 directly compared the retention of familiar and less familiar brand names that were promoted by static and dynamic ads and demonstrated that the use of dynamic advertising can increase brand name recall, though only for familiar brand names. This study also demonstrated that, in the Web environment, much as in other advertising environments, familiar brand names possess a mnemonic advantage not possessed by less familiar brand names. Finally, data regarding Web usage gathered from all experiments confirm reports that Web usage among males tends to exceed that among females.

  8. Context mining and integration into predictive web analytics

    NARCIS (Netherlands)

    Kiseleva, Y.

    2013-01-01

    Predictive Web Analytics is aimed at understanding behavioural patterns of users of various web-based applications: e-commerce, ubiquitous and mobile computing, and computational advertising. Within these applications business decisions often rely on two types of predictions: an overall or

  9. PaaS for web applications with OpenShift Origin

    OpenAIRE

    Lossent, A; Rodriguez Peon, A; Wagner, A

    2017-01-01

    The CERN Web Frameworks team has deployed OpenShift Origin to facilitate deployment of web applications and to improving efficiency in terms of computing resource usage. OpenShift leverages Docker containers and Kubernetes orchestration to provide a Platform-as-a-service solution oriented for web applications. We will review use cases and how OpenShift was integrated with other services such as source control, web site management and authentication services.

  10. PaaS for web applications with OpenShift Origin

    Science.gov (United States)

    Lossent, A.; Rodriguez Peon, A.; Wagner, A.

    2017-10-01

    The CERN Web Frameworks team has deployed OpenShift Origin to facilitate deployment of web applications and to improving efficiency in terms of computing resource usage. OpenShift leverages Docker containers and Kubernetes orchestration to provide a Platform-as-a-service solution oriented for web applications. We will review use cases and how OpenShift was integrated with other services such as source control, web site management and authentication services.

  11. Creating Usage Context-Based Object Similarities to Boost Recommender Systems in Technology Enhanced Learning

    Science.gov (United States)

    Niemann, Katja; Wolpers, Martin

    2015-01-01

    In this paper, we introduce a new way of detecting semantic similarities between learning objects by analysing their usage in web portals. Our approach relies on the usage-based relations between the objects themselves rather then on the content of the learning objects or on the relations between users and learning objects. We then take this new…

  12. Collecting conditions usage metadata to optimize current and future ATLAS software and processing

    CERN Document Server

    AUTHOR|(INSPIRE)INSPIRE-00064378; The ATLAS collaboration; Formica, Andrea; Gallas, Elizabeth; Oda, Susumu; Rinaldi, Lorenzo; Rybkin, Grigori; Verducci, Monica

    2017-01-01

    Conditions data (for example: alignment, calibration, data quality) are used extensively in the processing of real and simulated data in ATLAS. The volume and variety of the conditions data needed by different types of processing are quite diverse, so optimizing its access requires a careful understanding of conditions usage patterns. These patterns can be quantified by mining representative log files from each type of processing and gathering detailed information about conditions usage for that type of processing into a central repository.

  13. Patterns of Internet Usage: Learning Sphere and the Socio-cultural Context

    Directory of Open Access Journals (Sweden)

    Hossein Ebrahimabadi

    2009-11-01

    Full Text Available In addition to the curriculum and the learning targets, there are some other points –as “the culture of the real life”, “patterns of communication and virtual-life’s experiencing”, and generally “pattern of communication and internet usage”- should be considered in evaluating internet. Applying results of a survey on the impacts of both the web-based and the traditional educational methods on students’ learning and motivation, the present study explores the patterns of internet usage. Research method is experimental, using the t test for independent groups and analyzing multi-variable regression, and some points as the population, method of sampling and data gathering is explained in the article. Results show that there is a meaningful difference between the grades of the test group and the witness group; thus variable of “the internet usage” could predict changes in learning. In other words, supra-usage of internet would decrease learning and curriculum development. However, using internet for scientific and schooling would cause students to correlate their patterns of computer and internet usage. As results show, decline in entertaining usage of internet is related to the socio-cultural context, way and amount of participating in the web, and the quality of virtual learning sphere, rather than the interest or disinterest of the users.

  14. Service mining framework and application

    CERN Document Server

    Chang, Wei-Lun

    2014-01-01

    The shifting focus of service from the 1980s to 2000s has proved that IT not only lowers the cost of service but creates avenues to enhance and increase revenue through service. The new type of service, e-service, is mobile, flexible, interactive, and interchangeable. While service science provides an avenue for future service researches, the specific research areas from the IT perspective still need to be elaborated. This book introduces a novel concept-service mining-to address several research areas from technology, model, management, and application perspectives. Service mining is defined as "a systematical process including service discovery, service experience, service recovery, and service retention to discover unique patterns and exceptional values within the existing services." The goal of service mining is similar to data mining, text mining, or web mining, and aims to "detect something new" from the service pool. The major difference is the feature of service is quite distinct from the mining targe...

  15. An Evaluative Methodology for Virtual Communities Using Web Analytics

    Science.gov (United States)

    Phippen, A. D.

    2004-01-01

    The evaluation of virtual community usage and user behaviour has its roots in social science approaches such as interview, document analysis and survey. Little evaluation is carried out using traffic or protocol analysis. Business approaches to evaluating customer/business web site usage are more advanced, in particular using advanced web…

  16. Awareneness and usage of web 2.0 tools among lecturers in ...

    African Journals Online (AJOL)

    Findings from the study revealed a high level of awareness and use of Web 2.0 tools among the lecturers in Nigerian universities while facebook, youtube, linkedln, twitter, wikis, and podcasting were found to be the popular tools among the lecturers. Also, facebook, linkedln, and wikis were found to be the most used Web ...

  17. Prediction of users webpage access behaviour using association ...

    Indian Academy of Sciences (India)

    pages mainly depended on the support and lift measure whereas confidence assumed ... Apriori algorithm; association rules; data mining; MSNBC; web usage .... clustering was used in finding the user access patterns from web access log. .... satisfied the minimum support and confidence of 0.6% and 100% respectively.

  18. Deploying and sharing U-Compare workflows as web services.

    Science.gov (United States)

    Kontonatsios, Georgios; Korkontzelos, Ioannis; Kolluru, Balakrishna; Thompson, Paul; Ananiadou, Sophia

    2013-02-18

    U-Compare is a text mining platform that allows the construction, evaluation and comparison of text mining workflows. U-Compare contains a large library of components that are tuned to the biomedical domain. Users can rapidly develop biomedical text mining workflows by mixing and matching U-Compare's components. Workflows developed using U-Compare can be exported and sent to other users who, in turn, can import and re-use them. However, the resulting workflows are standalone applications, i.e., software tools that run and are accessible only via a local machine, and that can only be run with the U-Compare platform. We address the above issues by extending U-Compare to convert standalone workflows into web services automatically, via a two-click process. The resulting web services can be registered on a central server and made publicly available. Alternatively, users can make web services available on their own servers, after installing the web application framework, which is part of the extension to U-Compare. We have performed a user-oriented evaluation of the proposed extension, by asking users who have tested the enhanced functionality of U-Compare to complete questionnaires that assess its functionality, reliability, usability, efficiency and maintainability. The results obtained reveal that the new functionality is well received by users. The web services produced by U-Compare are built on top of open standards, i.e., REST and SOAP protocols, and therefore, they are decoupled from the underlying platform. Exported workflows can be integrated with any application that supports these open standards. We demonstrate how the newly extended U-Compare enhances the cross-platform interoperability of workflows, by seamlessly importing a number of text mining workflow web services exported from U-Compare into Taverna, i.e., a generic scientific workflow construction platform.

  19. A Web-Based GIS for Reporting Water Usage in the High Plains Underground Water Conservation District

    Science.gov (United States)

    Jia, M.; Deeds, N.; Winckler, M.

    2012-12-01

    The High Plains Underground Water Conservation District (HPWD) is the largest and oldest of the Texas water conservation districts, and oversees approximately 1.7 million irrigated acres. Recent rule changes have motivated HPWD to develop a more automated system to allow owners and operators to report well locations, meter locations, meter readings, the association between meters and wells, and contiguous acres. INTERA, Inc. has developed a web-based interactive system for HPWD water users to report water usage and for the district to better manage its water resources. The HPWD web management system utilizes state-of-the-art GIS techniques, including cloud-based Amazon EC2 virtual machine, ArcGIS Server, ArcSDE and ArcGIS Viewer for Flex, to support web-based water use management. The system enables users to navigate to their area of interest using a well-established base-map and perform a variety of operations and inquiries against their spatial features. The application currently has six components: user privilege management, property management, water meter registration, area registration, meter-well association and water use report. The system is composed of two main databases: spatial database and non-spatial database. With the help of Adobe Flex application at the front end and ArcGIS Server as the middle-ware, the spatial feature geometry and attributes update will be reflected immediately in the back end. As a result, property owners, along with the HPWD staff, collaborate together to weave the fabric of the spatial database. Interactions between the spatial and non-spatial databases are established by Windows Communication Foundation (WCF) services to record water-use report, user-property associations, owner-area associations, as well as meter-well associations. Mobile capabilities will be enabled in the near future for field workers to collect data and synchronize them to the spatial database. The entire solution is built on a highly scalable cloud

  20. Proceedings. Fourth international symposium on mine mechanisation and automation

    Energy Technology Data Exchange (ETDEWEB)

    Gurgenci, H.; Hood, M. [eds.

    1997-12-31

    Papers in the first volume are presented under the following session headings: drilling; mining robotics; machine monitoring; mine automation systems; reliability and maintenance; mine automation - communications mechanical excavation of medium-strength rock; and new mining equipment technologies. The second volume covers: mechanical excavation of hard rock; autonomous vehicles; mechanical excavation industry experience; machine guidance; applications of rock mechanics, mine planning management and scheduling; orebody delineation; and safety. Selected papers have been abstracted separately for the IEA Coal Research databases available on CD-ROM and the worldwide web.

  1. Carbon and nitrogen stable isotopes and metal concentration in food webs from a mining-impacted coastal lagoon

    International Nuclear Information System (INIS)

    Marin-Guirao, Lazaro; Lloret, Javier; Marin, Arnaldo

    2008-01-01

    Two food webs from the Mar Menor coastal lagoon, differing in the distance from the desert-stream through which mining wastes were discharged, were examined by reference to essential (Zn and Cu) and non-essential (Pb and Cd) metal concentrations and stable isotopes content (C and N). The partial extraction technique applied, which reflects the availability of metals to organisms after sediment ingestion, showed higher bioavailable metal concentrations in sediments from the station influenced by the mining discharges, in agreement with the higher metal concentrations observed in organisms, which in many cases exceeded the regulatory limits established in Spanish legislation concerning seafood. Spatial differences in essential metal concentrations in the fauna suggest that several organisms are exposed to metal levels above their regulation capacity. Differences in isotopic composition were found between both food webs, the wadi-influenced station showing higher δ 15 N values and lower δ 13 C levels, due to the discharge of urban waste waters and by the entrance of freshwater and allochthonous marsh plants. The linear-regressions between trophic levels (as indicated by δ 15 N) and the metal content indicated that biomagnification does not occur. In the case of invertebrates, since the 'handle strategy' of the species and the physiological requirements of the organisms, among other factors, determine the final concentration of a specific element, no clear relationships between trophic level and the metal content are to be expected. For their part, fish communities did not show clear patterns in the case of any of the analyzed metals, probably because most fish species have similar metal requirements, and because biological factors also intervened. Finally, since the study deals with metals, assumptions concerning trophic transfer factors calculation may not be suitable since the metal burden originates not only from the prey but also from adsorption over the body

  2. Carbon and nitrogen stable isotopes and metal concentration in food webs from a mining-impacted coastal lagoon

    Energy Technology Data Exchange (ETDEWEB)

    Marin-Guirao, Lazaro [Departamento de Ecologia e Hidrologia, Facultad de Biologia, Universidad de Murcia, 30100-Murcia (Spain)], E-mail: lamarin@um.es; Lloret, Javier; Marin, Arnaldo [Departamento de Ecologia e Hidrologia, Facultad de Biologia, Universidad de Murcia, 30100-Murcia (Spain)

    2008-04-01

    Two food webs from the Mar Menor coastal lagoon, differing in the distance from the desert-stream through which mining wastes were discharged, were examined by reference to essential (Zn and Cu) and non-essential (Pb and Cd) metal concentrations and stable isotopes content (C and N). The partial extraction technique applied, which reflects the availability of metals to organisms after sediment ingestion, showed higher bioavailable metal concentrations in sediments from the station influenced by the mining discharges, in agreement with the higher metal concentrations observed in organisms, which in many cases exceeded the regulatory limits established in Spanish legislation concerning seafood. Spatial differences in essential metal concentrations in the fauna suggest that several organisms are exposed to metal levels above their regulation capacity. Differences in isotopic composition were found between both food webs, the wadi-influenced station showing higher {delta}{sup 15}N values and lower {delta}{sup 13}C levels, due to the discharge of urban waste waters and by the entrance of freshwater and allochthonous marsh plants. The linear-regressions between trophic levels (as indicated by {delta}{sup 15}N) and the metal content indicated that biomagnification does not occur. In the case of invertebrates, since the 'handle strategy' of the species and the physiological requirements of the organisms, among other factors, determine the final concentration of a specific element, no clear relationships between trophic level and the metal content are to be expected. For their part, fish communities did not show clear patterns in the case of any of the analyzed metals, probably because most fish species have similar metal requirements, and because biological factors also intervened. Finally, since the study deals with metals, assumptions concerning trophic transfer factors calculation may not be suitable since the metal burden originates not only from the prey but

  3. A Generic Framework for Extraction of Knowledge from Social Web Sources (Social Networking Websites for an Online Recommendation System

    Directory of Open Access Journals (Sweden)

    Javubar Sathick

    2015-04-01

    Full Text Available Mining social web data is a challenging task and finding user interest for personalized and non-personalized recommendation systems is another important task. Knowledge sharing among web users has become crucial in determining usage of web data and personalizing content in various social websites as per the user’s wish. This paper aims to design a framework for extracting knowledge from web sources for the end users to take a right decision at a crucial juncture. The web data is collected from various web sources and structured appropriately and stored as an ontology based data repository. The proposed framework implements an online recommender application for the learners online who pursue their graduation in an open and distance learning environment. This framework possesses three phases: data repository, knowledge engine, and online recommendation system. The data repository possesses common data which is attained by the process of acquiring data from various web sources. The knowledge engine collects the semantic data from the ontology based data repository and maps it to the user through the query processor component. Establishment of an online recommendation system is used to make recommendations to the user for a decision making process. This research work is implemented with the help of an experimental case study which deals with an online recommendation system for the career guidance of a learner. The online recommendation application is implemented with the help of R-tool, NLP parser and clustering algorithm.This research study will help users to attain semantic knowledge from heterogeneous web sources and to make decisions.

  4. Accelerator Physics Code Web Repository

    CERN Document Server

    Zimmermann, Frank; Bellodi, G; Benedetto, E; Dorda, U; Giovannozzi, Massimo; Papaphilippou, Y; Pieloni, T; Ruggiero, F; Rumolo, G; Schmidt, F; Todesco, E; Zotter, Bruno W; Payet, J; Bartolini, R; Farvacque, L; Sen, T; Chin, Y H; Ohmi, K; Oide, K; Furman, M; Qiang, J; Sabbi, G L; Seidl, P A; Vay, J L; Friedman, A; Grote, D P; Cousineau, S M; Danilov, V; Holmes, J A; Shishlo, A; Kim, E S; Cai, Y; Pivi, M; Kaltchev, D I; Abell, D T; Katsouleas, Thomas C; Boine-Frankenheim, O; Franchetti, G; Hofmann, I; Machida, S; Wei, J

    2006-01-01

    In the framework of the CARE HHH European Network, we have developed a web-based dynamic acceleratorphysics code repository. We describe the design, structure and contents of this repository, illustrate its usage, and discuss our future plans, with emphasis on code benchmarking.

  5. ACCELERATION PHYSICS CODE WEB REPOSITORY.

    Energy Technology Data Exchange (ETDEWEB)

    WEI, J.

    2006-06-26

    In the framework of the CARE HHH European Network, we have developed a web-based dynamic accelerator-physics code repository. We describe the design, structure and contents of this repository, illustrate its usage, and discuss our future plans, with emphasis on code benchmarking.

  6. Data mining usage in health care management: literature survey and decision tree application

    Directory of Open Access Journals (Sweden)

    Dijana Ćosić

    2008-02-01

    Full Text Available Aim To show the benefits of data mining in health care management.In this example, we are going to show a way to raise awarenessof women in terms of contraceptive methods they use (do notuse.Methods Goal of the data mining analysis was to determine ifthere are common characteristics of the women according to theirchoice of contraception (typical classification problem. Therefore,we decided to use decision trees. We have generated a CHAIDmodel in “Statistica”, based on the database that was formed as aresult of an Indonesian research that was conducted in 1987. Thesample contains married women who were either not pregnant ordid not know if they were pregnant at the time of the interview.The database consists of 1473 cases. Also, an extensive internetsearch was conducted in order to detect a number of articles citedin scientific databases published on the subject of data mining inhealth care management.Results It has shown that the most important variable in case ofwomen’s choice of contraceptive methods is – a husband’s profession.Also we retrieved 221 articles published on the application ofdata mining in health care.Conclusion The goal of the paper is achieved in two ways: first,retrieving 221 articles published on the subject we have proved thebenefits of data mining in the health care management. Second,the decision tree method is successfully applied in explanation ofwomen’s choice of contraceptive methods.

  7. Dental practice websites: creating a Web presence.

    Science.gov (United States)

    Miller, Syrene A; Forrest, Jane L

    2002-07-01

    Web technology provides an opportunity for dentists to showcase their practice philosophy, quality of care, office setting, and staff in a creative manner. Having a Website provides a practice with innovative and cost-effective communications and marketing tools for current and potential patients who use the Internet. The main benefits of using a Website to promote one's practice are: Making office time more productive, tasks more timely, follow-up less necessary Engaging patients in an interactive and visual learning process Providing online forms and procedure examples for patients Projecting a competent and current image Tracking the usage of Web pages. Several options are available when considering the development of a Website. These options range in cost based on customization of the site and ongoing support services, such as site updates, technical assistance, and Web usage statistics. In most cases, Websites are less expensive than advertising in the phone book. Options in creating a Website include building one's own, employing a company that offers Website templates, and employing a company that offers customized sites. These development options and benefits will continue to grow as individuals access the Web and more information and sites become available.

  8. Participants, usage, and use patterns of a web-based intervention for the prevention of depression within a randomized controlled trial.

    Science.gov (United States)

    Kelders, Saskia M; Bohlmeijer, Ernst T; Van Gemert-Pijnen, Julia Ewc

    2013-08-20

    nonadherers and adherers, and fewer sessions to complete the lesson than adherers. Furthermore, late nonadherers seemed to have a shorter total duration of sessions than adherers. By using log data combined with baseline characteristics of participants, we extracted valuable lessons for redesign of this intervention and the design of Web-based interventions in general. First, although characteristics of respondents can significantly predict adherence, their predictive value is small. Second, it is important to design Web-based interventions to foster adherence and usage of all features in an intervention. Dutch Trial Register Number: NTR3007; http://www.trialregister.nl/trialreg/admin/rctview.asp?TC=3007 (Archived by WebCite at http://www.webcitation.org/6ILhI3rd8).

  9. Learners’ Evaluation Based on Data Mining in a Web Based Learning Environment

    Directory of Open Access Journals (Sweden)

    İdris GÖKSU

    2015-06-01

    Full Text Available This study has been done in order to determine the efficiency level in the extend of learners’ evaluation by means of comparing the Web Based Learning (WBL with traditional face to face learning. In this respect, the effect of WBL and traditional environment has been analyzed in the class of Visual Programming I, and the learners have been evaluated with the rule based data mining method in a WBL environment. The study has been conducted according to experimental design with pre-test and post-test groups. Experimental group has attended the class in WBL environment, and the control group in a traditional class environment. In accordance with the pre-test and post-test scores of experimental and control groups, both methods have been proved to be effective. According the average scores of post-test, the learners in experimental groups have been more successful than the ones in the control group. The guiding of WBL system prepared for the study has been found to be significant in terms of both underlining the points in which the learners are unsuccessful in a short time and having trust in the system technically.

  10. Usage of cell nomenclature in biomedical literature

    KAUST Repository

    Kafkas, Senay

    2017-12-21

    Background Cell lines and cell types are extensively studied in biomedical research yielding to a significant amount of publications each year. Identifying cell lines and cell types precisely in publications is crucial for science reproducibility and knowledge integration. There are efforts for standardisation of the cell nomenclature based on ontology development to support FAIR principles of the cell knowledge. However, it is important to analyse the usage of cell nomenclature in publications at a large scale for understanding the level of uptake of cell nomenclature in literature by scientists. In this study, we analyse the usage of cell nomenclature, both in Vivo, and in Vitro in biomedical literature by using text mining methods and present our results. Results We identified 59% of the cell type classes in the Cell Ontology and 13% of the cell line classes in the Cell Line Ontology in the literature. Our analysis showed that cell line nomenclature is much more ambiguous compared to the cell type nomenclature. However, trends indicate that standardised nomenclature for cell lines and cell types are being increasingly used in publications by the scientists. Conclusions Our findings provide an insight to understand how experimental cells are described in publications and may allow for an improved standardisation of cell type and cell line nomenclature as well as can be utilised to develop efficient text mining applications on cell types and cell lines. All data generated in this study is available at https://github.com/shenay/CellNomenclatureStudy.

  11. Evaluation of ecological constraints on peat mining in New Brunswick

    Energy Technology Data Exchange (ETDEWEB)

    Gautreau-Daigle, H

    1990-07-01

    A study was undertaken to obtain baseline information on moose and waterfowl usage of peatlands in the Escuminac bog complex in New Brunswick, in order to determine the impact of existing peat mining activities and to assist in making decisions regarding future resource development. The bog complex comprises a relatively large number of freshwater ponds which support breeding populations for waterfowl and serve as staging areas during bird migrations. Aerial surveys were carried out to quantify the use of these ponds by waterfowl and to determine changes in their level of use as a result of peat extraction. Results indicate that usage of ponds by birds seems mostly limited to staging and migration, except for black and ring-necked ducks. Those species are the most significant users of bog ponds and have been found to breed and raise young in the ponds. Some areas were found to get more waterfowl than others, but this was not shown to be related to peat mining activity. Active mined areas were devoid of waterfowl, but this area was a relatively small portion of the total bog area. The moose survey examined moose activity in a control area (without peat mining) and a representative bog area where peat mining occurred. Results do not indicate a difference in the moose activity patterns between the two areas. 9 refs., 25 figs., 17 tabs.

  12. Data mining application in customer relationship management for hospital inpatients.

    Science.gov (United States)

    Lee, Eun Whan

    2012-09-01

    This study aims to discover patients loyal to a hospital and model their medical service usage patterns. Consequently, this study proposes a data mining application in customer relationship management (CRM) for hospital inpatients. A recency, frequency, monetary (RFM) model has been applied toward 14,072 patients discharged from a university hospital. Cluster analysis was conducted to segment customers, and it modeled the patterns of the loyal customers' medical services usage via a decision tree. Patients were divided into two groups according to the variables of the RFM model and the group which had significantly high frequency of medical use and expenses was defined as loyal customers, a target market. As a result of the decision tree, the predictable factors of the loyal clients were; length of stay, certainty of selectable treatment, surgery, number of accompanying treatments, kind of patient room, and department from which they were discharged. Particularly, this research showed that when a patient within the internal medicine department who did not have surgery stayed for more than 13.5 days, their probability of being a classified as a loyal customer was 70.0%. To discover a hospital's loyal patients and model their medical usage patterns, the application of data-mining has been suggested. This paper suggests practical use of combining segmentation, targeting, positioning (STP) strategy and the RFM model with data-mining in CRM.

  13. Improving the web site's effectiveness by considering each page's temporal information

    NARCIS (Netherlands)

    Li, ZG; Sun, MT; Dunham, MH; Xiao, YQ; Dong, G; Tang, C; Wang, W

    2003-01-01

    Improving the effectiveness of a web site is always one of its owner's top concerns. By focusing on analyzing web users' visiting behavior, web mining researchers have developed a variety of helpful methods, based upon association rules, clustering, prediction and so on. However, we have found

  14. Data mining methods

    CERN Document Server

    Chattamvelli, Rajan

    2015-01-01

    DATA MINING METHODS, Second Edition discusses both theoretical foundation and practical applications of datamining in a web field including banking, e-commerce, medicine, engineering and management. This book starts byintroducing data and information, basic data type, data category and applications of data mining. The second chapterbriefly reviews data visualization technology and importance in data mining. Fundamentals of probability and statisticsare discussed in chapter 3, and novel algorithm for sample covariants are derived. The next two chapters give an indepthand useful discussion of data warehousing and OLAP. Decision trees are clearly explained and a new tabularmethod for decision tree building is discussed. The chapter on association rules discusses popular algorithms andcompares various algorithms in summary table form. An interesting application of genetic algorithm is introduced inthe next chapter. Foundations of neural networks are built from scratch and the back propagation algorithm is derived...

  15. Analyzing Web Server Logs to Improve a Site's Usage. The Systems Librarian

    Science.gov (United States)

    Breeding, Marshall

    2005-01-01

    This column describes ways to streamline and optimize how a Web site works in order to improve both its usability and its visibility. The author explains how to analyze logs and other system data to measure the effectiveness of the Web site design and search engine.

  16. Off the Beaten tracks: Exploring Three Aspects of Web Navigation

    NARCIS (Netherlands)

    Weinreich, H.; Obendorf, H.; Herder, E.; Mayer, M.; Edmonds, H.; Hawkey, K.; Kellar, M.; Turnbull, D.

    2006-01-01

    This paper presents results of a long-term client-side Web usage study, updating previous studies that range in age from five to ten years. We focus on three aspects of Web navigation: changes in the distribution of navigation actions, speed of navigation and within-page navigation. “Navigation

  17. Development of a web-based, underground coalmine gas outburst information management system

    Energy Technology Data Exchange (ETDEWEB)

    Naj Aziz; Richard Caladine; Lucia Tome; Ken Cram; Devendra Vyas [University of Wollongong, NSW (Australia)

    2007-04-15

    The primary objective of this project was to develop an online coal mine outburst information management system to provide the coal mining industry with the necessary information and knowledge on outbursts via the World Wide Web. The Website has been constructed using the standard web format. Access to the site is by standard web browsers. The address of the site is http://www.uow.edu.au/eng/outburst. The website has 85 conference papers which were held in Australia, dating as far back as the 1980's, various seminar presentations, more than 250 references, a limited but important collection of international papers, direct links to ACARP and NERRDC publication lists, links to several leading organisations of particular interest in mine gas and outburst control. These links include both private and government organisations, and a forum for discussion.

  18. Applying Supervised Opinion Mining Techniques on Online User Reviews

    Directory of Open Access Journals (Sweden)

    Ion SMEUREANU

    2012-01-01

    Full Text Available In recent years, the spectacular development of web technologies, lead to an enormous quantity of user generated information in online systems. This large amount of information on web platforms make them viable for use as data sources, in applications based on opinion mining and sentiment analysis. The paper proposes an algorithm for detecting sentiments on movie user reviews, based on naive Bayes classifier. We make an analysis of the opinion mining domain, techniques used in sentiment analysis and its applicability. We implemented the proposed algorithm and we tested its performance, and suggested directions of development.

  19. Environment: General; Grammar & Usage; Money Management; Music History; Web Page Creation & Design.

    Science.gov (United States)

    Web Feet, 2001

    2001-01-01

    Describes Web site resources for elementary and secondary education in the topics of: environment, grammar, money management, music history, and Web page creation and design. Each entry includes an illustration of a sample page on the site and an indication of the grade levels for which it is appropriate. (AEF)

  20. Applied data mining for business and industry

    CERN Document Server

    Giudici, Paolo

    2009-01-01

    The increasing availability of data in our current, information overloaded society has led to the need for valid tools for its modelling and analysis. Data mining and applied statistical methods are the appropriate tools to extract knowledge from such data. This book provides an accessible introduction to data mining methods in a consistent and application oriented statistical framework, using case studies drawn from real industry projects and highlighting the use of data mining methods in a variety of business applications. Introduces data mining methods and applications.Covers classical and Bayesian multivariate statistical methodology as well as machine learning and computational data mining methods.Includes many recent developments such as association and sequence rules, graphical Markov models, lifetime value modelling, credit risk, operational risk and web mining.Features detailed case studies based on applied projects within industry.Incorporates discussion of data mining software, with case studies a...

  1. Performance Issues Related to Web Service Usage for Remote Data Access

    International Nuclear Information System (INIS)

    Pais, V. F.; Stancalie, V.; Mihailescu, F. A.; Totolici, M. C.

    2008-01-01

    Web services are starting to be widely used in applications for remotely accessing data. This is of special interest for research based on small and medium scale fusion devices, since scientists participating remotely to experiments are accessing large amounts of data over the Internet. Recent tests were conducted to see how the new network traffic, generated by the use of web services, can be integrated in the existing infrastructure and what would be the impact over existing applications, especially those used in a remote participation scenario

  2. Analysis of mesenchymal stem cell differentiation in vitro using classification association rule mining.

    Science.gov (United States)

    Wang, Weiqi; Wang, Yanbo Justin; Bañares-Alcántara, René; Coenen, Frans; Cui, Zhanfeng

    2009-12-01

    In this paper, data mining is used to analyze the data on the differentiation of mammalian Mesenchymal Stem Cells (MSCs), aiming at discovering known and hidden rules governing MSC differentiation, following the establishment of a web-based public database containing experimental data on the MSC proliferation and differentiation. To this effect, a web-based public interactive database comprising the key parameters which influence the fate and destiny of mammalian MSCs has been constructed and analyzed using Classification Association Rule Mining (CARM) as a data-mining technique. The results show that the proposed approach is technically feasible and performs well with respect to the accuracy of (classification) prediction. Key rules mined from the constructed MSC database are consistent with experimental observations, indicating the validity of the method developed and the first step in the application of data mining to the study of MSCs.

  3. MB3-Miner: efficiently mining eMBedded subTREEs using Tree Model Guided candidate generation

    NARCIS (Netherlands)

    Tan, H.; Dillon, T.; Hadzic, F.; Chang, E.; Feng, L.

    2005-01-01

    Tree mining has many useful applications in areas such as Bioinformatics, XML mining, Web mining, etc. In general, most of the formally represented information in these domains is a tree structured form. In this paper we focus on mining frequent embedded subtrees from databases of rooted labeled

  4. Method for effective usage of Google Analytics tools

    Directory of Open Access Journals (Sweden)

    Ирина Николаевна Егорова

    2016-01-01

    Full Text Available Modern Google Analytics tools have been investigated against effective attraction channels for users and bottlenecks detection. Conducted investigation allowed to suggest modern method for effective usage of Google Analytics tools. The method is based on main traffic indicators analysis, as well as deep analysis of goals and their consecutive tweaking. Method allows to increase website conversion and might be useful for SEO and Web analytics specialists

  5. pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts.

    Science.gov (United States)

    Rani, Jyoti; Shah, A B Rauf; Ramachandran, Srinivasan

    2015-10-01

    The PubMed literature database is a valuable source of information for scientific research. It is rich in biomedical literature with more than 24 million citations. Data-mining of voluminous literature is a challenging task. Although several text-mining algorithms have been developed in recent years with focus on data visualization, they have limitations such as speed, are rigid and are not available in the open source. We have developed an R package, pubmed.mineR, wherein we have combined the advantages of existing algorithms, overcome their limitations, and offer user flexibility and link with other packages in Bioconductor and the Comprehensive R Network (CRAN) in order to expand the user capabilities for executing multifaceted approaches. Three case studies are presented, namely, 'Evolving role of diabetes educators', 'Cancer risk assessment' and 'Dynamic concepts on disease and comorbidity' to illustrate the use of pubmed.mineR. The package generally runs fast with small elapsed times in regular workstations even on large corpus sizes and with compute intensive functions. The pubmed.mineR is available at http://cran.rproject. org/web/packages/pubmed.mineR.

  6. LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes.

    Science.gov (United States)

    Cañada, Andres; Capella-Gutierrez, Salvador; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso; Krallinger, Martin

    2017-07-03

    A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes-CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. MouseMine: a new data warehouse for MGI.

    Science.gov (United States)

    Motenko, H; Neuhauser, S B; O'Keefe, M; Richardson, J E

    2015-08-01

    MouseMine (www.mousemine.org) is a new data warehouse for accessing mouse data from Mouse Genome Informatics (MGI). Based on the InterMine software framework, MouseMine supports powerful query, reporting, and analysis capabilities, the ability to save and combine results from different queries, easy integration into larger workflows, and a comprehensive Web Services layer. Through MouseMine, users can access a significant portion of MGI data in new and useful ways. Importantly, MouseMine is also a member of a growing community of online data resources based on InterMine, including those established by other model organism databases. Adopting common interfaces and collaborating on data representation standards are critical to fostering cross-species data analysis. This paper presents a general introduction to MouseMine, presents examples of its use, and discusses the potential for further integration into the MGI interface.

  8. CreatIng Web-based Math learnIng tool for TURKISH mIddle school students: Webquest

    Directory of Open Access Journals (Sweden)

    Aytac KURTULUS

    2009-04-01

    Full Text Available Internet is the most important product for the computer technology and it began to be used in many fields. Especially in the recent years, the usage of Internet has increased in the fields of communication, entertainment, advertisement, media, and technology. In Turkey, the usage of Internet is not used very common and active in primary and secondary education. The fast developments of the new technologies and the Web-Based Education Systems must be increased the importance of giving courses. In this study, the information to be aimed at is to introduce the WebQuest system, which was developed at San Diego State University by Bernie Dodge. A webQuest can be used web-based math learning tool for Turkish middle school students. Therefore, an example of geometry education WebQuest is given to introduce WebQuest system because WebQuest will be active in geometry teaching similar to the other subjects. An overview of WebQuest technology application and several resources for teachers and students interested in creating WebQuests can be found on The WebQutest Page (Dodge, 2001. Table 1 lists web sites that have many of these resources.

  9. Does Brief Telephone Support Improve Engagement With a Web-Based Weight Management Intervention? Randomized Controlled Trial

    OpenAIRE

    Dennison, Laura; Morrison, Leanne; Lloyd, Scott; Phillips, Dawn; Stuart, Beth; Williams, Sarah; Bradbury, Katherine; Roderick, Paul; Murray, Elizabeth; Michie, Susan; Little, Paul; Yardley, Lucy

    2014-01-01

    Background Recent reviews suggest Web-based interventions are promising approaches for weight management but they identify difficulties with suboptimal usage. The literature suggests that offering some degree of human support to website users may boost usage and outcomes. Objective We disseminated the POWeR (“Positive Online Weight Reduction”) Web-based weight management intervention in a community setting. POWeR consisted of weekly online sessions that emphasized self-monitoring, goal-settin...

  10. EFFICIENCY OF THE USE OF AUTHENTIC WEB-RESOURCES IN TRANSLATORS TRAINING

    OpenAIRE

    Iryna M. Drobit; Nataliia V. Rak

    2013-01-01

    The article deals with pedagogical assumptions and efficiency of the use of Information and Communication Technologies, especially authentic web-resources, while teaching language for specific purposes (translators and interpreters). Accuracy, content, and functionality of web-resource TED, which contains examples of authentic speech in English, have been outlined. It has been demonstrated that usage of multimedia and communication facilities of the TED web-resource provides favourable opport...

  11. WEB LOG EXPLORER – CONTROL OF MULTIDIMENSIONAL DYNAMICS OF WEB PAGES

    Directory of Open Access Journals (Sweden)

    Mislav Šimunić

    2012-07-01

    Full Text Available Demand markets dictate and pose increasingly more requirements to the supplymarket that are not easily satisfied. The supply market presenting its web pages to thedemand market should find the best and quickest ways to respond promptly to the changesdictated by the demand market. The question is how to do that in the most efficient andquickest way. The data on the usage of web pages on a specific web site are recorded in alog file. The data in a log file are stochastic and unordered and require systematicmonitoring, categorization, analyses, and weighing. From the data processed in this way, itis necessary to single out and sort the data by their importance that would be a basis for acontinuous generation of dynamics/changes to the web site pages in line with the criterionchosen. To perform those tasks successfully, a new software solution is required. For thatpurpose, the authors have developed the first version of the WLE (WebLogExplorersoftware solution, which is actually realization of web page multidimensionality and theweb site as a whole. The WebLogExplorer enables statistical and semantic analysis of a logfile and on the basis thereof, multidimensional control of the web page dynamics. Theexperimental part of the work was done within the web site of HTZ (Croatian NationalTourist Board being the main portal of the global tourist supply in the Republic of Croatia(on average, daily "log" consists of c. 600,000 sets, average size of log file is 127 Mb, andc. 7000-8000 daily visitors on the web site.

  12. Data Mining Application in Customer Relationship Management for Hospital Inpatients

    Science.gov (United States)

    2012-01-01

    Objectives This study aims to discover patients loyal to a hospital and model their medical service usage patterns. Consequently, this study proposes a data mining application in customer relationship management (CRM) for hospital inpatients. Methods A recency, frequency, monetary (RFM) model has been applied toward 14,072 patients discharged from a university hospital. Cluster analysis was conducted to segment customers, and it modeled the patterns of the loyal customers' medical services usage via a decision tree. Results Patients were divided into two groups according to the variables of the RFM model and the group which had significantly high frequency of medical use and expenses was defined as loyal customers, a target market. As a result of the decision tree, the predictable factors of the loyal clients were; length of stay, certainty of selectable treatment, surgery, number of accompanying treatments, kind of patient room, and department from which they were discharged. Particularly, this research showed that when a patient within the internal medicine department who did not have surgery stayed for more than 13.5 days, their probability of being a classified as a loyal customer was 70.0%. Conclusions To discover a hospital's loyal patients and model their medical usage patterns, the application of data-mining has been suggested. This paper suggests practical use of combining segmentation, targeting, positioning (STP) strategy and the RFM model with data-mining in CRM. PMID:23115740

  13. Modeling and clustering users with evolving profiles in usage streams

    KAUST Repository

    Zhang, Chongsheng; Masseglia, Florent; Zhang, Xiangliang

    2012-01-01

    Today, there is an increasing need of data stream mining technology to discover important patterns on the fly. Existing data stream models and algorithms commonly assume that users' records or profiles in data streams will not be updated or revised once they arrive. Nevertheless, in various applications such asWeb usage, the records/profiles of the users can evolve along time. This kind of streaming data evolves in two forms, the streaming of tuples or transactions as in the case of traditional data streams, and more importantly, the evolving of user records/profiles inside the streams. Such data streams bring difficulties on modeling and clustering for exploring users' behaviors. In this paper, we propose three models to summarize this kind of data streams, which are the batch model, the Evolving Objects (EO) model and the Dynamic Data Stream (DDS) model. Through creating, updating and deleting user profiles, these models summarize the behaviors of each user as a profile object. Based upon these models, clustering algorithms are employed to discover interesting user groups from the profile objects. We have evaluated all the proposed models on a large real-world data set, showing that the DDS model summarizes the data streams with evolving tuples more efficiently and effectively, and provides better basis for clustering users than the other two models. © 2012 IEEE.

  14. Modeling and clustering users with evolving profiles in usage streams

    KAUST Repository

    Zhang, Chongsheng

    2012-09-01

    Today, there is an increasing need of data stream mining technology to discover important patterns on the fly. Existing data stream models and algorithms commonly assume that users\\' records or profiles in data streams will not be updated or revised once they arrive. Nevertheless, in various applications such asWeb usage, the records/profiles of the users can evolve along time. This kind of streaming data evolves in two forms, the streaming of tuples or transactions as in the case of traditional data streams, and more importantly, the evolving of user records/profiles inside the streams. Such data streams bring difficulties on modeling and clustering for exploring users\\' behaviors. In this paper, we propose three models to summarize this kind of data streams, which are the batch model, the Evolving Objects (EO) model and the Dynamic Data Stream (DDS) model. Through creating, updating and deleting user profiles, these models summarize the behaviors of each user as a profile object. Based upon these models, clustering algorithms are employed to discover interesting user groups from the profile objects. We have evaluated all the proposed models on a large real-world data set, showing that the DDS model summarizes the data streams with evolving tuples more efficiently and effectively, and provides better basis for clustering users than the other two models. © 2012 IEEE.

  15. Stratification-Based Outlier Detection over the Deep Web

    OpenAIRE

    Xian, Xuefeng; Zhao, Pengpeng; Sheng, Victor S.; Fang, Ligang; Gu, Caidong; Yang, Yuanfeng; Cui, Zhiming

    2016-01-01

    For many applications, finding rare instances or outliers can be more interesting than finding common patterns. Existing work in outlier detection never considers the context of deep web. In this paper, we argue that, for many scenarios, it is more meaningful to detect outliers over deep web. In the context of deep web, users must submit queries through a query interface to retrieve corresponding data. Therefore, traditional data mining methods cannot be directly applied. The primary contribu...

  16. Tools and Databases of the KOMICS Web Portal for Preprocessing, Mining, and Dissemination of Metabolomics Data

    Directory of Open Access Journals (Sweden)

    Nozomu Sakurai

    2014-01-01

    Full Text Available A metabolome—the collection of comprehensive quantitative data on metabolites in an organism—has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal, where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data.

  17. Tools and databases of the KOMICS web portal for preprocessing, mining, and dissemination of metabolomics data.

    Science.gov (United States)

    Sakurai, Nozomu; Ara, Takeshi; Enomoto, Mitsuo; Motegi, Takeshi; Morishita, Yoshihiko; Kurabayashi, Atsushi; Iijima, Yoko; Ogata, Yoshiyuki; Nakajima, Daisuke; Suzuki, Hideyuki; Shibata, Daisuke

    2014-01-01

    A metabolome--the collection of comprehensive quantitative data on metabolites in an organism--has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal), where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data.

  18. A study on the personalization methods of the web | Hajighorbani ...

    African Journals Online (AJOL)

    ... methods of correct patterns and analyze them. Here we will discuss the basic concepts of web personalization and consider the three approaches of web personalization and we evaluated the methods belonging to each of them. Keywords: personalization, search engine, user preferences, data mining methods ...

  19. Mercury flow through an Asian rice-based food web

    International Nuclear Information System (INIS)

    Abeysinghe, Kasun S.; Qiu, Guangle; Goodale, Eben; Anderson, Christopher W.N.; Bishop, Kevin; Evers, David C.; Goodale, Morgan W.; Hintelmann, Holger; Liu, Shengjie

    2017-01-01

    Mercury (Hg) is a globally-distributed pollutant, toxic to humans and animals. Emissions are particularly high in Asia, and the source of exposure for humans there may also be different from other regions, including rice as well as fish consumption, particularly in contaminated areas. Yet the threats Asian wildlife face in rice-based ecosystems are as yet unclear. We sought to understand how Hg flows through rice-based food webs in historic mining and non-mining regions of Guizhou, China. We measured total Hg (THg) and methylmercury (MeHg) in soil, rice, 38 animal species (27 for MeHg) spanning multiple trophic levels, and examined the relationship between stable isotopes and Hg concentrations. Our results confirm biomagnification of THg/MeHg, with a high trophic magnification slope. Invertivorous songbirds had concentrations of THg in their feathers that were 15x and 3x the concentration reported to significantly impair reproduction, at mining and non-mining sites, respectively. High concentrations in specialist rice consumers and in granivorous birds, the later as high as in piscivorous birds, suggest rice is a primary source of exposure. Spiders had the highest THg concentrations among invertebrates and may represent a vector through which Hg is passed to vertebrates, especially songbirds. Our findings suggest there could be significant population level health effects and consequent biodiversity loss in sensitive ecosystems, like agricultural wetlands, across Asia, and invertivorous songbirds would be good subjects for further studies investigating this possibility. - Highlights: • Hg concentrations were measured across rice-based food webs in Guizhou, China. • Of 38 animal species, THg concentrations were highest for invertivorous songbirds. • High THg levels in rice pests and in granivorous birds suggest rice as a source. • Levels of THg in songbird feathers at mining site were among highest ever recorded. • Even at non-mining site, THg in such

  20. Ethical Issues of Social Media Usage in Healthcare

    OpenAIRE

    Denecke, Kerstin; Bamidis, Panagiotis D.; Bond, Carol; Gabarron, Elia; Househ, M; Lau, A. Y. S.; Mayer, Miguel A.; Merolli, Mark; Hansen, Margareth

    2015-01-01

    Accepted manuscript version. This article is not an exact copy of the original published article in The IMIA Yearbook of Medical Informatics. The definitive publisher-authenticated version of "Ethical Issues of Social Media Usage in Healthcare" is available online at http://doi.org/10.15265/IY-2015-001. OBJECTIVE: Social media, web and mobile technologies are increasingly used in healthcare and directly support patientcentered care. Patients benefit from disease self-management tools, ...

  1. Response of dandelion (Taraxacum officinale Web) to heavy metals from mine sites: micromorphology of leaves and roots.

    Science.gov (United States)

    Bini, Claudio; Maleci, Laura; Buffa, Gabriella; Wahsha, Mohammad; Fontana, Silvia

    2013-04-01

    Response of dandelion (Taraxacum officinale Web) to heavy metals from mine sites: micromorphology of leaves and roots. Maleci L.1 , Bini C.2, Buffa G. 2, Fontana S2., Wahsha M.3 1 - Dept of Biology, University of Florence, Italy. 2 - Dept of Environmental Sciences, Informatics and Statistics. Ca'Foscari University, Venice - Italy. 3 - Marine Science Centre - University of Jordan, Aqaba section, Jordan. Heavy metal accumulation is known to produce significant physiological and biochemical responses in vascular plants. Yet, metabolic and physiological responses of plants to heavy metal concentration can be viewed as potentially adaptive changes of the plants during stress. From this point of view, plants growing on abandoned mine sites are of particular interest, since they are genetically tolerant to high metal concentrations, and can be utilized in soil restoration. Among wild plants, the common dandelion (Taraxacum officinale Web) has received attention as bioindicator plant, and has been also suggested in remediation projects. Wild specimens of Taraxacum officinale Web, with their soil clod, were gathered from three sites with different contamination levels by heavy metals (Cd, Cr, Cu, Fe, Pb, Zn) in the abandoned Imperina Valley mine (Northeast Italy). A control plant was also gathered from a not contaminated site nearby. Plants were cultivated in pots for one year at HBF, and appeared macroscopically not affected by toxic signals (reduced growth, leaf necrosis) possibly induced by soil HM concentration. Leaves and roots taken at the same growing season were observed by LM and TEM. Light microscopy observations carried out on the leaf lamina show a clear difference in the cellular organization of not-contaminated and contaminated samples. The unpolluted samples present a well organized palisade tissue and spongy photosynthetic parenchyma. Samples from contaminated sites, instead, present a palisade parenchyma less organized, and a reduction of leaf thickness

  2. Social Media Usage for Patients and Healthcare Consumers: A Literature Review

    Directory of Open Access Journals (Sweden)

    Ariana-Anamaria Cordoş

    2017-04-01

    Full Text Available The evolution of Internet from static Web “publishing” to the highly participative, and data-driven, innovations of Web 2.0 has been influencing how people search for health-related information. This review included studies indexed in the PubMed electronic database that focused on social media analysis, examining relationships between participants (patients and healthcare consumers through social media usage. The obtained results showed that previous research regarding social media’s impact on patients and healthcare consumers aimed at a combination of platforms, but there is a penury of information about niche topics or its usage for retrieving medical information. Nevertheless, social media proved to be to be a promising tool in research mainly for recruitment purposes. The review has outlined that eHealth literacy is an attribute for populations that are female and relatively young and educated. Blogs share personal experiences, YouTube contains unregulated, high- and low-quality information that can mislead individuals, Facebook contains more marketing than health-related information, while Wikipedia is recommended for providing high-quality information. Despite healthcare practitioners’ and healthcare public institutions’ reluctance about the use of social media, this review demonstrates the usefulness of social media for patients and healthcare consumers in retrieving health-related information based on content availability and usage implications, and highlights gaps in knowledge that further research needs to fill.

  3. Developing an open source-based spatial data infrastructure for integrated monitoring of mining areas

    Science.gov (United States)

    Lahn, Florian; Knoth, Christian; Prinz, Torsten; Pebesma, Edzer

    2014-05-01

    In all phases of mining campaigns, comprehensive spatial information is an essential requirement in order to ensure economically efficient but also safe mining activities as well as to reduce environmental impacts. Earth observation data acquired from various sources like remote sensing or ground measurements is important e.g. for the exploration of mineral deposits, the monitoring of mining induced impacts on vegetation or the detection of ground subsidence. The GMES4Mining project aims at exploring new remote sensing techniques and developing analysis methods on various types of sensor data to provide comprehensive spatial information during mining campaigns (BENECKE et al. 2013). One important task in this project is the integration of the data gathered (e.g. hyperspectral images, spaceborne radar data and ground measurements) as well as results of the developed analysis methods within a web-accessible data source based on open source software. The main challenges here are to provide various types and formats of data from different sensors and to enable access to analysis and processing techniques without particular software or licensing requirements for users. Furthermore the high volume of the involved data (especially hyperspectral remote sensing images) makes data transfer a major issue in this use case. To engage these problems a spatial data infrastructure (SDI) including a web portal as user frontend is being developed which allows users to access not only the data but also several analysis methods. The Geoserver software is used for publishing the data, which is then accessed and visualized in a JavaScript-based web portal. In order to perform descriptive statistics and some straightforward image processing techniques on the raster data (e.g. band arithmetic or principal component analysis) the statistics software R is implemented on a server and connected via Rserve. The analysis is controlled and executed directly by the user through the web portal and

  4. Design and Implementation WebGIS for Improving the Quality of Exploration Decisions at Sin-Quyen Copper Mine, Northern Vietnam

    Science.gov (United States)

    Quang Truong, Xuan; Luan Truong, Xuan; Nguyen, Tuan Anh; Nguyen, Dinh Tuan; Cong Nguyen, Chi

    2017-12-01

    The objective of this study is to design and implement a WebGIS Decision Support System (WDSS) for reducing uncertainty and supporting to improve the quality of exploration decisions in the Sin-Quyen copper mine, northern Vietnam. The main distinctive feature of the Sin-Quyen deposit is an unusual composition of ores. Computer and software applied to the exploration problem have had a significant impact on the exploration process over the past 25 years, but up until now, no online system has been undertaken. The system was completely built on open source technology and the Open Geospatial Consortium Web Services (OWS). The input data includes remote sensing (RS), Geographical Information System (GIS) and data from drillhole explorations, the drillhole exploration data sets were designed as a geodatabase and stored in PostgreSQL. The WDSS must be able to processed exploration data and support users to access 2-dimensional (2D) or 3-dimensional (3D) cross-sections and map of boreholles exploration data and drill holes. The interface was designed in order to interact with based maps (e.g., Digital Elevation Model, Google Map, OpenStreetMap) and thematic maps (e.g., land use and land cover, administrative map, drillholes exploration map), and to provide GIS functions (such as creating a new map, updating an existing map, querying and statistical charts). In addition, the system provides geological cross-sections of ore bodies based on Inverse Distance Weighting (IDW), nearest neighbour interpolation and Kriging methods (e.g., Simple Kriging, Ordinary Kriging, Indicator Kriging and CoKriging). The results based on data available indicate that the best estimation method (of 23 borehole exploration data sets) for estimating geological cross-sections of ore bodies in Sin-Quyen copper mine is Ordinary Kriging. The WDSS could provide useful information to improve drilling efficiency in mineral exploration and for management decision making.

  5. Environmental management in North American mining sector.

    Science.gov (United States)

    Asif, Zunaira; Chen, Zhi

    2016-01-01

    This paper reviews the environmental issues and management practices in the mining sector in the North America. The sustainable measures on waste management are recognized as one of the most serious environmental concerns in the mining industry. For mining activities, it will be no surprise that the metal recovery reagents and acid effluents are a threat to the ecosystem as well as hazards to human health. In addition, poor air quality and ventilation in underground mines can lead to occupational illness and death of workers. Electricity usage and fuel consumption are major factors that contribute to greenhouse gases. On the other hand, many sustainability challenges are faced in the management of tailings and disposal of waste rock. This paper aims to highlight the problems that arise due to poor air quality and acid mine drainage. The paper also addresses some of the advantages and limitations of tailing and waste rock management that still have to be studied in context of the mining sector. This paper suggests that implementation of suitable environmental management tools like life cycle assessment (LCA), cleaner production technologies (CPTs), and multicriteria decision analysis (MCD) are important as it ultimately lead to improve environmental performance and enabling a mine to focus on the next stage of sustainability.

  6. A Combined Mining Approach and Application in Tax Administration

    OpenAIRE

    Arun Solanki; Dr. Ela Kumar

    2010-01-01

    This paper reports the development of a model for taxation. This model will work for the tax payers as well as for the administrator. It utilizes the technique of web mining, text mining, data mining and human experience knowledge for creating a knowledge base of taxation. All knowledge from each part is saved in knowledge base through a knowledge management platform. Using this knowledge management platform the administrator and tax payer can retrieve knowledge;send feedback on the basis of ...

  7. Anthropogenic and natural sources of acidity and metals and their influence on the structure of stream food webs

    International Nuclear Information System (INIS)

    Hogsden, Kristy L.; Harding, Jon S.

    2012-01-01

    We compared food web structure in 20 streams with either anthropogenic or natural sources of acidity and metals or circumneutral water chemistry in New Zealand. Community and diet analysis indicated that mining streams receiving anthropogenic inputs of acidic and metal-rich drainage had much simpler food webs (fewer species, shorter food chains, less links) than those in naturally acidic, naturally high metal, and circumneutral streams. Food webs of naturally high metal streams were structurally similar to those in mining streams, lacking fish predators and having few species. Whereas, webs in naturally acidic streams differed very little from those in circumneutral streams due to strong similarities in community composition and diets of secondary and top consumers. The combined negative effects of acidity and metals on stream food webs are clear. However, elevated metal concentrations, regardless of source, appear to play a more important role than acidity in driving food web structure. - Highlights: ► Food webs in acid mine drainage impacted streams are small and extremely simplified. ► Conductivity explained differences in food web properties between streams. ► Number of links and web size accounted for much dissimilarity between food webs. ► Food web structure was comparable in naturally acidic and circumneutral streams. - Food web structure differs in streams with anthropogenic and natural sources of acidity and metals.

  8. Endoparasite Community Differences in Sunfish (Lepomis spp.) Above and Below Coal Mine Effluent in Southern Illinois.

    Science.gov (United States)

    Claxton, Andrew; Laursen, Jeff

    2015-06-01

    Parasite assemblages acquired through trophic interactions in fish hosts are increasingly cited as a means to determine pollution effects on water quality and food web structure. We examined gastrointestinal parasite community changes above and below coal mine input from 597 individuals representing 3 species of sunfish: green sunfish ( Lepomis cyanellus ), bluegill ( L. macrochirus ), and longear sunfish ( L. megalotis ). Hosts were collected from 6 sites in or near the south fork of the Saline River Basin in southern Illinois in the spring and fall of 2006. Three sites received no known effluent from coal mines. An additional 3 sites received effluent termed acid mine drainage (AMD). We recovered 1,064 parasites from 12 genera. The parasite community in sunfish collected downstream nearest to the source of AMD was significantly different from 3 upstream sites. In addition, 2 sites farther downstream receiving AMD were different from 2 of 3 reference sites. However, there was also considerable variability in parasite assemblages between sites grouped as above or below coal mine effluent. Several parasite species responded to changes in water quality. Spinitectus sp. (Nematoda), which uses sensitive mayfly hosts to complete its life cycle, was less abundant at sites downstream of coal mine effluent in both green sunfish and bluegill. In contrast, 2 acanthocephalans ( Neoechinorhynchus sp. and Eocollis arcanus) and a nematode ( Spiroxys sp.) were found in green sunfish more frequently in areas downstream of AMD. This study further suggests metazoan parasites may be useful as indicators of water quality; however, variability among similar sites may limit their application. In addition, strong assemblage differences were found among the 3 sunfish species, suggesting variable habitat usage and potential resource partitioning among congeneric fish hosts in streams.

  9. Opinion Mining in Web 2.0

    OpenAIRE

    Pérez Gallego, Pablo José

    2012-01-01

    During the last years we are assisting to an intense Web transformation process. It is no longer a mere static information repository but a dynamic system in which users have become the main content contributors. They actively participate in sharing their opinions, thoughts and views about products, events and almost anything in social networks, forums, blogs, etc. With the latest advances in mobile technologies, users can actually interact anytime from anywhere; real time info...

  10. Asymmetric threat data mining and knowledge discovery

    Science.gov (United States)

    Gilmore, John F.; Pagels, Michael A.; Palk, Justin

    2001-03-01

    Asymmetric threats differ from the conventional force-on- force military encounters that the Defense Department has historically been trained to engage. Terrorism by its nature is now an operational activity that is neither easily detected or countered as its very existence depends on small covert attacks exploiting the element of surprise. But terrorism does have defined forms, motivations, tactics and organizational structure. Exploiting a terrorism taxonomy provides the opportunity to discover and assess knowledge of terrorist operations. This paper describes the Asymmetric Threat Terrorist Assessment, Countering, and Knowledge (ATTACK) system. ATTACK has been developed to (a) data mine open source intelligence (OSINT) information from web-based newspaper sources, video news web casts, and actual terrorist web sites, (b) evaluate this information against a terrorism taxonomy, (c) exploit country/region specific social, economic, political, and religious knowledge, and (d) discover and predict potential terrorist activities and association links. Details of the asymmetric threat structure and the ATTACK system architecture are presented with results of an actual terrorist data mining and knowledge discovery test case shown.

  11. Formal Model of Web Service Composition: An Actor-Based Approach to Unifying Orchestration and Choreography

    OpenAIRE

    Wang, Yong

    2013-01-01

    Web Service Composition creates new composite Web Services from the collection of existing ones to be composed further and embodies the added values and potential usages of Web Services. Web Service Composition includes two aspects: Web Service orchestration denoting a workflow-like composition pattern and Web Service choreography which represents an aggregate composition pattern. There were only a few works which give orchestration and choreography a relationship. In this paper, we introduce...

  12. Ensemble learned vaccination uptake prediction using web search queries

    DEFF Research Database (Denmark)

    Hansen, Niels Dalum; Lioma, Christina; Mølbak, Kåre

    2016-01-01

    We present a method that uses ensemble learning to combine clinical and web-mined time-series data in order to predict future vaccination uptake. The clinical data is official vaccination registries, and the web data is query frequencies collected from Google Trends. Experiments with official...... vaccine records show that our method predicts vaccination uptake eff?ectively (4.7 Root Mean Squared Error). Whereas performance is best when combining clinical and web data, using solely web data yields comparative performance. To our knowledge, this is the ?first study to predict vaccination uptake...

  13. Utilization of two web-based continuing education courses evaluated by Markov chain model.

    Science.gov (United States)

    Tian, Hao; Lin, Jin-Mann S; Reeves, William C

    2012-01-01

    To evaluate the web structure of two web-based continuing education courses, identify problems and assess the effects of web site modifications. Markov chain models were built from 2008 web usage data to evaluate the courses' web structure and navigation patterns. The web site was then modified to resolve identified design issues and the improvement in user activity over the subsequent 12 months was quantitatively evaluated. Web navigation paths were collected between 2008 and 2010. The probability of navigating from one web page to another was analyzed. The continuing education courses' sequential structure design was clearly reflected in the resulting actual web usage models, and none of the skip transitions provided was heavily used. The web navigation patterns of the two different continuing education courses were similar. Two possible design flaws were identified and fixed in only one of the two courses. Over the following 12 months, the drop-out rate in the modified course significantly decreased from 41% to 35%, but remained unchanged in the unmodified course. The web improvement effects were further verified via a second-order Markov chain model. The results imply that differences in web content have less impact than web structure design on how learners navigate through continuing education courses. Evaluation of user navigation can help identify web design flaws and guide modifications. This study showed that Markov chain models provide a valuable tool to evaluate web-based education courses. Both the results and techniques in this study would be very useful for public health education and research specialists.

  14. An analysis of technology usage for streaming digital video in support of a preclinical curriculum.

    Science.gov (United States)

    Dev, P; Rindfleisch, T C; Kush, S J; Stringer, J R

    2000-01-01

    Usage of streaming digital video of lectures in preclinical courses was measured by analysis of the data in the log file maintained on the web server. We observed that students use the video when it is available. They do not use it to replace classroom attendance but rather for review before examinations or when a class has been missed. Usage of video has not increased significantly for any course within the 18 month duration of this project.

  15. Usage, Barriers, and Training of Web 2.0 Technology Applications

    Science.gov (United States)

    Pritchett, Christopher G.; Pritchett, Christal C.; Wohleb, Elisha C.

    2013-01-01

    This research study was designed to determine the degree of use of Web 2.0 technology applications by certified education professionals and examine differences among various groups as well as reasons for these differences. A quantitative survey instrument was developed to gather demographic information and data. Participants reported they would be…

  16. Using a web-based orthopaedic clinic in the curricular teaching of a German university hospital: analysis of learning effect, student usage and reception.

    Science.gov (United States)

    Wünschel, Markus; Leichtle, Ulf; Wülker, Nikolaus; Kluba, Torsten

    2010-10-01

    Modern teaching concepts for undergraduate medical students in Germany include problem based learning as a major component of the new licensing regulations for physicians. Here we describe the usage of a web-based virtual outpatient clinic in the teaching curriculum of undergraduate medical students, its effect on learning success, and student reception. Fifth year medial students were requested to examine 7 virtual orthopaedic patients which had been created by the authors using the Inmedea-Simulator. They also had to take a multiple-choice examination on two different occasions and their utilisation of the simulator was analysed subjectively and objectively. One hundred and sixty students took part in the study. The average age was 24.9 years, 60% were female. Most of the participants studied on their own using their private computer with a fast internet-connection at home. The average usage time was 263 min, most of the students worked with the system in the afternoon, although a considerable number used it late in the night. Regarding learning success, we found that the examination results were significantly better after using the system (7.66 versus 8.37, plearning efficacy. The way the system was used by the students emphasises the advantages of the internet-like free time management and the implementation of multimedia-based content. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  17. Using Web-Based Technologies and Tools in Future Choreographers' Training: British Experience

    Science.gov (United States)

    Bidyuk, Dmytro

    2016-01-01

    In the paper the problem of using effective web-based technologies and tools in teaching choreography in British higher education institutions has been discussed. Researches on the usage of web-based technologies and tools for practical dance courses in choreographers' professional training at British higher education institutions by such British…

  18. Usage of Web Service in Mobile Application for Parents and Students in Binus School Serpong

    OpenAIRE

    Karto Iskandar; Andrew Thejo Putrantob

    2016-01-01

    A web service is a service offered by a device electronically to communicate with other electronic device using the World wide web. Smartphone is an electronic device that almost everyone has, especially student and parent for getting information about the school. In BINUS School Serpong mobile application, web services used for getting data from web server like student and menu data. Problem faced by BINUS School Serpong today is the time-consuming application update when using the native ap...

  19. Personalized links recommendation based on data mining in adaptive educational hypermedia systems

    NARCIS (Netherlands)

    Romero, C.; Ventura, S.; Delgado, J.A.; De Bra, P.M.E.; Duval, E.; Klamma, R.; Wolpers, M.

    2007-01-01

    In this paper, we describe a personalized recommender system that uses web mining techniques for recommending a student which (next) links to visit within an adaptable educational hypermedia system. We present a specific mining tool and a recommender engine that we have integrated in the AHA! system

  20. The Term cybrarian : Concept and The Arabic Usage

    Directory of Open Access Journals (Sweden)

    Mahmoud A.Sattar Khalifa

    2004-06-01

    Full Text Available A Study about the term cybrarian, dealing with its origin, definition in the public and specific dictionaries and gives comments for each one , then deals with the usage of term on the Arabic coverage which acted by appearing a printed pamphlet and discussion group entitled cybrarians, and a published study about this topic , also acted by establishing an Arabic web site with the same name, finally the study try to give an Arabic opposite to this term.

  1. Programming Collective Intelligence Building Smart Web 2.0 Applications

    CERN Document Server

    Segaran, Toby

    2008-01-01

    This fascinating book demonstrates how you can build web applications to mine the enormous amount of data created by people on the Internet. With the sophisticated algorithms in this book, you can write smart programs to access interesting datasets from other web sites, collect data from users of your own applications, and analyze and understand the data once you've found it.

  2. Mining Frequent Item Sets in Asynchronous Transactional Data Streams over Time Sensitive Sliding Windows Model

    International Nuclear Information System (INIS)

    Javaid, Q.; Memon, F.; Talpur, S.; Arif, M.; Awan, M.D.

    2016-01-01

    EPs (Extracting Frequent Patterns) from the continuous transactional data streams is a challenging and critical task in some of the applications, such as web mining, data analysis and retail market, prediction and network monitoring, or analysis of stock market exchange data. Many algorithms have been developed previously for mining FPs (Frequent Patterns) from a data stream. Such algorithms are currently highly required to develop new solutions and approaches to the precise handling of data streams. New techniques, solutions, or approaches are developed to address unbounded, ordered, and continuous sequences of data and for the generation of data at a rapid speed from data streams. Hence, extracting FPs using fresh or recent data involves the high-level analysis of data streams. We have suggested an efficient technique for the window sliding model; this technique extracts new and fresh FPs from high-speed data streams. In this study, a CPILT (Compacted Tree Compact Pattern Tree) is developed to capture the latest contents in the stream and to efficiently remove outdated contents from the data stream. The main concept introduced in this work on CPILT is the dynamic restructuring of a tree, which is helpful in producing a compacted tree and the frequency descending structure of a tree on runtime. With the help of the mining technique of FP growth, a complete list of new and fresh FPs is obtained from a CPILT using an existing window. The memory usage and time complexity of the latest FPs in high-speed data streams can efficiently be determined through proper experimentation and analysis. (author)

  3. Entomopathogenic nematode food webs in an ancient, mining pollution gradient in Spain.

    Science.gov (United States)

    Campos-Herrera, Raquel; Rodríguez Martín, José Antonio; Escuer, Miguel; García-González, María Teresa; Duncan, Larry W; Gutiérrez, Carmen

    2016-12-01

    Mining activities pollute the environment with by-products that cause unpredictable impacts in surrounding areas. Cartagena-La Unión mine (Southeastern-Spain) was active for >2500years. Despite its closure in 1991, high concentrations of metals and waste residues remain in this area. A previous study using nematodes suggested that high lead content diminished soil biodiversity. However, the effects of mine pollution on specific ecosystem services remain unknown. Entomopathogenic nematodes (EPN) play a major role in the biocontrol of insect pests. Because EPNs are widespread throughout the world, we speculated that EPNs would be present in the mined areas, but at increased incidence with distance from the pollution focus. We predicted that the natural enemies of nematodes would follow a similar spatial pattern. We used qPCR techniques to measure abundance of five EPN species, five nematophagous fungi species, two bacterial ectoparasites of EPNs and one group of free-living nematodes that compete for the insect-cadaver. The study comprised 193 soil samples taken from mining sites, natural areas and agricultural fields. The highest concentrations of iron and zinc were detected in the mined area as was previously described for lead, cadmium and nickel. Molecular tools detected very low numbers of EPNs in samples found to be negative by insect-baiting, demonstrating the importance of the approach. EPNs were detected at low numbers in 13% of the localities, without relationship to heavy-metal concentrations. Only Acrobeloides-group nematodes were inversely related to the pollution gradient. Factors associated with agricultural areas explained 98.35% of the biotic variability, including EPN association with agricultural areas. Our study suggests that EPNs have adapted to polluted habitats that might support arthropod hosts. By contrast, the relationship between abundance of Acrobeloides-group and heavy-metal levels, revealed these taxa as especially well suited bio

  4. Changes in host-parasitoid food web structure with elevation.

    Science.gov (United States)

    Maunsell, Sarah C; Kitching, Roger L; Burwell, Chris J; Morris, Rebecca J

    2015-03-01

    Gradients in elevation are increasingly used to investigate how species respond to changes in local climatic conditions. Whilst many studies have shown elevational patterns in species richness and turnover, little is known about how food web structure is affected by elevation. Contrasting responses of predator and prey species to elevation may lead to changes in food web structure. We investigated how the quantitative structure of a herbivore-parasitoid food web changes with elevation in an Australian subtropical rain forest. On four occasions, spread over 1 year, we hand-collected leaf miners at twelve sites, along three elevational gradients (between 493 m and 1159 m a.s.l). A total of 5030 insects, including 603 parasitoids, were reared, and summary food webs were created for each site. We also carried out a replicated manipulative experiment by translocating an abundant leaf-mining weevil Platynotocis sp., which largely escaped parasitism at high elevations (≥ 900 m a.s.l.), to lower, warmer elevations, to test if it would experience higher parasitism pressure. We found strong evidence that the environmental change that occurs with increasing elevation affects food web structure. Quantitative measures of generality, vulnerability and interaction evenness decreased significantly with increasing elevation (and decreasing temperature), whilst elevation did not have a significant effect on connectance. Mined plant composition also had a significant effect on generality and vulnerability, but not on interaction evenness. Several relatively abundant species of leaf miner appeared to escape parasitism at higher elevations, but contrary to our prediction, Platynotocis sp. did not experience greater levels of parasitism when translocated to lower elevations. Our study indicates that leaf-mining herbivores and their parasitoids respond differently to environmental conditions imposed by elevation, thus producing structural changes in their food webs. Increasing

  5. Error Checking for Chinese Query by Mining Web Log

    Directory of Open Access Journals (Sweden)

    Jianyong Duan

    2015-01-01

    Full Text Available For the search engine, error-input query is a common phenomenon. This paper uses web log as the training set for the query error checking. Through the n-gram language model that is trained by web log, the queries are analyzed and checked. Some features including query words and their number are introduced into the model. At the same time data smoothing algorithm is used to solve data sparseness problem. It will improve the overall accuracy of the n-gram model. The experimental results show that it is effective.

  6. Rule-based statistical data mining agents for an e-commerce application

    Science.gov (United States)

    Qin, Yi; Zhang, Yan-Qing; King, K. N.; Sunderraman, Rajshekhar

    2003-03-01

    Intelligent data mining techniques have useful e-Business applications. Because an e-Commerce application is related to multiple domains such as statistical analysis, market competition, price comparison, profit improvement and personal preferences, this paper presents a hybrid knowledge-based e-Commerce system fusing intelligent techniques, statistical data mining, and personal information to enhance QoS (Quality of Service) of e-Commerce. A Web-based e-Commerce application software system, eDVD Web Shopping Center, is successfully implemented uisng Java servlets and an Oracle81 database server. Simulation results have shown that the hybrid intelligent e-Commerce system is able to make smart decisions for different customers.

  7. Mining of the social network extraction

    Science.gov (United States)

    Nasution, M. K. M.; Hardi, M.; Syah, R.

    2017-01-01

    The use of Web as social media is steadily gaining ground in the study of social actor behaviour. However, information in Web can be interpreted in accordance with the ability of the method such as superficial methods for extracting social networks. Each method however has features and drawbacks: it cannot reveal the behaviour of social actors, but it has the hidden information about them. Therefore, this paper aims to reveal such information in the social networks mining. Social behaviour could be expressed through a set of words extracted from the list of snippets.

  8. Web Usage Mining Analysis of Federated Search Tools for Egyptian Scholars

    Science.gov (United States)

    Mohamed, Khaled A.; Hassan, Ahmed

    2008-01-01

    Purpose: This paper aims to examine the behaviour of the Egyptian scholars while accessing electronic resources through two federated search tools. The main purpose of this article is to provide guidance for federated search tool technicians and support teams about user issues, including the need for training. Design/methodology/approach: Log…

  9. Collecting conditions usage metadata to optimize current and future ATLAS software and processing

    CERN Document Server

    Barberis, Dario; The ATLAS collaboration; Gallas, Elizabeth; Oda, Susumu

    2016-01-01

    Conditions data (for example: alignment, calibration, data quality) are used extensively in the processing of real and simulated data in ATLAS. The volume and variety of the conditions data needed by different types of processing are quite diverse, so optimizing its access requires a careful understanding of conditions usage patterns. These patterns can be quantified by mining representative log files from each type of processing and gathering detailed information about conditions usage for that type of processing into a central repository. In this presentation, we describe the systems developed to collect this conditions usage metadata per job type and describe a few specific (but very different) ways in which it has been used. For example, it can be used to cull specific conditions data into a much more compact package to be used by jobs doing similar types of processing: these customized collections can then be shipped with jobs to be executed on isolated worker nodes (such as HPC farms) that have no netwo...

  10. Handling Dynamic Weights in Weighted Frequent Pattern Mining

    Science.gov (United States)

    Ahmed, Chowdhury Farhan; Tanbeer, Syed Khairuzzaman; Jeong, Byeong-Soo; Lee, Young-Koo

    Even though weighted frequent pattern (WFP) mining is more effective than traditional frequent pattern mining because it can consider different semantic significances (weights) of items, existing WFP algorithms assume that each item has a fixed weight. But in real world scenarios, the weight (price or significance) of an item can vary with time. Reflecting these changes in item weight is necessary in several mining applications, such as retail market data analysis and web click stream analysis. In this paper, we introduce the concept of a dynamic weight for each item, and propose an algorithm, DWFPM (dynamic weighted frequent pattern mining), that makes use of this concept. Our algorithm can address situations where the weight (price or significance) of an item varies dynamically. It exploits a pattern growth mining technique to avoid the level-wise candidate set generation-and-test methodology. Furthermore, it requires only one database scan, so it is eligible for use in stream data mining. An extensive performance analysis shows that our algorithm is efficient and scalable for WFP mining using dynamic weights.

  11. Clustering Educational Digital Library Usage Data: A Comparison of Latent Class Analysis and K-Means Algorithms

    Science.gov (United States)

    Xu, Beijie; Recker, Mimi; Qi, Xiaojun; Flann, Nicholas; Ye, Lei

    2013-01-01

    This article examines clustering as an educational data mining method. In particular, two clustering algorithms, the widely used K-means and the model-based Latent Class Analysis, are compared, using usage data from an educational digital library service, the Instructional Architect (IA.usu.edu). Using a multi-faceted approach and multiple data…

  12. Assessing the Effects of Participant Preference and Demographics in the Usage of Web-based Survey Questionnaires by Women Attending Screening Mammography in British Columbia.

    Science.gov (United States)

    Mlikotic, Rebecca; Parker, Brent; Rajapakshe, Rasika

    2016-03-22

    Increased usage of Internet applications has allowed for the collection of patient reported outcomes (PROs) and other health data through Web-based communication and questionnaires. While these Web platforms allow for increased speed and scope of communication delivery, there are certain limitations associated with this technology, as survey mode preferences vary across demographic groups. To investigate the impact of demographic factors and participant preferences on the use of a Web-based questionnaire in comparison with more traditional methods (mail and phone) for women participating in screening mammography in British Columbia, Canada. A sample of women attending the Screening Mammography Program of British Columbia (SMPBC) participated in a breast cancer risk assessment project. The study questionnaire was administered through one of three modes (ie, telephone, mail, or website platform). Survey mode preferences and actual methods of response were analyzed for participants recruited from Victoria General Hospital. Both univariate and multivariate analyses were used to investigate the association of demographic factors (ie, age, education level, and ethnicity) with certain survey response types. A total of 1192 women successfully completed the study questionnaire at Victoria General Hospital. Mail was stated as the most preferred survey mode (509/1192, 42.70%), followed by website platform (422/1192, 35.40%), and telephone (147/1192, 12.33%). Over 80% (955/1192) of participants completed the questionnaire in the mode previously specified as their most preferred; mail was the most common method of response (688/1192, 57.72%). Mail was also the most preferred type of questionnaire response method when participants responded in a mode other than their original preference. The average age of participants who responded via the Web-based platform (age 52.9, 95% confidence interval [CI] 52.1-53.7) was significantly lower than those who used mail and telephone methods

  13. Usage, attitudes and workload implications for a Web-based learning environment

    NARCIS (Netherlands)

    Collis, Betty; Messing, John

    2001-01-01

    At the University of Twente, a locally developed Web-based learning environment called the TeleTOP system is being implemented throughout the university after being first developed and used in the Faculty of Educational Science and Technology, followed by use in the Department of Telematics.

  14. Mining of high utility-probability sequential patterns from uncertain databases.

    Directory of Open Access Journals (Sweden)

    Binbin Zhang

    Full Text Available High-utility sequential pattern mining (HUSPM has become an important issue in the field of data mining. Several HUSPM algorithms have been designed to mine high-utility sequential patterns (HUPSPs. They have been applied in several real-life situations such as for consumer behavior analysis and event detection in sensor networks. Nonetheless, most studies on HUSPM have focused on mining HUPSPs in precise data. But in real-life, uncertainty is an important factor as data is collected using various types of sensors that are more or less accurate. Hence, data collected in a real-life database can be annotated with existing probabilities. This paper presents a novel pattern mining framework called high utility-probability sequential pattern mining (HUPSPM for mining high utility-probability sequential patterns (HUPSPs in uncertain sequence databases. A baseline algorithm with three optional pruning strategies is presented to mine HUPSPs. Moroever, to speed up the mining process, a projection mechanism is designed to create a database projection for each processed sequence, which is smaller than the original database. Thus, the number of unpromising candidates can be greatly reduced, as well as the execution time for mining HUPSPs. Substantial experiments both on real-life and synthetic datasets show that the designed algorithm performs well in terms of runtime, number of candidates, memory usage, and scalability for different minimum utility and minimum probability thresholds.

  15. Facebook usage among Indian businesses: A website content analysis

    OpenAIRE

    Rajwinder Saini

    2018-01-01

    The revolution of technologies in the era of internet has led to the new ways in which the companies communicate with their stakeholders. Facebook is the popular type of social media which is used by companies in these days as it promotes two- way communication. This study attempts to investigate the facebook usage among Indian business organization by using web content analysis method. A total of 50 business organizations were investigated and it was found that only 41 of them have their fac...

  16. BioServices: a common Python package to access biological Web Services programmatically.

    Science.gov (United States)

    Cokelaer, Thomas; Pultz, Dennis; Harder, Lea M; Serra-Musach, Jordi; Saez-Rodriguez, Julio

    2013-12-15

    Web interfaces provide access to numerous biological databases. Many can be accessed to in a programmatic way thanks to Web Services. Building applications that combine several of them would benefit from a single framework. BioServices is a comprehensive Python framework that provides programmatic access to major bioinformatics Web Services (e.g. KEGG, UniProt, BioModels, ChEMBLdb). Wrapping additional Web Services based either on Representational State Transfer or Simple Object Access Protocol/Web Services Description Language technologies is eased by the usage of object-oriented programming. BioServices releases and documentation are available at http://pypi.python.org/pypi/bioservices under a GPL-v3 license.

  17. Usage and applications of Semantic Web techniques and technologies to support chemistry research.

    Science.gov (United States)

    Borkum, Mark I; Frey, Jeremy G

    2014-01-01

    The drug discovery process is now highly dependent on the management, curation and integration of large amounts of potentially useful data. Semantics are necessary in order to interpret the information and derive knowledge. Advances in recent years have mitigated concerns that the lack of robust, usable tools has inhibited the adoption of methodologies based on semantics. THIS PAPER PRESENTS THREE EXAMPLES OF HOW SEMANTIC WEB TECHNIQUES AND TECHNOLOGIES CAN BE USED IN ORDER TO SUPPORT CHEMISTRY RESEARCH: a controlled vocabulary for quantities, units and symbols in physical chemistry; a controlled vocabulary for the classification and labelling of chemical substances and mixtures; and, a database of chemical identifiers. This paper also presents a Web-based service that uses the datasets in order to assist with the completion of risk assessment forms, along with a discussion of the legal implications and value-proposition for the use of such a service. We have introduced the Semantic Web concepts, technologies, and methodologies that can be used to support chemistry research, and have demonstrated the application of those techniques in three areas very relevant to modern chemistry research, generating three new datasets that we offer as exemplars of an extensible portfolio of advanced data integration facilities. We have thereby established the importance of Semantic Web techniques and technologies for meeting Wild's fourth "grand challenge".

  18. The ATLAS Public Web Pages: Online Management of HEP External Communication Content

    CERN Document Server

    Goldfarb, Steven; Phoboo, Abha Eli; Shaw, Kate

    2015-01-01

    The ATLAS Education and Outreach Group is in the process of migrating its public online content to a professionally designed set of web pages built on the Drupal content management system. Development of the front-end design passed through several key stages, including audience surveys, stakeholder interviews, usage analytics, and a series of fast design iterations, called sprints. Implementation of the web site involves application of the html design using Drupal templates, refined development iterations, and the overall population of the site with content. We present the design and development processes and share the lessons learned along the way, including the results of the data-driven discovery studies. We also demonstrate the advantages of selecting a back-end supported by content management, with a focus on workflow. Finally, we discuss usage of the new public web pages to implement outreach strategy through implementation of clearly presented themes, consistent audience targeting and messaging, and th...

  19. Potential influence of Web 2.0 usage and security practices of online users on information management

    Directory of Open Access Journals (Sweden)

    R.J. Rudman

    2009-02-01

    Full Text Available The proliferation of Web 2.0 applications was the impetus for this survey-based research into practices that online users currently employ when using Web 2.0 sites. As part of the study, the popularity of Web 2.0 technologies and sites among online users at a university was investigated to determine the extent of the potential threat to corporate security, arising from Web 2.0 use and access. The results of this study indicate that the use of Web 2.0 sites is very popular among students, as a proxy for the potential future business users, and that users are not necessarily aware of the risks associated with these sites. The respondents indicated that they regularly visit Web 2.0 sites, and that they post personal information on these sites. This is of concern in protecting arguably the most valuable asset of a business.

  20. Evaluating the Utility of Web-Based Consumer Support Tools Using Rough Sets

    Science.gov (United States)

    Maciag, Timothy; Hepting, Daryl H.; Slezak, Dominik; Hilderman, Robert J.

    On the Web, many popular e-commerce sites provide consumers with decision support tools to assist them in their commerce-related decision-making. Many consumers will rank the utility of these tools quite highly. Data obtained from web usage mining analyses, which may provide knowledge about a user's online experiences, could help indicate the utility of these tools. This type of analysis could provide insight into whether provided tools are adequately assisting consumers in conducting their online shopping activities or if new or additional enhancements need consideration. Although some research in this regard has been described in previous literature, there is still much that can be done. The authors of this paper hypothesize that a measurement of consumer decision accuracy, i.e. a measurement preferences, could help indicate the utility of these tools. This paper describes a procedure developed towards this goal using elements of rough set theory. The authors evaluated the procedure using two support tools, one based on a tool developed by the US-EPA and the other developed by one of the authors called cogito. Results from the evaluation did provide interesting insights on the utility of both support tools. Although it was shown that the cogito tool obtained slightly higher decision accuracy, both tools could be improved from additional enhancements. Details of the procedure developed and results obtained from the evaluation will be provided. Opportunities for future work are also discussed.

  1. EFFICIENCY OF THE USE OF AUTHENTIC WEB-RESOURCES IN TRANSLATORS TRAINING

    Directory of Open Access Journals (Sweden)

    Iryna M. Drobit

    2013-06-01

    Full Text Available The article deals with pedagogical assumptions and efficiency of the use of Information and Communication Technologies, especially authentic web-resources, while teaching language for specific purposes (translators and interpreters. Accuracy, content, and functionality of web-resource TED, which contains examples of authentic speech in English, have been outlined. It has been demonstrated that usage of multimedia and communication facilities of the TED web-resource provides favourable opportunity to involve students in such professional activities as translation and proofreading, and also to improve the level of their language skills.

  2. Usage of the www.2aida.org AIDA diabetes software Website: a pilot study.

    Science.gov (United States)

    Lehmann, Eldon D

    2003-01-01

    AIDA is a diabetes-computing program freely available from www.2aida.org on the Web. The software is intended to serve as an educational support tool, and can be used by anyone who has an interest in diabetes, whether they be patients, relatives, health-care professionals, or students. In previous "Diabetes Information Technology & WebWatch" columns various indicators of usage of the AIDA program have been reviewed, and various comments from users of the software have been documented. One aspect of AIDA, though, that has been of considerable interest has been to investigate its Web-based distribution as a wider paradigm for more general medically related usage of the Internet. In this respect we have been keen to understand in general terms: (1) why people are turning to the Web for health-care/diabetes information; (2) more specifically, what sort of people are making use of the AIDA software; and (3) what benefits they feel might accrue from using the program. To answer these types of questions we have been conducting a series of audits/surveys via the AIDA Website, and via the software program itself, to learn as much as possible about who the AIDA end users really are. The rationale for this work is that, in this way, it should be possible to improve the program as well as tailor future versions of the software to the interests and needs of its users. However, a recurring observation is that data collection is easiest if it is as unobtrusive and innocuous as possible. One aspect of learning as much as possible about diabetes Website visitors and users may be to apply techniques that do not necessitate any visitor or user interaction. There are various programs that can monitor what pages visitors are viewing at a site. As these programs do not require visitors to do anything special, over time some interesting insights into Website usage may be obtained. For the current study we have reviewed anonymous logstats data, which are automatically collected at many

  3. Stochastic Modeling of Usage Patterns in a Web-Based Information System.

    Science.gov (United States)

    Chen, Hui-Min; Cooper, Michael D.

    2002-01-01

    Uses continuous-time stochastic models, mainly based on semi-Markov chains, to derive user state transition patterns, both in rates and in probabilities, in a Web-based information system. Describes search sessions from transaction logs of the University of California's MELVYL library catalog system and discusses sequential dependency. (Author/LRW)

  4. A New Look at Data Usage by Using Metadata Attributes as Indicators of Data Quality

    Science.gov (United States)

    Won, Y. I.; Wanchoo, L.; Behnke, J.

    2016-12-01

    NASA's Earth Observing System Data and Information System (EOSDIS) stores and distributes data from EOS satellites, as well as ancillary, airborne, in-situ, and socio-economic data. Twelve EOSDIS data centers support different scientific disciplines by providing products and services tailored to specific science communities. Although discipline oriented, these data centers provide common data management functions of ingest, archive and distribution, as well as documentation of their data and services on their web-sites. The Earth Science Data and Information System (ESDIS) Project collects these metrics from the EOSDIS data centers on a daily basis through a tool called the ESDIS Metrics System (EMS). These metrics are used in this study. The implementation of the Earthdata Login - formerly known as the User Registration System (URS) - across the various NASA data centers provides the EMS additional information about users obtaining data products from EOSDIS data centers. These additional user attributes collected by the Earthdata login, such as the user's primary area of study can augment the understanding of data usage, which in turn can help the EOSDIS program better understand the users' needs. This study will review the key metrics (users, distributed volume, and files) in multiple ways to gain an understanding of the significance of the metadata. Characterizing the usability of data by key metadata elements such as discipline and study area, will assist in understanding how the users have evolved over time. The data usage pattern based on version numbers may also provide some insight into the level of data quality. In addition, the data metrics by various services such as the Open-source Project for a Network Data Access Protocol (OPeNDAP), Web Map Service (WMS), Web Coverage Service (WCS), and subsets, will address how these services have extended the usage of data. Over-all, this study will present the usage of data and metadata by metrics analyses and will

  5. Brief Report: Web-based Management of Adolescent Chronic Pain: Development and Usability Testing of an Online Family Cognitive Behavioral Therapy Program

    Science.gov (United States)

    Palermo, Tonya M.

    2009-01-01

    Objectives This study evaluates the usability and feasibility of a Web-based intervention (Web-MAP) to deliver cognitive behavioral therapy (CBT) to adolescents with chronic pain and their parents. Methods The Web site was evaluated in two stages. In stage one, recovered adolescents and parents (n = 5 dyads), who had completed office-based CBT through a pediatric pain management clinic, completed ratings of Web site content, usability, appearance, and theme. In stage two, treatment-seeking adolescents and their parents (n = 6 dyads) completed the full-length Web program. Program usage data were obtained to assess interaction with the Web site. Results Participants rated moderate to strong acceptability of the program. Usage data indicated that participants interacted with the site and used communication features. Conclusions Feedback from usability testing provided important information in the process of designing a feasible Web-based treatment for adolescents with chronic pain for use in a randomized controlled trial. PMID:18669578

  6. Traffic-based feedback on the web.

    Science.gov (United States)

    Aizen, Jonathan; Huttenlocher, Daniel; Kleinberg, Jon; Novak, Antal

    2004-04-06

    Usage data at a high-traffic web site can expose information about external events and surges in popularity that may not be accessible solely from analyses of content and link structure. We consider sites that are organized around a set of items available for purchase or download, consider, for example, an e-commerce site or collection of online research papers, and we study a simple indicator of collective user interest in an item, the batting average, defined as the fraction of visits to an item's description that result in an acquisition of that item. We develop a stochastic model for identifying points in time at which an item's batting average experiences significant change. In experiments with usage data from the Internet Archive, we find that such changes often occur in an abrupt, discrete fashion, and that these changes can be closely aligned with events such as the highlighting of an item on the site or the appearance of a link from an active external referrer. In this way, analyzing the dynamics of item popularity at an active web site can help characterize the impact of a range of events taking place both on and off the site.

  7. Analysing Customer Opinions with Text Mining Algorithms

    Science.gov (United States)

    Consoli, Domenico

    2009-08-01

    Knowing what the customer thinks of a particular product/service helps top management to introduce improvements in processes and products, thus differentiating the company from their competitors and gain competitive advantages. The customers, with their preferences, determine the success or failure of a company. In order to know opinions of the customers we can use technologies available from the web 2.0 (blog, wiki, forums, chat, social networking, social commerce). From these web sites, useful information must be extracted, for strategic purposes, using techniques of sentiment analysis or opinion mining.

  8. An Exploratory Study on Small Business Website Creation and Usage

    OpenAIRE

    Chuleeporn Changchit; Tim Klaus

    2015-01-01

    This study aims at exploring the factors related to the implementation of E-commerce websites by small business owners. While large organizations often consider E-commerce as a fundamental piece of their business strategy, small businesses place varying degrees of importance on E-commerce as a strategic tool to business success. Through a survey of small businesses, this study examines the creation and usage of E-commerce websites for small businesses. For companies with only a web presence, ...

  9. Do College Faculty Embrace Web 2.0 Technology?

    Science.gov (United States)

    Siha, Samia M.; Bell, Reginald Lamar; Roebuck, Deborah

    2016-01-01

    The authors sought to determine if Rogers's Innovation Decision Process model could analyze Web 2.0 usage within the collegiate environment. The key independent variables studied in relationship to this model were gender, faculty rank, course content delivery method, and age. Chi-square nonparametric tests on the independent variables across…

  10. Web multimedia information retrieval using improved Bayesian algorithm.

    Science.gov (United States)

    Yu, Yi-Jun; Chen, Chun; Yu, Yi-Min; Lin, Huai-Zhong

    2003-01-01

    The main thrust of this paper is application of a novel data mining approach on the log of user's feedback to improve web multimedia information retrieval performance. A user space model was constructed based on data mining, and then integrated into the original information space model to improve the accuracy of the new information space model. It can remove clutter and irrelevant text information and help to eliminate mismatch between the page author's expression and the user's understanding and expectation. User space model was also utilized to discover the relationship between high-level and low-level features for assigning weight. The authors proposed improved Bayesian algorithm for data mining. Experiment proved that the authors' proposed algorithm was efficient.

  11. What Are the Usage Conditions of Web 2.0 Tools Faculty of Education Students?

    Science.gov (United States)

    Agir, Ahmet

    2014-01-01

    As a result of advances in technology and then the emergence of using Internet in every step of life, web that provides access to the documents such as picture, audio, animation and text in Internet started to be used. At first, web consists of only visual and text pages that couldn't enable to make user's interaction. However, it is seen that not…

  12. Extracting Baseline Electricity Usage Using Gradient Tree Boosting

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Taehoon [Ulsan Nat. Inst. of Sci. & Tech., Ulsan (South Korea); Lee, Dongeun [Ulsan Nat. Inst. of Sci. & Tech., Ulsan (South Korea); Choi, Jaesik [Ulsan Nat. Inst. of Sci. & Tech., Ulsan (South Korea); Spurlock, Anna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Sim, Alex [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Todd, Annika [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Wu, Kesheng [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

    2016-05-05

    To understand how specific interventions affect a process observed over time, we need to control for the other factors that influence outcomes. Such a model that captures all factors other than the one of interest is generally known as a baseline. In our study of how different pricing schemes affect residential electricity consumption, the baseline would need to capture the impact of outdoor temperature along with many other factors. In this work, we examine a number of different data mining techniques and demonstrate Gradient Tree Boosting (GTB) to be an effective method to build the baseline. We train GTB on data prior to the introduction of new pricing schemes, and apply the known temperature following the introduction of new pricing schemes to predict electricity usage with the expected temperature correction. Our experiments and analyses show that the baseline models generated by GTB capture the core characteristics over the two years with the new pricing schemes. In contrast to the majority of regression based techniques which fail to capture the lag between the peak of daily temperature and the peak of electricity usage, the GTB generated baselines are able to correctly capture the delay between the temperature peak and the electricity peak. Furthermore, subtracting this temperature-adjusted baseline from the observed electricity usage, we find that the resulting values are more amenable to interpretation, which demonstrates that the temperature-adjusted baseline is indeed effective.

  13. Turkish University Students’ Perceptions of the World Wide Web as a Learning Tool: An Investigation Based on Gender, Socio-Economic Background, and Web Experience

    Directory of Open Access Journals (Sweden)

    Erkan Tekinarslan

    2009-04-01

    Full Text Available The main purpose of the study is to investigate Turkish undergraduate students’ perceptions of the Web as a learning tool and to analyze whether their perceptions differ significantly based on gender, socio-economic background, and Web experience. Data obtained from 722 undergraduate students (331 males and 391 females were used in the analyses. The findings indicated significant differences based on gender, socio-economic background, and Web experience. The students from higher socio-economic backgrounds indicated significantly higher attitude scores on the self-efficacy subscale of the Web attitude scale. Similarly, the male students indicated significantly higher scores on the self-efficacy subscale than the females. Also, the students with higher Web experience in terms of usage frequency indicated higher scores on all subscales (i.e., self-efficacy, affective, usefulness, Web-based learning. Moreover, the two-way ANOVA results indicated that the student’s PC ownership has significant main effects on their Web attitudes and on the usefulness, self-efficacy, and affective subscales.

  14. metabolicMine: an integrated genomics, genetics and proteomics data warehouse for common metabolic disease research.

    Science.gov (United States)

    Lyne, Mike; Smith, Richard N; Lyne, Rachel; Aleksic, Jelena; Hu, Fengyuan; Kalderimis, Alex; Stepan, Radek; Micklem, Gos

    2013-01-01

    Common metabolic and endocrine diseases such as diabetes affect millions of people worldwide and have a major health impact, frequently leading to complications and mortality. In a search for better prevention and treatment, there is ongoing research into the underlying molecular and genetic bases of these complex human diseases, as well as into the links with risk factors such as obesity. Although an increasing number of relevant genomic and proteomic data sets have become available, the quantity and diversity of the data make their efficient exploitation challenging. Here, we present metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases. Developed in collaboration with leading UK metabolic disease groups, metabolicMine integrates data sets from a range of experiments and model organisms alongside tools for exploring them. The current version brings together information covering genes, proteins, orthologues, interactions, gene expression, pathways, ontologies, diseases, genome-wide association studies and single nucleotide polymorphisms. Although the emphasis is on human data, key data sets from mouse and rat are included. These are complemented by interoperation with the RatMine rat genomics database, with a corresponding mouse version under development by the Mouse Genome Informatics (MGI) group. The web interface contains a number of features including keyword search, a library of Search Forms, the QueryBuilder and list analysis tools. This provides researchers with many different ways to analyse, view and flexibly export data. Programming interfaces and automatic code generation in several languages are supported, and many of the features of the web interface are available through web services. The combination of diverse data sets integrated with analysis tools and a powerful query system makes metabolicMine a valuable research resource. The web interface makes it accessible to first

  15. Navigation, findability and the usage of cultural heritage on the web

    DEFF Research Database (Denmark)

    Fransson, Jonas

    2014-01-01

    . On average cultural heritage objects are viewed in half of the session. In the analysis of the web survey answers two groups of users’ are distinguished, the professional user in a work context and users in a hobby or leisure context. School or study as a context is prominent in Guaman Poma, the Inca...

  16. Web-based Data Mining to Systematically Determine Data Quality From the EarthScope USArray Seismic Observatory Project

    Science.gov (United States)

    Newman, R. L.; Lindquist, K. G.; Hansen, T. S.; Vernon, F. L.; Eakins, J.; Foley, S.

    2004-12-01

    When fully operational, the Transportable Array (TA) and Flexible Array (FA) components of the continent-scale EarthScope USArray seismic observatory project will provide telemetered real-time data from up to 600 stations. By the fifth year of the deployment the predicted total amount of data production for the TA and FA will be approximately 1500 Gb/yr and approximately 1000 Gb/yr respectively. In addition to delivering the data to the IRIS Data Management Center (DMC) for permanent archiving, the Array Network Facility (ANF) is charged with real-time data quality control, calibration, metadata storage and retrieval, network monitoring and local archiving. The Antelope real-time processing software provides the back-bone to this effort, supported by the Storage Resource Broker data replication/archiving system and the Nagios network monitoring tool. Real-time, web-based data mining, with support for multiple database schemas, is provided by an Antelope interface to both Perl and PHP scripting languages. This allows embedding of database functions in HTML. A suite of online tools allows query and graphical display of dynamic real-time sensor network parameters such as data latency, network topologies, and data return rates. Data and metadata are also web-accessible, for example XML trees of seismic data and graphical display of instrument response functions. The purpose of these tools is to provide the ANF, IRIS and end-users of USArray data with a real-time systematic method of determining data quality for the spatio-temporal area of interest. The tools are accessible at http://anf.ucsd.edu

  17. Collection Usage Pre- and Post-Summon Implementation at the University of Manitoba

    Directory of Open Access Journals (Sweden)

    Lisa O’Hara

    2012-12-01

    Full Text Available Objectives – This study examines the use of print and electronic collections bothbefore and after implementation of Summon at the University of Manitoba Libraries.Summon is a web-scale discovery service which allows discovery of all of thematerials the library owns or has access to from a simple search box on the library’sweb page.Methods – COUNTER statistics were used to determine database, e-journal, and ebookstatistics, including database search statistics (DR1 from the COUNTERDatabase Report 1, full-text article downloads from the COUNTER Journal Report 1(JR1, and successful section search requests from the COUNTER Book Report 2 (BR2for electronic resources. Sirsi, the University of Manitoba’s integrated library system,provided statistics on checkouts for the libraries’ circulating print monograph andserial collections. The percentage change from the pre-Summon implementationperiod to the post-Summon implementation period was calculated and these numberswere used to determine whether usage had increased or decreased for both print andelectronic collections.Results – As expected, searches in citation databases decreased because searches wereno longer being carried out in the native database as the metadata from the databaseis included in Summon. E-journal usage increased dramatically and e-book usage alsoincreased for four of six providers examined. Print usage decreased, but the resultswere inconclusive.Conclusions – Summon implementation had a favourable impact on collection usage.

  18. Wer geht ins Netz? Web of Knowledge - Nutzungszahlen österreichischer Universitäten 2005

    Directory of Open Access Journals (Sweden)

    Dollfuß, Helmut

    2006-09-01

    Full Text Available Web of Knowledge (Thomson/ISI is licenced by a consortium of Austrian institutes. 2005 usage was analysed based on COUNTER compliant reports from the ISI Usage Reporting System. The article concentrates on the five databases which where most frequently used (SCI, SSCI, AHCI, CCC, JCR. The distribution of the number of subsessions for each institute is shown graphically. Session numbers where calculated against numbers of Full Time Equivalents (FTEs. Big institutes use the databases more frequently in regard to usage numbers. Institutes with a focus on biomedicine and smaller institutes in general use the databases better in respect to usage per FTE.

  19. Web X-Ray: Developing and Adopting Web Best Practices in Enterprises

    Directory of Open Access Journals (Sweden)

    Reinaldo Ferreira

    2016-12-01

    Full Text Available The adoption of Semantic Web technologies constitutes a promising approach to data structuring and integration, both for public and private usage. While these technologies have been around for some time, their adoption is behind overall expectations, particularly in the case of Enterprises. Having that in mind, we developed a Semantic Web Implementation Model that measures and facilitates the implementation of the technology. The advantages of using the model proposed are two-fold: the model serves as a guide for driving the implementation of the Semantic Web as well as it helps to evaluate the impact of the introduction of the technology. The model was adopted by 19 enterprises in an Action Research intervention of one year with promising results: according to the model's scale, in average, all enterprises evolved from a 6% evaluation to 46% during that period. Furthermore, practical implementation recommendations, a typical consulting tool, were developed and adopted during the project by all enterprises, providing important guidelines for the identification of a development path that may be adopted on a larger scale. Meanwhile, the project also outlined that most enterprises were interested in an even broader scope of the Implementation Model and the ambition of a "All Web Technologies" approach arose. One model that could embrace the observable overlapping of different Web generations, namely the Web of Documents, the Social Web, the Web of Data and, ultimately, the Web of Context. One model that could combine the evaluation and guidance for all enterprises to follow. That's the goal of the undergoing "Project Web X-ray" that aims to involve 200 enterprises in the adoption of best practices that may lead to their business development based on Web technologies. This paper presents a case of how Action Research promoted the simultaneous advancement of academic research and enterprise development and introduces the framework and opportunities

  20. Why can’t users choose their identity providers on the web?

    Directory of Open Access Journals (Sweden)

    Corre Kevin

    2017-07-01

    Full Text Available Authentication delegation is a major function of the modern web. Identity Providers (IdP acquired a central role by providing this function to other web services. By knowing which web services or web applications access its service, an IdP can violate the enduser privacy by discovering information that the user did not want to share with its IdP. For instance, WebRTC introduces a new field of usage as authentication delegation happens during the call session establishment, between two users. As a result, an IdP can easily discover that Bob has a meeting with Alice. A second issue that increases the privacy violation is the lack of choice for the end-user to select its own IdP. Indeed, on many web-applications, the end-user can only select between a subset of IdPs, in most cases Facebook or Google. In this paper, we analyze this phenomena, in particular why the end-user cannot easily select its preferred IdP, though there exists standards in this field such as OpenID Connect and OAuth 2? To lead this analysis, we conduct three investigations. The first one is a field survey on OAuth 2 and OpenID Connect scope usage by web sites to understand if scopes requested by websites could allow for user defined IdPs. The second one tries to understand whether the problem comes from the OAuth 2 protocol or its implementations by IdP. The last one tries to understand if trust relations between websites and IdP could prevent the end user to select its own IdP. Finally, we sketch possible architecture for web browser based identity management, and report on the implementation of a prototype.

  1. Web-based health interventions for family caregivers of elderly individuals: A Scoping Review.

    Science.gov (United States)

    Wasilewski, Marina B; Stinson, Jennifer N; Cameron, Jill I

    2017-07-01

    For the growing proportion of elders globally, aging-related illnesses are primary causes of morbidity causing reliance on family members for support in the community. Family caregivers experience poorer physical and mental health than their non-caregiving counterparts. Web-based interventions can provide accessible support to family caregivers to offset declines in their health and well-being. Existing reviews focused on web-based interventions for caregivers have been limited to single illness populations and have mostly focused on the efficacy of the interventions. We therefore have limited insight into how web-based interventions for family caregiver have been developed, implemented and evaluated across aging-related illness. To describe: a) theoretical underpinnings of the literature; b) development, content and delivery of web-based interventions; c) caregiver usage of web-based interventions; d) caregiver experience with web-based interventions and e) impact of web-based interventions on caregivers' health outcomes. We followed Arksey and O'Malley's methodological framework for conducting scoping reviews which entails setting research questions, selecting relevant studies, charting the data and synthesizing the results in a report. Fifty-three publications representing 32 unique web-based interventions were included. Over half of the interventions were targeted at dementia caregivers, with the rest targeting caregivers to the stroke, cancer, diabetes and general frailty populations. Studies used theory across the intervention trajectory. Interventions aimed to improve a range of health outcomes for caregivers through static and interactive delivery methods Caregivers were satisfied with the usability and accessibility of the websites but usage was generally low and declined over time. Depression and caregiver burden were the most common outcomes evaluated. The interventions ranged in their impact on health and social outcomes but reductions in perception of

  2. Literature Mining Methods for Toxicology and Construction of ...

    Science.gov (United States)

    Webinar Presentation on text-mining methodologies in use at NCCT and how they can be used to assist with the OECD Retinoid project. Presentation to 1st Workshop/Scientific Expert Group meeting on the OECD Retinoid Project - April 26, 2016 –Brussels, Presented remotely via web.

  3. A simplified approach to the PROMETHEE method for priority setting in management of mine action projects

    Directory of Open Access Journals (Sweden)

    Marko Mladineo

    2016-12-01

    Full Text Available In the last 20 years, priority setting in mine actions, i.e. in humanitarian demining, has become an increasingly important topic. Given that mine action projects require management and decision-making based on a multi -criteria approach, multi-criteria decision-making methods like PROMETHEE and AHP have been used worldwide for priority setting. However, from the aspect of mine action, where stakeholders in the decision-making process for priority setting are project managers, local politicians, leaders of different humanitarian organizations, or similar, applying these methods can be difficult. Therefore, a specialized web-based decision support system (Web DSS for priority setting, developed as part of the FP7 project TIRAMISU, has been extended using a module for developing custom priority setting scenarios in line with an exceptionally easy, user-friendly approach. The idea behind this research is to simplify the multi-criteria analysis based on the PROMETHEE method. Therefore, a simplified PROMETHEE method based on statistical analysis for automated suggestions of parameters such as preference function thresholds, interactive selection of criteria weights, and easy input of criteria evaluations is presented in this paper. The result is web-based DSS that can be applied worldwide for priority setting in mine action. Additionally, the management of mine action projects is supported using modules for providing spatial data based on the geographic information system (GIS. In this paper, the benefits and limitations of a simplified PROMETHEE method are presented using a case study involving mine action projects, and subsequently, certain proposals are given for the further research.

  4. Anthropogenic and natural sources of acidity and metals and their influence on the structure of stream food webs.

    Science.gov (United States)

    Hogsden, Kristy L; Harding, Jon S

    2012-03-01

    We compared food web structure in 20 streams with either anthropogenic or natural sources of acidity and metals or circumneutral water chemistry in New Zealand. Community and diet analysis indicated that mining streams receiving anthropogenic inputs of acidic and metal-rich drainage had much simpler food webs (fewer species, shorter food chains, less links) than those in naturally acidic, naturally high metal, and circumneutral streams. Food webs of naturally high metal streams were structurally similar to those in mining streams, lacking fish predators and having few species. Whereas, webs in naturally acidic streams differed very little from those in circumneutral streams due to strong similarities in community composition and diets of secondary and top consumers. The combined negative effects of acidity and metals on stream food webs are clear. However, elevated metal concentrations, regardless of source, appear to play a more important role than acidity in driving food web structure. Copyright © 2011 Elsevier Ltd. All rights reserved.

  5. MRI usage in a pediatric emergency department: an analysis of usage and usage trends over 5 years

    Energy Technology Data Exchange (ETDEWEB)

    Scheinfeld, Meir H. [Montefiore Medical Center, Albert Einstein College of Medicine, Department of Radiology, Division of Emergency Radiology, Bronx, NY (United States); Moon, Jee-Young; Wang, Dan [Albert Einstein College of Medicine, Department of Epidemiology and Population Health, Bronx, NY (United States); Fagan, Michele J. [Montefiore Medical Center, Albert Einstein College of Medicine, Department of Pediatrics, Division of Emergency Medicine, Bronx, NY (United States); Davoudzadeh, Reubin [Montefiore Medical Center, Department of Radiology, Bronx, NY (United States); Taragin, Benjamin H. [Montefiore Medical Center, Albert Einstein College of Medicine, Department of Radiology, Division of Pediatric Radiology, Bronx, NY (United States)

    2017-03-15

    Magnetic resonance imaging (MRI) usage has anecdotally increased due to the principles of ALARA and the desire to Image Gently. Aside from a single abstract in the emergency medicine literature, pediatric emergency department MRI usage has not been described. Our objective was to determine whether MRI use is indeed increasing at a high-volume urban pediatric emergency department with 24/7 MRI availability. Also, we sought to determine which exams, time periods and demographics influenced the trend. Institutional Review Board exemption was obtained. Emergency department patient visit and exam data were obtained from the hospital database for the 2011-2015 time period. MRI usage data were normalized using emergency department patient visit data to determine usage rates. The z-test was used to compare MRI use by gender. The chi-square test was used to test for trends in MRI usage during the study period and in patient age. MRI usage for each hour and each weekday were tabulated to determine peak and trough usage times. MRI usage rate per emergency department patient visit was 0.36%. Headache, pain and rule-out appendicitis were the most common indications for neuroradiology, musculoskeletal and trunk exams, respectively. Usage in female patients was significantly greater than in males (0.42% vs. 0.29%, respectively, P<0.001). Usage significantly increased during the 5-year period (P<0.001). Use significantly increased from age 3 to 17 (0.011% to 1.1%, respectively, P<0.001). Sixty percent of exams were performed after-hours, the highest volume during the 10 p.m. hour and lowest between 4 a.m. and 9 a.m. MRI use was highest on Thursdays and lowest on Sundays (MRI on 0.45% and 0.22% of patients, respectively). MRI use in children increased during the study period, most notably in females, on weekdays and after-hours. (orig.)

  6. SOAP based web services and their future role in VO projects

    Science.gov (United States)

    Topf, F.; Jacquey, C.; Génot, V.; Cecconi, B.; André, N.; Zhang, T. L.; Kallio, E.; Lammer, H.; Facsko, G.; Stöckler, R.; Khodachenko, M.

    2011-10-01

    Modern state-of-the-art web services are from crucial importance for the interoperability of different VO tools existing in the planetary community. SOAP based web services assure the interconnectability between different data sources and tools by providing a common protocol for communication. This paper will point out a best practice approach with the Automated Multi-Dataset Analysis Tool (AMDA) developed by CDPP, Toulouse and the provision of VEX/MAG data from a remote database located at IWF, Graz. Furthermore a new FP7 project IMPEx will be introduced with a potential usage example of AMDA web services in conjunction with simulation models.

  7. Polytechnic Students? Perceptions of Youtube Usage in the English Oral Communication Classroom

    OpenAIRE

    Gunadevi K. Jeevi Subramaniam; Fathimah Pathma Abdullah; Raja Nor Safinas Raja Harun

    2013-01-01

    A new creative classroom technique to promote learning environment in English oral communication lesson is important. Integrating and adopting multimedia and web technologies can motivate and engage the new generation learners. YouTube usage in the English oral communication classroom is one of the strategies which will have more flexible, effective instructional materials to the learners in making the students involve in active communication. The inclusion of multimedia technologies into the...

  8. Arabic web pages clustering and annotation using semantic class features

    Directory of Open Access Journals (Sweden)

    Hanan M. Alghamdi

    2014-12-01

    Full Text Available To effectively manage the great amount of data on Arabic web pages and to enable the classification of relevant information are very important research problems. Studies on sentiment text mining have been very limited in the Arabic language because they need to involve deep semantic processing. Therefore, in this paper, we aim to retrieve machine-understandable data with the help of a Web content mining technique to detect covert knowledge within these data. We propose an approach to achieve clustering with semantic similarities. This approach comprises integrating k-means document clustering with semantic feature extraction and document vectorization to group Arabic web pages according to semantic similarities and then show the semantic annotation. The document vectorization helps to transform text documents into a semantic class probability distribution or semantic class density. To reach semantic similarities, the approach extracts the semantic class features and integrates them into the similarity weighting schema. The quality of the clustering result has evaluated the use of the purity and the mean intra-cluster distance (MICD evaluation measures. We have evaluated the proposed approach on a set of common Arabic news web pages. We have acquired favorable clustering results that are effective in minimizing the MICD, expanding the purity and lowering the runtime.

  9. Comparison of SOAP and REST Based Web Services Using Software Evaluation Metrics

    Directory of Open Access Journals (Sweden)

    Tihomirovs Juris

    2016-12-01

    Full Text Available The usage of Web services has recently increased. Therefore, it is important to select right type of Web services at the project design stage. The most common implementations are based on SOAP (Simple Object Access Protocol and REST (Representational State Transfer Protocol styles. Maintainability of REST and SOAP Web services has become an important issue as popularity of Web services is increasing. Choice of the right approach is not an easy decision since it is influenced by development requirements and maintenance considerations. In the present research, we present the comparison of SOAP and REST based Web services using software evaluation metrics. To achieve this aim, a systematic literature review will be made to compare REST and SOAP Web services in terms of the software evaluation metrics.

  10. Near-line Archive Data Mining at the Goddard Distributed Active Archive Center

    Science.gov (United States)

    Pham, L.; Mack, R.; Eng, E.; Lynnes, C.

    2002-12-01

    NASA's Earth Observing System (EOS) is generating immense volumes of data, in some cases too much to provide to users with data-intensive needs. As an alternative to moving the data to the user and his/her research algorithms, we are providing a means to move the algorithms to the data. The Near-line Archive Data Mining (NADM) system is the Goddard Earth Sciences Distributed Active Archive Center's (GES DAAC) web data mining portal to the EOS Data and Information System (EOSDIS) data pool, a 50-TB online disk cache. The NADM web portal enables registered users to submit and execute data mining algorithm codes on the data in the EOSDIS data pool. A web interface allows the user to access the NADM system. The users first develops personalized data mining code on their home platform and then uploads them to the NADM system. The C, FORTRAN and IDL languages are currently supported. The user developed code is automatically audited for any potential security problems before it is installed within the NADM system and made available to the user. Once the code has been installed the user is provided a test environment where he/she can test the execution of the software against data sets of the user's choosing. When the user is satisfied with the results, he/she can promote their code to the "operational" environment. From here the user can interactively run his/her code on the data available in the EOSDIS data pool. The user can also set up a processing subscription. The subscription will automatically process new data as it becomes available in the EOSDIS data pool. The generated mined data products are then made available for FTP pickup. The NADM system uses the GES DAAC-developed Simple Scalable Script-based Science Processor (S4P) to automate tasks and perform the actual data processing. Users will also have the option of selecting a DAAC-provided data mining algorithm and using it to process the data of their choice.

  11. minepath.org: a free interactive pathway analysis web server.

    Science.gov (United States)

    Koumakis, Lefteris; Roussos, Panos; Potamias, George

    2017-07-03

    ( www.minepath.org ) is a web-based platform that elaborates on, and radically extends the identification of differentially expressed sub-paths in molecular pathways. Besides the network topology, the underlying MinePath algorithmic processes exploit exact gene-gene molecular relationships (e.g. activation, inhibition) and are able to identify differentially expressed pathway parts. Each pathway is decomposed into all its constituent sub-paths, which in turn are matched with corresponding gene expression profiles. The highly ranked, and phenotype inclined sub-paths are kept. Apart from the pathway analysis algorithm, the fundamental innovation of the MinePath web-server concerns its advanced visualization and interactive capabilities. To our knowledge, this is the first pathway analysis server that introduces and offers visualization of the underlying and active pathway regulatory mechanisms instead of genes. Other features include live interaction, immediate visualization of functional sub-paths per phenotype and dynamic linked annotations for the engaged genes and molecular relations. The user can download not only the results but also the corresponding web viewer framework of the performed analysis. This feature provides the flexibility to immediately publish results without publishing source/expression data, and get all the functionality of a web based pathway analysis viewer. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Implementasi Web Service Dan Analisis Kinerja Algoritma Klasifikasi Data Mining Untuk Memprediksi Diabetes Mellitus

    Directory of Open Access Journals (Sweden)

    Doni Setyawan

    2017-11-01

    Full Text Available Salah satu penyakit yang ditimbulkan akibat kesalahan pola gaya hidup adalah Diabetes Mellitus (DM. Gejala penyakit diabetes sering dilalaikan oleh kebanyakan orang, sehingga mereka cenderung untuk mengabaikannya dan tidak mau melakukan medical check up. Di Indonesia jumlah penderita DM terus mengalami peningkatan dari tahun ke tahun. World Health Organization (WHO memperkirakan jumlah penderita DM tipe 2 di Indonesia akan mengalami peningkatan secara signifikan hingga 21,3 juta jiwa pada tahun 2030 mendatang. Ternyata dengan bantuan ilmu data mining, data pasien diabetes dapat digunakan untuk memprediksi apakah sesorang positif diabetes atau tidak. Tahapan awal dilakukan preprocessing data untuk menangani missing dan non numeric values. Kemudian traning dan testing menggunakan k-fold cross validation dengan algoritma K-Nearest Neighbors (KNN, random forest dan naive bayesian. Pengujian dilakukan dengan menghitung accuracy, sensitivity dan specificity. Dari hasil uji 10-fold cross validation diperoleh rata-rata akurasi tertinggi ketika menggunakan naive bayesian yaitu 75,65%, sedangkan KNN 75,53% dan random forest 73,69%. Perhitungan sensitivity dan specificity dengan membagi 786 data menjadi 594 data training dan 192 data testing. Untuk KNN diperoleh sensitivity 56,72% dan specificity 78,68%, random forest diperoleh sensitivity 53,73% dan specificity 86,4%, sedangkan naive bayesian diperoleh sensitivity 62,69% dan specificity 84%. Implementasi restful web service diterapkan pada model dengan akurasi tertinggi yaitu naive bayesian dengan format json sebagai return value.

  13. The PETfold and PETcofold web servers for intra- and intermolecular structures of multiple RNA sequences

    DEFF Research Database (Denmark)

    Seemann, Ernst Stefan; Menzel, Karl Peter; Backofen, Rolf

    2011-01-01

    gene. We present web servers to analyze multiple RNA sequences for common RNA structure and for RNA interaction sites. The web servers are based on the recent PET (Probabilistic Evolutionary and Thermodynamic) models PETfold and PETcofold, but add user friendly features ranging from a graphical layer...... to interactive usage of the predictors. Additionally, the web servers provide direct access to annotated RNA alignments, such as the Rfam 10.0 database and multiple alignments of 16 vertebrate genomes with human. The web servers are freely available at: http://rth.dk/resources/petfold/...

  14. Comparison of Turkish and US Pre-Service Teachers' Web 2.0 Tools Usage Characteristics

    Science.gov (United States)

    Kiyici, Mubin; Akyeampong, Albert; Balkan Kiyici, Fatime

    2013-01-01

    As the Internet and computer develop, the world is changing dramatically and fantastically. Usage of technological tools is increased day by day in daily life besides ICT. All the technological tools shape individual behavior, life style and learning style as well as individual lives. Today's child use different tools and different way to…

  15. Adherence to a Web-Based Physical Activity Intervention for Patients With Knee and/or Hip Osteoarthritis: A Mixed Method Study

    NARCIS (Netherlands)

    Bossen, D.; Buskermolen, M.; Veenhof, C.; de Bakker, D.H.; Dekker, J.

    2013-01-01

    Background: Web-based interventions show promise in promoting a healthy lifestyle, but their effectiveness is hampered by high rates of nonusage. Predictors and reasons for (non)usage are not well known. Identifying which factors are related to usage contributes to the recognition of subgroups who

  16. Uncovering obfuscated web tracking

    OpenAIRE

    Espuña Buxó, Álvaro

    2016-01-01

    En este proyecto creamos una plataforma para detectar automáticamente y de forma dinámica si en una cierta página web se esta usando "canvas fingerprinting" y si el uso de éste está siendo ofuscado. Además analizamos las páginas más visitadas según Alexa y exponemos los resultado obtenidos. In this project we develop a framework that tries to detect automatically and dynamically if a website is using canvas fingerprinting and if its usage is being obfuscated. We also analyze the top ranked...

  17. DATA MINING AND STATISTICS METHODS USAGE FOR ADVANCED TRAINING COURSES QUALITY MEASUREMENT: CASE STUDY

    Directory of Open Access Journals (Sweden)

    Maxim I. Galchenko

    2014-01-01

    Full Text Available In the article we consider a case of the analysis of the data connected with educational statistics, namely – result of professional development courses students survey with specialized software usage. Need for expanded statistical results processing, the scheme of carrying out the analysis is shown. Conclusions on a studied case are presented. 

  18. Big Data Mining of Energy Time Series for Behavioral Analytics and Energy Consumption Forecasting

    Directory of Open Access Journals (Sweden)

    Shailendra Singh

    2018-02-01

    Full Text Available Responsible, efficient and environmentally aware energy consumption behavior is becoming a necessity for the reliable modern electricity grid. In this paper, we present an intelligent data mining model to analyze, forecast and visualize energy time series to uncover various temporal energy consumption patterns. These patterns define the appliance usage in terms of association with time such as hour of the day, period of the day, weekday, week, month and season of the year as well as appliance-appliance associations in a household, which are key factors to infer and analyze the impact of consumers’ energy consumption behavior and energy forecasting trend. This is challenging since it is not trivial to determine the multiple relationships among different appliances usage from concurrent streams of data. Also, it is difficult to derive accurate relationships between interval-based events where multiple appliance usages persist for some duration. To overcome these challenges, we propose unsupervised data clustering and frequent pattern mining analysis on energy time series, and Bayesian network prediction for energy usage forecasting. We perform extensive experiments using real-world context-rich smart meter datasets. The accuracy results of identifying appliance usage patterns using the proposed model outperformed Support Vector Machine (SVM and Multi-Layer Perceptron (MLP at each stage while attaining a combined accuracy of 81.82%, 85.90%, 89.58% for 25%, 50% and 75% of the training data size respectively. Moreover, we achieved energy consumption forecast accuracies of 81.89% for short-term (hourly and 75.88%, 79.23%, 74.74%, and 72.81% for the long-term; i.e., day, week, month, and season respectively.

  19. Implementation of E-Service Intelligence in the Field of Web Mining

    OpenAIRE

    PROF. MS. S. P. SHINDE,; PROF. V.P.DESHMUKH

    2011-01-01

    The World Wide Web is a popular and interactive medium to disseminate information today .The web is huge, diverse, dynamic, widely distributed global information service centre. We are familiar with the terms like e-commerce, e-governance, e-market, e-finance, e-learning, e-banking etc. These terms come under online services called e-service applications. E-services involve various types of delivery systems, advanced information technologies, methodologies and applications of online services....

  20. Efficient Web Vulnerability Detection Tool for Sleeping Giant-Cross Site Request Forgery

    Science.gov (United States)

    Parimala, G.; Sangeetha, M.; AndalPriyadharsini, R.

    2018-04-01

    Now day’s web applications are very high in the rate of usage due to their user friendly environment and getting any information via internet but these web applications are affected by lot of threats. CSRF attack is one of the serious threats to web applications which is based on the vulnerabilities present in the normal web request and response of HTTP protocol. It is hard to detect but hence still it is present in most of the existing web applications. In CSRF attack, without user knowledge the unwanted actions on a reliable websites are forced to happen. So it is placed in OWASP’s top 10 Web Application attacks list. My proposed work is to do a real time scan of CSRF vulnerability attack in given URL of the web applications as well as local host address for any organization using python language. Client side detection of CSRF is depended on Form count which is presented in that given web site.

  1. Improving occupational safety in Kuzbass mines

    Energy Technology Data Exchange (ETDEWEB)

    Evseev, V S

    1986-08-01

    Some achievements of VostNII are listed in improving occupational safety in Kuzbass mines. Methane is a major problem: 90.6% of mines is in category III or supercategory; over 21% has an absolute methane emission of 30 m/sup 3//min or more. Another problem is spontaneous fires, which cost 2 million t of coal per year. One method of preventing these is injection of antipyrogens (urea and diammonium phosphate); another is the creation of gel (water glass, ammonium chloride and water) barriers in goaf areas. High pressure water jets are also used. Various methods of improving ventilation systems to match increased coal output are proposed, including drilling large diameter ventilation boreholes from the surface. In Leninskugol' mines the useful air is only 55.8% of the total delivered. More attention should be given to degassing (currently producing 130 million m/sup 3//y of methane). Dust levels are increasing due to the advent of narrow web cutter loaders (100% of coal cutter loaders in Kuzbass mines in 1984). Water injection and spraying are partially effective at dust suppression. Some electrical safety devices developed by VostNII are described.

  2. Social Web Identity Established upon Trust and Reputations

    Directory of Open Access Journals (Sweden)

    Rajni Goel

    2014-11-01

    Full Text Available Online social networks have become a seamless and critical online communication platform for personal interactions. They are a powerful tool that businesses are using to expand among domestic markets. The increase in participation in online social networking can and has caused damage to individuals and organizations, and the issuance of trust has become a concern on the social web. The factors determining the reputation of persons (customers in the real world may relate to the factors of reputation on the social web, though relative to how trust is established in the physical world, establishing trust on the social web can be fairly difficult. Determining how to trust another individual’s online social profile becomes critical in initiating any interaction on the social web. Rather than focusing on content on the social network page, this research proposes and examines the application of user reputations to determine whether the trust should be issued on the social web. A top-level framework to establish trust in an identity on the Social Network Sites (SNS as a function of the users’ associations, usage patterns and reputation on the social web is presented.

  3. An End User Development Approach for Mobile Web Augmentation

    Directory of Open Access Journals (Sweden)

    Gabriela Bosetti

    2017-01-01

    Full Text Available The trend towards mobile devices usage has made it possible for the Web to be conceived not only as an information space but also as a ubiquitous platform where users perform all kinds of tasks. In some cases, users access the Web with native mobile applications developed for well-known sites, such as, LinkedIn, Facebook, and Twitter. These native applications might offer further (e.g., location-based functionalities to their users in comparison with their corresponding Web sites because they were developed with mobile features in mind. However, many Web applications have no native counterpart and users access them using a mobile Web browser. Although the access to context information is not a complex issue nowadays, not all Web applications adapt themselves according to it or diversely improve the user experience by listening to a wide range of sensors. At some point, users might want to add mobile features to these Web sites, even if those features were not originally supported. In this paper, we present a novel approach to allow end users to augment their preferred Web sites with mobile features. We support our claims by presenting a framework for mobile Web augmentation, an authoring tool, and an evaluation with 21 end users.

  4. Application of the coal-mining waste in building ceramics production

    Directory of Open Access Journals (Sweden)

    Vaysman Yakov Iosifovich

    Full Text Available In the process of construction ceramics production a substantial quantity of non-renewable natural resources - clays - are used. One of the ways of science development in building materials production is investigation of the possibility of regular materials production using technogenic waste. Application of coal-mining waste (technogenic raw material in charge composition for production of ceramic products provides rational use of fuel, contributes to implementation of resource saving technologies on construction materials production enterprises. Though science development on revealing new raw material sources should be conducted with account for safety, reliability, technical, ecological and economical sides of the problem, which is especially current. The article deals with the problem of coal-mining waste usage in building ceramics production instead of fresh primary component (clay, fluxes, thinning agents and combustible additives. The interdependence between the density and shrinkage of the ceramic products and the amount and quality of coal-mining waste in its composition was established. The optimal proportion of coal-mining waste and clay in building ceramics production was estimated.

  5. E-Journal Metrics for Collection Management: Exploring Disciplinary Usage Differences in Scopus and Web of Science

    Directory of Open Access Journals (Sweden)

    Katherine Chew

    2016-04-01

    Full Text Available Objective – The purpose was to determine whether a relationship exists between journal downloads and either faculty authoring venue or citations to these faculty, or whether a relationship exists between journal rankings and local authoring venues or citations. A related purpose was to determine if any such relationship varied between or within disciplines. A final purpose was to determine if specific tools for ranking journals or indexing authorship and citation were demonstrably better than alternatives. Methods – Multiple years of journal usage, ranking, and citation data for twelve disciplines were combined in Excel, and the strength of relationships were determined using rank correlation coefficients. Results – The results illustrated marked disciplinary variation as to the degree that faculty decisions to download a journal article can be used as a proxy to predict which journals they will publish in or which journals will cite faculty’s work. While journal access requests show moderate to strong relationships with the journals in which faculty publish, as well as journals whose articles cite local faculty, the data suggest that Scopus may be the better resource to find such information for these journals in the health sciences and Web of Science may be the better resource for all other disciplines analyzed. The same can be said for the ability of external ranking mechanisms to predict faculty publishing behaviours. Eigenfactor is more predictive for both authoring and citing-by-others across most of the representative disciplines in the social sciences as well as the physical and natural sciences. With the health sciences, no clear pattern emerges. Conclusion – Collecting and correlating authorship and citation data allows patterns of use to emerge, resulting in a more accurate picture of use activity than the commonly used cost-per-use method. To find the best information on authoring activity by local faculty for subscribed

  6. Media Multitasking across Generations: Simultaneous Mobile Internet and Television Usage Behaviors and Motives

    OpenAIRE

    Yuhmiin Chang

    2015-01-01

    Simultaneous mobile internet and television usage has been getting very popular. Few, if any, studies explicated generational differences in this type of media multitasking behaviors. This study is the first to examine whether different generations have different behaviors and motives in the mobile internet-television media multitasking context. A national face-to-face survey with the probability proportional to size random sampling method was employed. The results showed that Web generation ...

  7. Safety concerning the alteration in fuel material usage (new installation of the uranium enrichment pilot plant) at Ningyo Pass Mine of Power Reactor and Nuclear Fuel Development Corporation

    International Nuclear Information System (INIS)

    1978-01-01

    A report of the Committee on Examination of Nuclear Fuel Safety was presented to the Atomic Energy Commission of Japan, which is concerned with the safety in the alteration of fuel material usage (new installation of the uranium enrichment pilot plant) at the Ningyo Pass Mine. Its safety was confirmed. The alteration, i.e. installation of the uranium enrichment pilot plant, is as follows. Intended for the overall test of centrifugal uranium enrichment technology, the pilot plant includes a two-storied main building of about 9,000 m 2 floor space, containing centrifuges, UF 6 equipment, etc., a uranium storage of about 1,000 m 2 floor space, and a waste water treatment facility, two-storied with about 300 m 2 floor space. The contents of the examination are safety of the facilities, criticality control, radiation control, waste treatment, and effects of accidents on the surrounding environment. (Mori, K

  8. Barriers to green supply chain management in Indian mining industries

    DEFF Research Database (Denmark)

    Muduli, K.; Govindan, Kannan; Barve, A.

    2013-01-01

    industries are increasingly implementing environmental management systems (EMS), cleaner production (CP), and adopting green supply chain management (GSCM) practices. GSCM focuses on a reduction of the adverse impacts of supply chain activities as well as a minimization of energy and material usage......A country's mining industry, despite its significant contributions to the country's economic growth, generally has a very poor public image because it is considered as a major environmental polluter. To acquire an improved social image, as well as to comply with government regulations, mining...... strength of the barriers will help decision makers rank them and decide a course of action that will make an optimum utilization of available resources during times of resource scarcity....

  9. Online Persistence in Higher Education Web-Supported Courses

    Science.gov (United States)

    Hershkovitz, Arnon; Nachmias, Rafi

    2011-01-01

    This research consists of an empirical study of online persistence in Web-supported courses in higher education, using Data Mining techniques. Log files of 58 Moodle websites accompanying Tel Aviv University courses were drawn, recording the activity of 1189 students in 1897 course enrollments during the academic year 2008/9, and were analyzed…

  10. Coal mine safety achievements in the USA and the contribution of NIOSH research

    Energy Technology Data Exchange (ETDEWEB)

    Esterhuizen, G.S.; Gurtunca, R.G. [NIOSH, Washington, DC (United States)

    2006-12-15

    Over the past century coal miner safety and health have seen tremendous improvements: the fatality and injury rates continue to decrease while productivity continues to increase. Many of the hazards that plagued miners in the past, such as coal bumps, methane and coal dust explosions, ground fall accidents and health issues have been significantly reduced. The contribution of NIOSH research includes products for prevention and survival of mine fires, methane control measures, design procedure for underground coal mines, methods for excavation surface controls, methods and procedures for blasting, laser usage in underground mines and prevention of electrocution from overhead power lines that have reduced accidents and injuries in underground coal mines. Health research has produced products such as the personal dust monitor, noise abating technologies and ergonomic solutions for equipment operators. Research priorities at NIOSH are set by considering surveillance statistics, stakeholder inputs and loss control principles. Future research in coal mining is directed towards respiratory diseases, noise-induced hearing loss, repetitive musculoskeletal injuries, traumatic injuries, falls of ground and mine disasters. The recent spate of accidents in coal mines resulted in the Miner Act of 2006, which includes a specific role for NIOSH in future mine safety research and development. The mine safety achievements in the USA reflect the commitment of industry, labour, government and research organizations to improving the safety of the mine worker.

  11. MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways

    Science.gov (United States)

    Koumakis, Lefteris; Kartsaki, Evgenia; Chatzimina, Maria; Zervakis, Michalis; Vassou, Despoina; Marias, Kostas; Moustakis, Vassilis; Potamias, George

    2016-01-01

    Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the

  12. Influence of Information Product Quality on Informing Users: A Web Portal Context

    OpenAIRE

    Junghyun Nam

    2016-01-01

    Web portals have been used as information products to deliver personalized, feature-rich, and flexible information needs to Internet users. However, all portals are not equal. Most of them have relatively a small number of visitors, while a few capture the majority of surfers. This study seeks to uncover the factors that contribute the perceived quality of a general portal. Based on 21 factors derived from an extensive literature review on Information Product Quality (IPQ), web usage, and med...

  13. Genomics Portals: integrative web-platform for mining genomics data.

    Science.gov (United States)

    Shinde, Kaustubh; Phatak, Mukta; Johannes, Freudenberg M; Chen, Jing; Li, Qian; Vineet, Joshi K; Hu, Zhen; Ghosh, Krishnendu; Meller, Jaroslaw; Medvedovic, Mario

    2010-01-13

    A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems. Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc), and the integration with an extensive knowledge base that can be used in such analysis. The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.

  14. Patient Continued Use of Online Health Care Communities: Web Mining of Patient-Doctor Communication.

    Science.gov (United States)

    Wu, Bing

    2018-04-16

    In practice, online health communities have passed the adoption stage and reached the diffusion phase of development. In this phase, patients equipped with knowledge regarding the issues involved in health care are capable of switching between different communities to maximize their online health community activities. Online health communities employ doctors to answer patient questions, and high quality online health communities are more likely to be acknowledged by patients. Therefore, the factors that motivate patients to maintain ongoing relationships with online health communities must be addressed. However, this has received limited scholarly attention. The purpose of this study was to identify the factors that drive patients to continue their use of online health communities where doctor-patient communication occurs. This was achieved by integrating the information system success model with online health community features. A Web spider was used to download and extract data from one of the most authoritative Chinese online health communities in which communication occurs between doctors and patients. The time span analyzed in this study was from January 2017 to March 2017. A sample of 469 valid anonymous patients with 9667 posts was obtained (the equivalent of 469 respondents in survey research). A combination of Web mining and structural equation modeling was then conducted to test the research hypotheses. The results show that the research framework for integrating the information system success model and online health community features contributes to our understanding of the factors that drive patients' relationships with online health communities. The primary findings are as follows: (1) perceived usefulness is found to be significantly determined by three exogenous variables (ie, social support, information quality, and service quality; R 2 =0.88). These variables explain 87.6% of the variance in perceived usefulness of online health communities; (2

  15. Competence and Usage of Web 2.0 Technologies by Higher Education Faculty

    Science.gov (United States)

    Soomro, Kamal Ahmed; Zai, Sajid Yousuf; Jafri, Iftikhar Hussain

    2015-01-01

    Literature on Web 2.0 experiences of higher education faculty in developing countries such as Pakistan is very limited. An insight on awareness and practices of higher education faculty with these tools can be helpful to map strategies and plan of action for adopting latest technologies to support teaching-learning processes in higher education of…

  16. WISE-MD usage among millennial medical students.

    Science.gov (United States)

    Phitayakorn, Roy; Nick, Michael W; Alseidi, Adnan; Lind, David Scott; Sudan, Ranjan; Isenberg, Gerald; Capella, Jeannette; Hopkins, Mary A; Petrusa, Emil R

    2015-01-01

    E-learning is increasingly common in undergraduate medical education. Internet-based multimedia materials should be designed with millennial learner utilization preferences in mind for maximal impact. Medical students used all 20 Web Initiative for Surgical Education of Medical Doctors modules from July 1, 2013 to October 1, 2013. Data were analyzed for topic frequency, time and week day, and access to questions. Three thousand five hundred eighty-seven students completed 35,848 modules. Students accessed modules for average of 51 minutes. Most frequent use occurred on Sunday (23.1%), Saturday (15.4%), and Monday (14.3%). Friday had the least use (8.2%). A predominance of students accessed the modules between 7 and 10 PM (34.4%). About 80.4% of students accessed questions for at least one module. They completed an average of 40 ± 30 of the questions. Only 827 students (2.3%) repeated the questions. Web Initiative for Surgical Education of Medical Doctors has peak usage during the weekend and evenings. Most frequently used modules reflect core surgical problems. Multiple factors influence the manner module questions are accessed. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. QuakeSim: a Web Service Environment for Productive Investigations with Earth Surface Sensor Data

    Science.gov (United States)

    Parker, J. W.; Donnellan, A.; Granat, R. A.; Lyzenga, G. A.; Glasscoe, M. T.; McLeod, D.; Al-Ghanmi, R.; Pierce, M.; Fox, G.; Grant Ludwig, L.; Rundle, J. B.

    2011-12-01

    The QuakeSim science gateway environment includes a visually rich portal interface, web service access to data and data processing operations, and the QuakeTables ontology-based database of fault models and sensor data. The integrated tools and services are designed to assist investigators by covering the entire earthquake cycle of strain accumulation and release. The Web interface now includes Drupal-based access to diverse and changing content, with new ability to access data and data processing directly from the public page, as well as the traditional project management areas that require password access. The system is designed to make initial browsing of fault models and deformation data particularly engaging for new users. Popular data and data processing include GPS time series with data mining techniques to find anomalies in time and space, experimental forecasting methods based on catalogue seismicity, faulted deformation models (both half-space and finite element), and model-based inversion of sensor data. The fault models include the CGS and UCERF 2.0 faults of California and are easily augmented with self-consistent fault models from other regions. The QuakeTables deformation data include the comprehensive set of UAVSAR interferograms as well as a growing collection of satellite InSAR data.. Fault interaction simulations are also being incorporated in the web environment based on Virtual California. A sample usage scenario is presented which follows an investigation of UAVSAR data from viewing as an overlay in Google Maps, to selection of an area of interest via a polygon tool, to fast extraction of the relevant correlation and phase information from large data files, to a model inversion of fault slip followed by calculation and display of a synthetic model interferogram.

  18. From Sensor to Observation Web with Environmental Enablers in the Future Internet

    Science.gov (United States)

    Havlik, Denis; Schade, Sven; Sabeur, Zoheir A.; Mazzetti, Paolo; Watson, Kym; Berre, Arne J.; Mon, Jose Lorenzo

    2011-01-01

    This paper outlines the grand challenges in global sustainability research and the objectives of the FP7 Future Internet PPP program within the Digital Agenda for Europe. Large user communities are generating significant amounts of valuable environmental observations at local and regional scales using the devices and services of the Future Internet. These communities’ environmental observations represent a wealth of information which is currently hardly used or used only in isolation and therefore in need of integration with other information sources. Indeed, this very integration will lead to a paradigm shift from a mere Sensor Web to an Observation Web with semantically enriched content emanating from sensors, environmental simulations and citizens. The paper also describes the research challenges to realize the Observation Web and the associated environmental enablers for the Future Internet. Such an environmental enabler could for instance be an electronic sensing device, a web-service application, or even a social networking group affording or facilitating the capability of the Future Internet applications to consume, produce, and use environmental observations in cross-domain applications. The term “envirofied” Future Internet is coined to describe this overall target that forms a cornerstone of work in the Environmental Usage Area within the Future Internet PPP program. Relevant trends described in the paper are the usage of ubiquitous sensors (anywhere), the provision and generation of information by citizens, and the convergence of real and virtual realities to convey understanding of environmental observations. The paper addresses the technical challenges in the Environmental Usage Area and the need for designing multi-style service oriented architecture. Key topics are the mapping of requirements to capabilities, providing scalability and robustness with implementing context aware information retrieval. Another essential research topic is handling

  19. From Sensor to Observation Web with environmental enablers in the Future Internet.

    Science.gov (United States)

    Havlik, Denis; Schade, Sven; Sabeur, Zoheir A; Mazzetti, Paolo; Watson, Kym; Berre, Arne J; Mon, Jose Lorenzo

    2011-01-01

    This paper outlines the grand challenges in global sustainability research and the objectives of the FP7 Future Internet PPP program within the Digital Agenda for Europe. Large user communities are generating significant amounts of valuable environmental observations at local and regional scales using the devices and services of the Future Internet. These communities' environmental observations represent a wealth of information which is currently hardly used or used only in isolation and therefore in need of integration with other information sources. Indeed, this very integration will lead to a paradigm shift from a mere Sensor Web to an Observation Web with semantically enriched content emanating from sensors, environmental simulations and citizens. The paper also describes the research challenges to realize the Observation Web and the associated environmental enablers for the Future Internet. Such an environmental enabler could for instance be an electronic sensing device, a web-service application, or even a social networking group affording or facilitating the capability of the Future Internet applications to consume, produce, and use environmental observations in cross-domain applications. The term "envirofied" Future Internet is coined to describe this overall target that forms a cornerstone of work in the Environmental Usage Area within the Future Internet PPP program. Relevant trends described in the paper are the usage of ubiquitous sensors (anywhere), the provision and generation of information by citizens, and the convergence of real and virtual realities to convey understanding of environmental observations. The paper addresses the technical challenges in the Environmental Usage Area and the need for designing multi-style service oriented architecture. Key topics are the mapping of requirements to capabilities, providing scalability and robustness with implementing context aware information retrieval. Another essential research topic is handling data

  20. From Sensor to Observation Web with Environmental Enablers in the Future Internet

    Directory of Open Access Journals (Sweden)

    Jose Lorenzo Mon

    2011-03-01

    Full Text Available This paper outlines the grand challenges in global sustainability research and the objectives of the FP7 Future Internet PPP program within the Digital Agenda for Europe. Large user communities are generating significant amounts of valuable environmental observations at local and regional scales using the devices and services of the Future Internet. These communities’ environmental observations represent a wealth of information which is currently hardly used or used only in isolation and therefore in need of integration with other information sources. Indeed, this very integration will lead to a paradigm shift from a mere Sensor Web to an Observation Web with semantically enriched content emanating from sensors, environmental simulations and citizens. The paper also describes the research challenges to realize the Observation Web and the associated environmental enablers for the Future Internet. Such an environmental enabler could for instance be an electronic sensing device, a web-service application, or even a social networking group affording or facilitating the capability of the Future Internet applications to consume, produce, and use environmental observations in cross-domain applications. The term “envirofied” Future Internet is coined to describe this overall target that forms a cornerstone of work in the Environmental Usage Area within the Future Internet PPP program. Relevant trends described in the paper are the usage of ubiquitous sensors (anywhere, the provision and generation of information by citizens, and the convergence of real and virtual realities to convey understanding of environmental observations. The paper addresses the technical challenges in the Environmental Usage Area and the need for designing multi-style service oriented architecture. Key topics are the mapping of requirements to capabilities, providing scalability and robustness with implementing context aware information retrieval. Another essential research

  1. The Inclusion of African-American Study Participants in Web-Based Research Studies: Viewpoint

    OpenAIRE

    Watson, Bekeela; Robinson, Dana H.Z; Harker, Laura; Arriola, Kimberly R. Jacob

    2016-01-01

    The use of Web-based methods for research recruitment and intervention delivery has greatly increased as Internet usage continues to grow. These Internet-based strategies allow for researchers to quickly reach more people. African-Americans are underrepresented in health research studies. Due to this, African-Americans get less benefit from important research that could address the disproportionate health outcomes they face. Web-based research studies are one promising way to engage more Afri...

  2. Towards the Development of Web-based Business intelligence Tools

    DEFF Research Database (Denmark)

    Georgiev, Lachezar; Tanev, Stoyan

    2011-01-01

    This paper focuses on using web search techniques in examining the co-creation strategies of technology driven firms. It does not focus on the co-creation results but describes the implementation of a software tool using data mining techniques to analyze the content on firms’ websites. The tool...

  3. A Study of the Demographics of Web-Based Health-Related Social Media Users.

    Science.gov (United States)

    Sadah, Shouq A; Shahbazi, Moloud; Wiley, Matthew T; Hristidis, Vagelis

    2015-08-06

    The rapid spread of Web-based social media in recent years has impacted how patients share health-related information. However, little work has studied the demographics of these users. Our aim was to study the demographics of users who participate in health-related Web-based social outlets to identify possible links to health care disparities. We analyze and compare three different types of health-related social outlets: (1) general Web-based social networks, Twitter and Google+, (2) drug review websites, and (3) health Web forums. We focus on the following demographic attributes: age, gender, ethnicity, location, and writing level. We build and evaluate domain-specific classifiers to infer missing data where possible. The estimated demographic statistics are compared against various baselines, such as Internet and social networks usage of the population. We found that (1) drug review websites and health Web forums are dominated by female users, (2) the participants of health-related social outlets are generally older with the exception of the 65+ years bracket, (3) blacks are underrepresented in health-related social networks, (4) users in areas with better access to health care participate more in Web-based health-related social outlets, and (5) the writing level of users in health-related social outlets is significantly lower than the reading level of the population. We identified interesting and actionable disparities in the participation of various demographic groups to various types of health-related social outlets. These disparities are significantly distinct from the disparities in Internet usage or general social outlets participation.

  4. PERANCANGAN SISTEM PREDIKSI CHURN PELANGGAN PT. TELEKOMUNIKASI SELULER DENGAN MEMANFAATKAN PROSES DATA MINING

    Directory of Open Access Journals (Sweden)

    Rajesri Govindaraju

    2008-01-01

    Full Text Available The purpose of this research is to design a customer churn prediction system using data mining approach. This system is able to perform data integration, data cleaning, data transformation, sampling and data splitting, prediction model building, predicting customer churn, and show the results in certain agreed forms. Churn prediction variables were identified based on earlier research reports that include customer information, payment method, call pattern, complaint data, telecommunication services usage and change of telecommunication services usage behavior data. The preferred mining technique used is the classification with decision tree algorithm. The decision tree can present visual model which represents customer churn and non churn pattern behavior. This system was tested using Kartu Halo customer data in Bandung area and testing result showed 70,94% accuracy of the prediction model. Abstract in Bahasa Indonesia : Penelitian ini bertujuan merancang sistem prediksi churn pelanggan yang memanfaatkan proses data mining. Sistem yang dihasilkan dapat melakukan integrasi data, pembersihan data, transformasi data, sampling dan pemisahan data, konstruksi model prediksi, memprediksi churn pelanggan dan menampilkan hasil prediksi dalam format laporan tertentu yang diperlukan. Identifikasi variabel-variabel prediksi churn dilakukan berdasarkan model prediksi churn yang telah dikembangkan pada penelitian terdahulu yang antara lain mencakup informasi mengenai pelanggan, metode pembayaran, data percakapan, data penggunaan jenis-jenis layanan telekomunikasi dan data yang menggambarkan perubahan perilaku penggunaan layanan telekomunikasi tersebut. Teknik mining yang dipilih adalah teknik klasifikasi dengan algoritma decision tree. Decision tree menghasilkan model visual yang merepresentasikan pola perilaku pelanggan yang churn dan tidak churn. Uji coba sistem yang dilakukan menggunakan data pelanggan Kartu Halo daerah Bandung menghasilkan tingkat akurasi

  5. A Balanced Approach to Capturing User Requirements in Business- to- Consumer Web Information Systems

    Directory of Open Access Journals (Sweden)

    M. S. Lane

    2001-11-01

    Full Text Available The development of business-to-consumer web information systems pose special challenges in the requirements analysis phase. It is difficult to capture user requirements given that users are relatively autonomous and anonymous and there are no major incentives for users to become involved in the development of a web information system. The researchers reviewed traditional requirement elicitation techniques, marketing research techniques and web usage analysis techniques. Current practice was assessed and the findings suggest that a balanced approach to user requirements capture will result in more complete and user centred requirements. This approach should lead to more effective business-to consumer web information systems.

  6. Technical note: real-time web-based wireless visual guidance system for radiotherapy.

    Science.gov (United States)

    Lee, Danny; Kim, Siyong; Palta, Jatinder R; Kim, Taeho

    2017-06-01

    Describe a Web-based wireless visual guidance system that mitigates issues associated with hard-wired audio-visual aided patient interactive motion management systems that are cumbersome to use in routine clinical practice. Web-based wireless visual display duplicates an existing visual display of a respiratory-motion management system for visual guidance. The visual display of the existing system is sent to legacy Web clients over a private wireless network, thereby allowing a wireless setting for real-time visual guidance. In this study, active breathing coordinator (ABC) trace was used as an input for visual display, which captured and transmitted to Web clients. Virtual reality goggles require two (left and right eye view) images for visual display. We investigated the performance of Web-based wireless visual guidance by quantifying (1) the network latency of visual displays between an ABC computer display and Web clients of a laptop, an iPad mini 2 and an iPhone 6, and (2) the frame rate of visual display on the Web clients in frames per second (fps). The network latency of visual display between the ABC computer and Web clients was about 100 ms and the frame rate was 14.0 fps (laptop), 9.2 fps (iPad mini 2) and 11.2 fps (iPhone 6). In addition, visual display for virtual reality goggles was successfully shown on the iPhone 6 with 100 ms and 11.2 fps. A high network security was maintained by utilizing the private network configuration. This study demonstrated that a Web-based wireless visual guidance can be a promising technique for clinical motion management systems, which require real-time visual display of their outputs. Based on the results of this study, our approach has the potential to reduce clutter associated with wired-systems, reduce space requirements, and extend the use of medical devices from static usage to interactive and dynamic usage in a radiotherapy treatment vault.

  7. Development of Database for Accident Analysis in Indian Mines

    Science.gov (United States)

    Tripathy, Debi Prasad; Guru Raghavendra Reddy, K.

    2016-10-01

    Mining is a hazardous industry and high accident rates associated with underground mining is a cause of deep concern. Technological developments notwithstanding, rate of fatal accidents and reportable incidents have not shown corresponding levels of decline. This paper argues that adoption of appropriate safety standards by both mine management and the government may result in appreciable reduction in accident frequency. This can be achieved by using the technology in improving the working conditions, sensitising workers and managers about causes and prevention of accidents. Inputs required for a detailed analysis of an accident include information on location, time, type, cost of accident, victim, nature of injury, personal and environmental factors etc. Such information can be generated from data available in the standard coded accident report form. This paper presents a web based application for accident analysis in Indian mines during 2001-2013. An accident database (SafeStat) prototype based on Intranet of the TCP/IP agreement, as developed by the authors, is also discussed.

  8. Lessons learned: the effect of prior technology use on Web-based interventions.

    Science.gov (United States)

    Carey, Joanne C; Wade, Shari L; Wolfe, Christopher R

    2008-04-01

    This study examined the role of regular prior technology use in treatment response to an online family problem-solving (OFPS) intervention and an Internet resource intervention (IRI) for pediatric traumatic brain injury (TBI). Participants were 150 individuals in 40 families of children with TBI randomly assigned to OFPS intervention or an IRI. All families received free computers and Internet access to TBI resources. OFPS families received Web-based sessions and therapist-guided synchronous videoconferences focusing on problem solving, communication skills, and behavior management. All participants completed measures of depression, anxiety, and computer usage. OFPS participants rated treatment satisfaction, therapeutic alliance, and Web site and technology comfort. With the OFPS intervention, depression and anxiety improved significantly more among technology using parents (n = 14) than nontechnology users (n = 6). Technology users reported increasing comfort with technology over time, and this change was predictive of depression at followup. Satisfaction and ease-of-use ratings did not differ by technology usage. Lack of regular prior home computer usage and nonadherence were predictive of anxiety at followup. The IRI was not globally effective. However, controlling for prior depression, age, and technology at work, there was a significant effect of technology at home for depression. Families with technology experience at home (n = 11) reported significantly greater improvements in depression than families without prior technology experience at home (n = 8). Although Web-based OFPS was effective in improving caregiver functioning, individuals with limited computer experience may benefit less from an online intervention due to increased nonadherence.

  9. Usage Center

    DEFF Research Database (Denmark)

    Kleinaltenkamp, Michael; Plewa, Carolin; Gudergan, Siegfried

    2017-01-01

    Purpose: The purpose of this paper is to advance extant theorizing around resourceintegration by conceptualizing and delineating the notion of a usage center. Ausage center consists of a combination of interdependent actors that draw onresources across their individual usage processes to create v...

  10. Nonblocking Scheduling for Web Service Transactions

    DEFF Research Database (Denmark)

    Alrifai, Mohammad; Balke, Wolf-Tilo; Dolog, Peter

    2007-01-01

    . In this paper, we propose a novel nonblocking scheduling mechanism that is used prior to the actual service invocations. Its aim is to reach an agreement between the client and all participating providers on what transaction processing times have to be expected, accepted, and guaranteed. This enables service......For improved flexibility and concurrent usage existing transaction management models for Web services relax the isolation property of Web service-based transactions. Correctness of the concurrent execution then has to be ensured by commit order-preserving transaction schedulers. However, local...... schedulers of service providers typically do take into account neither time constraints for committing the whole transaction, nor the individual services' constraints when scheduling decisions are made. This often leads to an unnecessary blocking of transactions by (possibly long-running) others...

  11. 78 FR 77706 - Notice of Intent To Prepare an Environmental Impact Statement for the Proposed Gemfield Mine...

    Science.gov (United States)

    2013-12-24

    ... gold mine and associated processing and ancillary facilities. The project would be located on public... media, newspapers and the BLM Web site at: http://www.blm.gov/nv/st/en/fo/battle_mountain_field.html... to construct, operate, reclaim, and close an open pit, heap leach, gold mining operation known as the...

  12. Genomics Portals: integrative web-platform for mining genomics data

    Directory of Open Access Journals (Sweden)

    Ghosh Krishnendu

    2010-01-01

    Full Text Available Abstract Background A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems. Results Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc, and the integration with an extensive knowledge base that can be used in such analysis. Conclusion The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.

  13. TDCCREC: AN EFFICIENT AND SCALABLE WEB-BASED RECOMMENDATION SYSTEM

    Directory of Open Access Journals (Sweden)

    K.Latha

    2010-10-01

    Full Text Available Web browsers are provided with complex information space where the volume of information available to them is huge. There comes the Recommender system which effectively recommends web pages that are related to the current webpage, to provide the user with further customized reading material. To enhance the performance of the recommender systems, we include an elegant proposed web based recommendation system; Truth Discovery based Content and Collaborative RECommender (TDCCREC which is capable of addressing scalability. Existing approaches such as Learning automata deals with usage and navigational patterns of users. On the other hand, Weighted Association Rule is applied for recommending web pages by assigning weights to each page in all the transactions. Both of them have their own disadvantages. The websites recommended by the search engines have no guarantee for information correctness and often delivers conflicting information. To solve them, content based filtering and collaborative filtering techniques are introduced for recommending web pages to the active user along with the trustworthiness of the website and confidence of facts which outperforms the existing methods. Our results show how the proposed recommender system performs better in predicting the next request of web users.

  14. Data Mining Thesis Topics in Finland

    OpenAIRE

    Bajo Rouvinen, Ari

    2017-01-01

    The Theseus open repository contains metadata about more than 100,000 thesis publications from the different universities of applied sciences in Finland. Different data mining techniques were applied to the Theseus dataset to build a web application to explore thesis topics and degree programmes using different libraries in Python and JavaScript. Thesis topics were extracted from manually annotated keywords by the authors and curated subjects by the librarians. During the project, the quality...

  15. Energy Monitoring System Berbasis Web

    Directory of Open Access Journals (Sweden)

    Novan Zulkarnain

    2013-12-01

    Full Text Available Government through the Ministry of Energy and Mineral Resources (ESDM encourages the energy savings at whole buildings in Indonesia. Energy Monitoring System (EMS is a web-based solution to monitor energy usage in a building. The research methods used are the analysis, prototype design and testing. EMSconsists of hardware which consists of electrical sensors, temperature-humidity sensor, and a computer. Data on EMS are designed using Modbus protocol, stored in MySQL database application, and displayed on charts through Dashboard on LED TV using PHP programming.

  16. Content and Form Anaysis of the Web Sites of University Libraries: A study on the Case in Turkey

    Directory of Open Access Journals (Sweden)

    Mesut Kurulgan

    2006-06-01

    Full Text Available Internet is an important medium in the process of development of information and information technologies. University library web sites are used by many users to reach information. The speed, ease and efficiency of library web site usage contributes to users' satisfaction. This study compares library web sites of state universities to the foundation universities in terms ofform and content. Evaluation criteria obtained through content analysis is measured by visiting each library Web site and measures are given as frequency distribution and percentage analysis. The study concludes that library web sites of state universities use the Internet opportunities more effectively than the library web sites of foundation universities.

  17. mORCA: ubiquitous access to life science web services.

    Science.gov (United States)

    Diaz-Del-Pino, Sergio; Trelles, Oswaldo; Falgueras, Juan

    2018-01-16

    Technical advances in mobile devices such as smartphones and tablets have produced an extraordinary increase in their use around the world and have become part of our daily lives. The possibility of carrying these devices in a pocket, particularly mobile phones, has enabled ubiquitous access to Internet resources. Furthermore, in the life sciences world there has been a vast proliferation of data types and services that finish as Web Services. This suggests the need for research into mobile clients to deal with life sciences applications for effective usage and exploitation. Analysing the current features in existing bioinformatics applications managing Web Services, we have devised, implemented, and deployed an easy-to-use web-based lightweight mobile client. This client is able to browse, select, compose parameters, invoke, and monitor the execution of Web Services stored in catalogues or central repositories. The client is also able to deal with huge amounts of data between external storage mounts. In addition, we also present a validation use case, which illustrates the usage of the application while executing, monitoring, and exploring the results of a registered workflow. The software its available in the Apple Store and Android Market and the source code is publicly available in Github. Mobile devices are becoming increasingly important in the scientific world due to their strong potential impact on scientific applications. Bioinformatics should not fall behind this trend. We present an original software client that deals with the intrinsic limitations of such devices and propose different guidelines to provide location-independent access to computational resources in bioinformatics and biomedicine. Its modular design makes it easily expandable with the inclusion of new repositories, tools, types of visualization, etc.

  18. Understanding usage of a hybrid website and smartphone app for weight management: a mixed-methods study.

    Science.gov (United States)

    Morrison, Leanne G; Hargood, Charlie; Lin, Sharon Xiaowen; Dennison, Laura; Joseph, Judith; Hughes, Stephanie; Michaelides, Danius T; Johnston, Derek; Johnston, Marie; Michie, Susan; Little, Paul; Smith, Peter Wf; Weal, Mark J; Yardley, Lucy

    2014-10-22

    Advancements in mobile phone technology offer huge potential for enhancing the timely delivery of health behavior change interventions. The development of smartphone-based health interventions (apps) is a rapidly growing field of research, yet there have been few longitudinal examinations of how people experience and use these apps within their day-to-day routines, particularly within the context of a hybrid Web- and app-based intervention. This study used an in-depth mixed-methods design to examine individual variation in (1) impact on self-reported goal engagement (ie, motivation, self-efficacy, awareness, effort, achievement) of access to a weight management app (POWeR Tracker) when provided alongside a Web-based weight management intervention (POWeR) and (2) usage and views of POWeR Tracker. Thirteen adults were provided access to POWeR and were monitored over a 4-week period. Access to POWeR Tracker was provided in 2 alternate weeks (ie, weeks 1 and 3 or weeks 2 and 4). Participants' goal engagement was measured daily via self-report. Mixed effects models were used to examine change in goal engagement between the weeks when POWeR Tracker was and was not available and whether the extent of change in goal engagement varied between individual participants. Usage of POWeR and POWeR Tracker was automatically recorded for each participant. Telephone interviews were conducted and analyzed using inductive thematic analysis to further explore participants' experiences using POWeR and POWeR Tracker. Access to POWeR Tracker was associated with a significant increase in participants' awareness of their eating (β1=0.31, P=.04) and physical activity goals (β1=0.28, P=.03). The level of increase varied between individual participants. Usage data showed that participants used the POWeR website for similar amounts of time during the weeks when POWeR Tracker was (mean 29 minutes, SD 31 minutes) and was not available (mean 27 minutes, SD 33 minutes). POWeR Tracker was mostly

  19. The Quality of Academic Library Building Improvements Has a Positive Impact on Library Usage. A review of: Shill, Harold B. and Shawn Tonner. “Does the Building Still Matter? Usage Patterns in New, Expanded, and Renovated Libraries, 1995‐2002.” College & Research Libraries 65.2 (Mar.2004): 123-150.

    OpenAIRE

    Julie McKenna

    2006-01-01

    Objective – To measure the impact of academic library facility improvements on physical library usage. Design – The facility improvement data used for this study were previously collected through a 68-item Web survey for the companion article “Creating a Better Place: Physical Improvements in Academic Libraries, 1995-2002” (Shill and Tonner). The measurement of library usage was by exit gate counts before and after library improvements. Setting – American academic libraries in wh...

  20. Design and development of a web-enabled data mining system ...

    Indian Academy of Sciences (India)

    Abstract. With the advent of cost effective storage systems and high speed net- ... All the other advantages of a web-based application such as security, reliability and ..... Fowler M 2004 Inversion of control containers and the injection pattern.

  1. PLAN: a web platform for automating high-throughput BLAST searches and for managing and mining results.

    Science.gov (United States)

    He, Ji; Dai, Xinbin; Zhao, Xuechun

    2007-02-09

    BLAST searches are widely used for sequence alignment. The search results are commonly adopted for various functional and comparative genomics tasks such as annotating unknown sequences, investigating gene models and comparing two sequence sets. Advances in sequencing technologies pose challenges for high-throughput analysis of large-scale sequence data. A number of programs and hardware solutions exist for efficient BLAST searching, but there is a lack of generic software solutions for mining and personalized management of the results. Systematically reviewing the results and identifying information of interest remains tedious and time-consuming. Personal BLAST Navigator (PLAN) is a versatile web platform that helps users to carry out various personalized pre- and post-BLAST tasks, including: (1) query and target sequence database management, (2) automated high-throughput BLAST searching, (3) indexing and searching of results, (4) filtering results online, (5) managing results of personal interest in favorite categories, (6) automated sequence annotation (such as NCBI NR and ontology-based annotation). PLAN integrates, by default, the Decypher hardware-based BLAST solution provided by Active Motif Inc. with a greatly improved efficiency over conventional BLAST software. BLAST results are visualized by spreadsheets and graphs and are full-text searchable. BLAST results and sequence annotations can be exported, in part or in full, in various formats including Microsoft Excel and FASTA. Sequences and BLAST results are organized in projects, the data publication levels of which are controlled by the registered project owners. In addition, all analytical functions are provided to public users without registration. PLAN has proved a valuable addition to the community for automated high-throughput BLAST searches, and, more importantly, for knowledge discovery, management and sharing based on sequence alignment results. The PLAN web interface is platform

  2. PLAN: a web platform for automating high-throughput BLAST searches and for managing and mining results

    Directory of Open Access Journals (Sweden)

    Zhao Xuechun

    2007-02-01

    Full Text Available Abstract Background BLAST searches are widely used for sequence alignment. The search results are commonly adopted for various functional and comparative genomics tasks such as annotating unknown sequences, investigating gene models and comparing two sequence sets. Advances in sequencing technologies pose challenges for high-throughput analysis of large-scale sequence data. A number of programs and hardware solutions exist for efficient BLAST searching, but there is a lack of generic software solutions for mining and personalized management of the results. Systematically reviewing the results and identifying information of interest remains tedious and time-consuming. Results Personal BLAST Navigator (PLAN is a versatile web platform that helps users to carry out various personalized pre- and post-BLAST tasks, including: (1 query and target sequence database management, (2 automated high-throughput BLAST searching, (3 indexing and searching of results, (4 filtering results online, (5 managing results of personal interest in favorite categories, (6 automated sequence annotation (such as NCBI NR and ontology-based annotation. PLAN integrates, by default, the Decypher hardware-based BLAST solution provided by Active Motif Inc. with a greatly improved efficiency over conventional BLAST software. BLAST results are visualized by spreadsheets and graphs and are full-text searchable. BLAST results and sequence annotations can be exported, in part or in full, in various formats including Microsoft Excel and FASTA. Sequences and BLAST results are organized in projects, the data publication levels of which are controlled by the registered project owners. In addition, all analytical functions are provided to public users without registration. Conclusion PLAN has proved a valuable addition to the community for automated high-throughput BLAST searches, and, more importantly, for knowledge discovery, management and sharing based on sequence alignment results

  3. Using Web 2.0 applications to promote health-related physical activity: findings from the WALK 2.0 randomised controlled trial.

    Science.gov (United States)

    Kolt, Gregory S; Rosenkranz, Richard R; Vandelanotte, Corneel; Caperchione, Cristina M; Maeder, Anthony J; Tague, Rhys; Savage, Trevor N; Van, Itallie Anetta; Mummery, W Kerry; Oldmeadow, Christopher; Duncan, Mitch J

    2017-10-01

    Web 2.0 internet technology has great potential in promoting physical activity. This trial investigated the effectiveness of a Web 2.0-based intervention on physical activity behaviour, and the impact on website usage and engagement. 504 (328 women, 126 men) insufficiently active adult participants were randomly allocated to one of two web-based interventions or a paper-based Logbook group. The Web 1.0 group participated in the existing 10 000 Steps programme, while the Web 2.0 group participated in a Web 2.0-enabled physical activity intervention including user-to-user interaction through social networking capabilities. ActiGraph GT3X activity monitors were used to assess physical activity at four points across the intervention (0, 3, 12 and 18 months), and usage and engagement were assessed continuously through website usage statistics. Treatment groups differed significantly in trajectories of minutes/day of physical activity (p=0.0198), through a greater change at 3 months for Web 2.0 than Web 1.0 (7.3 min/day, 95% CI 2.4 to 12.3). In the Web 2.0 group, physical activity increased at 3 (mean change 6.8 min/day, 95% CI 3.9 to 9.6) and 12 months (3.8 min/day, 95% CI 0.5 to 7.0), but not 18 months. The Logbook group also increased physical activity at 3 (4.8 min/day, 95% CI 1.8 to 7.7) and 12 months (4.9 min/day, 95% CI 0.7 to 9.1), but not 18 months. The Web 1.0 group increased physical activity at 12 months only (4.9 min/day, 95% CI 0.5 to 9.3). The Web 2.0 group demonstrated higher levels of website engagement (p=0.3964). In comparison to a Web 1.0 intervention, a more interactive Web 2.0 intervention, as well as the paper-based Logbook intervention, improved physical activity in the short term, but that effect reduced over time, despite higher levels of engagement of the Web 2.0 group. ACTRN12611000157976. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to

  4. Extraction panel guidelines for high production underground auger mining in Australian conditions

    Energy Technology Data Exchange (ETDEWEB)

    Paul Buddery; David Hill [Strata Engineering (Australia)

    2004-09-15

    The project involved monitoring ground behaviour during augering, with the intention of monitoring several sites with varying geotechnical environments and developing guidelines from these to assist in future layout design. This approach is appropriate where the mining layout involves the complex interaction of several components that cannot be readily simplified to the extent necessary for numerical or physical models to play the primary role. Only one site was secured within the project time frame. Consequently, the project has utilised the results from a Southern Colliery augering trial, coupled to the outcomes of numerical and physical modelling tests. The auger mining operations themselves were carried out by a Joint Venture (Coal Recovery Australia Pty Ltd) between Cutting Edge Technology Pty Ltd and SBD Services Pty Ltd. The underground trial indicated that empirical design methodologies involving pillar strength equations coupled to abutment angle models can be used to design stable augering layouts. Although the designed hole configuration was not fully achieved, there is, a suggestion that a layout so determined will be conservative, holding out the possibility of future optimisation on the basis of actual performance. Monitoring and re-appraisal in the context of a formal strata management process are critical to the success of any such approach, particularly in terms of optimisation. The two-dimensional UDEC numerical modelling code was used to model augering webs, but seemed to underestimate the stability of an auger mining panel, while over -estimating the strength of individual auger webs. Physical tests appeared to give a realistic quantification of the size effect. The tests suggest that determining the strength of an hourglass web by increasing the strength of an equivalent rectangular web by 25% would be a justifiable step at this stage.

  5. Data Mining Application in Customer Relationship Management for Hospital Inpatients

    OpenAIRE

    Lee, Eun Whan

    2012-01-01

    Objectives This study aims to discover patients loyal to a hospital and model their medical service usage patterns. Consequently, this study proposes a data mining application in customer relationship management (CRM) for hospital inpatients. Methods A recency, frequency, monetary (RFM) model has been applied toward 14,072 patients discharged from a university hospital. Cluster analysis was conducted to segment customers, and it modeled the patterns of the loyal customers' medical services us...

  6. Impacts of gold mine waste disposal on a tropical pelagic ecosystem

    International Nuclear Information System (INIS)

    Brewer, D.T.; Morello, E.B.; Griffiths, S.; Fry, G.; Heales, D.; Apte, S.C.; Venables, W.N.; Rothlisberg, P.C.; Moeseneder, C.; Lansdell, M.; Pendrey, R.; Coman, F.

    2012-01-01

    Highlights: ► We investigate the impact of gold mine tailings disposal into the sea. ► We use a comparative impact-control approach. ► Similar abundance and diversity of zooplankton and micronekton at mine and control. ► High metal concentrations and biomagnification evident in lower trophic levels only. ► No differences in metal concentrations of fish at mine and control. - Abstract: We used a comparative approach to investigate the impact of the disposal of gold mine tailings into the ocean near the Lihir mine (Niolam Island, Papua New Guinea). We found abundance and diversity of zooplankton, micronekton and pelagic fish to be similar or higher in the mine region compared to the reference site. We also found relatively high trace metal concentrations in lower trophic level groups, especially zooplankton, near the mine discharge, but few differences in tissue concentrations of micronekton, baitfish and pelagic fish between the two regions. Biomagnification of some trace metals by micronekton, and of mercury by fish was evident in both regions. We conclude that ocean mine waste disposal at Niolam Island has a local impact on the smaller and less mobile pelagic communities in terms of trace metal concentrations, but has little effect on the abundance and biodiversity of the local food web.

  7. CoP Sensing Framework on Web-Based Environment

    Science.gov (United States)

    Mustapha, S. M. F. D. Syed

    The Web technologies and Web applications have shown similar high growth rate in terms of daily usages and user acceptance. The Web applications have not only penetrated in the traditional domains such as education and business but have also encroached into areas such as politics, social, lifestyle, and culture. The emergence of Web technologies has enabled Web access even to the person on the move through PDAs or mobile phones that are connected using Wi-Fi, HSDPA, or other communication protocols. These two phenomena are the inducement factors toward the need of building Web-based systems as the supporting tools in fulfilling many mundane activities. In doing this, one of the many focuses in research has been to look at the implementation challenges in building Web-based support systems in different types of environment. This chapter describes the implementation issues in building the community learning framework that can be supported on the Web-based platform. The Community of Practice (CoP) has been chosen as the community learning theory to be the case study and analysis as it challenges the creativity of the architectural design of the Web system in order to capture the presence of learning activities. The details of this chapter describe the characteristics of the CoP to understand the inherent intricacies in modeling in the Web-based environment, the evidences of CoP that need to be traced automatically in a slick manner such that the evidence-capturing process is unobtrusive, and the technologies needed to embrace a full adoption of Web-based support system for the community learning framework.

  8. Feature Usage Explorer: Usage Monitoring and Visualization Tool in HTML5 Based Applications

    Directory of Open Access Journals (Sweden)

    Sarunas Marciuska

    2013-10-01

    Full Text Available Feature Usage Explorer is a JavaScript library, which automatically detects features in HTML5 based applications and monitors their usage. The collected information can be visualized in a Feature Usage Diagram, which is automatically generated from an input json file. Currently, the users of Feature Usage Explorer have to design their own tool in order to generate the json file from collected usage information. This option remains viable when using the library in order not to constraint the user’s choice of preferred data storage. Feature Usage Explorer can be reused in any HTML5 based applications where an understanding of how users interact with the system is required (i.e. user experience and usability studies, human computer interaction field, or requirement prioritization area.

  9. Analyzing engagement in a web-based intervention platform through visualizing log-data.

    Science.gov (United States)

    Morrison, Cecily; Doherty, Gavin

    2014-11-13

    Engagement has emerged as a significant cross-cutting concern within the development of Web-based interventions. There have been calls to institute a more rigorous approach to the design of Web-based interventions, to increase both the quantity and quality of engagement. One approach would be to use log-data to better understand the process of engagement and patterns of use. However, an important challenge lies in organizing log-data for productive analysis. Our aim was to conduct an initial exploration of the use of visualizations of log-data to enhance understanding of engagement with Web-based interventions. We applied exploratory sequential data analysis to highlight sequential aspects of the log data, such as time or module number, to provide insights into engagement. After applying a number of processing steps, a range of visualizations were generated from the log-data. We then examined the usefulness of these visualizations for understanding the engagement of individual users and the engagement of cohorts of users. The visualizations created are illustrated with two datasets drawn from studies using the SilverCloud Platform: (1) a small, detailed dataset with interviews (n=19) and (2) a large dataset (n=326) with 44,838 logged events. We present four exploratory visualizations of user engagement with a Web-based intervention, including Navigation Graph, Stripe Graph, Start-Finish Graph, and Next Action Heat Map. The first represents individual usage and the last three, specific aspects of cohort usage. We provide examples of each with a discussion of salient features. Log-data analysis through data visualization is an alternative way of exploring user engagement with Web-based interventions, which can yield different insights than more commonly used summative measures. We describe how understanding the process of engagement through visualizations can support the development and evaluation of Web-based interventions. Specifically, we show how visualizations

  10. PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan.

    Science.gov (United States)

    Kinjo, Akira R; Yamashita, Reiko; Nakamura, Haruki

    2010-08-25

    This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/

  11. Mine railway equipments management information system

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, X.; Han, K.; Duan, T.; Liu, Z.; Lu, H. [China University of Mining and Technology, Xuzhou (China)

    2007-06-15

    Based on client/server and browser/server models, the management information system described realized the entire life-cycle management of mine railway equipment which included universal equipment and special equipment in the locomotive depot, track maintenance division, electrical depot and car depot. The system has other online functions such as transmitting reports, graphics management, statistics, searches, graphics wizard and web propaganda. It was applied in Pingdingshan Coal Co. Ltd.'s Railway Transport Department. 5 refs., 4 figs.

  12. Virtual Reality in Presentation of the Underground Mine Technological Process

    Directory of Open Access Journals (Sweden)

    Kodym Oldøich

    2003-09-01

    Full Text Available Virtual Reality in Presentation of the Underground Mine Technological Process focuses on methods of presentation of an underground mine technologies in intranet technology. It shows usage of platform independent VRML client for presentation of static and dynamic information about technological process. Bi-directional interactions between client and process information database are solved.Based on analysis of technological process of underground mine a database structure was designed. It is skeleton for storing all information about any underground mine. This skeleton can be modified in any direction. Data in this "static model" of underground mine can be applied for visualization in VRML environment. In this way it is possible to simplify and unify a user's front-end for all kinds of tasks.All designed scenes can be interactively displayed in full view or in any detail view, so that a user is able to recognize every important part of installed equipment, its stage, technical parameters and other information. If manufacturers of mining equipment will supply VRML model of their real products everybody would be able to place it into VRML scene and learn everything about it.This work explores and tries to enlighten some of the areas and available approaches compliant with VRML 97 specification of modifying static scene by its browser. Concepts of animation pipeline, inside and outside scripting in scene displayed and authoring of VRML targeted geometry are discussed including database connectivity.

  13. An Improved Algorithm Research on the PrefixSpan Based on the Server Session Constraint

    Directory of Open Access Journals (Sweden)

    Cai Hong-Guo

    2017-01-01

    Full Text Available When we mine long sequential pattern and discover knowledge by the PrefixSpan algorithm in Web Usage Mining (WUM.The elements and the suffix sequences are much more may cause the problem of the calculation, such as the space explosion. To further solve the problem a more effective way is that. Firstly, a server session-based server log file format is proposed. Then the improved algorithm on the PrefixSpan based on server session constraint is discussed for mining frequent Sequential patterns on the website. Finally, the validity and superiority of the method are presented by the experiment in the paper.

  14. The Important of the Usage of Information Technology during the Local Services: Special Provincial Administration of Kırşehir

    Directory of Open Access Journals (Sweden)

    Mustafa KOCAOĞLU

    2014-06-01

    Full Text Available The development of information and communication technologies has accelerated public service delivery through the application of information technologies in the world. In addition to these improvements, as a consequence of the reform efforts in the 2000s, important changes have occurred in the quality and quantity of the duties of local governments. It is expressed that the usage of information technologies for public service provision is making important contribution to local governments to fulfill their duties and responsibilities. In this paper aims the analyze that the usage of information technology during the local services delivery at The Special Provincial Administration of Kırşehir. Survey and interview were used as a method of the field research and additionally, the web site of the institute was analyzed. The results of the survey revealed that it has made progress in the efforts of computerization and web site development. The institute is expected to show progress on online service delivery and online management.

  15. Strategic Implications of Water Usage: an Analysis in Brazilian Mining Industries

    Directory of Open Access Journals (Sweden)

    Roberto Schoproni Bichueti

    2014-04-01

    Full Text Available This study aims at identifying the practices of water use management and the business performance in industries in the Brazilian mineral sector. To this end, a descriptive and quantitative study was developed, using the survey method, in industries associated with the Brazilian Mining Institute – IBRAM. The water use management practices were identified based in a model addressing the following aspects: water accounting, risk assessment, direct operations, supply chain, and stakeholders engagement. The business performance was measured from a model involving the following dimensions: economic, environmental and social. Among the results, the risks assessment involved and the direct operations practices stand out, in order to reduce the amount of water used and waste discharges. The need for greater engagement of industries with the stakeholders and the supply chain, through a more integrated and collaborative management, was also evident.

  16. Alkemio: association of chemicals with biomedical topics by text and data mining.

    Science.gov (United States)

    Gijón-Correas, José A; Andrade-Navarro, Miguel A; Fontaine, Jean F

    2014-07-01

    The PubMed® database of biomedical citations allows the retrieval of scientific articles studying the function of chemicals in biology and medicine. Mining millions of available citations to search reported associations between chemicals and topics of interest would require substantial human time. We have implemented the Alkemio text mining web tool and SOAP web service to help in this task. The tool uses biomedical articles discussing chemicals (including drugs), predicts their relatedness to the query topic with a naïve Bayesian classifier and ranks all chemicals by P-values computed from random simulations. Benchmarks on seven human pathways showed good retrieval performance (areas under the receiver operating characteristic curves ranged from 73.6 to 94.5%). Comparison with existing tools to retrieve chemicals associated to eight diseases showed the higher precision and recall of Alkemio when considering the top 10 candidate chemicals. Alkemio is a high performing web tool ranking chemicals for any biomedical topics and it is free to non-commercial users. http://cbdm.mdc-berlin.de/∼medlineranker/cms/alkemio. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. A New Unified Intrusion Anomaly Detection in Identifying Unseen Web Attacks

    Directory of Open Access Journals (Sweden)

    Muhammad Hilmi Kamarudin

    2017-01-01

    Full Text Available The global usage of more sophisticated web-based application systems is obviously growing very rapidly. Major usage includes the storing and transporting of sensitive data over the Internet. The growth has consequently opened up a serious need for more secured network and application security protection devices. Security experts normally equip their databases with a large number of signatures to help in the detection of known web-based threats. In reality, it is almost impossible to keep updating the database with the newly identified web vulnerabilities. As such, new attacks are invisible. This research presents a novel approach of Intrusion Detection System (IDS in detecting unknown attacks on web servers using the Unified Intrusion Anomaly Detection (UIAD approach. The unified approach consists of three components (preprocessing, statistical analysis, and classification. Initially, the process starts with the removal of irrelevant and redundant features using a novel hybrid feature selection method. Thereafter, the process continues with the application of a statistical approach to identifying traffic abnormality. We performed Relative Percentage Ratio (RPR coupled with Euclidean Distance Analysis (EDA and the Chebyshev Inequality Theorem (CIT to calculate the normality score and generate a finest threshold. Finally, Logitboost (LB is employed alongside Random Forest (RF as a weak classifier, with the aim of minimising the final false alarm rate. The experiment has demonstrated that our approach has successfully identified unknown attacks with greater than a 95% detection rate and less than a 1% false alarm rate for both the DARPA 1999 and the ISCX 2012 datasets.

  18. Web Browser History Detection as a Real-World Privacy Threat

    CERN Document Server

    Janc, A

    2010-01-01

    Web browser history detection using CSS $visited$ styles has long been dismissed as an issue of marginal impact. However, due to recent changes in Web usage patterns, coupled with browser performance improvements, the long-standing issue has now become a significant threat to the privacy of Internet users. In this paper we analyze the impact of CSS-based history detection and demonstrate the feasibility of conducting practical attacks with minimal resources. We analyze Web browser behavior and detectability of content loaded via standard protocols and with various HTTP response codes. We develop an algorithm for efficient examination of large link sets and evaluate its performance in modern browsers. Compared to existing methods our approach is up to 6 times faster, and is able to detect up to 30,000 visited links per second. We present a novel Web application capable of effectively detecting clients’ browsing histories and discuss real-world results obtained from 271,576 Internet users. Our results indicat...

  19. Carbon dynamics, food web structure and reclamation strategies in Athabasca oil sands wetlands (CFRAW) : overview and progress

    International Nuclear Information System (INIS)

    Ciborowski, J.; Dixon, D.G.; Foote, L.; Liber, K.; Smits, J.E.

    2009-01-01

    Seven oil sand mining partners and 5 university labs have joined forces to study the effects of mine tailings and process waters on development, health and function of wetland communities formed in post-mining landscapes. The collaborative effort, know as the carbon dynamics, food web structure and reclamation strategies in Athabasca oil sands wetlands (CRFAW), aims to identify the materials and strategies most effective and economical in producing a functioning reclamation landscape. This presentation reported on part of the study that tested predictions about how quickly wetlands amended with reclamation materials approach the conditions of reference wetland systems. It provided a conceptual model of carbon pathways and budgets to assess how the allocation of carbon among compartments changes as newly formed wetlands mature in the boreal system. It was assumed that stockpiling constructed wetlands with peat or topsoil would accelerate succession and community development. Although the bitumen and the naphthenic acids found in constructed wetlands are initially toxic, they may serve as an alternate source of carbon once they degrade. This study also assessed the sources, biological uptake, pathways, and movement through the food web of materials used by the biota in constructed wetlands. Additional studies are examining how the productivity of new wetlands is maintained. Net ecosystem productivity is being monitored along with rates of organic carbon accumulation from microbial, algal, and macrophyte production, and influx of outside materials. The rates of leaf litter breakdown and microbial respiration are being compared to determine how constituents speed or slow food web processes of young and older wetlands. Carbon and nitrogen isotope values in food web compartments indicate which sources are incorporated into the food web as wetlands age. The values are used to determine how this influences community development, food web structure and complexity, and the

  20. Process evaluation of a web-based intervention aimed at empowerment of disability benefit claimants

    NARCIS (Netherlands)

    Samoocha, D.; Snels, I.A.K.; Bruinvels, D.J.; Anema, J.R.; Kowalczyk, W.J.; van der Beek, A.J.

    2011-01-01

    Background: The objective of this process evaluation study was to gain insight into the reach, compliance, appreciation, usage barriers, and users' perceived effectiveness of a web-based intervention http://www.wiagesprek.nl. This intervention was aimed at empowerment of disability claimants, prior

  1. Process evaluation of a web-based intervention aimed at empowerment of disability benefit claimants

    NARCIS (Netherlands)

    Samoocha, David; Snels, Ingrid A. K.; Bruinvels, David J.; Anema, Johannes R.; Kowalczyk, Wojtek; van der Beek, Allard J.

    2011-01-01

    The objective of this process evaluation study was to gain insight into the reach, compliance, appreciation, usage barriers, and users' perceived effectiveness of a web-based intervention http://www.wiagesprek.nl. This intervention was aimed at empowerment of disability claimants, prior to the

  2. Tangled in the breast cancer web: an evaluation of the usage of web-based information resources by breast cancer patients.

    Science.gov (United States)

    Nguyen, Sonia Kim Anh; Ingledew, Paris-Ann

    2013-12-01

    This study describes Internet use by breast cancer patients highlighting search patterns and examining the impact of web-based information on the clinical encounter. From September 2011 to January 2012, breast cancer patients at a cancer center completed a survey. Answers were closed and open-ended. Eighty-one patients were approached and 56 completed the survey. Forty-five (80 %) respondents used the Internet and 32 (71 %) searched for breast cancer information. All used Google as their principal search engine. To evaluate quality, 47 % referred to author credentials and 41 % examined references. Most sought information with respect to treatment or prognosis. Eighty percent felt that the information increased their knowledge and influenced treatment decision making for 53 %. This study highlights search patterns and factors used by breast cancer patients in seeking web-based information. Physicians must appreciate that patients use the Internet and address discrepancies between information sought and that which is available.

  3. Text mining for adverse drug events: the promise, challenges, and state of the art.

    Science.gov (United States)

    Harpaz, Rave; Callahan, Alison; Tamang, Suzanne; Low, Yen; Odgers, David; Finlayson, Sam; Jung, Kenneth; LePendu, Paea; Shah, Nigam H

    2014-10-01

    Text mining is the computational process of extracting meaningful information from large amounts of unstructured text. It is emerging as a tool to leverage underutilized data sources that can improve pharmacovigilance, including the objective of adverse drug event (ADE) detection and assessment. This article provides an overview of recent advances in pharmacovigilance driven by the application of text mining, and discusses several data sources-such as biomedical literature, clinical narratives, product labeling, social media, and Web search logs-that are amenable to text mining for pharmacovigilance. Given the state of the art, it appears text mining can be applied to extract useful ADE-related information from multiple textual sources. Nonetheless, further research is required to address remaining technical challenges associated with the text mining methodologies, and to conclusively determine the relative contribution of each textual source to improving pharmacovigilance.

  4. Research on parallel algorithm for sequential pattern mining

    Science.gov (United States)

    Zhou, Lijuan; Qin, Bai; Wang, Yu; Hao, Zhongxiao

    2008-03-01

    Sequential pattern mining is the mining of frequent sequences related to time or other orders from the sequence database. Its initial motivation is to discover the laws of customer purchasing in a time section by finding the frequent sequences. In recent years, sequential pattern mining has become an important direction of data mining, and its application field has not been confined to the business database and has extended to new data sources such as Web and advanced science fields such as DNA analysis. The data of sequential pattern mining has characteristics as follows: mass data amount and distributed storage. Most existing sequential pattern mining algorithms haven't considered the above-mentioned characteristics synthetically. According to the traits mentioned above and combining the parallel theory, this paper puts forward a new distributed parallel algorithm SPP(Sequential Pattern Parallel). The algorithm abides by the principal of pattern reduction and utilizes the divide-and-conquer strategy for parallelization. The first parallel task is to construct frequent item sets applying frequent concept and search space partition theory and the second task is to structure frequent sequences using the depth-first search method at each processor. The algorithm only needs to access the database twice and doesn't generate the candidated sequences, which abates the access time and improves the mining efficiency. Based on the random data generation procedure and different information structure designed, this paper simulated the SPP algorithm in a concrete parallel environment and implemented the AprioriAll algorithm. The experiments demonstrate that compared with AprioriAll, the SPP algorithm had excellent speedup factor and efficiency.

  5. Media Usage in Post-Secondary Education and Implications for Teaching and Learning

    Directory of Open Access Journals (Sweden)

    G. Gidion

    2014-12-01

    Full Text Available The Web 2.0 has permeated academic life. The use of online information services in post-secondary education has led to dramatic changes in faculty teaching methods as well as in the learning and study behavior of students. At the same time, traditional information media, such as textbooks and printed handouts, still form the basic pillars of teaching and learning. This paper reports the results of a survey about media usage in teaching and learning conducted with Western University students and instructors, highlighting trends in the usage of new and traditional media in higher education by instructors and students. In addition, the survey comprises part of an international research program in which 20 universities from 10 countries are currently participating. Further, the study will hopefully become a part of the ongoing discussion of practices and policies that purport to advance the effective use of media in teaching and learning.

  6. Mine drivage in hydraulic mines

    Energy Technology Data Exchange (ETDEWEB)

    Ehkber, B Ya

    1983-09-01

    From 20 to 25% of labor cost in hydraulic coal mines falls on mine drivage. Range of mine drivage is high due to the large number of shortwalls mined by hydraulic monitors. Reducing mining cost in hydraulic mines depends on lowering drivage cost by use of new drivage systems or by increasing efficiency of drivage systems used at present. The following drivage methods used in hydraulic mines are compared: heading machines with hydraulic haulage of cut rocks and coal, hydraulic monitors with hydraulic haulage, drilling and blasting with hydraulic haulage of blasted rocks. Mining and geologic conditions which influence selection of the optimum mine drivage system are analyzed. Standardized cross sections of mine roadways driven by the 3 methods are shown in schemes. Support systems used in mine roadways are compared: timber supports, roof bolts, roof bolts with steel elements, and roadways driven in rocks without a support system. Heading machines (K-56MG, GPKG, 4PU, PK-3M) and hydraulic monitors (GMDTs-3M, 12GD-2) used for mine drivage are described. Data on mine drivage in hydraulic coal mines in the Kuzbass are discussed. From 40 to 46% of roadways are driven by heading machines with hydraulic haulage and from 12 to 15% by hydraulic monitors with hydraulic haulage.

  7. A Parallel Approach for Frequent Subgraph Mining in a Single Large Graph Using Spark

    Directory of Open Access Journals (Sweden)

    Fengcai Qiao

    2018-02-01

    Full Text Available Frequent subgraph mining (FSM plays an important role in graph mining, attracting a great deal of attention in many areas, such as bioinformatics, web data mining and social networks. In this paper, we propose SSiGraM (Spark based Single Graph Mining, a Spark based parallel frequent subgraph mining algorithm in a single large graph. Aiming to approach the two computational challenges of FSM, we conduct the subgraph extension and support evaluation parallel across all the distributed cluster worker nodes. In addition, we also employ a heuristic search strategy and three novel optimizations: load balancing, pre-search pruning and top-down pruning in the support evaluation process, which significantly improve the performance. Extensive experiments with four different real-world datasets demonstrate that the proposed algorithm outperforms the existing GraMi (Graph Mining algorithm by an order of magnitude for all datasets and can work with a lower support threshold.

  8. Data mining concepts and techniques

    CERN Document Server

    Han, Jiawei

    2005-01-01

    Our ability to generate and collect data has been increasing rapidly. Not only are all of our business, scientific, and government transactions now computerized, but the widespread use of digital cameras, publication tools, and bar codes also generate data. On the collection side, scanned text and image platforms, satellite remote sensing systems, and the World Wide Web have flooded us with a tremendous amount of data. This explosive growth has generated an even more urgent need for new techniques and automated tools that can help us transform this data into useful information and knowledge.Like the first edition, voted the most popular data mining book by KD Nuggets readers, this book explores concepts and techniques for the discovery of patterns hidden in large data sets, focusing on issues relating to their feasibility, usefulness, effectiveness, and scalability. However, since the publication of the first edition, great progress has been made in the development of new data mining methods, systems, and app...

  9. The SADI Personal Health Lens: A Web Browser-Based System for Identifying Personally Relevant Drug Interactions.

    Science.gov (United States)

    Vandervalk, Ben; McCarthy, E Luke; Cruz-Toledo, José; Klein, Artjom; Baker, Christopher J O; Dumontier, Michel; Wilkinson, Mark D

    2013-04-05

    The Web provides widespread access to vast quantities of health-related information that can improve quality-of-life through better understanding of personal symptoms, medical conditions, and available treatments. Unfortunately, identifying a credible and personally relevant subset of information can be a time-consuming and challenging task for users without a medical background. The objective of the Personal Health Lens system is to aid users when reading health-related webpages by providing warnings about personally relevant drug interactions. More broadly, we wish to present a prototype for a novel, generalizable approach to facilitating interactions between a patient, their practitioner(s), and the Web. We utilized a distributed, Semantic Web-based architecture for recognizing personally dangerous drugs consisting of: (1) a private, local triple store of personal health information, (2) Semantic Web services, following the Semantic Automated Discovery and Integration (SADI) design pattern, for text mining and identifying substance interactions, (3) a bookmarklet to trigger analysis of a webpage and annotate it with personalized warnings, and (4) a semantic query that acts as an abstract template of the analytical workflow to be enacted by the system. A prototype implementation of the system is provided in the form of a Java standalone executable JAR file. The JAR file bundles all components of the system: the personal health database, locally-running versions of the SADI services, and a javascript bookmarklet that triggers analysis of a webpage. In addition, the demonstration includes a hypothetical personal health profile, allowing the system to be used immediately without configuration. Usage instructions are provided. The main strength of the Personal Health Lens system is its ability to organize medical information and to present it to the user in a personalized and contextually relevant manner. While this prototype was limited to a single knowledge domain

  10. The SADI Personal Health Lens: A Web Browser-Based System for Identifying Personally Relevant Drug Interactions

    Science.gov (United States)

    Vandervalk, Ben; McCarthy, E Luke; Cruz-Toledo, José; Klein, Artjom; Baker, Christopher J O; Dumontier, Michel

    2013-01-01

    Background The Web provides widespread access to vast quantities of health-related information that can improve quality-of-life through better understanding of personal symptoms, medical conditions, and available treatments. Unfortunately, identifying a credible and personally relevant subset of information can be a time-consuming and challenging task for users without a medical background. Objective The objective of the Personal Health Lens system is to aid users when reading health-related webpages by providing warnings about personally relevant drug interactions. More broadly, we wish to present a prototype for a novel, generalizable approach to facilitating interactions between a patient, their practitioner(s), and the Web. Methods We utilized a distributed, Semantic Web-based architecture for recognizing personally dangerous drugs consisting of: (1) a private, local triple store of personal health information, (2) Semantic Web services, following the Semantic Automated Discovery and Integration (SADI) design pattern, for text mining and identifying substance interactions, (3) a bookmarklet to trigger analysis of a webpage and annotate it with personalized warnings, and (4) a semantic query that acts as an abstract template of the analytical workflow to be enacted by the system. Results A prototype implementation of the system is provided in the form of a Java standalone executable JAR file. The JAR file bundles all components of the system: the personal health database, locally-running versions of the SADI services, and a javascript bookmarklet that triggers analysis of a webpage. In addition, the demonstration includes a hypothetical personal health profile, allowing the system to be used immediately without configuration. Usage instructions are provided. Conclusions The main strength of the Personal Health Lens system is its ability to organize medical information and to present it to the user in a personalized and contextually relevant manner. While this

  11. Rethinking the dose-response relationship between usage and outcome in an online intervention for depression: randomized controlled trial.

    Science.gov (United States)

    Donkin, Liesje; Hickie, Ian B; Christensen, Helen; Naismith, Sharon L; Neal, Bruce; Cockayne, Nicole L; Glozier, Nick

    2013-10-17

    There is now substantial evidence that Web-based interventions can be effective at changing behavior and successfully treating psychological disorders. However, interest in the impact of usage on intervention outcomes has only been developed recently. To date, persistence with or completion of the intervention has been the most commonly reported metric of use, but this does not adequately describe user behavior online. Analysis of alternative measures of usage and their relationship to outcome may help to understand how much of the intervention users may need to obtain a clinically significant benefit from the program. The objective of this study was to determine which usage metrics, if any, are associated with outcome in an online depression treatment trial. Cardiovascular Risk E-couch Depression Outcome (CREDO) is a randomized controlled trial evaluating an unguided Web-based program (E-couch) based on cognitive behavioral therapy and interpersonal therapy for people with depression and cardiovascular disease. In all, 280 participants in the active arm of the trial commenced the program, delivered in 12 modules containing pages of text and activities. Usage data (eg, number of log-ins, modules completed, time spent online, and activities completed) were captured automatically by the program interface. We estimated the association of these and composite metrics with the outcome of a clinically significant improvement in depression score on the Patient Health Questionnaire (PHQ-9) of ≥ 5 points. In all, 214/280 (76.4%) participants provided outcome data at the end of the 12-week period and were included in the analysis. Of these, 94 (43.9%) participants obtained clinically significant improvement. Participants logged into the program an average of 18.7 times (SD 8.3) with most (62.1%, 133/214) completing all 12 modules. Average time spent online per log-in was 17.3 minutes (SD 10.5). Participants completed an average of 9 of 18 activities available within the

  12. Nagra technical report 14-02, geological basics - Dossier VII - Usage conflicts

    International Nuclear Information System (INIS)

    Gautschi, A.; Becker, J.; Traber, D.; Leu, W.

    2014-01-01

    This dossier is the seventh of a series of eight reports concerning the safety and technical aspects of locations for the disposal of radioactive wastes in Switzerland. It discusses possible conflicts with respect to the use of rock strata below or above the proposed host rock layers. Possible usage could include the extraction of salt, coal or other hydrocarbons. Other possible conflicting uses include the mining of stone, ores and minerals as well as the extraction of mineral water and thermal water. The construction of deep boreholes, for example for geothermal probes, could also cause conflicts with any nuclear waste depositories. The storage of natural gas or carbon sequestration, however, is not considered likely

  13. GPU-Accelerated Text Mining

    International Nuclear Information System (INIS)

    Cui, X.; Mueller, F.; Zhang, Y.; Potok, Thomas E.

    2009-01-01

    Accelerating hardware devices represent a novel promise for improving the performance for many problem domains but it is not clear for which domains what accelerators are suitable. While there is no room in general-purpose processor design to significantly increase the processor frequency, developers are instead resorting to multi-core chips duplicating conventional computing capabilities on a single die. Yet, accelerators offer more radical designs with a much higher level of parallelism and novel programming environments. This present work assesses the viability of text mining on CUDA. Text mining is one of the key concepts that has become prominent as an effective means to index the Internet, but its applications range beyond this scope and extend to providing document similarity metrics, the subject of this work. We have developed and optimized text search algorithms for GPUs to exploit their potential for massive data processing. We discuss the algorithmic challenges of parallelization for text search problems on GPUs and demonstrate the potential of these devices in experiments by reporting significant speedups. Our study may be one of the first to assess more complex text search problems for suitability for GPU devices, and it may also be one of the first to exploit and report on atomic instruction usage that have recently become available in NVIDIA devices

  14. Co-clustering for Weblogs in Semantic Space

    DEFF Research Database (Denmark)

    Zong, Yu; Xu, Guandong; Dolog, Peter

    2010-01-01

    Web clustering is an approach for aggregating web objects into various groups according to underlying relationships among them. Finding co-clusters of web objects in semantic space is an interesting topic in the context of web usage mining, which is able to capture the underlying user navigational...... interest and content preference simultaneously. In this paper we will present a novel web co-clustering algorithm named Co-Clustering in Semantic space (COCS) to simultaneously partition web users and pages via a latent semantic analysis approach. In COCS, we first, train the latent semantic space...... of weblog data by using Probabilistic Latent Semantic Analysis (PLSA) model, and then, project all weblog data objects into this semantic space with probability distribution to capture the relationship among web pages and web users, at last, propose a clustering algorithm to generate the co...

  15. Role of behavioural factors in green supply chain management implementation in Indian mining industries

    DEFF Research Database (Denmark)

    Muduli, K.; Govindan, Kannan; Barve, A.

    2013-01-01

    Green supply chain management (GSCM) integrates ecological concepts with those of supply chain management in order to minimize energy and material usage and to reduce adverse impacts of supply chain activities on the environment. GSCM implementation in mining industries depends largely upon certain...... be taken as a reference by the decision makers while deciding the hierarchy of action necessary for effective implementation of green practices in mining supply chains. The present research attempts to explore various behavioural factors affecting GCSM practices and their interactions which help to attain...... green-enabled needs. Interpretive structural modelling (ISM) is employed in this research to extract the interrelationships among the identified behavioural factors....

  16. Web-based tools for data analysis and quality assurance on a life-history trait database of plants of Northwest Europe

    NARCIS (Netherlands)

    Stadler, Michael; Ahlers, Dirk; Bekker, Rene M.; Finke, Jens; Kunzmann, Dierk; Sonnenschein, Michael

    2006-01-01

    Most data mining techniques have rarely been used in ecology. To address the specific needs of scientists analysing data from a plant trait database developed during the LEDA project, a web-based data mining tool has been developed. This paper presents the DIONE data miner and the project it has

  17. HC StratoMineR: A web-based tool for the rapid analysis of high content datasets

    NARCIS (Netherlands)

    Omta, W.; Heesbeen, R. van; Pagliero, R.; Velden, L. van der; Lelieveld, D.; Nellen, M.; Kramer, M.; Yeong, M.; Saeidi, A.; Medema, R.; Spruit, M.; Brinkkemper, S.; Klumperman, J.; Egan, D.

    2016-01-01

    High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that

  18. HC StratoMineR : A Web-Based Tool for the Rapid Analysis of High-Content Datasets

    NARCIS (Netherlands)

    Omta, Wienand A; van Heesbeen, Roy G; Pagliero, Romina J; van der Velden, Lieke M; Lelieveld, Daphne; Nellen, Mehdi; Kramer, Maik; Yeong, Marley; Saeidi, Amir M; Medema, Rene H; Spruit, Marco; Brinkkemper, Sjaak; Klumperman, Judith; Egan, David A

    2016-01-01

    High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that

  19. Mining Trust Relationships from Online Social Networks

    Institute of Scientific and Technical Information of China (English)

    Yu Zhang; Tong Yu

    2012-01-01

    With the growing popularity of online social network,trust plays a more and more important role in connecting people to each other.We rely on our personal trust to accept recommendations,to make purchase decisions and to select transaction partners in the online community.Therefore,how to obtain trust relationships through mining online social networks becomes an important research topic.There are several shortcomings of existing trust mining methods.First,trust is category-dependent.However,most of the methods overlook the category attribute of trust relationships,which leads to low accuracy in trust calculation.Second,since the data in online social networks cannot be understood and processed by machines directly,traditional mining methods require much human effort and are not easily applied to other applications.To solve the above problems,we propose a semantic-based trust reasoning mechanism to mine trust relationships from online social networks automatically.We emphasize the category attribute of pairwise relationships and utilize Semantic Web technologies to build a domain ontology for data communication and knowledge sharing.We exploit role-based and behavior-based reasoning functions to infer implicit trust relationships and category-specific trust relationships.We make use of path expressions to extend reasoning rules so that the mining process can be done directly without much human effort.We perform experiments on real-life data extracted from Epinions.The experimental results verify the effectiveness and wide application use of our proposed method.

  20. Measurement of Self-Monitoring Web Technology Acceptance and Use in an e-Health Weight-Loss Trial

    OpenAIRE

    Ma, Jun; Xiao, Lan; Blonstein, Andrea C.

    2013-01-01

    Background: Research on technology acceptance and use in e-health weight-loss interventions is limited. Using data from a randomized controlled trial of two e-health interventions, we evaluated the acceptance and use of a self-monitoring Web site for weight loss. Materials and Methods: We examined eight theoretical constructs about technology acceptance using adapted 5-point Likert scales and the association of measured Web site usage and weight loss. Results: All scales had hi...

  1. Recent advancements on the development of web-based applications for the implementation of seismic analysis and surveillance systems

    Science.gov (United States)

    Friberg, P. A.; Luis, R. S.; Quintiliani, M.; Lisowski, S.; Hunter, S.

    2014-12-01

    Recently, a novel set of modules has been included in the Open Source Earthworm seismic data processing system, supporting the use of web applications. These include the Mole sub-system, for storing relevant event data in a MySQL database (see M. Quintiliani and S. Pintore, SRL, 2013), and an embedded webserver, Moleserv, for serving such data to web clients in QuakeML format. These modules have enabled, for the first time using Earthworm, the use of web applications for seismic data processing. These can greatly simplify the operation and maintenance of seismic data processing centers by having one or more servers providing the relevant data as well as the data processing applications themselves to client machines running arbitrary operating systems.Web applications with secure online web access allow operators to work anywhere, without the often cumbersome and bandwidth hungry use of secure shell or virtual private networks. Furthermore, web applications can seamlessly access third party data repositories to acquire additional information, such as maps. Finally, the usage of HTML email brought the possibility of specialized web applications, to be used in email clients. This is the case of EWHTMLEmail, which produces event notification emails that are in fact simple web applications for plotting relevant seismic data.Providing web services as part of Earthworm has enabled a number of other tools as well. One is ISTI's EZ Earthworm, a web based command and control system for an otherwise command line driven system; another is a waveform web service. The waveform web service serves Earthworm data to additional web clients for plotting, picking, and other web-based processing tools. The current Earthworm waveform web service hosts an advanced plotting capability for providing views of event-based waveforms from a Mole database served by Moleserve.The current trend towards the usage of cloud services supported by web applications is driving improvements in Java

  2. Using text-mining techniques in electronic patient records to identify ADRs from medicine use

    DEFF Research Database (Denmark)

    Warrer, Pernille; Hansen, Ebba Holme; Jensen, Lars Juhl

    2012-01-01

    This literature review included studies that use text-mining techniques in narrative documents stored in electronic patient records (EPRs) to investigate ADRs. We searched PubMed, Embase, Web of Science and International Pharmaceutical Abstracts without restrictions from origin until July 2011. We...... included empirically based studies on text mining of electronic patient records (EPRs) that focused on detecting ADRs, excluding those that investigated adverse events not related to medicine use. We extracted information on study populations, EPR data sources, frequencies and types of the identified ADRs......, medicines associated with ADRs, text-mining algorithms used and their performance. Seven studies, all from the United States, were eligible for inclusion in the review. Studies were published from 2001, the majority between 2009 and 2010. Text-mining techniques varied over time from simple free text...

  3. Mining social networks and security informatics

    CERN Document Server

    Özyer, Tansel; Rokne, Jon; Khoury, Suheil

    2013-01-01

    Crime, terrorism and security are in the forefront of current societal concerns. This edited volume presents research based on social network techniques showing how data from crime and terror networks can be analyzed and how information can be extracted. The topics covered include crime data mining and visualization; organized crime detection; crime network visualization; computational criminology; aspects of terror network analyses and threat prediction including cyberterrorism and the related area of dark web; privacy issues in social networks; security informatics; graph algorithms for soci

  4. Data mining application in industrial energy audit for lighting

    Energy Technology Data Exchange (ETDEWEB)

    Maricar, N.M.; Kim, G.C.; Jamal, N. [Kolej Univ., Melaka (Malaysia). Faculty of Electrical Engineering

    2005-07-01

    A data mining application for lighting energy audits at industrial sites was presented. Data collection was based on the parameters needed for the analysis part of the audit. Data collection included the activity for which the room was used; its dimension; light level readings in lux; the number of luminaries; the number of lamps per luminaries; lamp fixtures; and lamp wattage. The lumen method was used to calculate the recommended numbers of luminaries in the room. The number was then compared with the existing system's luminaries. The installed load efficacy ratio (ILER) was then used to determine proper retrofit action to maximize energy usage. The difference between the calculated lux and the standard lux was used to create data subsets. A data mining algorithm was used to determine that the ILER plays an important role in calculating the efficiency of lighting systems. It was also concluded that the method can be used to minimize the time needed to analyze large amounts of lighting data. The results of case studies were also used to show that the combined data mining algorithm provided accurate assessments using existing calculated data. 7 refs., 8 tabs., 5 figs.

  5. Learning System of Web Navigation Patterns through Hypertext Probabilistic Grammars

    Science.gov (United States)

    Cortes Vasquez, Augusto

    2015-01-01

    One issue of real interest in the area of web data mining is to capture users' activities during connection and extract behavior patterns that help define their preferences in order to improve the design of future pages adapting websites interfaces to individual users. This research is intended to provide, first of all, a presentation of the…

  6. Learning System of Web Navigation Patterns through Hypertext Probabilistic Grammars

    Directory of Open Access Journals (Sweden)

    Augusto Cortez Vasquez

    2015-01-01

    Full Text Available One issue of real interest in the area of web data mining is to capture users’ activities during connection and extract behavior patterns that help define their preferences in order to improve the design of future pages adapting websites interfaces to individual users. This research is intended to provide, first of all, a presentation of the methodological foundations of the use of probabilistic languages to identify relevant or most visited websites. Secondly, the web sessions are represented by graphs and probabilistic context-free grammars so that the sessions that have the highest probabilities are considered the most visited and most preferred, therefore, the most important in relation to a particular topic. It aims to develop a tool for processing web sessions obtained from a log server represented by probabilistic context-free grammars.

  7. DISEASES: text mining and data integration of disease-gene associations.

    Science.gov (United States)

    Pletscher-Frankild, Sune; Pallejà, Albert; Tsafou, Kalliopi; Binder, Janos X; Jensen, Lars Juhl

    2015-03-01

    Text mining is a flexible technology that can be applied to numerous different tasks in biology and medicine. We present a system for extracting disease-gene associations from biomedical abstracts. The system consists of a highly efficient dictionary-based tagger for named entity recognition of human genes and diseases, which we combine with a scoring scheme that takes into account co-occurrences both within and between sentences. We show that this approach is able to extract half of all manually curated associations with a false positive rate of only 0.16%. Nonetheless, text mining should not stand alone, but be combined with other types of evidence. For this reason, we have developed the DISEASES resource, which integrates the results from text mining with manually curated disease-gene associations, cancer mutation data, and genome-wide association studies from existing databases. The DISEASES resource is accessible through a web interface at http://diseases.jensenlab.org/, where the text-mining software and all associations are also freely available for download. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  8. Usage Record Format Recommendation

    CERN Document Server

    Nilsen, J.K.; Muller-Pfeerkorn, R

    2013-01-01

    For resources to be shared, sites must be able to exchange basic accounting and usage data in a common format. This document describes a common format which enables the exchange of basic accounting and usage data from different resources. This record format is intended to facilitate the sharing of usage information, particularly in the area of the accounting of jobs, computing, memory, storage and cloud usage but with a structure that allows an easy extension to other resources. This document describes the Usage Record components both in natural language form and annotated XML. This document does not address how these records should be used, nor does it attempt to dictate the format in which the accounting records are stored. Instead, it denes a common exchange format. Furthermore, nothing is said regarding the communication mechanisms employed to exchange the records, i.e. transport layer, framing, authentication, integrity, etc.

  9. Carbon dynamics, food web structure and reclamation strategies in Athabasca oil sands wetlands (CFRAW)

    International Nuclear Information System (INIS)

    Ciborowski, J.J.; Dixon, G.; Foote, L.; Liber, K.; Smits, J.E.

    2007-01-01

    The remediation and ecology of oilsands constructed wetlands was discussed with reference to a project known as the Carbon dynamics, Food web structure and Reclamation strategies in Athabasca oil sands Wetlands (CFRAW). This joint project between 7 mining partners and 5 universities documents how tailings in constructed wetlands modify maturation leading to natural conditions in a reclaimed landscape. Since wetlands are expected to make up 20-50 per cent of the final reclamation landscape of areas surface mined for oil sands in northeastern Alberta, the project focuses on how quickly wetlands amended with reclamation materials approach the conditions seen in reference wetland systems. This study provided a conceptual model of carbon pathways and budgets to evaluate how the allocation of carbon among compartments changes as newly formed wetlands mature in the boreal system. It is likely that succession and community development will accelerate if constructed wetlands are supplemented with stockpiled peat or topsoil. The bitumens and naphthenic acids found in wetlands constructed with mine tailings materials are initially toxic, but may ultimately serve as an alternate source of carbon once they degrade or are metabolized by bacteria. This study evaluated the sources, biological uptake, pathways, and movement through the food web of materials used by the biota in constructed wetlands, with particular reference to how productivity of new wetlands is maintained. Net ecosystem productivity is being monitored along with rates of organic carbon accumulation from microbial, algal, and macrophyte production, and influx of outside materials. The rates of leaf litter breakdown and microbial respiration are also being monitored to determine how constituents speed or slow food web processes of young and older wetlands. Carbon and nitrogen stable isotope measurements indicate which sources are incorporated into the food web as wetlands age, and how this influences community

  10. Simple, Scalable, Script-based, Science Processor for Measurements - Data Mining Edition (S4PM-DME)

    Science.gov (United States)

    Pham, L. B.; Eng, E. K.; Lynnes, C. S.; Berrick, S. W.; Vollmer, B. E.

    2005-12-01

    The S4PM-DME is the Goddard Earth Sciences Distributed Active Archive Center's (GES DAAC) web-based data mining environment. The S4PM-DME replaces the Near-line Archive Data Mining (NADM) system with a better web environment and a richer set of production rules. S4PM-DME enables registered users to submit and execute custom data mining algorithms. The S4PM-DME system uses the GES DAAC developed Simple Scalable Script-based Science Processor for Measurements (S4PM) to automate tasks and perform the actual data processing. A web interface allows the user to access the S4PM-DME system. The user first develops personalized data mining algorithm on his/her home platform and then uploads them to the S4PM-DME system. Algorithms in C and FORTRAN languages are currently supported. The user developed algorithm is automatically audited for any potential security problems before it is installed within the S4PM-DME system and made available to the user. Once the algorithm has been installed the user can promote the algorithm to the "operational" environment. From here the user can search and order the data available in the GES DAAC archive for his/her science algorithm. The user can also set up a processing subscription. The subscription will automatically process new data as it becomes available in the GES DAAC archive. The generated mined data products are then made available for FTP pickup. The benefits of using S4PM-DME are 1) to decrease the downloading time it typically takes a user to transfer the GES DAAC data to his/her system thus off-load the heavy network traffic, 2) to free-up the load on their system, and last 3) to utilize the rich and abundance ocean, atmosphere data from the MODIS and AIRS instruments available from the GES DAAC.

  11. A Javascript GIS Platform Based on Invocable Geospatial Web Services

    Directory of Open Access Journals (Sweden)

    Konstantinos Evangelidis

    2018-04-01

    Full Text Available Semantic Web technologies are being increasingly adopted by the geospatial community during last decade through the utilization of open standards for expressing and serving geospatial data. This was also dramatically assisted by the ever-increasing access and usage of geographic mapping and location-based services via smart devices in people’s daily activities. In this paper, we explore the developmental framework of a pure JavaScript client-side GIS platform exclusively based on invocable geospatial Web services. We also extend JavaScript utilization on the server side by deploying a node server acting as a bridge between open source WPS libraries and popular geoprocessing engines. The vehicle for such an exploration is a cross platform Web browser capable of interpreting JavaScript commands to achieve interaction with geospatial providers. The tool is a generic Web interface providing capabilities of acquiring spatial datasets, composing layouts and applying geospatial processes. In an ideal form the end-user will have to identify those services, which satisfy a geo-related need and put them in the appropriate row. The final output may act as a potential collector of freely available geospatial web services. Its server-side components may exploit geospatial processing suppliers composing that way a light-weight fully transparent open Web GIS platform.

  12. Astroinformatics, data mining and the future of astronomical research

    Energy Technology Data Exchange (ETDEWEB)

    Brescia, Massimo, E-mail: longo@na.infn.it [INAF, Astronomical Obs. of Capodimonte, Via Moiariello 16, I-80131 Napoli (Italy); Longo, Giuseppe [Department of Physics, University Federico II, Via Cintia 6, 80126 Napoli (Italy); Department of Astronomy, Caltech, Pasadena (United States)

    2013-08-21

    Astronomy, as many other scientific disciplines, is facing a true data deluge which is bound to change both the praxis and the methodology of every day research work. The emerging field of astroinformatics, while on the one end appears crucial to face the technological challenges, on the other is opening new exciting perspectives for new astronomical discoveries through the implementation of advanced data mining procedures. The complexity of astronomical data and the variety of scientific problems, however, call for innovative algorithms and methods as well as for an extreme usage of ICT technologies.

  13. Astroinformatics, data mining and the future of astronomical research

    International Nuclear Information System (INIS)

    Brescia, Massimo; Longo, Giuseppe

    2013-01-01

    Astronomy, as many other scientific disciplines, is facing a true data deluge which is bound to change both the praxis and the methodology of every day research work. The emerging field of astroinformatics, while on the one end appears crucial to face the technological challenges, on the other is opening new exciting perspectives for new astronomical discoveries through the implementation of advanced data mining procedures. The complexity of astronomical data and the variety of scientific problems, however, call for innovative algorithms and methods as well as for an extreme usage of ICT technologies

  14. The prospects of Web 2.0 technologies in teaching and learning in higher learning institutes: The case study of the Sokoine University of Agriculture in Tanzania

    Directory of Open Access Journals (Sweden)

    Wulystan Pius Mtega

    2013-12-01

    Full Text Available The study investigated the perceptions of students and lecturers on Web 2.0 as learning and teaching tools. It identified the commonly used web 2.0 tools; determined how the tools facilitate teaching and learning; assessed the appropriateness of features of the commonly used web 2.0 tools in teaching and learning and; determined the challenges associated with the usage of the tools in teaching and learning in higher education environments. The study was conducted at the Sokoine University of Agriculture (SUA in Tanzania; it employed combined research designs where both qualitative and quantitative designs were used. Stratified sampling techniques were employed to select respondents from the different strata namely students (undergraduate and postgraduate and teaching staff. Structured questionnaires were distributed to 120 students and 50 teaching staff who were randomly selected from each stratum. Findings show that blogs, Facebook, Wikis, Google drive and YouTube were used for teaching and learning at SUA. However, the level of usage of Web 2.0 tools for non academic activities was higher than for academic purposes. It is concluded that that not all tools and applications were suitable for teaching and learning. It is recommended that students and staff should be trained on how to use Web 2.0 tools in teaching and learning. Institutes should promote the usage of such tools because some of them have suitable applications for teaching and learning. Developers of Web 2.o tools should incorporate more applications that may help teaching staff to supervise and assist students in the learning process.

  15. Mining and mining authorities in Saarland 2016. Mining economy, mining technology, occupational safety, environmental protection, statistics, mining authority activities. Annual report

    International Nuclear Information System (INIS)

    2016-01-01

    The annual report of the Saarland Upper Mining Authority provides an insight into the activities of mining authorities. Especially, the development of the black coal mining, safety and technology of mining as well as the correlation between mining and environment are stressed.

  16. Annotating images by mining image search results.

    Science.gov (United States)

    Wang, Xin-Jing; Zhang, Lei; Li, Xirong; Ma, Wei-Ying

    2008-11-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

  17. MetaRanker 2.0: a web server for prioritization of genetic variation data

    DEFF Research Database (Denmark)

    Pers, Tune Hannes; Dworzynski, Piotr; Thomas, Cecilia Engel

    2013-01-01

    MetaRanker 2.0 is a web server for prioritization of common and rare frequency genetic variation data. Based on heterogeneous data sets including genetic association data, protein–protein interactions, large-scale text-mining data, copy number variation data and gene expression experiments, Meta...

  18. Automated data mining: an innovative and efficient web-based approach to maintaining resident case logs.

    Science.gov (United States)

    Bhattacharya, Pratik; Van Stavern, Renee; Madhavan, Ramesh

    2010-12-01

    Use of resident case logs has been considered by the Residency Review Committee for Neurology of the Accreditation Council for Graduate Medical Education (ACGME). This study explores the effectiveness of a data-mining program for creating resident logs and compares the results to a manual data-entry system. Other potential applications of data mining to enhancing resident education are also explored. Patient notes dictated by residents were extracted from the Hospital Information System and analyzed using an unstructured mining program. History, examination and ICD codes were obtained and compared to the existing manual log. The automated data History, examination, and ICD codes were gathered for a 30-day period and compared to manual case logs. The automated method extracted all resident dictations with the dates of encounter and transcription. The automated data-miner processed information from all 19 residents, while only 4 residents logged manually. The manual method identified only broad categories of diseases; the major categories were stroke or vascular disorder 53 (27.6%), epilepsy 28 (14.7%), and pain syndromes 26 (13.5%). In the automated method, epilepsy 114 (21.1%), cerebral atherosclerosis 114 (21.1%), and headache 105 (19.4%) were the most frequent primary diagnoses, and headache 89 (16.5%), seizures 94 (17.4%), and low back pain 47 (9%) were the most common chief complaints. More detailed patient information such as tobacco use 227 (42%), alcohol use 205 (38%), and drug use 38 (7%) were extracted by the data-mining method. Manual case logs are time-consuming, provide limited information, and may be unpopular with residents. Data mining is a time-effective tool that may aid in the assessment of resident experience or the ACGME core competencies or in resident clinical research. More study of this method in larger numbers of residency programs is needed.

  19. Social media mining with R

    CERN Document Server

    Heimann, Richard

    2014-01-01

    A concise, hands-on guide with many practical examples and a detailed treatise on inference and social science research that will help you in mining data in the real world. Whether you are an undergraduate who wishes to get hands-on experience working with social data from the Web, a practitioner wishing to expand your competencies and learn unsupervised sentiment analysis, or you are simply interested in social data analysis, this book will prove to be an essential asset. No previous experience with R or statistics is required, though having knowledge of both will enrich your experience.

  20. EnvMine: A text-mining system for the automatic extraction of contextual information

    Directory of Open Access Journals (Sweden)

    de Lorenzo Victor

    2010-06-01

    Full Text Available Abstract Background For ecological studies, it is crucial to count on adequate descriptions of the environments and samples being studied. Such a description must be done in terms of their physicochemical characteristics, allowing a direct comparison between different environments that would be difficult to do otherwise. Also the characterization must include the precise geographical location, to make possible the study of geographical distributions and biogeographical patterns. Currently, there is no schema for annotating these environmental features, and these data have to be extracted from textual sources (published articles. So far, this had to be performed by manual inspection of the corresponding documents. To facilitate this task, we have developed EnvMine, a set of text-mining tools devoted to retrieve contextual information (physicochemical variables and geographical locations from textual sources of any kind. Results EnvMine is capable of retrieving the physicochemical variables cited in the text, by means of the accurate identification of their associated units of measurement. In this task, the system achieves a recall (percentage of items retrieved of 92% with less than 1% error. Also a Bayesian classifier was tested for distinguishing parts of the text describing environmental characteristics from others dealing with, for instance, experimental settings. Regarding the identification of geographical locations, the system takes advantage of existing databases such as GeoNames to achieve 86% recall with 92% precision. The identification of a location includes also the determination of its exact coordinates (latitude and longitude, thus allowing the calculation of distance between the individual locations. Conclusion EnvMine is a very efficient method for extracting contextual information from different text sources, like published articles or web pages. This tool can help in determining the precise location and physicochemical

  1. Queensland Mines plant trials with Caro's acid

    International Nuclear Information System (INIS)

    Lucas, G.C.; Fulton, E.J.; Vautier, F.E.; Waters, D.J.; Ring, R.J.

    1983-01-01

    Laboratory leach tests have been carried out to compare the effectiveness of Caro's acid (permonosulphuric acid) as an alternative oxidant to pyrolusite in the leaching of uranium ores. Results demonstrated that Caro's acid reduced acid consumption in leaching and the time required for neutralisation of tailings liquor. The uranium extraction was unaffected by choice of oxidant. A plant trial confirmed that significant savings in acid and lime usage can be achieved under plant conditions. Plant operations also demonstrated that Caro's acid has a number of significant operating advantages over pyrolusite. Queensland Mines Ltd. have recently decided to convert their leaching process from pyrolusite to Caro's acid

  2. A Semantic Web-based System for Mining Genetic Mutations in Cancer Clinical Trials.

    Science.gov (United States)

    Priya, Sambhawa; Jiang, Guoqian; Dasari, Surendra; Zimmermann, Michael T; Wang, Chen; Heflin, Jeff; Chute, Christopher G

    2015-01-01

    Textual eligibility criteria in clinical trial protocols contain important information about potential clinically relevant pharmacogenomic events. Manual curation for harvesting this evidence is intractable as it is error prone and time consuming. In this paper, we develop and evaluate a Semantic Web-based system that captures and manages mutation evidences and related contextual information from cancer clinical trials. The system has 2 main components: an NLP-based annotator and a Semantic Web ontology-based annotation manager. We evaluated the performance of the annotator in terms of precision and recall. We demonstrated the usefulness of the system by conducting case studies in retrieving relevant clinical trials using a collection of mutations identified from TCGA Leukemia patients and Atlas of Genetics and Cytogenetics in Oncology and Haematology. In conclusion, our system using Semantic Web technologies provides an effective framework for extraction, annotation, standardization and management of genetic mutations in cancer clinical trials.

  3. CMS data quality monitoring web service

    Energy Technology Data Exchange (ETDEWEB)

    Tuura, L; Eulisse, G [Northeastern University, Boston, MA (United States); Meyer, A, E-mail: lat@cern.c, E-mail: giulio.eulisse@cern.c, E-mail: andreas.meyer@cern.c [DESY, Hamburg (Germany)

    2010-04-01

    A central component of the data quality monitoring system of the CMS experiment at the Large Hadron Collider is a web site for browsing data quality histograms. The production servers in data taking provide access to several hundred thousand histograms per run, both live in online as well as for up to several terabytes of archived histograms for the online data taking, Tier-0 prompt reconstruction, prompt calibration and analysis activities, for re-reconstruction at Tier-1s and for release validation. At the present usage level the servers currently handle in total around a million authenticated HTTP requests per day. We describe the main features and components of the system, our implementation for web-based interactive rendering, and the server design. We give an overview of the deployment and maintenance procedures. We discuss the main technical challenges and our solutions to them, with emphasis on functionality, long-term robustness and performance.

  4. CMS data quality monitoring web service

    International Nuclear Information System (INIS)

    Tuura, L; Eulisse, G; Meyer, A

    2010-01-01

    A central component of the data quality monitoring system of the CMS experiment at the Large Hadron Collider is a web site for browsing data quality histograms. The production servers in data taking provide access to several hundred thousand histograms per run, both live in online as well as for up to several terabytes of archived histograms for the online data taking, Tier-0 prompt reconstruction, prompt calibration and analysis activities, for re-reconstruction at Tier-1s and for release validation. At the present usage level the servers currently handle in total around a million authenticated HTTP requests per day. We describe the main features and components of the system, our implementation for web-based interactive rendering, and the server design. We give an overview of the deployment and maintenance procedures. We discuss the main technical challenges and our solutions to them, with emphasis on functionality, long-term robustness and performance.

  5. Anonymous communication networks protecting privacy on the web

    CERN Document Server

    Peng, Kun

    2014-01-01

    In today's interactive network environment, where various types of organizations are eager to monitor and track Internet use, anonymity is one of the most powerful resources available to counterbalance the threat of unknown spectators and to ensure Internet privacy.Addressing the demand for authoritative information on anonymous Internet usage, Anonymous Communication Networks: Protecting Privacy on the Web examines anonymous communication networks as a solution to Internet privacy concerns. It explains how anonymous communication networks make it possible for participants to communicate with

  6. antiSMASH 2.0-a versatile platform for genome mining of secondary metabolite producers

    NARCIS (Netherlands)

    Blin, Kai; Medema, Marnix H.; Kazempour, Daniyal; Fischbach, Michael A.; Breitling, Rainer; Takano, Eriko; Weber, Tilmann

    Microbial secondary metabolites are a potent source of antibiotics and other pharmaceuticals. Genome mining of their biosynthetic gene clusters has become a key method to accelerate their identification and characterization. In 2011, we developed antiSMASH, a web-based analysis platform that

  7. A Clustering Methodology of Web Log Data for Learning Management Systems

    Science.gov (United States)

    Valsamidis, Stavros; Kontogiannis, Sotirios; Kazanidis, Ioannis; Theodosiou, Theodosios; Karakos, Alexandros

    2012-01-01

    Learning Management Systems (LMS) collect large amounts of data. Data mining techniques can be applied to analyse their web data log files. The instructors may use this data for assessing and measuring their courses. In this respect, we have proposed a methodology for analysing LMS courses and students' activity. This methodology uses a Markov…

  8. Utilisation du site Web | CRDI - Centre de recherches pour le ...

    International Development Research Centre (IDRC) Digital Library (Canada)

    Ce site Web fournit des espaces de discussion et donne accès à divers mécanismes de communication. L'utilisateur s'engage à en faire un usage approprié et pertinent à l'objet des discussions. Il s'engage en outre à ne diffuser aucun message diffamatoire, illégal, obscène ou menaçant et à ne pas télécharger ou joindre ...

  9. AHCODA-DB: a data repository with web-based mining tools for the analysis of automated high-content mouse phenomics data.

    Science.gov (United States)

    Koopmans, Bastijn; Smit, August B; Verhage, Matthijs; Loos, Maarten

    2017-04-04

    Systematic, standardized and in-depth phenotyping and data analyses of rodent behaviour empowers gene-function studies, drug testing and therapy design. However, no data repositories are currently available for standardized quality control, data analysis and mining at the resolution of individual mice. Here, we present AHCODA-DB, a public data repository with standardized quality control and exclusion criteria aimed to enhance robustness of data, enabled with web-based mining tools for the analysis of individually and group-wise collected mouse phenotypic data. AHCODA-DB allows monitoring in vivo effects of compounds collected from conventional behavioural tests and from automated home-cage experiments assessing spontaneous behaviour, anxiety and cognition without human interference. AHCODA-DB includes such data from mutant mice (transgenics, knock-out, knock-in), (recombinant) inbred strains, and compound effects in wildtype mice and disease models. AHCODA-DB provides real time statistical analyses with single mouse resolution and versatile suite of data presentation tools. On March 9th, 2017 AHCODA-DB contained 650 k data points on 2419 parameters from 1563 mice. AHCODA-DB provides users with tools to systematically explore mouse behavioural data, both with positive and negative outcome, published and unpublished, across time and experiments with single mouse resolution. The standardized (automated) experimental settings and the large current dataset (1563 mice) in AHCODA-DB provide a unique framework for the interpretation of behavioural data and drug effects. The use of common ontologies allows data export to other databases such as the Mouse Phenome Database. Unbiased presentation of positive and negative data obtained under the highly standardized screening conditions increase cost efficiency of publicly funded mouse screening projects and help to reach consensus conclusions on drug responses and mouse behavioural phenotypes. The website is publicly

  10. Critical Success Factors for Adoption of Web-Based Learning Management Systems in Tanzania

    Science.gov (United States)

    Lwoga, Edda Tandi

    2014-01-01

    This paper examines factors that predict students' continual usage intention of web-based learning content management systems in Tanzania, with a specific focus at Muhimbili University of Health and Allied Science (MUHAS). This study sent a questionnaire surveys to 408 first year undergraduate students, with a rate of return of 66.7. This study…

  11. Military Parents' Personal Technology Usage and Interest in e-Health Information for Obesity Prevention.

    Science.gov (United States)

    Jai, Tun-Min; McCool, Barent N; Reed, Debra B

    2016-03-01

    U.S. military families are experiencing high obesity rates similar to the civilian population. The Department of Defense's Military Health System (MHS) is one of the largest healthcare providers in the United States, serving approximately 9.2 million active duty service members, retirees, spouses, and children. The annual cost to the MHS for morbidities associated with being overweight exceeds $1 billion. The preschool age has been suggested as an opportune time to intervene for the prevention of obesity. Thus, this study investigated the current level of technology usage by military service member families and assessed their needs and interests in health/nutrition information. This needs assessment is crucial for researchers/educators to design further studies and intervention programs for obesity prevention in military families with young children. In total, 288 military parents (233 Army and 55 Air Force) at two military bases whose children were enrolled in military childcare centers in the southwestern United States participated in a Technology Usage in Military Family (TUMF) survey in 2013. Overall, both bases presented similar technology usage patterns in terms of computer and mobile device usage on the Internet. Air Force base parents had a slightly higher knowledge level of nutrition/health information than Army base parents. The TUMF survey suggested practical ways such as mobile applications/Web sites, social networks, games, etc., that health educators can use to disseminate nutrition/health information for obesity prevention among military families with young children.

  12. Carbon dynamics, food web structure and reclamation strategies in Athabasca oil sands wetlands (CRFAW)

    International Nuclear Information System (INIS)

    Ciborowski, J.; Dixon, G.; Foote, L.; Liber, K.; Smits, J.

    2010-01-01

    This abstract provided details of the Carbon Dynamics, Food Web Structure and Reclamation Strategies in Athabasca Oil Sands Wetlands (CFRAW) program, a collaboration between oil sands industry partners and university laboratories. CFRAW researchers are investigating the effects of mine tailings and process waters on the development, health, and function of wetland communities in post-mining landscapes. The aim of the program is to accurately predict how quickly the reclaimed wetlands will approach conditions seen in reference wetland systems. The program is also examining the effects of hydrocarbons as a surrogate source of carbon after they are metabolized by bacteria. The biological uptake, pathways, and movement through the food web of materials used by the biota in constructed wetlands are also being studied. Flux estimates will be used to determine if wetlands amended with peat will maintain their productivity. A conceptual model of carbon pathways and budgets is also being developed.

  13. Proton Pump Inhibitor Usage and the Risk of Myocardial Infarction in the General Population.

    Directory of Open Access Journals (Sweden)

    Nigam H Shah

    Full Text Available Proton pump inhibitors (PPIs have been associated with adverse clinical outcomes amongst clopidogrel users after an acute coronary syndrome. Recent pre-clinical results suggest that this risk might extend to subjects without any prior history of cardiovascular disease. We explore this potential risk in the general population via data-mining approaches.Using a novel approach for mining clinical data for pharmacovigilance, we queried over 16 million clinical documents on 2.9 million individuals to examine whether PPI usage was associated with cardiovascular risk in the general population.In multiple data sources, we found gastroesophageal reflux disease (GERD patients exposed to PPIs to have a 1.16 fold increased association (95% CI 1.09-1.24 with myocardial infarction (MI. Survival analysis in a prospective cohort found a two-fold (HR = 2.00; 95% CI 1.07-3.78; P = 0.031 increase in association with cardiovascular mortality. We found that this association exists regardless of clopidogrel use. We also found that H2 blockers, an alternate treatment for GERD, were not associated with increased cardiovascular risk; had they been in place, such pharmacovigilance algorithms could have flagged this risk as early as the year 2000.Consistent with our pre-clinical findings that PPIs may adversely impact vascular function, our data-mining study supports the association of PPI exposure with risk for MI in the general population. These data provide an example of how a combination of experimental studies and data-mining approaches can be applied to prioritize drug safety signals for further investigation.

  14. Enjeux communicationnels du Web 2.0 pour les Relations Publiques/Professional and organisational issues of Web 2.0 Public Relations

    Directory of Open Access Journals (Sweden)

    Francine Charest

    2012-12-01

    Full Text Available The Web 2.0, which includes Facebook, Twitter, Youtube and other social medias, is considered as to be one of the strongest communication tools of the early 21st century. TheWeb evolutionhas changed deeply the way Public relations agents operate. In 2009, Charest and Bédard have shown that the Web 2.0 was in fact a reclaim by the internet users of the Web as it was first imagined by Tim Berners-Lee in November 1993 : a tool to exchange and share information. The Web first generation has instead been used by the administrators for dissemination and promotion. Today, in order to appropriate themselves these new medias, PR agents have to findnew business models, even new ways to communicate.RésuméLe Web 2.0, regroupant les Facebook, Twitter, YouTube et autres médias sociaux, est considéré comme l’un des plus puissants outils de communication, en ce début de XX1 siècle. C’est sous l’angle de mutations qu’il induit sur les pratiques des professionnels en relations publiques, qu’il nous intéresse d’étudier les enjeux de l’évolution des usages du Web 2.0. Charest et Bédard ont montré en 2009 que le Web 2.0 était la revanche des internautes qui tentent de se réapproprier le Web tel qu’il avait été conçu par Tim Berners-Lee en novembre 1993, soit comme un outil d’échange et de partage d’information. Il a été clairement montré que la première générationde Web a plutôt été utilisée par les gestionnaires à des fins de diffusion et de promotion. L’appropriation de ces nouveaux médias par les relationnistes passe nécessairement par de nouveaux modèles d’affaires, voire de nouvelles façons de communiquer.

  15. Mine Water Treatment in Hongai Coal Mines

    Science.gov (United States)

    Dang, Phuong Thao; Dang, Vu Chi

    2018-03-01

    Acid mine drainage (AMD) is recognized as one of the most serious environmental problem associated with mining industry. Acid water, also known as acid mine drainage forms when iron sulfide minerals found in the rock of coal seams are exposed to oxidizing conditions in coal mining. Until 2009, mine drainage in Hongai coal mines was not treated, leading to harmful effects on humans, animals and aquatic ecosystem. This report has examined acid mine drainage problem and techniques for acid mine drainage treatment in Hongai coal mines. In addition, selection and criteria for the design of the treatment systems have been presented.

  16. Mine Water Treatment in Hongai Coal Mines

    OpenAIRE

    Dang Phuong Thao; Dang Vu Chi

    2018-01-01

    Acid mine drainage (AMD) is recognized as one of the most serious environmental problem associated with mining industry. Acid water, also known as acid mine drainage forms when iron sulfide minerals found in the rock of coal seams are exposed to oxidizing conditions in coal mining. Until 2009, mine drainage in Hongai coal mines was not treated, leading to harmful effects on humans, animals and aquatic ecosystem. This report has examined acid mine drainage problem and techniques for acid mine ...

  17. ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery.

    Science.gov (United States)

    Krishnakumar, Vivek; Contrino, Sergio; Cheng, Chia-Yi; Belyaeva, Irina; Ferlanti, Erik S; Miller, Jason R; Vaughn, Matthew W; Micklem, Gos; Town, Christopher D; Chan, Agnes P

    2017-01-01

    ThaleMine (https://apps.araport.org/thalemine/) is a comprehensive data warehouse that integrates a wide array of genomic information of the model plant Arabidopsis thaliana. The data collection currently includes the latest structural and functional annotation from the Araport11 update, the Col-0 genome sequence, RNA-seq and array expression, co-expression, protein interactions, homologs, pathways, publications, alleles, germplasm and phenotypes. The data are collected from a wide variety of public resources. Users can browse gene-specific data through Gene Report pages, identify and create gene lists based on experiments or indexed keywords, and run GO enrichment analysis to investigate the biological significance of selected gene sets. Developed by the Arabidopsis Information Portal project (Araport, https://www.araport.org/), ThaleMine uses the InterMine software framework, which builds well-structured data, and provides powerful data query and analysis functionality. The warehoused data can be accessed by users via graphical interfaces, as well as programmatically via web-services. Here we describe recent developments in ThaleMine including new features and extensions, and discuss future improvements. InterMine has been broadly adopted by the model organism research community including nematode, rat, mouse, zebrafish, budding yeast, the modENCODE project, as well as being used for human data. ThaleMine is the first InterMine developed for a plant model. As additional new plant InterMines are developed by the legume and other plant research communities, the potential of cross-organism integrative data analysis will be further enabled. © The Author 2016. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  18. Improving Web Learning through model Optimization using Bootstrap for a Tour-Guide Robot

    Directory of Open Access Journals (Sweden)

    Rafael León

    2012-09-01

    Full Text Available We perform a review of Web Mining techniques and we describe a Bootstrap Statistics methodology applied to pattern model classifier optimization and verification for Supervised Learning for Tour-Guide Robot knowledge repository management. It is virtually impossible to test thoroughly Web Page Classifiers and many other Internet Applications with pure empirical data, due to the need for human intervention to generate training sets and test sets. We propose using the computer-based Bootstrap paradigm to design a test environment where they are checked with better reliability

  19. Mine Water Treatment in Hongai Coal Mines

    Directory of Open Access Journals (Sweden)

    Dang Phuong Thao

    2018-01-01

    Full Text Available Acid mine drainage (AMD is recognized as one of the most serious environmental problem associated with mining industry. Acid water, also known as acid mine drainage forms when iron sulfide minerals found in the rock of coal seams are exposed to oxidizing conditions in coal mining. Until 2009, mine drainage in Hongai coal mines was not treated, leading to harmful effects on humans, animals and aquatic ecosystem. This report has examined acid mine drainage problem and techniques for acid mine drainage treatment in Hongai coal mines. In addition, selection and criteria for the design of the treatment systems have been presented.

  20. Attitudes and awareness of web-based self-care resources in the military: a preliminary survey study.

    Science.gov (United States)

    Luxton, David D; Armstrong, Christina M; Fantelli, Emily E; Thomas, Elissa K

    2011-09-01

    Web-based self-care resources have a number of potential benefits for military service members (SMs) and their families such as convenience, anonymity, and immediate 24/7 access to useful information. There is limited data available, however, regarding SM and military healthcare provider use of online self-care resources. Our goal with this study was to conduct a preliminary survey assessment of self-care Web site awareness, general attitudes about use, and usage behaviors of Web-based self-care resources among SMs and military healthcare providers. Results show that the majority of SMs and providers use the Internet often, use Internet self-care resources, and are willing to use additional Web-based resources and capabilities. SMs and providers also indicated a preference for Web-based self-care resources as adjunct tools to face-to-face/in-person care. Data from this preliminary study are useful for informing additional research and best practices for integrating Web-based self-care for the military community.

  1. Environmental Management Practices and Firm Performance in a South African Mining Firm

    Directory of Open Access Journals (Sweden)

    Gibson Nyirenda

    2013-09-01

    Full Text Available This paper examines the impact of environmental management practices on the financial performance of a South African mining firm. The major aim of this paper is to investigate whether such practices have a close relationship with the mining firm’s financial performance (represented by return on equity [ROE]. The approach is a case study of a South African mining firm listed under the socially responsible index (SRI of the Johannesburg Stock Exchange (JSE. It uses Green-Steel sa (pseudonym used in place of the real name as a case study. Using multiple regression statistics, the return on equity of Green-Steel sa is regressed on three environmental management practices of Green- Steel (carbon reduction, energy efficiency, and water usage. The result shows there is no significant relationship between the variables and this lends credence to information gathered from Green-Steel environmental reports that Green-Steel’s environmental management practices are driven mostly by a desire to abide by regulations and also by a moral obligation to use environmental management practices to mitigate climate change impact.

  2. WALK 2.0 - using Web 2.0 applications to promote health-related physical activity: a randomised controlled trial protocol.

    Science.gov (United States)

    Kolt, Gregory S; Rosenkranz, Richard R; Savage, Trevor N; Maeder, Anthony J; Vandelanotte, Corneel; Duncan, Mitch J; Caperchione, Cristina M; Tague, Rhys; Hooker, Cindy; Mummery, W Kerry

    2013-05-03

    Physical inactivity is one of the leading modifiable causes of death and disease in Australia. National surveys indicate less than half of the Australian adult population are sufficiently active to obtain health benefits. The Internet is a potentially important medium for successfully communicating health messages to the general population and enabling individual behaviour change. Internet-based interventions have proven efficacy; however, intervention studies describing website usage objectively have reported a strong decline in usage, and high attrition rate, over the course of the interventions. Web 2.0 applications give users control over web content generated and present innovative possibilities to improve user engagement. There is, however, a need to assess the effectiveness of these applications in the general population. The Walk 2.0 project is a 3-arm randomised controlled trial investigating the effects of "next generation" web-based applications on engagement, retention, and subsequent physical activity behaviour change. 504 individuals will be recruited from two sites in Australia, randomly allocated to one of two web-based interventions (Web 1.0 or Web 2.0) or a control group, and provided with a pedometer to monitor physical activity. The Web 1.0 intervention will provide participants with access to an existing physical activity website with limited interactivity. The Web 2.0 intervention will provide access to a website featuring Web 2.0 content, including social networking, blogs, and virtual walking groups. Control participants will receive a logbook to record their steps. All groups will receive similar educational material on setting goals and increasing physical activity. The primary outcomes are objectively measured physical activity and website engagement and retention. Other outcomes measured include quality of life, psychosocial correlates, and anthropometric measurements. Outcomes will be measured at baseline, 3, 12 and 18 months. The

  3. The Smallest Valid Extension-Based Efficient, Rare Graph Pattern Mining, Considering Length-Decreasing Support Constraints and Symmetry Characteristics of Graphs

    Directory of Open Access Journals (Sweden)

    Unil Yun

    2016-05-01

    Full Text Available Frequent graph mining has been proposed to find interesting patterns (i.e., frequent sub-graphs from databases composed of graph transaction data, which can effectively express complex and large data in the real world. In addition, various applications for graph mining have been suggested. Traditional graph pattern mining methods use a single minimum support threshold factor in order to check whether or not mined patterns are interesting. However, it is not a sufficient factor that can consider valuable characteristics of graphs such as graph sizes and features of graph elements. That is, previous methods cannot consider such important characteristics in their mining operations since they only use a fixed minimum support threshold in the mining process. For this reason, in this paper, we propose a novel graph mining algorithm that can consider various multiple, minimum support constraints according to the types of graph elements and changeable minimum support conditions, depending on lengths of graph patterns. In addition, the proposed algorithm performs in mining operations more efficiently because it can minimize duplicated operations and computational overheads by considering symmetry features of graphs. Experimental results provided in this paper demonstrate that the proposed algorithm outperforms previous mining approaches in terms of pattern generation, runtime and memory usage.

  4. Leveraging Bibliographic RDF Data for Keyword Prediction with Association Rule Mining (ARM

    Directory of Open Access Journals (Sweden)

    Nidhi Kushwaha

    2014-11-01

    Full Text Available The Semantic Web (Web 3.0 has been proposed as an efficient way to access the increasingly large amounts of data on the internet. The Linked Open Data Cloud project at present is the major effort to implement the concepts of the Seamtic Web, addressing the problems of inhomogeneity and large data volumes. RKBExplorer is one of many repositories implementing Open Data and contains considerable bibliographic information. This paper discusses bibliographic data, an important part of cloud data. Effective searching of bibiographic datasets can be a challenge as many of the papers residing in these databases do not have sufficient or comprehensive keyword information. In these cases however, a search engine based on RKBExplorer is only able to use information to retrieve papers based on author names and title of papers without keywords. In this paper we attempt to address this problem by using the data mining algorithm Association Rule Mining (ARM to develop keywords based on features retrieved from Resource Description Framework (RDF data within a bibliographic citation. We have demonstrate the applicability of this method for predicting missing keywords for bibliographic entries in several typical databases. −−−−− Paper presented at 1st International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2014 March 27-28, 2014. Organized by VIT University, Chennai, India. Sponsored by BRNS.

  5. Extending DoD Modeling and Simulation with Web 2.0, Ajax and X3D

    Science.gov (United States)

    2007-09-01

    support for service-oriented architectures • JSF is currently integrated into NetBeans 6, so if developers are already using the IDE it will integrate...required the exclusive usage of open source technologies. The web application was developed in the NetBeans 5.5 IDE running on Apache Tomcat 5.5

  6. OSCAR4: a flexible architecture for chemical text-mining

    Directory of Open Access Journals (Sweden)

    Jessop David M

    2011-10-01

    Full Text Available Abstract The Open-Source Chemistry Analysis Routines (OSCAR software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed.

  7. Identification of mine rescue equipment reduction gears technical condition

    Science.gov (United States)

    Gerike, B. L.; Klishin, V. I.; Kuzin, E. G.

    2017-09-01

    The article presents the reasons for adopting intelligent service of mine belt conveyer drives concerning evaluation of their technical condition based on the diagnostic techniques instead of regular preventative maintenance. The article reveals the diagnostic results of belt conveyer drive reduction gears condition taking into account the parameters of lubricating oil, vibration and temperature. Usage of a complex approach to evaluate technical conditions allows reliability of the forecast to be improved, which makes it possible not only to prevent accidental breakdowns and eliminate unscheduled downtime, but also to bring sufficient economic benefits through reduction of the term and scope of work during overhauls.

  8. Trajectories of 12-Month Usage Patterns for Two Smoking Cessation Websites: Exploring How Users Engage Over Time.

    Science.gov (United States)

    Bricker, Jonathan B; Sridharan, Vasundhara; Zhu, Yifan; Mull, Kristin E; Heffner, Jaimee L; Watson, Noreen L; McClure, Jennifer B; Di, Chongzhi

    2018-04-20

    Little is known about how individuals engage with electronic health (eHealth) interventions over time and whether this engagement predicts health outcomes. The objectives of this study, by using the example of a specific type of eHealth intervention (ie, websites for smoking cessation), were to determine (1) distinct groups of log-in trajectories over a 12-month period, (2) their association with smoking cessation, and (3) baseline user characteristics that predict trajectory group membership. We conducted a functional clustering analysis of 365 consecutive days of log-in data from both arms of a large (N=2637) randomized trial of 2 website interventions for smoking cessation (WebQuit and Smokefree), with a primary outcome of 30-day point prevalence smoking abstinence at 12 months. We conducted analyses for each website separately. A total of 3 distinct trajectory groups emerged for each website. For WebQuit, participants were clustered into 3 groups: 1-week users (682/1240, 55.00% of the sample), 5-week users (399/1240, 32.18%), and 52-week users (159/1240, 12.82%). Compared with the 1-week users, the 5- and 52-week users had 57% higher odds (odds ratio [OR] 1.57, 95% CI 1.13-2.17; P=.007) and 124% higher odds (OR 2.24, 95% CI 1.45-3.43; Pusers were clustered into 3 groups: 1-week users (645/1309, 49.27% of the sample), 4-week users (395/1309, 30.18%), and 5-week users (269/1309, 20.55%). Compared with the 1-week users, 5-week users (but not 4-week users; P=.99) had 48% higher odds (OR 1.48, 95% CI 1.05-2.07; P=.02) of being abstinent at 12 months. In general, the WebQuit intervention had a greater number of weekly log-ins within each of the 3 trajectory groups as compared with those of the Smokefree intervention. Baseline characteristics associated with trajectory group membership varied between websites. Patterns of 1-, 4-, and 5-week usage of websites may be common for how people engage in eHealth interventions. The 5-week usage of either website, and 52-week

  9. Review of the "Web:How2SolveIt"Website

    OpenAIRE

    Pamela, Stanworth; 白田, 由香利

    2013-01-01

    Web:How2SolveIt is a website provided for Gakushuin University to help students understand maths concepts,mainly in the areas of Economics and Business.The site offers a practical collection of quality study material,with useful videos.\\ The site has been evaluated by "walking through" the student interface,applying some typical user cases,which are typical usage stories of student users.We recommend a few changes to the site,which would help students to find what they need easily, making it ...

  10. Client-side Web Mining for Community Formation in Peer-to-Peer Environments

    Data.gov (United States)

    National Aeronautics and Space Administration — In this paper we present a framework for forming interests-based Peer-to-Peer communities using client-side web browsing history. At the heart of this framework is...

  11. Identifying Engineering Students' English Sentence Reading Comprehension Errors: Applying a Data Mining Technique

    Science.gov (United States)

    Tsai, Yea-Ru; Ouyang, Chen-Sen; Chang, Yukon

    2016-01-01

    The purpose of this study is to propose a diagnostic approach to identify engineering students' English reading comprehension errors. Student data were collected during the process of reading texts of English for science and technology on a web-based cumulative sentence analysis system. For the analysis, the association-rule, data mining technique…

  12. Application of Learning Analytics Using Clustering Data Mining for Students' Disposition Analysis

    Science.gov (United States)

    Bharara, Sanyam; Sabitha, Sai; Bansal, Abhay

    2018-01-01

    Learning Analytics (LA) is an emerging field in which sophisticated analytic tools are used to improve learning and education. It draws from, and is closely tied to, a series of other fields of study like business intelligence, web analytics, academic analytics, educational data mining, and action analytics. The main objective of this research…

  13. Occupational Health and Safety Management and Turnover Intention in the Ghanaian Mining Sector.

    Science.gov (United States)

    Amponsah-Tawiah, Kwesi; Ntow, Michael Akomeah Ofori; Mensah, Justice

    2016-03-01

    The mining industry is considered as one of the most dangerous and hazardous industries and the need for effective and efficient occupational health and safety management is critical to safeguard workers and the industry. Despite the dangers and hazards present in the mining industry, only few studies have focused on how occupational health and safety and turnover intentions in the mines. The study suing a cross-sectional survey design collected quantitative data from the 255 mine workers that were conveniently sampled from the Ghanaian mining industry. The data collection tools were standardized questionnaires that measured occupational health and safety management and turnover intentions. These scales were also pretested before their usage in actual data collection. The correlation coefficient showed that a negative relationship existed between dimensions of occupational health and safety management and turnover intention; safety leadership (r = -0.33, p safety facilities and equipment (r = -0.32, p safety procedure (r = -0.27, p safety leadership and safety facility were significant predictors of turnover intention, (β = -0.28, p safety leadership in ensuring the effective formulation of policies and supervision of occupational health and safety at the workplace. The present study demonstrates that safety leadership is crucial in the administration of occupational health and safety and reducing turnover intention in organizations.

  14. Educational data mining: a sample of review and study case

    Directory of Open Access Journals (Sweden)

    Alejandro Pena, Rafael Domínguez, Jose de Jesus Medel

    2009-12-01

    Full Text Available The aim of this work is to encourage the research in a novel merged field: Educational data mining (EDM. Thereby, twosubjects are outlined: The first one corresponds to a review of data mining (DM methods and EDM applications. Thesecond topic represents an EDM study case. As a result of the application of DM in Web-based Education Systems (WBES,stratified groups of students were found during a trial. Such groups reveal key attributes of volunteers that deserted orremained during a WBES experiment. This kind of discovered knowledge inspires the statement of correlational hypothesisto set relations between attributes and behavioral patterns of WBES users. We concluded that: When EDM findings aretaken into account for designing and managing WBES, the learning objectives are improved

  15. Improving entrepreneurial opportunity recognition through web content analytics

    Science.gov (United States)

    Bakar, Muhamad Shahbani Abu; Azmi, Azwiyati

    2017-10-01

    The ability to recognize and develop an opportunity into a venture defines an entrepreneur. Research in opportunity recognition has been robust and focuses more on explaining the processes involved in opportunity recognition. Factors such as prior knowledge, cognitive and creative capabilities are shown to affect opportunity recognition in entrepreneurs. Prior knowledge in areas such as customer problems, ways to serve the market, and technology has been shows in various studies to be a factor that facilitates entrepreneurs to identify and recognize opportunities. Findings from research also shows that experienced entrepreneurs search and scan for information to discover opportunities. Searching and scanning for information has also been shown to help novice entrepreneurs who lack prior knowledge to narrow this gap and enable them to better identify and recognize opportunities. There is less focus in research on finding empirically proven techniques and methods to develop and enhance opportunity recognition in student entrepreneurs. This is important as the country pushes for more graduate entrepreneurs that can drive the economy. This paper aims to discuss Opportunity Recognition Support System (ORSS), an information support system to help especially student entrepreneurs in identifying and recognizing business opportunities. The ORSS aims to provide the necessary knowledge to student entrepreneurs to be able to better identify and recognize opportunities. Applying design research, theories in opportunity recognition are applied to identify the requirements for the support system and the requirements in turn dictate the design of the support system. The paper proposes the use of web content mining and analytics as two core components and techniques for the support system. Web content mining can mine the vast knowledge repositories available on the internet and analytics can provide entrepreneurs with further insights into the information needed to recognize

  16. A Web-Based Nuclear Criticality Safety Bibliographic Database

    International Nuclear Information System (INIS)

    Koponen, B L; Huang, S

    2007-01-01

    A bibliographic criticality safety database of over 13,000 records is available on the Internet as part of the U.S. Department of Energy's (DOE) Nuclear Criticality Safety Program (NCSP) website. This database is easy to access via the Internet and gets substantial daily usage. This database and other criticality safety resources are available at ncsp.llnl.gov. The web database has evolved from more than thirty years of effort at Lawrence Livermore National Laboratory (LLNL), beginning with compilations of critical experiment reports and American Nuclear Society Transactions

  17. UMineAR: Mobile-Tablet-Based Abandoned Mine Hazard Site Investigation Support System Using Augmented Reality

    Directory of Open Access Journals (Sweden)

    Jangwon Suh

    2017-10-01

    Full Text Available Conventional mine site investigation has difficulties in fostering location awareness and understanding the subsurface environment; moreover, it produces a large amount of hardcopy data. To overcome these limitations, the UMineAR mobile tablet application was developed. It enables users to rapidly identify underground mine objects (drifts, entrances, boreholes, hazards and intuitively visualize them in 3D using a mobile augmented reality (AR technique. To design UMineAR, South Korean georeferenced standard-mine geographic information system (GIS databases were employed. A web database system was designed to access via a tablet groundwater-level data measured every hour by sensors installed in boreholes. UMineAR consists of search, AR, map, and database modules. The search module provides data retrieval and visualization options/functions. The AR module provides 3D interactive visualization of mine GIS data and camera imagery on the tablet screen. The map module shows the locations of corresponding borehole data on a 2D map. The database module provides mine GIS database management functions. A case study showed that the proposed application is suitable for onsite visualization of high-volume mine GIS data based on geolocations; no specialized equipment or skills are required to understand the underground mine environment. UMineAR can be used to support abandoned-mine hazard site investigations.

  18. Advanced Query and Data Mining Capabilities for MaROS

    Science.gov (United States)

    Wang, Paul; Wallick, Michael N.; Allard, Daniel A.; Gladden, Roy E.; Hy, Franklin H.

    2013-01-01

    The Mars Relay Operational Service (MaROS) comprises a number of tools to coordinate, plan, and visualize various aspects of the Mars Relay network. These levels include a Web-based user interface, a back-end "ReSTlet" built in Java, and databases that store the data as it is received from the network. As part of MaROS, the innovators have developed and implemented a feature set that operates on several levels of the software architecture. This new feature is an advanced querying capability through either the Web-based user interface, or through a back-end REST interface to access all of the data gathered from the network. This software is not meant to replace the REST interface, but to augment and expand the range of available data. The current REST interface provides specific data that is used by the MaROS Web application to display and visualize the information; however, the returned information from the REST interface has typically been pre-processed to return only a subset of the entire information within the repository, particularly only the information that is of interest to the GUI (graphical user interface). The new, advanced query and data mining capabilities allow users to retrieve the raw data and/or to perform their own data processing. The query language used to access the repository is a restricted subset of the structured query language (SQL) that can be built safely from the Web user interface, or entered as freeform SQL by a user. The results are returned in a CSV (Comma Separated Values) format for easy exporting to third party tools and applications that can be used for data mining or user-defined visualization and interpretation. This is the first time that a service is capable of providing access to all cross-project relay data from a single Web resource. Because MaROS contains the data for a variety of missions from the Mars network, which span both NASA and ESA, the software also establishes an access control list (ACL) on each data record

  19. Current status of thin seam longwall mining in the US

    Energy Technology Data Exchange (ETDEWEB)

    Peng, S.S. [West Virginia Univ., Morgantown, WV (United States); Orndorff, A.

    1996-12-31

    Thin seams in this paper refers to those seams the economic mining height of which is below 50-55 in. that are traditionally considered to be the proprietary of plowing and present a whole net set of problems for longwall mining. In thin seams it is difficult to design and manufacture an efficient high capacity cutting machine for maintenance and production operations. Thin seam mining by longwall plowing began in the late fifties in southern West Virginia, and continues to the present time. In the seventies when longwall mining began to take off a large percentage of U.S. longwalls were operating in the thin seams. Tables 1 and 2 show the historical trends of cutting machines used for seams less than 55 in and 50 in, respectively. In addition to the plow system, the single-ended fixed drum and single-ended ranging drum shearers were introduced in the mid and late seventies and operated continuously until 2-4 years ago. The double-ended ranging drum shearers have also been employed for thin seam longwall mining during this period including several in-web (or off-pan) shearers between late seventies and early eighties. In this paper three thin-seam longwalls in three states employing the latest thin-seam longwall technology will be reviewed. However only two of them are still in operation while the third one ceased operation recently.

  20. A genotypic method for determining HIV-2 coreceptor usage enables epidemiological studies and clinical decision support.

    Science.gov (United States)

    Döring, Matthias; Borrego, Pedro; Büch, Joachim; Martins, Andreia; Friedrich, Georg; Camacho, Ricardo Jorge; Eberle, Josef; Kaiser, Rolf; Lengauer, Thomas; Taveira, Nuno; Pfeifer, Nico

    2016-12-20

    CCR5-coreceptor antagonists can be used for treating HIV-2 infected individuals. Before initiating treatment with coreceptor antagonists, viral coreceptor usage should be determined to ensure that the virus can use only the CCR5 coreceptor (R5) and cannot evade the drug by using the CXCR4 coreceptor (X4-capable). However, until now, no online tool for the genotypic identification of HIV-2 coreceptor usage had been available. Furthermore, there is a lack of knowledge on the determinants of HIV-2 coreceptor usage. Therefore, we developed a data-driven web service for the prediction of HIV-2 coreceptor usage from the V3 loop of the HIV-2 glycoprotein and used the tool to identify novel discriminatory features of X4-capable variants. Using 10 runs of tenfold cross validation, we selected a linear support vector machine (SVM) as the model for geno2pheno[coreceptor-hiv2], because it outperformed the other SVMs with an area under the ROC curve (AUC) of 0.95. We found that SVMs were highly accurate in identifying HIV-2 coreceptor usage, attaining sensitivities of 73.5% and specificities of 96% during tenfold nested cross validation. The predictive performance of SVMs was not significantly different (p value 0.37) from an existing rules-based approach. Moreover, geno2pheno[coreceptor-hiv2] achieved a predictive accuracy of 100% and outperformed the existing approach on an independent data set containing nine new isolates with corresponding phenotypic measurements of coreceptor usage. geno2pheno[coreceptor-hiv2] could not only reproduce the established markers of CXCR4-usage, but also revealed novel markers: the substitutions 27K, 15G, and 8S were significantly predictive of CXCR4 usage. Furthermore, SVMs trained on the amino-acid sequences of the V1 and V2 loops were also quite accurate in predicting coreceptor usage (AUCs of 0.84 and 0.65, respectively). In this study, we developed geno2pheno[coreceptor-hiv2], the first online tool for the prediction of HIV-2 coreceptor

  1. Gamification and Adherence to Web-Based Mental Health Interventions: A Systematic Review.

    Science.gov (United States)

    Brown, Menna; O'Neill, Noelle; van Woerden, Hugo; Eslambolchilar, Parisa; Jones, Matt; John, Ann

    2016-08-24

    Adherence to effective Web-based interventions for common mental disorders (CMDs) and well-being remains a critical issue, with clear potential to increase effectiveness. Continued identification and examination of "active" technological components within Web-based interventions has been called for. Gamification is the use of game design elements and features in nongame contexts. Health and lifestyle interventions have implemented a variety of game features in their design in an effort to encourage engagement and increase program adherence. The potential influence of gamification on program adherence has not been examined in the context of Web-based interventions designed to manage CMDs and well-being. This study seeks to review the literature to examine whether gaming features predict or influence reported rates of program adherence in Web-based interventions designed to manage CMDs and well-being. A systematic review was conducted of peer-reviewed randomized controlled trials (RCTs) designed to manage CMDs or well-being and incorporated gamification features. Seven electronic databases were searched. A total of 61 RCTs met the inclusion criteria and 47 different intervention programs were identified. The majority were designed to manage depression using cognitive behavioral therapy. Eight of 10 popular gamification features reviewed were in use. The majority of studies utilized only one gamification feature (n=58) with a maximum of three features. The most commonly used feature was story/theme. Levels and game leaders were not used in this context. No studies explicitly examined the role of gamification features on program adherence. Usage data were not commonly reported. Interventions intended to be 10 weeks in duration had higher mean adherence than those intended to be 6 or 8 weeks in duration. Gamification features have been incorporated into the design of interventions designed to treat CMD and well-being. Further research is needed to improve understanding

  2. International mining forum 2004, new technologies in underground mining, safety in mines proceedings

    Energy Technology Data Exchange (ETDEWEB)

    Jerzy Kicki; Eugeniusz Sobczyk (eds.)

    2004-01-15

    The book comprises technical papers that were presented at the International Mining Forum 2004. This event aims to bring together scientists and engineers in mining, rock mechanics, and computer engineering, with a view to explore and discuss international developments in the field. Topics discussed in this book are: trends in the mining industry; new solutions and tendencies in underground mines; rock engineering problems in underground mines; utilization and exploitation of methane; prevention measures for the control of rock bursts in Polish mines; and current problems in Ukrainian coal mines.

  3. High Level of Integration in Integrated Disease Management Leads to Higher Usage in the e-Vita Study: Self-Management of Chronic Obstructive Pulmonary Disease With Web-Based Platforms in a Parallel Cohort Design.

    Science.gov (United States)

    Talboom-Kamp, Esther Pwa; Verdijk, Noortje A; Kasteleyn, Marise J; Harmans, Lara M; Talboom, Irvin Jsh; Numans, Mattijs E; Chavannes, Niels H

    2017-05-31

    Worldwide, nearly 3 million people die of chronic obstructive pulmonary disease (COPD) every year. Integrated disease management (IDM) improves disease-specific quality of life and exercise capacity for people with COPD, but can also reduce hospital admissions and hospital days. Self-management of COPD through eHealth interventions has shown to be an effective method to improve the quality and efficiency of IDM in several settings, but it remains unknown which factors influence usage of eHealth and change in behavior of patients. Our study, e-Vita COPD, compares different levels of integration of Web-based self-management platforms in IDM in three primary care settings. The main aim of this study is to analyze the factors that successfully promote the use of a self-management platform for COPD patients. The e-Vita COPD study compares three different approaches to incorporating eHealth via Web-based self-management platforms into IDM of COPD using a parallel cohort design. Three groups integrated the platforms to different levels. In groups 1 (high integration) and 2 (medium integration), randomization was performed to two levels of personal assistance for patients (high and low assistance); in group 3 there was no integration into disease management (none integration). Every visit to the e-Vita and Zorgdraad COPD Web platforms was tracked objectively by collecting log data (sessions and services). At the first log-in, patients completed a baseline questionnaire. Baseline characteristics were automatically extracted from the log files including age, gender, education level, scores on the Clinical COPD Questionnaire (CCQ), dyspnea scale (MRC), and quality of life questionnaire (EQ5D). To predict the use of the platforms, multiple linear regression analyses for the different independent variables were performed: integration in IDM (high, medium, none), personal assistance for the participants (high vs low), educational level, and self-efficacy level (General Self

  4. Generic HTML Form Processor: A versatile PHP script to save web-collected data into a MySQL database.

    Science.gov (United States)

    Göritz, Anja S; Birnbaum, Michael H

    2005-11-01

    The customizable PHP script Generic HTML Form Processor is intended to assist researchers and students in quickly setting up surveys and experiments that can be administered via the Web. This script relieves researchers from the burdens of writing new CGI scripts and building databases for each Web study. Generic HTML Form Processor processes any syntactically correct HTML forminput and saves it into a dynamically created open-source database. We describe five modes for usage of the script that allow increasing functionality but require increasing levels of knowledge of PHP and Web servers: The first two modes require no previous knowledge, and the fifth requires PHP programming expertise. Use of Generic HTML Form Processor is free for academic purposes, and its Web address is www.goeritz.net/brmic.

  5. Enhanced DIII-D Data Management Through a Relational Database

    Science.gov (United States)

    Burruss, J. R.; Peng, Q.; Schachter, J.; Schissel, D. P.; Terpstra, T. B.

    2000-10-01

    A relational database is being used to serve data about DIII-D experiments. The database is optimized for queries across multiple shots, allowing for rapid data mining by SQL-literate researchers. The relational database relates different experiments and datasets, thus providing a big picture of DIII-D operations. Users are encouraged to add their own tables to the database. Summary physics quantities about DIII-D discharges are collected and stored in the database automatically. Meta-data about code runs, MDSplus usage, and visualization tool usage are collected, stored in the database, and later analyzed to improve computing. Documentation on the database may be accessed through programming languages such as C, Java, and IDL, or through ODBC compliant applications such as Excel and Access. A database-driven web page also provides a convenient means for viewing database quantities through the World Wide Web. Demonstrations will be given at the poster.

  6. Differences in smartphone usage

    DEFF Research Database (Denmark)

    Gustarini, Mattia; Scipioni, Marcello Paolo; Fanourakis, Marios

    2016-01-01

    We analyze the users’ intimacy to investigate the differences in smartphone usage, considering the user’s location and number and kind of people physically around the user. With a first user study we (1) validate the intimacy concept, (2) evaluate its correlation to smartphone usage features and ...

  7. Digital Workflows for a 3d Semantic Representation of AN Ancient Mining Landscape

    Science.gov (United States)

    Hiebel, G.; Hanke, K.

    2017-08-01

    The ancient mining landscape of Schwaz/Brixlegg in the Tyrol, Austria witnessed mining from prehistoric times to modern times creating a first order cultural landscape when it comes to one of the most important inventions in human history: the production of metal. In 1991 a part of this landscape was lost due to an enormous landslide that reshaped part of the mountain. With our work we want to propose a digital workflow to create a 3D semantic representation of this ancient mining landscape with its mining structures to preserve it for posterity. First, we define a conceptual model to integrate the data. It is based on the CIDOC CRM ontology and CRMgeo for geometric data. To transform our information sources to a formal representation of the classes and properties of the ontology we applied semantic web technologies and created a knowledge graph in RDF (Resource Description Framework). Through the CRMgeo extension coordinate information of mining features can be integrated into the RDF graph and thus related to the detailed digital elevation model that may be visualized together with the mining structures using Geoinformation systems or 3D visualization tools. The RDF network of the triple store can be queried using the SPARQL query language. We created a snapshot of mining, settlement and burial sites in the Bronze Age. The results of the query were loaded into a Geoinformation system and a visualization of known bronze age sites related to mining, settlement and burial activities was created.

  8. The EU-ADR Web Platform: delivering advanced pharmacovigilance tools.

    Science.gov (United States)

    Oliveira, José Luis; Lopes, Pedro; Nunes, Tiago; Campos, David; Boyer, Scott; Ahlberg, Ernst; van Mulligen, Erik M; Kors, Jan A; Singh, Bharat; Furlong, Laura I; Sanz, Ferran; Bauer-Mehren, Anna; Carrascosa, Maria C; Mestres, Jordi; Avillach, Paul; Diallo, Gayo; Díaz Acedo, Carlos; van der Lei, Johan

    2013-05-01

    Pharmacovigilance methods have advanced greatly during the last decades, making post-market drug assessment an essential drug evaluation component. These methods mainly rely on the use of spontaneous reporting systems and health information databases to collect expertise from huge amounts of real-world reports. The EU-ADR Web Platform was built to further facilitate accessing, monitoring and exploring these data, enabling an in-depth analysis of adverse drug reactions risks. The EU-ADR Web Platform exploits the wealth of data collected within a large-scale European initiative, the EU-ADR project. Millions of electronic health records, provided by national health agencies, are mined for specific drug events, which are correlated with literature, protein and pathway data, resulting in a rich drug-event dataset. Next, advanced distributed computing methods are tailored to coordinate the execution of data-mining and statistical analysis tasks. This permits obtaining a ranked drug-event list, removing spurious entries and highlighting relationships with high risk potential. The EU-ADR Web Platform is an open workspace for the integrated analysis of pharmacovigilance datasets. Using this software, researchers can access a variety of tools provided by distinct partners in a single centralized environment. Besides performing standalone drug-event assessments, they can also control the pipeline for an improved batch analysis of custom datasets. Drug-event pairs can be substantiated and statistically analysed within the platform's innovative working environment. A pioneering workspace that helps in explaining the biological path of adverse drug reactions was developed within the EU-ADR project consortium. This tool, targeted at the pharmacovigilance community, is available online at https://bioinformatics.ua.pt/euadr/. Copyright © 2012 John Wiley & Sons, Ltd.

  9. Sustainable Mining Environment: Technical Review of Post-mining Plans

    Directory of Open Access Journals (Sweden)

    Restu Juniah

    2017-12-01

    Full Text Available The mining industry exists because humans need mining commodities to meet their daily needs such as motor vehicles, mobile phones, electronic equipment and others. Mining commodities as mentioned in Government Regulation No. 23 of 2010 on Implementation of Mineral and Coal Mining Business Activities are radioactive minerals, metal minerals, nonmetallic minerals, rocks and coal. Mineral and coal mining is conducted to obtain the mining commodities through production operations. Mining and coal mining companies have an obligation to ensure that the mining environment in particular after the post production operation or post mining continues. The survey research aims to examine technically the post-mining plan in coal mining of PT Samantaka Batubara in Indragiri Hulu Regency of Riau Province towards the sustainability of the mining environment. The results indicate that the post-mining plan of PT Samantaka Batubara has met the technical aspects required in post mining planning for a sustainable mining environment. Postponement of post-mining land of PT Samantaka Batubara for garden and forest zone. The results of this study are expected to be useful and can be used by stakeholders, academics, researchers, practitioners and associations of mining, and the environment.

  10. World wide developments in shortwall and wide web mining techniques

    Energy Technology Data Exchange (ETDEWEB)

    Pollard, T

    1975-11-01

    The paper describes the progress to date with continuous pillar extraction, and how the typical longwall powered support has been modified to be both strong enough and stable enough to provide roof support for very wide webs. It also describes the operating systems which have been specially designed. The next stages of development are discussed, particularly the provision of continuous conveyor haulage in place of the present-day shuttle car. The author suggests that marrying American coal-getting technology and British roof support technology might increase productivity.

  11. CERN Web Application Detection. Refactoring and release as open source software

    CERN Document Server

    Lizonczyk, Piotr

    2015-01-01

    This paper covers my work during my assignment as participant of CERN Summer Students 2015 programme. The project was aimed at refactoring and publication of the Web Application Detection tool, which was developed at CERN and priorly used internally by the Computer Security team. The range of tasks performed include initial refactoring of code, which was developed like a script rather than a Python package, through extracting components that were not specific to CERN usage, the subsequent final release of the source code on GitHub and the integration with third-party software i.e. the w3af tool. Ultimately, Web Application Detection software received positive responses, being downloaded ca. 1500 times at the time of writing this report.

  12. DelPhiForce web server: electrostatic forces and energy calculations and visualization.

    Science.gov (United States)

    Li, Lin; Jia, Zhe; Peng, Yunhui; Chakravorty, Arghya; Sun, Lexuan; Alexov, Emil

    2017-11-15

    Electrostatic force is an essential component of the total force acting between atoms and macromolecules. Therefore, accurate calculations of electrostatic forces are crucial for revealing the mechanisms of many biological processes. We developed a DelPhiForce web server to calculate and visualize the electrostatic forces at molecular level. DelPhiForce web server enables modeling of electrostatic forces on individual atoms, residues, domains and molecules, and generates an output that can be visualized by VMD software. Here we demonstrate the usage of the server for various biological problems including protein-cofactor, domain-domain, protein-protein, protein-DNA and protein-RNA interactions. The DelPhiForce web server is available at: http://compbio.clemson.edu/delphi-force. delphi@clemson.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  13. Multiphase simulation of mine waters and aqueous leaching processes

    Directory of Open Access Journals (Sweden)

    Pajarre Risto

    2016-01-01

    Full Text Available Managing of large amounts of water in mining and mineral processing sites remains a concern in both actively operated and closed mining areas. When the mining site with its metal or concentrate producing units is operational, the challenge is to find either ways for economical processing with maximum yields, while minimizing the environmental impact of the water usage and waste salt treatments. For safe closure of the site, the environmental control of possible drainage will be needed. For both challenges, the present-day multiphase process simulations tools can be used to provide improved accuracy and better economy in controlling the smooth and environmentally sound operation of the plant. One of the pioneering studies in using the multiphase thermodynamic software in simulation of hydrometallurgical processes was that of Koukkari et al. [1]. The study covered the use of Solgasmix equilibrium software for a number of practical acid digesters. The models were made for sulfuric acid treatments in titania pigment production and in NPK fertilizer manufacturing. During the past two decades the extensive data assessment has taken place particularly in geochemistry and a new versions of geochemical multiphase equilibrium software has been developed. On the other hand, there has been some progress in development of the process simulation software in all the aforementioned fields. Thus, the thermodynamic simulation has become a tool of great importance in development of hydrometallurgical processes. The presentation will cover three example cases of either true pilot or industrial systems including a South African acid mine water drainage treatment, hydrometallurgical extraction of rare earths from uranium leachate in Russia and a multistage process simulation of a Finnish heap leaching mine with its subsequent water treatment system.

  14. Contract Mining versus Owner Mining

    African Journals Online (AJOL)

    Owner

    mining companies can concentrate on their core businesses while using specialists for ... 2 Definition of Contract and Owner. Mining ... equipment maintenance, scheduling and budgeting ..... No. Region. Amount Spent on. Contract Mining. ($ billion). Percent of. Total. 1 ... cost and productivity data based on a large range.

  15. Optimization of mining design of Hongwei uranium mine

    International Nuclear Information System (INIS)

    Wu Sanmao; Yuan Baixiang

    2012-01-01

    Combined with the mining conditions of Hongwei uranium mine, optimization schemes for hoisting cage, mine drainge,ore transport, mine wastewater treatment, power-supply system,etc are put forward in the mining design of the mine. Optimized effects are analyzed from the aspects of technique, economy, and energy saving and reducing emissions. (authors)

  16. Improving web site performance using commercially available analytical tools.

    Science.gov (United States)

    Ogle, James A

    2010-10-01

    It is easy to accurately measure web site usage and to quantify key parameters such as page views, site visits, and more complex variables using commercially available tools that analyze web site log files and search engine use. This information can be used strategically to guide the design or redesign of a web site (templates, look-and-feel, and navigation infrastructure) to improve overall usability. The data can also be used tactically to assess the popularity and use of new pages and modules that are added and to rectify problems that surface. This paper describes software tools used to: (1) inventory search terms that lead to available content; (2) propose synonyms for commonly used search terms; (3) evaluate the effectiveness of calls to action; (4) conduct path analyses to targeted content. The American Academy of Orthopaedic Surgeons (AAOS) uses SurfRay's Behavior Tracking software (Santa Clara CA, USA, and Copenhagen, Denmark) to capture and archive the search terms that have been entered into the site's Google Mini search engine. The AAOS also uses Unica's NetInsight program to analyze its web site log files. These tools provide the AAOS with information that quantifies how well its web sites are operating and insights for making improvements to them. Although it is easy to quantify many aspects of an association's web presence, it also takes human involvement to analyze the results and then recommend changes. Without a dedicated resource to do this, the work often is accomplished only sporadically and on an ad hoc basis.

  17. Processing biological literature with customizable Web services supporting interoperable formats.

    Science.gov (United States)

    Rak, Rafal; Batista-Navarro, Riza Theresa; Carter, Jacob; Rowley, Andrew; Ananiadou, Sophia

    2014-01-01

    Web services have become a popular means of interconnecting solutions for processing a body of scientific literature. This has fuelled research on high-level data exchange formats suitable for a given domain and ensuring the interoperability of Web services. In this article, we focus on the biological domain and consider four interoperability formats, BioC, BioNLP, XMI and RDF, that represent domain-specific and generic representations and include well-established as well as emerging specifications. We use the formats in the context of customizable Web services created in our Web-based, text-mining workbench Argo that features an ever-growing library of elementary analytics and capabilities to build and deploy Web services straight from a convenient graphical user interface. We demonstrate a 2-fold customization of Web services: by building task-specific processing pipelines from a repository of available analytics, and by configuring services to accept and produce a combination of input and output data interchange formats. We provide qualitative evaluation of the formats as well as quantitative evaluation of automatic analytics. The latter was carried out as part of our participation in the fourth edition of the BioCreative challenge. Our analytics built into Web services for recognizing biochemical concepts in BioC collections achieved the highest combined scores out of 10 participating teams. Database URL: http://argo.nactem.ac.uk. © The Author(s) 2014. Published by Oxford University Press.

  18. Gaming Device Usage Patterns Predict Internet Gaming Disorder: Comparison across Different Gaming Device Usage Patterns

    OpenAIRE

    Soo-Hyun Paik; Hyun Cho; Ji-Won Chun; Jo-Eun Jeong; Dai-Jin Kim

    2017-01-01

    Gaming behaviors have been significantly influenced by smartphones. This study was designed to explore gaming behaviors and clinical characteristics across different gaming device usage patterns and the role of the patterns on Internet gaming disorder (IGD). Responders of an online survey regarding smartphone and online game usage were classified by different gaming device usage patterns: (1) individuals who played only computer games; (2) individuals who played computer games more than smart...

  19. RCrawler: An R package for parallel web crawling and scraping

    Directory of Open Access Journals (Sweden)

    Salim Khalil

    2017-01-01

    Full Text Available RCrawler is a contributed R package for domain-based web crawling and content scraping. As the first implementation of a parallel web crawler in the R environment, RCrawler can crawl, parse, store pages, extract contents, and produce data that can be directly employed for web content mining applications. However, it is also flexible, and could be adapted to other applications. The main features of RCrawler are multi-threaded crawling, content extraction, and duplicate content detection. In addition, it includes functionalities such as URL and content-type filtering, depth level controlling, and a robot.txt parser. Our crawler has a highly optimized system, and can download a large number of pages per second while being robust against certain crashes and spider traps. In this paper, we describe the design and functionality of RCrawler, and report on our experience of implementing it in an R environment, including different optimizations that handle the limitations of R. Finally, we discuss our experimental results.

  20. The Geogenomic Mutational Atlas of Pathogens (GoMAP web system.

    Directory of Open Access Journals (Sweden)

    David P Sargeant

    Full Text Available We present a new approach for pathogen surveillance we call Geogenomics. Geogenomics examines the geographic distribution of the genomes of pathogens, with a particular emphasis on those mutations that give rise to drug resistance. We engineered a new web system called Geogenomic Mutational Atlas of Pathogens (GoMAP that enables investigation of the global distribution of individual drug resistance mutations. As a test case we examined mutations associated with HIV resistance to FDA-approved antiretroviral drugs. GoMAP-HIV makes use of existing public drug resistance and HIV protein sequence data to examine the distribution of 872 drug resistance mutations in ∼ 502,000 sequences for many countries in the world. We also implemented a broadened classification scheme for HIV drug resistance mutations. Several patterns for geographic distributions of resistance mutations were identified by visual mining using this web tool. GoMAP-HIV is an open access web application available at http://www.bio-toolkit.com/GoMap/project/