WorldWideScience

Sample records for context mining tool

  1. Big Data Mining: Tools & Algorithms

    Directory of Open Access Journals (Sweden)

    Adeel Shiraz Hashmi

    2016-03-01

    Full Text Available We are now in Big Data era, and there is a growing demand for tools which can process and analyze it. Big data analytics deals with extracting valuable information from that complex data which can’t be handled by traditional data mining tools. This paper surveys the available tools which can handle large volumes of data as well as evolving data streams. The data mining tools and algorithms which can handle big data have also been summarized, and one of the tools has been used for mining of large datasets using distributed algorithms.

  2. Sequential Pattern Mining Using Formal Language Tools

    Directory of Open Access Journals (Sweden)

    R. S. Jadon

    2012-09-01

    Full Text Available In present scenario almost every system and working is computerized and hence all information and data are being stored in Computers. Huge collections of data are emerging. Retrieval of untouched, hidden and important information from this huge data is quite tedious work. Data Mining is a great technological solution which extracts untouched, hidden and important information from vast databases to investigate noteworthy knowledge in the data warehouse. An important problem in data mining is to discover patterns in various fields like medical science, world wide web, telecommunication etc. In the field of Data Mining, Sequential pattern mining is one of the method in which we retrieve hidden pattern linked with instant or other sequences. In sequential pattern mining we extract those sequential patterns whose support count are greater than or equal to given minimum support threshold value. In current scenario users are interested in only specific and interesting pattern instead of entire probable sequential pattern. To control the exploration space users can use many heuristics which can be represented as constraints. Many algorithms have been developed in the fields of constraint mining which generate patterns as per user expectation. In the present work we will be exploring and enhancing the regular expression constraints .Regular expression is one of the constraint and number of algorithm developed for sequential pattern mining which uses regular expression as a constraint. Some constraints are neither regular nor context free like cross-serial pattern anbmcndm used in Swiss German Data. We cannot construct equivalent deterministic finite automata (DFA or Push down automata (PDA for such type of patterns. We have proposed a new algorithm PMFLT (Pattern Mining using Formal Language Tools for sequential pattern mining using formal language tools as constraints. The proposed algorithm finds only user specific frequent sequence in efficient

  3. A Collaborative Educational Association Rule Mining Tool

    Science.gov (United States)

    Garcia, Enrique; Romero, Cristobal; Ventura, Sebastian; de Castro, Carlos

    2011-01-01

    This paper describes a collaborative educational data mining tool based on association rule mining for the ongoing improvement of e-learning courses and allowing teachers with similar course profiles to share and score the discovered information. The mining tool is oriented to be used by non-expert instructors in data mining so its internal…

  4. Collaborative Data Mining Tool for Education

    Science.gov (United States)

    Garcia, Enrique; Romero, Cristobal; Ventura, Sebastian; Gea, Miguel; de Castro, Carlos

    2009-01-01

    This paper describes a collaborative educational data mining tool based on association rule mining for the continuous improvement of e-learning courses allowing teachers with similar course's profile sharing and scoring the discovered information. This mining tool is oriented to be used by instructors non experts in data mining such that, its…

  5. New solutions of mining tools for hard rock mining

    Energy Technology Data Exchange (ETDEWEB)

    Kotwica, K.; Dasgupta, S. [University of Mining and Metallurgy, Cracow (Poland). Dept. of Mining, Dressing and Transportation Machines

    2002-12-01

    This article presents new solutions of mining tools for hard rock mining and the test results of research in the laboratory stand constructed at the University of Mining and Metallurgy in Crakow for cutting of artificial samples of rock with new mining tools. New solutions of rotary picks and non-symmetric disk cutters have been used. During the studies of the pick edge wear, force and mining effect were measured, using several selected mining parameters. Results obtained with new bell-type pick and disc cutters have proved very encouraging. 2 refs., 12 figs., 1 tab.

  6. Data Mining Tools for Malware Detection

    CERN Document Server

    Masud, Mehedy; Thuraisingham, Bhavani; Andreasson, Kim J

    2011-01-01

    Although the use of data mining for security and malware detection is quickly on the rise, most books on the subject provide high-level theoretical discussions to the near exclusion of the practical aspects. Breaking the mold, Data Mining Tools for Malware Detection provides a step-by-step breakdown of how to develop data mining tools for malware detection. Integrating theory with practical techniques and experimental results, it focuses on malware detection applications for email worms, malicious code, remote exploits, and botnets. The authors describe the systems they have designed and devel

  7. Tools for Educational Data Mining: A Review

    Science.gov (United States)

    Slater, Stefan; Joksimovic, Srecko; Kovanovic, Vitomir; Baker, Ryan S.; Gasevic, Dragan

    2017-01-01

    In recent years, a wide array of tools have emerged for the purposes of conducting educational data mining (EDM) and/or learning analytics (LA) research. In this article, we hope to highlight some of the most widely used, most accessible, and most powerful tools available for the researcher interested in conducting EDM/LA research. We will…

  8. Tools for Educational Data Mining: A Review

    Science.gov (United States)

    Slater, Stefan; Joksimovic, Srecko; Kovanovic, Vitomir; Baker, Ryan S.; Gasevic, Dragan

    2017-01-01

    In recent years, a wide array of tools have emerged for the purposes of conducting educational data mining (EDM) and/or learning analytics (LA) research. In this article, we hope to highlight some of the most widely used, most accessible, and most powerful tools available for the researcher interested in conducting EDM/LA research. We will…

  9. On data mining in context : cases, fusion and evaluation

    NARCIS (Netherlands)

    Putten, Petrus Wilhelmus Henricus van der

    2010-01-01

    Data mining can be seen as a process, with modeling as the core step. However, other steps such as planning, data preparation, evaluation and deployment are of key importance for applications. This thesis studies data mining in the context of these other steps with the goal of improving data mining

  10. On data mining in context : cases, fusion and evaluation

    NARCIS (Netherlands)

    Putten, Petrus Wilhelmus Henricus van der

    2010-01-01

    Data mining can be seen as a process, with modeling as the core step. However, other steps such as planning, data preparation, evaluation and deployment are of key importance for applications. This thesis studies data mining in the context of these other steps with the goal of improving data mining

  11. IT Data Mining Tool Uses in Aerospace

    Science.gov (United States)

    Monroe, Gilena A.; Freeman, Kenneth; Jones, Kevin L.

    2012-01-01

    Data mining has a broad spectrum of uses throughout the realms of aerospace and information technology. Each of these areas has useful methods for processing, distributing, and storing its corresponding data. This paper focuses on ways to leverage the data mining tools and resources used in NASA's information technology area to meet the similar data mining needs of aviation and aerospace domains. This paper details the searching, alerting, reporting, and application functionalities of the Splunk system, used by NASA's Security Operations Center (SOC), and their potential shared solutions to address aircraft and spacecraft flight and ground systems data mining requirements. This paper also touches on capacity and security requirements when addressing sizeable amounts of data across a large data infrastructure.

  12. Mechanisms of gas migration in flooding post-mining context

    Energy Technology Data Exchange (ETDEWEB)

    Le Gal, N.; Lagneau, V. [MINES ParisTech, Fontainebleau (France). Geosciences Center, Hydrodynamic and Reactions Unit; Charmoille, A. [INERIS, Verneuil-en-Halatte (France). Dept. of Ground and Underground Risks, Groundwater and Gas Emission Unit

    2010-07-01

    This paper described a study being conducted by the French National Institute for Industrial Environment and Risks (INERIS) and the European Flooding Management for Underground Coal Mines Considering Regional Mining Networks (FLOMINET) program to analyze gas migration during and after mine flooding. In situ flow measurements and laboratory studies were used to characterize gas transfer and the influence of hydrostatic pressure on coal methane releases. A sorption capacity under high hydrostatic pressure during rock flooding (CASPER) device was used to evaluate gas sorption and gas releases by reproducing depth conditions in a water-saturated context in an autoclave cell. Several models are currently also being developed to characterize mine methane migration. A reactive transport model was used to simulate methane sorption constants under high hydrostatic pressure. Mines in the Lorraine basin in northeastern France were considered. Results of the research study will be used to prevent hazards associated with surface gas emissions from abandoned mines. 10 refs., 3 figs.

  13. Design and implementation of data mining tools

    CERN Document Server

    Thuraisingham, Bhavani; Awad, Mamoun

    2009-01-01

    DATA MINING TECHNIQUES AND APPLICATIONS IntroductionTrendsData Mining Techniques and ApplicationsData Mining for Cyber Security: Intrusion DetectionData Mining for Web: Web Page Surfing PredictionData Mining for Multimedia: Image ClassificationOrganization of This BookNext StepsData Mining TechniquesIntroductionOverview of Data Mining Tasks and TechniquesArtificial Neural NetworksSupport Vector MachinesMarkov ModelAssociation Rule Mining (ARM)Multiclass ProblemImage MiningSummaryData Mining ApplicationsIntroductionIntrusion DetectionWeb Page Surfing PredictionImage ClassificationSummaryDATA MI

  14. Data Mining: Applications, tools, learning types and other subtopics

    Directory of Open Access Journals (Sweden)

    Deborah Ribeiro Carvalho

    2015-03-01

    Full Text Available Experts in the field of data mining present concepts, features, limitations and possibilities of the data mining process, including the indication of tools available, links to artificial intelligence, and the implications of it's use in business intelligence.

  15. A Visualization Tool for Integrating Research Results at an Underground Mine

    Science.gov (United States)

    Boltz, S.; Macdonald, B. D.; Orr, T.; Johnson, W.; Benton, D. J.

    2016-12-01

    Researchers with the National Institute for Occupational Safety and Health are conducting research at a deep, underground metal mine in Idaho to develop improvements in ground control technologies that reduce the effects of dynamic loading on mine workings, thereby decreasing the risk to miners. This research is multifaceted and includes: photogrammetry, microseismic monitoring, geotechnical instrumentation, and numerical modeling. When managing research involving such a wide range of data, understanding how the data relate to each other and to the mining activity quickly becomes a daunting task. In an effort to combine this diverse research data into a single, easy-to-use system, a three-dimensional visualization tool was developed. The tool was created using the Unity3d video gaming engine and includes the mine development entries, production stopes, important geologic structures, and user-input research data. The tool provides the user with a first-person, interactive experience where they are able to walk through the mine as well as navigate the rock mass surrounding the mine to view and interpret the imported data in the context of the mine and as a function of time. The tool was developed using data from a single mine; however, it is intended to be a generic tool that can be easily extended to other mines. For example, a similar visualization tool is being developed for an underground coal mine in Colorado. The ultimate goal is for NIOSH researchers and mine personnel to be able to use the visualization tool to identify trends that may not otherwise be apparent when viewing the data separately. This presentation highlights the features and capabilities of the mine visualization tool and explains how it may be used to more effectively interpret data and reduce the risk of ground fall hazards to underground miners.

  16. Data Mining and Optimization Tools for Developing Engine Parameters Tools

    Science.gov (United States)

    Dhawan, Atam P.

    1998-01-01

    This project was awarded for understanding the problem and developing a plan for Data Mining tools for use in designing and implementing an Engine Condition Monitoring System. Tricia Erhardt and I studied the problem domain for developing an Engine Condition Monitoring system using the sparse and non-standardized datasets to be available through a consortium at NASA Lewis Research Center. We visited NASA three times to discuss additional issues related to dataset which was not made available to us. We discussed and developed a general framework of data mining and optimization tools to extract useful information from sparse and non-standard datasets. These discussions lead to the training of Tricia Erhardt to develop Genetic Algorithm based search programs which were written in C++ and used to demonstrate the capability of GA algorithm in searching an optimal solution in noisy, datasets. From the study and discussion with NASA LeRC personnel, we then prepared a proposal, which is being submitted to NASA for future work for the development of data mining algorithms for engine conditional monitoring. The proposed set of algorithm uses wavelet processing for creating multi-resolution pyramid of tile data for GA based multi-resolution optimal search.

  17. Data Mining and Optimization Tools for Developing Engine Parameters Tools

    Science.gov (United States)

    Dhawan, Atam P.

    1998-01-01

    This project was awarded for understanding the problem and developing a plan for Data Mining tools for use in designing and implementing an Engine Condition Monitoring System. Tricia Erhardt and I studied the problem domain for developing an Engine Condition Monitoring system using the sparse and non-standardized datasets to be available through a consortium at NASA Lewis Research Center. We visited NASA three times to discuss additional issues related to dataset which was not made available to us. We discussed and developed a general framework of data mining and optimization tools to extract useful information from sparse and non-standard datasets. These discussions lead to the training of Tricia Erhardt to develop Genetic Algorithm based search programs which were written in C++ and used to demonstrate the capability of GA algorithm in searching an optimal solution in noisy, datasets. From the study and discussion with NASA LeRC personnel, we then prepared a proposal, which is being submitted to NASA for future work for the development of data mining algorithms for engine conditional monitoring. The proposed set of algorithm uses wavelet processing for creating multi-resolution pyramid of tile data for GA based multi-resolution optimal search.

  18. Software tool for data mining and its applications

    Science.gov (United States)

    Yang, Jie; Ye, Chenzhou; Chen, Nianyi

    2002-03-01

    A software tool for data mining is introduced, which integrates pattern recognition (PCA, Fisher, clustering, hyperenvelop, regression), artificial intelligence (knowledge representation, decision trees), statistical learning (rough set, support vector machine), computational intelligence (neural network, genetic algorithm, fuzzy systems). It consists of nine function models: pattern recognition, decision trees, association rule, fuzzy rule, neural network, genetic algorithm, Hyper Envelop, support vector machine, visualization. The principle and knowledge representation of some function models of data mining are described. The software tool of data mining is realized by Visual C++ under Windows 2000. Nonmonotony in data mining is dealt with by concept hierarchy and layered mining. The software tool of data mining has satisfactorily applied in the prediction of regularities of the formation of ternary intermetallic compounds in alloy systems, and diagnosis of brain glioma.

  19. PubMatrix: a tool for multiplex literature mining

    Directory of Open Access Journals (Sweden)

    Bright Tiffani J

    2003-12-01

    Full Text Available Abstract Background Molecular experiments using multiplex strategies such as cDNA microarrays or proteomic approaches generate large datasets requiring biological interpretation. Text based data mining tools have recently been developed to query large biological datasets of this type of data. PubMatrix is a web-based tool that allows simple text based mining of the NCBI literature search service PubMed using any two lists of keywords terms, resulting in a frequency matrix of term co-occurrence. Results For example, a simple term selection procedure allows automatic pair-wise comparisons of approximately 1–100 search terms versus approximately 1–10 modifier terms, resulting in up to 1,000 pair wise comparisons. The matrix table of pair-wise comparisons can then be surveyed, queried individually, and archived. Lists of keywords can include any terms currently capable of being searched in PubMed. In the context of cDNA microarray studies, this may be used for the annotation of gene lists from clusters of genes that are expressed coordinately. An associated PubMatrix public archive provides previous searches using common useful lists of keyword terms. Conclusions In this way, lists of terms, such as gene names, or functional assignments can be assigned genetic, biological, or clinical relevance in a rapid flexible systematic fashion. http://pubmatrix.grc.nia.nih.gov/

  20. GLIDER: Free tool imagery data visualization, analysis and mining

    Science.gov (United States)

    Ramachandran, R.; Graves, S. J.; Berendes, T.; Maskey, M.; Chidambaram, C.; Hogan, P.; Gaskin, T.

    2009-12-01

    tools generally do not provide a good interface between visualization and analysis, especially a 3D view, and GLIDER fills this gap. This feature gives the users extremely useful spatial context to their data and analysis/mining results. This presentation will demonstrate the latest version of GLIDER and also describe its supporting documentation such as video tutorial, online resources etc.

  1. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  2. Applying data mining in the context of Industrial Internet

    Directory of Open Access Journals (Sweden)

    Oliviu Matei

    2016-01-01

    Full Text Available Nowadays, (industrial companies invest more and more in connecting with their clients and machines deployed to the clients. Mining all collected data brings up several technical challenges, but doing it means getting a lot of insight useful for improving equipments. We define two approaches in mining the data in the context of Industrial Internet, applied to one of the leading companies in shoe production lines, but easily extendible to any producer. For each approach, various machine learning algorithms are applied along with a voting system. This leads to a robust model, easy to adapt for any machine.

  3. Human Behavior Analysis by Means of Multimodal Context Mining.

    Science.gov (United States)

    Banos, Oresti; Villalonga, Claudia; Bang, Jaehun; Hur, Taeho; Kang, Donguk; Park, Sangbeom; Huynh-The, Thien; Le-Ba, Vui; Amin, Muhammad Bilal; Razzaq, Muhammad Asif; Khan, Wahajat Ali; Hong, Choong Seon; Lee, Sungyoung

    2016-01-01

    There is sufficient evidence proving the impact that negative lifestyle choices have on people's health and wellness. Changing unhealthy behaviours requires raising people's self-awareness and also providing healthcare experts with a thorough and continuous description of the user's conduct. Several monitoring techniques have been proposed in the past to track users' behaviour; however, these approaches are either subjective and prone to misreporting, such as questionnaires, or only focus on a specific component of context, such as activity counters. This work presents an innovative multimodal context mining framework to inspect and infer human behaviour in a more holistic fashion. The proposed approach extends beyond the state-of-the-art, since it not only explores a sole type of context, but also combines diverse levels of context in an integral manner. Namely, low-level contexts, including activities, emotions and locations, are identified from heterogeneous sensory data through machine learning techniques. Low-level contexts are combined using ontological mechanisms to derive a more abstract representation of the user's context, here referred to as high-level context. An initial implementation of the proposed framework supporting real-time context identification is also presented. The developed system is evaluated for various realistic scenarios making use of a novel multimodal context open dataset and data on-the-go, demonstrating prominent context-aware capabilities at both low and high levels.

  4. Comparative genomics using data mining tools

    Indian Academy of Sciences (India)

    Tannistha Nandi; Chandrika B-Rao; Srinivasan Ramachandran

    2002-02-01

    We have analysed the genomes of representatives of three kingdoms of life, namely, archaea, eubacteria and eukaryota using data mining tools based on compositional analyses of the protein sequences. The representatives chosen in this analysis were Methanococcus jannaschii, Haemophilus influenzae and Saccharomyces cerevisiae. We have identified the common and different features between the three genomes in the protein evolution patterns. M. jannaschii has been seen to have a greater number of proteins with more charged amino acids whereas S. cerevisiae has been observed to have a greater number of hydrophilic proteins. Despite the differences in intrinsic compositional characteristics between the proteins from the different genomes we have also identified certain common characteristics. We have carried out exploratory Principal Component Analysis of the multivariate data on the proteins of each organism in an effort to classify the proteins into clusters. Interestingly, we found that most of the proteins in each organism cluster closely together, but there are a few ‘outliers’. We focus on the outliers for the functional investigations, which may aid in revealing any unique features of the biology of the respective organisms.

  5. Active Learning in Context-Driven Stream Mining With an Application to Image Mining.

    Science.gov (United States)

    Tekin, Cem; van der Schaar, Mihaela

    2015-11-01

    We propose an image stream mining method in which images arrive with contexts (metadata) and need to be processed in real time by the image mining system (IMS), which needs to make predictions and derive actionable intelligence from these streams. After extracting the features of the image by preprocessing, IMS determines online the classifier to use on the extracted features to make a prediction using the context of the image. A key challenge associated with stream mining is that the prediction accuracy of the classifiers is unknown, since the image source is unknown; thus, these accuracies need to be learned online. Another key challenge of stream mining is that learning can only be done by observing the true label, but this is costly to obtain. To address these challenges, we model the image stream mining problem as an active, online contextual experts problem, where the context of the image is used to guide the classifier selection decision. We develop an active learning algorithm and show that it achieves regret sublinear in the number of images that have been observed so far. To further illustrate and assess the performance of our proposed methods, we apply them to diagnose breast cancer from the images of cellular samples obtained from the fine needle aspirate of breast mass. Our findings show that very high diagnosis accuracy can be achieved by actively obtaining only a small fraction of true labels through surgical biopsies. Other applications include video surveillance and video traffic monitoring.

  6. Tourist Behavior Pattern Mining Model Based on Context

    Directory of Open Access Journals (Sweden)

    Dong-sheng Liu

    2013-01-01

    Full Text Available Personalized travel experience and service of tourist has been a hot topic research in the tourism service supply chain. In this paper, we take the context into consideration and propose an analyzed method to the tourist based on the context: firstly, we analyze the context which influences the tourist behavior patterns, select the main context factors, and construct the tourist behavior pattern model based on it; then, we calculate the interest degree of the tourist behavior pattern and mine out the rules with high interest degree with the association rule algorithm; we can make some recommendations to the tourist with better personalized travelling experience and services. At last, we make an experiment to show the feasibility and effectiveness of our method.

  7. Data Mining in the Context of Monitoring Mt Etna, Italy

    Science.gov (United States)

    Aliotta, Marco; Cassisi, Carmelo; D'Agostino, Marcello; Falsaperla, Susanna; Ferrari, Ferruccio; Langer, Horst; Messina, Alfio; Montalto, Placido; Reitano, Danilo; Spampinato, Salvatore

    2015-04-01

    The persistent volcanic activity of Mt Etna makes the continuous monitoring of multidisciplinary data a first-class issue. Indeed, the monitoring systems rapidly accumulate huge quantity of data, arising specific problems of andling and interpretation. In order to respond to these problems, the INGV staff has developed a number of software tools for data mining. These tools have the scope of identifying structures in the data that can be related to volcanic activity, furnishing criteria for the identification of precursory scenarios. In particular, we use methods of clustering and classification in which data are divided into groups according to a-priori-defined measures of similarity or distance. Data groups may assume various shapes, such as convex clouds or complex concave bodies.The "KKAnalysis" software package is a basket of clustering methods. Currently, it is one of the key techniques of the tremor-based automatic alarm systems of INGV Osservatorio Etneo. It exploits both Self-Organizing Maps and Fuzzy Clustering. Beside seismic data, the software has been applied to the geochemical composition of eruptive products as well as a combined analysis of gas-emission (radon) and seismic data. The "DBSCAN" package exploits a concept based on density-based clustering. This method allows discovering clusters with arbitrary shape. Clusters are defined as dense regions of objects in the data space separated by regions of low density. In DBSCAN a cluster grows as long as the density within a group of objects exceeds some threshold. In the context of volcano monitoring, the method is particularly promising in the recognition of ash particles as they have a rather irregular shape. The "MOTIF" software allows us to identify typical waveforms in time series, outperforming methods like cross-correlation that entail a high computational effort. MOTIF can recognize the non-imilarity of two patterns on a small number of data points without going through the whole length of

  8. An Experiential Survey on Image Mining Tools, Techniques and Applications

    Directory of Open Access Journals (Sweden)

    C. Lakshmi Devasena,

    2011-03-01

    Full Text Available Digitization in every sector leads to the growth of digital data in a tremendous amount. Digital data are not only available in the form of text but it is also available in the form of images, audio andvideo. Decision making people in every field like business, public sector, hospital, etc. are trying to get useful and implicit information from the already existing digital data bases. Image mining is the concept used to extract implicit and useful data from images stored in the large data bases. Image mining is used in variety of fields like medical diagnosis, space research, remote sensing, agriculture, industries and even in the educational field. This paper elaborates the research works already done in image mining and also summarizes different tool developed, algorithms emerged and the applications of image mining used to extract the useful images in various fields.

  9. Accounting and Financial Data Analysis Data Mining Tools

    Directory of Open Access Journals (Sweden)

    Diana Elena Codreanu

    2011-05-01

    Full Text Available Computerized accounting systems in recent years have seen an increase in complexity due to thecompetitive economic environment but with the help of data analysis solutions such as OLAP and DataMining can be a multidimensional data analysis, can detect the fraud and can discover knowledge hidden indata, ensuring such information is useful for decision making within the organization. In the literature thereare many definitions for data mining but all boils down to same idea: the process takes place to extract newinformation from large data collections, information without the aid of data mining tools would be verydifficult to obtain. Information obtained by data mining process has the advantage that only respond to thequestion of what happens but at the same time argue and show why certain things are happening. In this paperwe wish to present advanced techniques for analysis and exploitation of data stored in a multidimensionaldatabase.

  10. Data Mining Tools Make Flights Safer, More Efficient

    Science.gov (United States)

    2014-01-01

    A small data mining team at Ames Research Center developed a set of algorithms ideal for combing through flight data to find anomalies. Dallas-based Southwest Airlines Co. signed a Space Act Agreement with Ames in 2011 to access the tools, helping the company refine its safety practices, improve its safety reviews, and increase flight efficiencies.

  11. Astronomical Images and Data Mining in the International Virtual Observatory Context

    Science.gov (United States)

    Pasian, F.; Brescia, M.; Longo, G.

    2012-12-01

    In the past ten years, the concept of Virtual Observatory (VObs) has increasingly gained importance in the domain of astrophysics, as a way of seamlessly accessing data in different wavelength domains stored in digital archives. There are many reasons why the VObs is useful for the development of science: to monitor time variability of phenomena, to compare phenomena in different bands, to increase return for investment (by fostering data re-use for scientific, educational and outreach purposes), to perform statistical analysis and mining on large quantities of data. The International Virtual Observatory Alliance (IVOA) has paved the way for the VObs to become a really useful tool for the scientific community, by promoting standards, by defining data interoperability methods, by fostering the needed coordination among data providers. But the VObs is more than just archives and standards: it is also infrastructure, basic software tools, advanced applications, evolution of methods and techniques, cross-fertilization with other communities. Discovering information in wide-field images and mining large archives are key items towards the use of the VObs as a tool for developing science. Data mining, or knowledge discovery in databases, while being the main methodology to extract the scientific information contained in Massive Data Sets (MDS), needs to tackle crucial problems since it has to orchestrate complex challenges posed by transparent access to different computing environments, scalability of algorithms, reusability of resources. To achieve a leap forward for the progress of astrophysics in the data avalanche era, the community needs to implement an infrastructure capable of performing data access, processing and mining in a distributed but integrated context.

  12. PALM-IST: Pathway Assembly from Literature Mining - an Information Search Tool

    Science.gov (United States)

    Mandloi, Sapan; Chakrabarti, Saikat

    2015-01-01

    Manual curation of biomedical literature has become extremely tedious process due to its exponential growth in recent years. To extract meaningful information from such large and unstructured text, newer and more efficient mining tool is required. Here, we introduce PALM-IST, a computational platform that not only allows users to explore biomedical abstracts using keyword based text mining but also extracts biological entity (e.g., gene/protein, drug, disease, biological processes, cellular component, etc.) information from the extracted text and subsequently mines various databases to provide their comprehensive inter-relation (e.g., interaction, expression, etc.). PALM-IST constructs protein interaction network and pathway information data relevant to the text search using multiple data mining tools and assembles them to create a meta-interaction network. It also analyzes scientific collaboration by extraction and creation of “co-authorship network,” for a given search context. Hence, this useful combination of literature and data mining provided in PALM-IST can be used to extract novel protein-protein interaction (PPI), to generate meta-pathways and further to identify key crosstalk and bottleneck proteins. PALM-IST is available at www.hpppi.iicb.res.in/ctm. PMID:25989388

  13. WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RULE MINING

    Directory of Open Access Journals (Sweden)

    Pratiyush Guleria

    2015-11-01

    Full Text Available This paper aims to explain the web-enabled tools for educational data mining. The proposed web-based tool developed using Asp.Net framework and php can be helpful for universities or institutions providing the students with elective courses as well improving academic activities based on feedback collected from students. In Asp.Net tool, association rule mining using Apriori algorithm is used whereas in php based Feedback Analytical Tool, feedback related to faculty and institutional infrastructure is collected from students and based on that Feedback it shows performance of faculty and institution. Using that data, it helps management to improve in-house training skills and gains knowledge about educational trends which is to be followed by faculty to improve the effectiveness of the course and teaching skills.

  14. Statistical and Visualization Data Mining Tools for Foundry Production

    Directory of Open Access Journals (Sweden)

    M. Perzyk

    2007-07-01

    Full Text Available In recent years a rapid development of a new, interdisciplinary knowledge area, called data mining, is observed. Its main task is extracting useful information from previously collected large amount of data. The main possibilities and potential applications of data mining in manufacturing industry are characterized. The main types of data mining techniques are briefly discussed, including statistical, artificial intelligence, data base and visualization tools. The statistical methods and visualization methods are presented in more detail, showing their general possibilities, advantages as well as characteristic examples of applications in foundry production. Results of the author’s research are presented, aimed at validation of selected statistical tools which can be easily and effectively used in manufacturing industry. A performance analysis of ANOVA and contingency tables based methods, dedicated for determination of the most significant process parameters as well as for detection of possible interactions among them, has been made. Several numerical tests have been performed using simulated data sets, with assumed hidden relationships as well some real data, related to the strength of ductile cast iron, collected in a foundry. It is concluded that the statistical methods offer relatively easy and fairly reliable tools for extraction of that type of knowledge about foundry manufacturing processes. However, further research is needed, aimed at explanation of some imperfections of the investigated tools as well assessment of their validity for more complex tasks.

  15. Context-Sensitive Opinion Mining using Polarity Patterns

    Directory of Open Access Journals (Sweden)

    Saeedeh Sadat Sadidpour

    2016-09-01

    Full Text Available The growing of Web 2.0 has led to huge information is available. The analysis of this information can be very useful in various fields. In this regards, opinion mining and sentiment analysis are one of the most interesting task that many researchers have paid attention for two last decades. However, this task involves to some challenges that a very important challenge is the different polarity of words in various domain and context. Word polarity is an important feature in the determination of review polarity through sentiment analysis. Existing studies have proposed n-gram technique as a solution which allows the matching of the selected words to the lexicon. However, identification of word polarity using the standard n-gram method poses limitation as it ignores the word placement and its effect according to the contextual domain. Therefore, this study proposes a linguistic-based model to extract the word adjacency patterns to determine the review polarity. The results reflect the superiority of the proposed model compared to other benchmarking approaches.

  16. Context-dependent tool use in New Caledonian crows.

    Science.gov (United States)

    Taylor, Alex H; Hunt, Gavin R; Gray, Russell D

    2012-04-23

    Humans and chimpanzees both exhibit context-dependent tool use. That is, both species choose to use tools when food is within reach, but the context is potentially hazardous. Here, we show that New Caledonian crows used tools more frequently when food was positioned next to a novel model snake than when food was positioned next to a novel teddy bear or a familiar food bowl. However, the crows showed no significant difference in their neophobic reactions towards the teddy bear and the model snake. Therefore, the crows used tools more in response to a risky object resembling a natural predator than to a less-threatening object that provoked a comparable level of neophobia. These results show that New Caledonian crows, like humans and chimpanzees, are capable of context-dependent tool use.

  17. Mathematical tools for data mining set theory, partial orders, combinatorics

    CERN Document Server

    Simovici, Dan A

    2014-01-01

    Data mining essentially relies on several mathematical disciplines, many of which are presented in this second edition of this book. Topics include partially ordered sets, combinatorics, general topology, metric spaces, linear spaces, graph theory. To motivate the reader a significant number of applications of these mathematical tools are included ranging from association rules, clustering algorithms, classification, data constraints, logical data analysis, etc. The book is intended as a reference for researchers and graduate students. The current edition is a significant expansion of the firs

  18. Mining Cancer Transcriptomes: Bioinformatic Tools and the Remaining Challenges.

    Science.gov (United States)

    Milan, Thomas; Wilhelm, Brian T

    2017-02-22

    The development of next-generation sequencing technologies has had a profound impact on the field of cancer genomics. With the enormous quantities of data being generated from tumor samples, researchers have had to rapidly adapt tools or develop new ones to analyse the raw data to maximize its value. While much of this effort has been focused on improving specific algorithms to get faster and more precise results, the accessibility of the final data for the research community remains a significant problem. Large amounts of data exist but are not easily available to researchers who lack the resources and experience to download and reanalyze them. In this article, we focus on RNA-seq analysis in the context of cancer genomics and discuss the bioinformatic tools available to explore these data. We also highlight the importance of developing new and more intuitive tools to provide easier access to public data and discuss the related issues of data sharing and patient privacy.

  19. Using the HSE stress indicator tool in a military context.

    Science.gov (United States)

    Bridger, Robert S; Dobson, Karen; Davison, Hannah

    2016-01-01

    An assessment of the Health and Safety Executive's (HSE) stress indicator tool was conducted to determine whether it was suitable for use with Ministry of Defence (MoD) personnel. A total of 1031 respondents from the Army, Navy, Air Force and MoD civilians completed a questionnaire containing the HSE tool and supplementary measures of work-life balance, engagement, deployment, leave taken and hours of work. Six measures of adverse reaction to the stressors were also reported: perceptions of job stress, job stress and health, psychological strain, fatigue after work, work ability and quality of working life. The stressor scales, particularly the 'demands' and 'relationships' scales, were associated with adverse outcomes as was the work-life balance scale. The HSE tool had some validity when used with MoD personnel, but its content was too narrow. The content validity of the tool can be improved for use in a military context with the addition of a 'work-life balance' scale'. The HSE stress tool was tested with a mixed sample of MoD employees. The ‘Demands and “Relationships” scales were associated with adverse outcomes. An additional Work–Life Balance’ scale improved the content validity, demonstrating the importance of assessing the psychometric qualities of scales when used within particular contexts to ensure validity.

  20. Development and assessment of the Alberta Context Tool

    Directory of Open Access Journals (Sweden)

    Birdsell Judy M

    2009-12-01

    Full Text Available Abstract Background The context of healthcare organizations such as hospitals is increasingly accepted as having the potential to influence the use of new knowledge. However, the mechanisms by which the organizational context influences evidence-based practices are not well understood. Current measures of organizational context lack a theory-informed approach, lack construct clarity and generally have modest psychometric properties. This paper presents the development and initial psychometric validation of the Alberta Context Tool (ACT, an eight dimension measure of organizational context for healthcare settings. Methods Three principles guided the development of the ACT: substantive theory, brevity, and modifiability. The Promoting Action on Research Implementation in Health Services (PARiHS framework and related literature were used to guide selection of items in the ACT. The ACT was required to be brief enough to be tolerated in busy and resource stretched work settings and to assess concepts of organizational context that were potentially modifiable. The English version of the ACT was completed by 764 nurses (752 valid responses working in seven Canadian pediatric care hospitals as part of its initial validation. Cronbach's alpha, exploratory factor analysis, analysis of variance, and tests of association were used to assess instrument reliability and validity. Results Factor analysis indicated a 13-factor solution (accounting for 59.26% of the variance in 'organizational context'. The composition of the factors was similar to those originally conceptualized. Cronbach's alpha for the 13 factors ranged from .54 to .91 with 4 factors performing below the commonly accepted alpha cut off of .70. Bivariate associations between instrumental research utilization levels (which the ACT was developed to predict and the ACT's 13 factors were statistically significant at the 5% level for 12 of the 13 factors. Each factor also showed a trend of

  1. Application of Modern Tools and Techniques for Mine Safety & Disaster Management

    Science.gov (United States)

    Kumar, Dheeraj

    2016-04-01

    The implementation of novel systems and adoption of improvised equipment in mines help mining companies in two important ways: enhanced mine productivity and improved worker safety. There is a substantial need for adoption of state-of-the-art automation technologies in the mines to ensure the safety and to protect health of mine workers. With the advent of new autonomous equipment used in the mine, the inefficiencies are reduced by limiting human inconsistencies and error. The desired increase in productivity at a mine can sometimes be achieved by changing only a few simple variables. Significant developments have been made in the areas of surface and underground communication, robotics, smart sensors, tracking systems, mine gas monitoring systems and ground movements etc. Advancement in information technology in the form of internet, GIS, remote sensing, satellite communication, etc. have proved to be important tools for hazard reduction and disaster management. This paper is mainly focused on issues pertaining to mine safety and disaster management and some of the recent innovations in the mine automations that could be deployed in mines for safe mining operations and for avoiding any unforeseen mine disaster.

  2. A Spreadsheet Auditing Tool Evaluated in an Industrial Context

    CERN Document Server

    Clermont, Markus; Mittermeir, Roland T

    2008-01-01

    Amongst the large number of write-and-throw-away spreadsheets developed for one-time use there is a rather neglected proportion of spreadsheets that are huge, periodically used, and submitted to regular update-cycles like any conventionally evolving valuable legacy application software. However, due to the very nature of spreadsheets, their evolution is particularly tricky and therefore error-prone. In our strive to develop tools and methodologies to improve spreadsheet quality, we analysed consolidation spreadsheets of an internationally operating company for the errors they contain. The paper presents the results of the field audit, involving 78 spreadsheets with 60,446 non-empty cells. As a by-product, the study performed was also to validate our analysis tools in an industrial context. The evaluated auditing tool offers the auditor a new view on the formula structure of the spreadsheet by grouping similar formulas into equivalence classes. Our auditing approach defines three similarity criteria between fo...

  3. Definition of redox and pH influence in the AMD mine system using a fuzzy qualitative tool (Iberian Pyrite Belt, SW Spain).

    Science.gov (United States)

    de la Torre, M L; Grande, J A; Valente, T; Perez-Ostalé, E; Santisteban, M; Aroba, J; Ramos, I

    2016-03-01

    Poderosa Mine is an abandoned pyrite mine, located in the Iberian Pyrite Belt which pours its acid mine drainage (AMD) waters into the Odiel river (South-West Spain). This work focuses on establishing possible reasons for interdependence between the potential redox and pH, with the load of metals and sulfates, as well as a set of variables that define the physical chemistry of the water-conductivity, temperature, TDS, and dissolved oxygen-transported by a channel from Poderosa mine affected by acid mine drainage, through the use of techniques of artificial intelligence: fuzzy logic and data mining. The sampling campaign was carried out in May of 2012. There were a total of 16 sites, the first inside the tunnel and the last at the mouth of the river Odiel, with a distance of approximately 10 m between each pair of measuring stations. While the tools of classical statistics, which are widely used in this context, prove useful for defining proximity ratios between variables based on Pearson's correlations, in addition to making it easier to handle large volumes of data and producing easier-to-understand graphs, the use of fuzzy logic tools and data mining results in better definition of the variations produced by external stimuli on the set of variables. This tool is adaptable and can be extrapolated to any system polluted by acid mine drainage using simple, intuitive reasoning.

  4. A Tools-Based Approach to Teaching Data Mining Methods

    Science.gov (United States)

    Jafar, Musa J.

    2010-01-01

    Data mining is an emerging field of study in Information Systems programs. Although the course content has been streamlined, the underlying technology is still in a state of flux. The purpose of this paper is to describe how we utilized Microsoft Excel's data mining add-ins as a front-end to Microsoft's Cloud Computing and SQL Server 2008 Business…

  5. A Tools-Based Approach to Teaching Data Mining Methods

    Science.gov (United States)

    Jafar, Musa J.

    2010-01-01

    Data mining is an emerging field of study in Information Systems programs. Although the course content has been streamlined, the underlying technology is still in a state of flux. The purpose of this paper is to describe how we utilized Microsoft Excel's data mining add-ins as a front-end to Microsoft's Cloud Computing and SQL Server 2008 Business…

  6. Data Mining for Secure Software Engineering – Source Code Management Tool Case Study

    Directory of Open Access Journals (Sweden)

    A.V.Krishna Prasad,

    2010-07-01

    Full Text Available As Data Mining for Secure Software Engineering improves software productivity and quality, software engineers are increasingly applying data mining algorithms to various software engineering tasks. However mining software engineering data poses several challenges, requiring various algorithms to effectively mine sequences, graphs and text from such data. Software engineering data includes code bases, execution traces, historical code changes,mailing lists and bug data bases. They contains a wealth of information about a projects-status, progress and evolution. Using well established data mining techniques, practitioners and researchers can explore the potential of this valuable data in order to better manage their projects and do produce higher-quality software systems that are delivered on time and with in budget. Data mining can be used in gathering and extracting latent security requirements, extracting algorithms and business rules from code, mining legacy applications for requirements and business rules for new projects etc. Mining algorithms for software engineering falls into four main categories: Frequent pattern mining – finding commonly occurring patterns; Pattern matching – finding data instances for given patterns; Clustering – grouping data into clusters and Classification – predicting labels of data based on already labeled data. In this paper, we will discuss the overview of strategies for data mining for secure software engineering, with the implementation of a case study of text mining for source code management tool.

  7. Mining biomarkers in human sera using proteomic tools.

    Science.gov (United States)

    Zhang, Rulin; Barker, Lisa; Pinchev, Deborah; Marshall, John; Rasamoelisolo, Michèle; Smith, Chris; Kupchak, Peter; Kireeva, Inga; Ingratta, Leslee; Jackowski, George

    2004-01-01

    One of the major difficulties in mining low abundance biomarkers from serum or plasma is due to the fact that a small number of proteins such as albumin, alpha2-macroglobulin, transferrin, and immunoglobulins, may represent as much as 80% of the total serum protein. The large quantity of these proteins makes it difficult to identify low abundance proteins in serum using traditional 2-dimensional electrophoresis. We recently used a combination of multidimensional liquid chromatography and gel electrophoresis coupled to matrix-assisted laser desorption/ionization-quadrupole-time of flight and Ion Trap liquid chromatography-tandem mass spectrometry to identify protein markers in sera of Alzheimer's disease (AD), insulin resistance/type-2 diabetes (IR/D2), and congestive heart failure (CHF) patients. We identified 8 proteins that exhibit higher levels in control sera and 36 proteins that exhibit higher levels in disease sera. For example, haptoglobin and hemoglobin are elevated in sera of AD, IR/D2, and CHF patients. The levels of several other proteins including fibrinogen and its fragments, alpha 2-macroglobulin, transthyretin, pro-platelet basic protein, protease inhibitors clade A and C, as well as proteins involved in the classical complement pathway such as complement C3, C4, and C1 inhibitor, were found to differ between IR/D2 and control sera. The sera levels of proteins, such as the 10 kDa subunit of vitronectin, alpha 1-acid glycoprotein, apolipoprotein B100, fragment of factor H, and histidine-rich glycoprotein were observed to be different between AD and controls. The differences observed in these biomarker candidates were confirmed by Western blot and the enzyme-linked immunosorbent assay. The biological meaning of the proteomic changes in the disease states and the potential use of these changes as diagnostic tools or for therapeutic intervention will be discussed.

  8. The South African dysphagia screening tool (SADS: A screening tool for a developing context

    Directory of Open Access Journals (Sweden)

    Calli Ostrofsky

    2016-02-01

    Full Text Available Background: Notwithstanding its value, there are challenges and limitations to implementing a dysphagia screening tool from a developed contexts in a developing context. The need for a reliable and valid screening tool for dysphagia that considers context, systemic rules and resources was identified to prevent further medical compromise, optimise dysphagia prognosis and ultimately hasten patients’ return to home or work.Methodology: To establish the validity and reliability of the South African dysphagia screening tool (SADS for acute stroke patients accessing government hospital services. The study was a quantitative, non-experimental, correlational cross-sectional design with a retrospective component. Convenient sampling was used to recruit 18 speech-language therapists and 63 acute stroke patients from three South African government hospitals. The SADS consists of 20 test items and was administered by speech-language therapists. Screening was followed by a diagnostic dysphagia assessment. The administrator of the tool was not involved in completing the diagnostic assessment, to eliminate bias and prevent contamination of results from screener to diagnostic assessment. Sensitivity, validity and efficacy of the screening tool were evaluated against the results of the diagnostic dysphagia assessment. Cohen’s kappa measures determined inter-rater agreement between the results of the SADS and the diagnostic assessment.Results and conclusion: The SADS was proven to be valid and reliable. Cohen’s kappa indicated a high inter-rater reliability and showed high sensitivity and adequate specificity in detecting dysphagia amongst acute stroke patients who were at risk for dysphagia. The SADS was characterised by concurrent, content and face validity. As a first step in establishing contextual appropriateness, the SADS is a valid and reliable screening tool that is sensitive in identifying stroke patients at risk for dysphagia within government

  9. New tools for radon progeny hazard prevention in mines

    Energy Technology Data Exchange (ETDEWEB)

    Skowronek, J.; Kajdasz, R.; Strzesniewicz, Z. [Central Mining Institute, Katowice (Poland). Laboratory of Radiometry

    2001-07-01

    The described project is an example of prevention actions performed under real conditions in one of Polish coal mine. This project, titled Prevention Of The Short Lived Radon Progeny Hazard was carried out by 'Nadwislanska Splka Weglowa' S.A., 'Ziemowit' Colliery and Central Mining Institute. The scientific part of the research was financed by Scientific Research Committee in Warsaw and the institutional part by Mining Plant 'Ziemowit'. The aim of the project was to give the mining industry a set of instruments what would enable it to influence the level of the radon progeny hazard on every stage of work, starting from planning the exploitation, prevention of radon secretion from orogene to the elimination of radon from the air by means of filtration. The project was completed successfully and its results have been instituted in Nadwislanska Splka Weglowa S.A. and Mining Plant 'Ziemowit'. The research has been made during 1994-6, and investments during 1996-7. 8 refs., 1 tab.

  10. Uncoolness factor of collaborative Web Mining Tools (WMT

    Directory of Open Access Journals (Sweden)

    Juan Luis Chulilla

    2009-12-01

    Full Text Available The recent development of social mining is a useful and direct analogy to talking about the less visible part of the adoption of successive waves of social software. The striking fact of visibility decrease as each type of social software matures should be taken into account for any comprehensive analysis of the relation between collectives and Internet technologies. One of the main results of this relation is the social data mining of Internet, which both gives sense to virtual communities and produces contents via feedback. We are just at the beginning of the adoption of new ways of social data mining, which will be significant when grow mature and become invisible.

  11. Text mining and visualization case studies using open-source tools

    CERN Document Server

    Chisholm, Andrew

    2016-01-01

    Text Mining and Visualization: Case Studies Using Open-Source Tools provides an introduction to text mining using some of the most popular and powerful open-source tools: KNIME, RapidMiner, Weka, R, and Python. The contributors-all highly experienced with text mining and open-source software-explain how text data are gathered and processed from a wide variety of sources, including books, server access logs, websites, social media sites, and message boards. Each chapter presents a case study that you can follow as part of a step-by-step, reproducible example. You can also easily apply and extend the techniques to other problems. All the examples are available on a supplementary website. The book shows you how to exploit your text data, offering successful application examples and blueprints for you to tackle your text mining tasks and benefit from open and freely available tools. It gets you up to date on the latest and most powerful tools, the data mining process, and specific text mining activities.

  12. A Platform for Supporting Knowledge Mining and Reuse Based on Context Information of a Project

    Directory of Open Access Journals (Sweden)

    I-Chin Wu

    2010-06-01

    Full Text Available Organizations implement Knowledge Management Systems (KMS to maximize the effectiveness and reuse of knowledge assets in order to increase productivity and profitability. Thus, effective project management can place great demands on knowledge management solutions designed to support and streamline the execution of project-related tasks. Accordingly, in this work we extract knowledge from historical projects, design a project-in-context (PIC meta-model, and deploy a platform that facilitates the capture and reuse of project-specific information based on the context. The research areas addressed in the work are as follows. (1 Knowledge acquisition: analyzing the type of project and its associated attributes and defining general, but essential, project context information based on the PIC model. (2 Knowledge discovery: the use of text mining and data mining techniques to extract knowledge items needed by workers, and discover the relationships between various knowledge items. (3 Knowledge utilization based on the context: with the proposed model and methods, several applications related to the reuse of project knowledge by pull- and push-based knowledge management strategies are developed to achieve effective project management. From the perspective of project management, the proposed model and system can help knowledge workers understand information about a current research project and resolve problems effectively. [Article content in Chinese; Extended abstract in English

  13. Strategic aspects of management organizational structure improvement in the context of ore mining companies

    Directory of Open Access Journals (Sweden)

    L.M. Varava

    2016-12-01

    management and development of scientifically proven investment programs and projects, it is recommended that the marketing functions be expanded through appropriate restructuring and creation of a marketing and sales department. The authors have come up with a functional direction of marketing activity in the context of today’s strategic orientation of ore mining companies towards the domestic market. The strategic activity is associated with a lot of information coming from external and internal sources and being processed at the company. In order to enhance the efficiency of the strategic management's information support, it is proposed to set up, within the company's Information Center, an information subsystem for strategic management, which would make it possible to collect, to process and to distribute the relevant bodies of information. The strategic information gathered, which matches the quality level set, can be used as analytical and predictive tools for developing strategic alternatives, choosing strategies and monitoring their implementation. Conclusions and directions of further researches. The present MOS makes it possible to avoid the disadvantages of other strategic-type structures, e. g.: dual subordination, duplication of the same-purpose functions and services when implementing different projects, difficulties in forming and managing project teams. The advantages of this organizational structure are: a high level of decentralization of production divisions as per areas of activity; the absence of centralized administration of financial resources' distribution; the possibility of downsizing central management personnel by reducing the number of their functions. In further research, it is advisable to focus attention on ensuring compliance of the organizational structure with implementation peculiarities of strategies being adopted by the company.

  14. Maternal vaccination and preterm birth: using data mining as a screening tool

    DEFF Research Database (Denmark)

    Orozova-Bekkevold, Ivanka; Jensen, Henrik; Stensballe, Lone

    2007-01-01

    Objective The main purpose of this study was to identify possible associations between medicines used in pregnancy and preterm deliveries using data mining as a screening tool. Settings Prospective cohort study. Methods We used data mining to identify possible correlates between preterm delivery...... measure Preterm birth, a delivery occurring before the 259th day of gestation (i.e., less than 37 full weeks). Results Data mining had indicated that maternal vaccination (among other factors) might be related to preterm birth. The following regression analysis showed that, the women who reported being...... further studies. Data mining, especially with additional refinements, may be a valuable and very efficient tool to screen large databases for relevant information which can be used in clinical and public health research....

  15. Reversed mining and reversed-reversed mining: the irrational context of geological disposal of nuclear waste

    Science.gov (United States)

    van Loon, A. J.

    2000-06-01

    Man does not only extract material from the Earth but increasingly uses the underground for storage and disposal purposes. One of the materials that might be disposed of this way is high-level nuclear waste. The development of safe disposal procedures, the choice of suitable host rocks, and the design of underground facilities have taken much time and money, but commissions in several countries have presented reports showing that — and how — safe geological disposal will be possible in such a way that definite isolation from the biosphere is achieved. Political views have changed in the past few years, however, and there is a strong tendency now to require that the high-level waste disposed of will be retrievable. Considering the underlying arguments for isolation from the biosphere, and also considering waste policy in general, this provides an irrational context. The development of new procedures and the design of new disposal facilities that allow retrieval will take much time again. A consequence may be that the high-active, heat-generating nuclear waste will be stored temporarily for a much longer time than objectively desirable. The delay in disposal and the counterproductive requirement of retrievability are partly due to the fact that earth-science organisations have failed to communicate in the way they should, possibly fearing public (and financial) reactions if taking a position that is (was?) considered as politically incorrect. Such an attitude should not be maintained in modern society, which has the right to be informed reliably by the scientific community.

  16. Data mining practical machine learning tools and techniques

    CERN Document Server

    Witten, Ian H

    2005-01-01

    As with any burgeoning technology that enjoys commercial attention, the use of data mining is surrounded by a great deal of hype. Exaggerated reports tell of secrets that can be uncovered by setting algorithms loose on oceans of data. But there is no magic in machine learning, no hidden power, no alchemy. Instead there is an identifiable body of practical techniques that can extract useful information from raw data. This book describes these techniques and shows how they work. The book is a major revision of the first edition that appeared in 1999. While the basic core remains the same

  17. Application of Quality Management Tools for Evaluating the Failure Frequency of Cutter-Loader and Plough Mining Systems

    Science.gov (United States)

    Biały, Witold

    2017-06-01

    Failure frequency in the mining process, with a focus on the mining machine, has been presented and illustrated by the example of two coal-mines. Two mining systems have been subjected to analysis: a cutter-loader and a plough system. In order to reduce costs generated by failures, maintenance teams should regularly make sure that the machines are used and operated in a rational and effective way. Such activities will allow downtimes to be reduced, and, in consequence, will increase the effectiveness of a mining plant. The evaluation of mining machines' failure frequency contained in this study has been based on one of the traditional quality management tools - the Pareto chart.

  18. A software tool for determination of breast cancer treatment methods using data mining approach.

    Science.gov (United States)

    Cakır, Abdülkadir; Demirel, Burçin

    2011-12-01

    In this work, breast cancer treatment methods are determined using data mining. For this purpose, software is developed to help to oncology doctor for the suggestion of application of the treatment methods about breast cancer patients. 462 breast cancer patient data, obtained from Ankara Oncology Hospital, are used to determine treatment methods for new patients. This dataset is processed with Weka data mining tool. Classification algorithms are applied one by one for this dataset and results are compared to find proper treatment method. Developed software program called as "Treatment Assistant" uses different algorithms (IB1, Multilayer Perception and Decision Table) to find out which one is giving better result for each attribute to predict and by using Java Net beans interface. Treatment methods are determined for the post surgical operation of breast cancer patients using this developed software tool. At modeling step of data mining process, different Weka algorithms are used for output attributes. For hormonotherapy output IB1, for tamoxifen and radiotherapy outputs Multilayer Perceptron and for the chemotherapy output decision table algorithm shows best accuracy performance compare to each other. In conclusion, this work shows that data mining approach can be a useful tool for medical applications particularly at the treatment decision step. Data mining helps to the doctor to decide in a short time.

  19. HC StratoMineR: A web-based tool for the rapid analysis of high content datasets

    NARCIS (Netherlands)

    Omta, W.; Heesbeen, R. van; Pagliero, R.; Velden, L. van der; Lelieveld, D.; Nellen, M.; Kramer, M.; Yeong, M.; Saeidi, A.; Medema, R.; Spruit, M.; Brinkkemper, S.; Klumperman, J.; Egan, D.

    2016-01-01

    High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that

  20. HC StratoMineR : A Web-Based Tool for the Rapid Analysis of High-Content Datasets

    NARCIS (Netherlands)

    Omta, Wienand A; van Heesbeen, Roy G; Pagliero, Romina J; van der Velden, Lieke M; Lelieveld, Daphne; Nellen, Mehdi; Kramer, Maik; Yeong, Marley; Saeidi, Amir M; Medema, Rene H; Spruit, Marco; Brinkkemper, Sjaak; Klumperman, Judith; Egan, David A

    2016-01-01

    High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that

  1. Multiple Criteria Decision-Making Preprocessing Using Data Mining Tools

    CERN Document Server

    Mosavi, A

    2010-01-01

    Real-life engineering optimization problems need Multiobjective Optimization (MOO) tools. These problems are highly nonlinear. As the process of Multiple Criteria Decision-Making (MCDM) is much expanded most MOO problems in different disciplines can be classified on the basis of it. Thus MCDM methods have gained wide popularity in different sciences and applications. Meanwhile the increasing number of involved components, variables, parameters, constraints and objectives in the process, has made the process very complicated. However the new generation of MOO tools has made the optimization process more automated, but still initializing the process and setting the initial value of simulation tools and also identifying the effective input variables and objectives in order to reach the smaller design space are still complicated. In this situation adding a preprocessing step into the MCDM procedure could make a huge difference in terms of organizing the input variables according to their effects on the optimizati...

  2. PPI finder: a mining tool for human protein-protein interactions.

    Directory of Open Access Journals (Sweden)

    Min He

    Full Text Available BACKGROUND: The exponential increase of published biomedical literature prompts the use of text mining tools to manage the information overload automatically. One of the most common applications is to mine protein-protein interactions (PPIs from PubMed abstracts. Currently, most tools in mining PPIs from literature are using co-occurrence-based approaches or rule-based approaches. Hybrid methods (frame-based approaches by combining these two methods may have better performance in predicting PPIs. However, the predicted PPIs from these methods are rarely evaluated by known PPI databases and co-occurred terms in Gene Ontology (GO database. METHODOLOGY/PRINCIPAL FINDINGS: We here developed a web-based tool, PPI Finder, to mine human PPIs from PubMed abstracts based on their co-occurrences and interaction words, followed by evidences in human PPI databases and shared terms in GO database. Only 28% of the co-occurred pairs in PubMed abstracts appeared in any of the commonly used human PPI databases (HPRD, BioGRID and BIND. On the other hand, of the known PPIs in HPRD, 69% showed co-occurrences in the literature, and 65% shared GO terms. CONCLUSIONS: PPI Finder provides a useful tool for biologists to uncover potential novel PPIs. It is freely accessible at http://liweilab.genetics.ac.cn/tm/.

  3. CMIN — a CRISP-DM-based case tool for supporting data mining projects

    Directory of Open Access Journals (Sweden)

    Carlos Cobos

    2010-10-01

    Full Text Available This paper introduces CMIN, an integrated computer aided software engineering (CASE tool based on cross-industry standard process for data mining (CRISP-DM 1.0 designed to support carrying out data mining projects. It is “integrated” in the sense that it supports all phases of a process. A general overview of how CMIN works is presented first, including a treatment of processes, templates and project management. CMIN’s capacity for easily and intuitively monitoring projects is highlighted, as is the manner in which CMIN allows a user to increase knowledge regarding using CRISP-DM or any other process defined in the CASE tool through the help and information presented in each step. Next, it is shown how CMIN can bind new data mining algorithms in runtime (without the need to recompile the tool to support modelling tasks (based on a Workflow and evaluate data mining projects. Finally, the results of two evaluations of the tool, some conclusions and suggestions for future work are presented.

  4. Big Data Mining: Challenges, Technologies, Tools and Applications

    OpenAIRE

    Asha M. PAWAR

    2016-01-01

    Big data is a data with large size means it has large volume, velocity and variety. Now a day's big data is expanding in a various science and engineering fields. And so there are many challenges to manage and analyse big data using various tools. This paper introduces the big data and its Characteristic concepts and Next section elaborates about the Challenges in Big data. In Particular, wed discuss about the technologies used in big data Analysis and Which Tools are mainly used to analyse t...

  5. tmBioC: improving interoperability of text-mining tools with BioC.

    Science.gov (United States)

    Khare, Ritu; Wei, Chih-Hsuan; Mao, Yuqing; Leaman, Robert; Lu, Zhiyong

    2014-01-01

    The lack of interoperability among biomedical text-mining tools is a major bottleneck in creating more complex applications. Despite the availability of numerous methods and techniques for various text-mining tasks, combining different tools requires substantial efforts and time owing to heterogeneity and variety in data formats. In response, BioC is a recent proposal that offers a minimalistic approach to tool interoperability by stipulating minimal changes to existing tools and applications. BioC is a family of XML formats that define how to present text documents and annotations, and also provides easy-to-use functions to read/write documents in the BioC format. In this study, we introduce our text-mining toolkit, which is designed to perform several challenging and significant tasks in the biomedical domain, and repackage the toolkit into BioC to enhance its interoperability. Our toolkit consists of six state-of-the-art tools for named-entity recognition, normalization and annotation (PubTator) of genes (GenNorm), diseases (DNorm), mutations (tmVar), species (SR4GN) and chemicals (tmChem). Although developed within the same group, each tool is designed to process input articles and output annotations in a different format. We modify these tools and enable them to read/write data in the proposed BioC format. We find that, using the BioC family of formats and functions, only minimal changes were required to build the newer versions of the tools. The resulting BioC wrapped toolkit, which we have named tmBioC, consists of our tools in BioC, an annotated full-text corpus in BioC, and a format detection and conversion tool. Furthermore, through participation in the 2013 BioCreative IV Interoperability Track, we empirically demonstrate that the tools in tmBioC can be more efficiently integrated with each other as well as with external tools: Our experimental results show that using BioC reduces >60% in lines of code for text-mining tool integration. The tmBioC toolkit

  6. Visualizing data mining results with the Brede tools

    DEFF Research Database (Denmark)

    Nielsen, Finn Årup

    2009-01-01

    presence of the Brede Database is established by a single script executing a workflow involving these steps together with a final generation of Web pages with embedded visualizations and links to interactive three-dimensional models in the Virtual Reality Modeling Language. Apart from the Brede tools I...

  7. Data mining of text as a tool in authorship attribution

    Science.gov (United States)

    Visa, Ari J. E.; Toivonen, Jarmo; Autio, Sami; Maekinen, Jarno; Back, Barbro; Vanharanta, Hannu

    2001-03-01

    It is common that text documents are characterized and classified by keywords that the authors use to give them. Visa et al. have developed a new methodology based on prototype matching. The prototype is an interesting document or a part of an extracted, interesting text. This prototype is matched with the document database of the monitored document flow. The new methodology is capable of extracting the meaning of the document in a certain degree. Our claim is that the new methodology is also capable of authenticating the authorship. To verify this claim two tests were designed. The test hypothesis was that the words and the word order in the sentences could authenticate the author. In the first test three authors were selected. The selected authors were William Shakespeare, Edgar Allan Poe, and George Bernard Shaw. Three texts from each author were examined. Every text was one by one used as a prototype. The two nearest matches with the prototype were noted. The second test uses the Reuters-21578 financial news database. A group of 25 short financial news reports from five different authors are examined. Our new methodology and the interesting results from the two tests are reported in this paper. In the first test, for Shakespeare and for Poe all cases were successful. For Shaw one text was confused with Poe. In the second test the Reuters-21578 financial news were identified by the author relatively well. The resolution is that our text mining methodology seems to be capable of authorship attribution.

  8. VLSI physical design analyzer: A profiling and data mining tool

    Science.gov (United States)

    Somani, Shikha; Verma, Piyush; Madhavan, Sriram; Batarseh, Fadi; Pack, Robert C.; Capodieci, Luigi

    2015-03-01

    Traditional physical design verification tools employ a deck of known design rules, each of which has a pre-defined pass/fail criteria associated with it. While passing a design rule deck is a necessary condition for a VLSI design to be manufacturable, it is not sufficient. Other physical design profiling decks that attempt to obtain statistical information about the various critical dimensions in the VLSI design lack a systematic methodology for rule enumeration. These decks are often inadequate, unable to extract all the interlayer and intralayer dimensions in a design that have a correlation with process yield. The Physical Design Analyzer is a comprehensive design analysis tool built with the objective of exhaustively exploring design-process correlations to increase the wafer yield.

  9. Review of Text Mining Tools%文本挖掘工具述评

    Institute of Scientific and Technical Information of China (English)

    张雯雯; 许鑫

    2012-01-01

    The authors briefly describe some commercial text mining tools and open source text mining tools, coupled with detailed comparisons of four typical open source tools concerning data format, functional module and user experience firstly. Then, the authors realize the testing of text classification function for three kinds of distinctive tool design. Finally, the authors offer some suggestions for the status of open source text mining tools.%简要介绍一些商业文本挖掘工具和开源文本挖掘工具,针对其中四款典型的开源工具进行详细的比较,包括数据格式、功能模块和用户体验三个方面;选取三种各具特色的工具就其文本分类功能进行测评。最后,针对开源文本挖掘工具的现状,提出几点建议。

  10. Affordable non-traditional source data mining for context assessment to improve distributed fusion system robustness

    Science.gov (United States)

    Bowman, Christopher; Haith, Gary; Steinberg, Alan; Morefield, Charles; Morefield, Michael

    2013-05-01

    This paper describes methods to affordably improve the robustness of distributed fusion systems by opportunistically leveraging non-traditional data sources. Adaptive methods help find relevant data, create models, and characterize the model quality. These methods also can measure the conformity of this non-traditional data with fusion system products including situation modeling and mission impact prediction. Non-traditional data can improve the quantity, quality, availability, timeliness, and diversity of the baseline fusion system sources and therefore can improve prediction and estimation accuracy and robustness at all levels of fusion. Techniques are described that automatically learn to characterize and search non-traditional contextual data to enable operators integrate the data with the high-level fusion systems and ontologies. These techniques apply the extension of the Data Fusion & Resource Management Dual Node Network (DNN) technical architecture at Level 4. The DNN architecture supports effectively assessment and management of the expanded portfolio of data sources, entities of interest, models, and algorithms including data pattern discovery and context conformity. Affordable model-driven and data-driven data mining methods to discover unknown models from non-traditional and `big data' sources are used to automatically learn entity behaviors and correlations with fusion products, [14 and 15]. This paper describes our context assessment software development, and the demonstration of context assessment of non-traditional data to compare to an intelligence surveillance and reconnaissance fusion product based upon an IED POIs workflow.

  11. [Gender inequity in health in contexts of environmental risk from mining and industrial activity in Mexico].

    Science.gov (United States)

    Catalán-Vázquez, Minerva; Riojas-Rodríguez, Horacio

    2015-06-01

    Analyze how gender inequity manifests in contexts of poverty in different environmental risk scenarios in Mexico. Qualitative design based on six discussion groups and 54 in-depth interviews with women from six exposed communities: two to environmental manganese in a mining district, two in an industrial corridor, and two bordering a sanitary landfill. A document review of environmental and health studies in each area was done to relate them to the women's perspective on the problem. In the three case studies, by gender roles, women stay at home and do housework and, therefore, are subject to intense environmental exposure when carrying out their daily tasks, such as house cleaning. Interview and discussion group results were found to be related to epidemiological study results. In the case of the mining district, women's perceptions are consistent with study comments on adverse cognitive effects of manganese exposure. In all three cases, there are serious limitations on women's political participation in environmental risk management. Due to conditions of inequity, women are highly exposed to environmental health risks and their social participation in solving environmental problems is quite limited. These results have social and environmental policy implications in the areas studied, especially with regard to risk assessment, management, and communication.

  12. RSECM: Robust Search Engine using Context-based Mining for Educational Big Data

    Directory of Open Access Journals (Sweden)

    D. Pratiba

    2016-12-01

    Full Text Available With an accelerating growth in the educational sector along with the aid of ICT and cloud-based services, there is a consistent rise of educational big data, where storage and processing become the prime matter of challenge. Although many recent attempts have used open source framework e.g. Hadoop for storage, still there are reported issues in sufficient security management and data analyzing problems. Hence, there is less applicability of mining techniques for upcoming search engine due to unstructured educational data. The proposed system introduces a technique called as RSECM i.e. Robust Search Engine using Context-based Modeling that presents a novel archival and search engine. RSECM generates its own massive stream of educational big data and performs the efficient search of data. Outcome exhibits RSECM outperforms SQL based approaches concerning faster retrieval of the dynamic user-defined query.

  13. Vetiver Grass: a potential tool for phytoremediation of iron ore mine site spoil dump

    Directory of Open Access Journals (Sweden)

    Anita Mukherjee

    2015-06-01

    Full Text Available The impact of mining has lead to the generation of a large amount of spoil dumps that has become dangerous to human health, wildlife and biodiversity. Thus it is essential that the post mining areas and waste land generated need to be rapidly vegetated. Vetiver grass (Chrysopogon zizanioides (L. Roberty is a tropical plant which grows naturally in various soil conditions and is well known for its ability to resist DNA damage while growing on typically polluted soil conditions. The spoil dumps from the iron mine site is unstable and inhospitable for plant growth due to presence of various toxic heavy metals like - Fe, Mn, Zn, Cu, Pb, Ni, Cr, Cd etc. Vetiver system is an efficient bio-engineering tool for reclaiming such spoil dumps. There are 12 known species of Vetiver grass, and many hundreds of different cultivars that are exploited by users depending on need. In the present study we selected the polyploid infertile variety of vetiver and carried pot experiments. Vetiver plants grown on the iron ore mine spoil dump show distinct differences in their growth with fewer numbers of tillers, reduced chlorophyll content, upregulation of antioxidant enzymes and increased proline content. To investigate the level of DNA damage incurred and change in the genetic stability Comet assay and RAPD analysis were performed. Results confirmed that Vetiver grass can serve as a model species for phytoremediating the iron ore mine spoil dumps.

  14. Using Data Mining Techniques Examination of the Middle School Students' Attitude towards Mathematics in the Context of Some Variables

    Science.gov (United States)

    Idil, Feriha Hande; Narli, Serkan; Aksoy, Esra

    2016-01-01

    The aim of this study is to examine middle school students' attitude towards mathematics in the context of their mathematic learning preferences using data mining which is data analysis methodology that has been successfully used in different areas including educational domains. "How do I actually learn?" questionnaire and attitude scale…

  15. THE INFLUENCE OF CONTEXT IN VIRTUAL TOOL USING

    Directory of Open Access Journals (Sweden)

    María Luisa Sevillano García

    2010-12-01

    Full Text Available This article is the product of an investigation that has pursued the following objectives: To know the uses of virtual tools by a representative population in age, profession, gender and formation. To identify the difficulties they encounter and to look for their causes. To assess the advantages of every tool in professional life, in leisure time and in teaching work.A sampling by quota has been chosen, basically for its rigor. The data acquisition tools -questionnaire protocol, interview script, and discussion group- than have been developed and applied make it possible to go beyond the frontier of quantitative vs qualitative analysis and to advance into a mixed methodology.  A quick, sure, selective, pertinent use of the studied virtual tools represents a principle of innovation, access to knowledge and development of professional performance. The qualifying in basic competences for building, transferring and processing general and specialized knowledge that can be found on the net and that is useful throughout one's life.

  16. KAIKObase: An integrated silkworm genome database and data mining tool

    Directory of Open Access Journals (Sweden)

    Nagaraju Javaregowda

    2009-10-01

    Full Text Available Abstract Background The silkworm, Bombyx mori, is one of the most economically important insects in many developing countries owing to its large-scale cultivation for silk production. With the development of genomic and biotechnological tools, B. mori has also become an important bioreactor for production of various recombinant proteins of biomedical interest. In 2004, two genome sequencing projects for B. mori were reported independently by Chinese and Japanese teams; however, the datasets were insufficient for building long genomic scaffolds which are essential for unambiguous annotation of the genome. Now, both the datasets have been merged and assembled through a joint collaboration between the two groups. Description Integration of the two data sets of silkworm whole-genome-shotgun sequencing by the Japanese and Chinese groups together with newly obtained fosmid- and BAC-end sequences produced the best continuity (~3.7 Mb in N50 scaffold size among the sequenced insect genomes and provided a high degree of nucleotide coverage (88% of all 28 chromosomes. In addition, a physical map of BAC contigs constructed by fingerprinting BAC clones and a SNP linkage map constructed using BAC-end sequences were available. In parallel, proteomic data from two-dimensional polyacrylamide gel electrophoresis in various tissues and developmental stages were compiled into a silkworm proteome database. Finally, a Bombyx trap database was constructed for documenting insertion positions and expression data of transposon insertion lines. Conclusion For efficient usage of genome information for functional studies, genomic sequences, physical and genetic map information and EST data were compiled into KAIKObase, an integrated silkworm genome database which consists of 4 map viewers, a gene viewer, and sequence, keyword and position search systems to display results and data at the level of nucleotide sequence, gene, scaffold and chromosome. Integration of the

  17. DMT-TAFM: a data mining tool for technical analysis of futures market

    Science.gov (United States)

    Stepanov, Vladimir; Sathaye, Archana

    2002-03-01

    Technical analysis of financial markets describes many patterns of market behavior. For practical use, all these descriptions need to be adjusted for each particular trading session. In this paper, we develop a data mining tool for technical analysis of the futures markets (DMT-TAFM), which dynamically generates rules based on the notion of the price pattern similarity. The tool consists of three main components. The first component provides visualization of data series on a chart with different ranges, scales, and chart sizes and types. The second component constructs pattern descriptions using sets of polynomials. The third component specifies the training set for mining, defines the similarity notion, and searches for a set of similar patterns. DMT-TAFM is useful to prepare the data, and then reveal and systemize statistical information about similar patterns found in any type of historical price series. We performed experiments with our tool on three decades of trading data fro hundred types of futures. Our results for this data set shows that, we can prove or disprove many well-known patterns based on real data, as well as reveal new ones, and use the set of relatively consistent patterns found during data mining for developing better futures trading strategies.

  18. The institutional regulation of the sustainability of water resources within mining contexts: accountability and plurality

    NARCIS (Netherlands)

    Sosa, M.; Zwarteveen, M.

    2014-01-01

    This article reviews recent literature on water and mining and uses illustrations from a large gold mine, Yanacocha, operating in Peru, to assess the effectiveness of institutional mechanisms for safeguarding the sustainability of water resources (and water-based ecosystems) in mining regions. The

  19. The institutional regulation of the sustainability of water resources within mining contexts: accountability and plurality

    NARCIS (Netherlands)

    Sosa Landeo, M.; Zwarteveen, M.Z.

    2014-01-01

    This article reviews recent literature on water and mining and uses illustrations from a large gold mine, Yanacocha, operating in Peru, to assess the effectiveness of institutional mechanisms for safeguarding the sustainability of water resources (and water-based ecosystems) in mining regions. The s

  20. JCoast - a biologist-centric software tool for data mining and comparison of prokaryotic (meta)genomes.

    Science.gov (United States)

    Richter, Michael; Lombardot, Thierry; Kostadinov, Ivaylo; Kottmann, Renzo; Duhaime, Melissa Beth; Peplies, Jörg; Glöckner, Frank Oliver

    2008-04-01

    Current sequencing technologies give access to sequence information for genomes and metagenomes at a tremendous speed. Subsequent data processing is mainly performed by automatic pipelines provided by the sequencing centers. Although, standardised workflows are desirable and useful in many respects, rational data mining, comparative genomics, and especially the interpretation of the sequence information in the biological context, demands for intuitive, flexible, and extendable solutions. The JCoast software tool was primarily designed to analyse and compare (meta)genome sequences of prokaryotes. Based on a pre-computed GenDB database project, JCoast offers a flexible graphical user interface (GUI), as well as an application programming interface (API) that facilitates back-end data access. JCoast offers individual, cross genome-, and metagenome analysis, and assists the biologist in exploration of large and complex datasets. JCoast combines all functions required for the mining, annotation, and interpretation of (meta)genomic data. The lightweight software solution allows the user to easily take advantage of advanced back-end database structures by providing a programming and graphical user interface to answer biological questions. JCoast is available at the project homepage.

  1. The Medicine Wheel: A Versatile Tool for Promoting Positive Change in Diverse Contexts

    Science.gov (United States)

    Kemppainen, David; Kopera-Frye, Karen; Woodard, Julie

    2008-01-01

    This essay describes the utility of employing the medicine wheel with university students in both counselling and instructional contexts. A brief description of the medicine wheel, its history, symbolic significance, and use in diverse contexts is discussed. The preliminary data suggest this to be a valuable tool in addressing both the academic…

  2. U-Compare: share and compare text mining tools with UIMA

    Science.gov (United States)

    Kano, Yoshinobu; Baumgartner, William A.; McCrohon, Luke; Ananiadou, Sophia; Cohen, K. Bretonnel; Hunter, Lawrence; Tsujii, Jun'ichi

    2009-01-01

    Summary: Due to the increasing number of text mining resources (tools and corpora) available to biologists, interoperability issues between these resources are becoming significant obstacles to using them effectively. UIMA, the Unstructured Information Management Architecture, is an open framework designed to aid in the construction of more interoperable tools. U-Compare is built on top of the UIMA framework, and provides both a concrete framework for out-of-the-box text mining and a sophisticated evaluation platform allowing users to run specific tools on any target text, generating both detailed statistics and instance-based visualizations of outputs. U-Compare is a joint project, providing the world's largest, and still growing, collection of UIMA-compatible resources. These resources, originally developed by different groups for a variety of domains, include many famous tools and corpora. U-Compare can be launched straight from the web, without needing to be manually installed. All U-Compare components are provided ready-to-use and can be combined easily via a drag-and-drop interface without any programming. External UIMA components can also simply be mixed with U-Compare components, without distinguishing between locally and remotely deployed resources. Availability: http://u-compare.org/ Contact: kano@is.s.u-tokyo.ac.jp PMID:19414535

  3. Data Mining and Machine Learning Tools for Combinatorial Material Science of All-Oxide Photovoltaic Cells.

    Science.gov (United States)

    Yosipof, Abraham; Nahum, Oren E; Anderson, Assaf Y; Barad, Hannah-Noa; Zaban, Arie; Senderowitz, Hanoch

    2015-06-01

    Growth in energy demands, coupled with the need for clean energy, are likely to make solar cells an important part of future energy resources. In particular, cells entirely made of metal oxides (MOs) have the potential to provide clean and affordable energy if their power conversion efficiencies are improved. Such improvements require the development of new MOs which could benefit from combining combinatorial material sciences for producing solar cells libraries with data mining tools to direct synthesis efforts. In this work we developed a data mining workflow and applied it to the analysis of two recently reported solar cell libraries based on Titanium and Copper oxides. Our results demonstrate that QSAR models with good prediction statistics for multiple solar cells properties could be developed and that these models highlight important factors affecting these properties in accord with experimental findings. The resulting models are therefore suitable for designing better solar cells.

  4. Data-Mining – A Valuable Managerial Tool for Improving Power Plants Efficiency

    Directory of Open Access Journals (Sweden)

    Danubianu Mirela

    2014-05-01

    Full Text Available Energy and environment are top priorities for the EU’s Europe 2020 Strategy. Both fields imply complex approaches and consistent investment. The paper presents an alternative to large investments to improve the efficiencies of existing (outdated power installations: namely the use of data-mining techniques for analysing existing operational data. Data-mining is based upon exhaustive analysis of operational records, inferring high-value information by simply processing records with advanced mathematical / statistical tools. Results can be: assessment of the consistency of measurements, identification of new hardware needed for improving the quality of data, deducing the most efficient level for operation (internal benchmarking, correlation of consumptions with power/ heat production, of technical parameters with environmental impact, scheduling the optimal maintenance time, fuel stock optimization, simulating scenarios for equipment operation, anticipating periods of maximal stress of equipment, identification of medium and long term trends, planning and decision support for new investment, etc. The paper presents a data mining process carried out at the TERMICA - Suceava power plant. The analysis calls for a multidisciplinary approach, a complex team (experts in power&heat production, mechanics, environmental protection, economists, and last but not least IT experts and can be carried out with lower expenses than an investment in new equipment. Involvement of top management of the company is essential, being the driving force and motivation source for the data-mining team. The approach presented is self learning as once established, the data-mining analytical, modelling and simulation procedures and associated parameter databases can adjust themselves by absorbing and processing new relevant information and can be used on a long term basis for monitoring the performance of the installation, certifying the soundness of managerial measures taken

  5. Mining

    Directory of Open Access Journals (Sweden)

    Khairullah Khan

    2014-09-01

    Full Text Available Opinion mining is an interesting area of research because of its applications in various fields. Collecting opinions of people about products and about social and political events and problems through the Web is becoming increasingly popular every day. The opinions of users are helpful for the public and for stakeholders when making certain decisions. Opinion mining is a way to retrieve information through search engines, Web blogs and social networks. Because of the huge number of reviews in the form of unstructured text, it is impossible to summarize the information manually. Accordingly, efficient computational methods are needed for mining and summarizing the reviews from corpuses and Web documents. This study presents a systematic literature survey regarding the computational techniques, models and algorithms for mining opinion components from unstructured reviews.

  6. THE DASHBOARD - STRATEGIC MANAGEMENT TOOL OF ACTIVITIES IN THE COAL MINING INDUSTRY

    Directory of Open Access Journals (Sweden)

    DINA IONELA-CLAUDIA

    2014-02-01

    Full Text Available In order to achieve the objectives at all levels and categories of functions in the unit, use multiple information systems, including the dashboard which is one of the management tools that responds very well to the rationale and supporting data of management decisions complementary to those provided by other media. In this respect, we try to calculate indicators organization currently working compartments of the coal mining units and will propose a series of indicators whose calculation we consider would be necessary for a detailed presentation as activity carried out and the improvement of management.

  7. Data Envelopment Analysis as a tool for the exploration phase of mining

    Science.gov (United States)

    Kauppinen, Tommi

    2016-08-01

    The exploration of mining has often been limited by time-consuming methods of analysis. This paper introduces Data Envelopment Analysis (DEA) as a new tool for the exploration phase of mining. DEA is a non-parametric method for data fusion, and it is used alongside with the on-site Raman analysis. Ten meters of halved rock drillcore from the Kittil mine (Suurikuusikko deposit) were pulverised and homogenised, thus ensuring that each meter had a representative sample. These 10 samples, one for each meter, were subsequently measured with a grid measurement (32×32 measurement each) using the Raman setup. All the data points were analysed using the point-count method. After identifying the frequency at which potentially valuable minerals appear in the samples, this information was analysed using DEA. The study ends by presenting an efficiency score for each meter of drillcore. These efficiency scores enable geologists to judge more rapidly which parts of the drillcore must be logged more carefully. In addition, Principal Component Analysis (PCA) is discussed as an alternative for producing similar results to DEA.

  8. Can abstract screening workload be reduced using text mining? User experiences of the tool Rayyan.

    Science.gov (United States)

    Olofsson, Hanna; Brolund, Agneta; Hellberg, Christel; Silverstein, Rebecca; Stenström, Karin; Österberg, Marie; Dagerhamn, Jessica

    2017-09-01

    One time-consuming aspect of conducting systematic reviews is the task of sifting through abstracts to identify relevant studies. One promising approach for reducing this burden uses text mining technology to identify those abstracts that are potentially most relevant for a project, allowing those abstracts to be screened first. To examine the effectiveness of the text mining functionality of the abstract screening tool Rayyan. User experiences were collected. Rayyan was used to screen abstracts for 6 reviews in 2015. After screening 25%, 50%, and 75% of the abstracts, the screeners logged the relevant references identified. A survey was sent to users. After screening half of the search result with Rayyan, 86% to 99% of the references deemed relevant to the study were identified. Of those studies included in the final reports, 96% to 100% were already identified in the first half of the screening process. Users rated Rayyan 4.5 out of 5. The text mining function in Rayyan successfully helped reviewers identify relevant studies early in the screening process. Copyright © 2017 John Wiley & Sons, Ltd.

  9. PPInterFinder--a mining tool for extracting causal relations on human proteins from literature.

    Science.gov (United States)

    Raja, Kalpana; Subramani, Suresh; Natarajan, Jeyakumar

    2013-01-01

    One of the most common and challenging problem in biomedical text mining is to mine protein-protein interactions (PPIs) from MEDLINE abstracts and full-text research articles because PPIs play a major role in understanding the various biological processes and the impact of proteins in diseases. We implemented, PPInterFinder--a web-based text mining tool to extract human PPIs from biomedical literature. PPInterFinder uses relation keyword co-occurrences with protein names to extract information on PPIs from MEDLINE abstracts and consists of three phases. First, it identifies the relation keyword using a parser with Tregex and a relation keyword dictionary. Next, it automatically identifies the candidate PPI pairs with a set of rules related to PPI recognition. Finally, it extracts the relations by matching the sentence with a set of 11 specific patterns based on the syntactic nature of PPI pair. We find that PPInterFinder is capable of predicting PPIs with the accuracy of 66.05% on AIMED corpus and outperforms most of the existing systems. DATABASE URL: http://www.biomining-bu.in/ppinterfinder/

  10. A case-based reasoning tool for breast cancer knowledge management with data mining concepts and techniques

    Science.gov (United States)

    Demigha, Souâd.

    2016-03-01

    The paper presents a Case-Based Reasoning Tool for Breast Cancer Knowledge Management to improve breast cancer screening. To develop this tool, we combine both concepts and techniques of Case-Based Reasoning (CBR) and Data Mining (DM). Physicians and radiologists ground their diagnosis on their expertise (past experience) based on clinical cases. Case-Based Reasoning is the process of solving new problems based on the solutions of similar past problems and structured as cases. CBR is suitable for medical use. On the other hand, existing traditional hospital information systems (HIS), Radiological Information Systems (RIS) and Picture Archiving Information Systems (PACS) don't allow managing efficiently medical information because of its complexity and heterogeneity. Data Mining is the process of mining information from a data set and transform it into an understandable structure for further use. Combining CBR to Data Mining techniques will facilitate diagnosis and decision-making of medical experts.

  11. Experimental validation of a mine-wide continuous closure monitoring system as a decision making tool for gold mines.

    CSIR Research Space (South Africa)

    Malan, DF

    2003-03-01

    Full Text Available of a mine-wide continuous closure monitoring system. A RMT remote reading telltale system was modified to operate as a closure system and installed at Mponeng Mine. Unfortunately, this system did not succeed in collecting data from a large number...

  12. SQUAT: A web tool to mine human, murine and avian SAGE data

    Directory of Open Access Journals (Sweden)

    Besson Jérémy

    2008-09-01

    Full Text Available Abstract Background There is an increasing need in transcriptome research for gene expression data and pattern warehouses. It is of importance to integrate in these warehouses both raw transcriptomic data, as well as some properties encoded in these data, like local patterns. Description We have developed an application called SQUAT (SAGE Querying and Analysis Tools which is available at: http://bsmc.insa-lyon.fr/squat/. This database gives access to both raw SAGE data and patterns mined from these data, for three species (human, mouse and chicken. This database allows to make simple queries like "In which biological situations is my favorite gene expressed?" as well as much more complex queries like: ≪what are the genes that are frequently co-over-expressed with my gene of interest in given biological situations?≫. Connections with external web databases enrich biological interpretations, and enable sophisticated queries. To illustrate the power of SQUAT, we show and analyze the results of three different queries, one of which led to a biological hypothesis that was experimentally validated. Conclusion SQUAT is a user-friendly information retrieval platform, which aims at bringing some of the state-of-the-art mining tools to biologists.

  13. Context-dependent 'safekeeping' of foraging tools in New Caledonian crows.

    Science.gov (United States)

    Klump, Barbara C; van der Wal, Jessica E M; St Clair, James J H; Rutz, Christian

    2015-06-07

    Several animal species use tools for foraging, such as sticks to extract embedded arthropods and honey, or stones to crack open nuts and eggs. While providing access to nutritious foods, these behaviours may incur significant costs, such as the time and energy spent searching for, manufacturing and transporting tools. These costs can be reduced by re-using tools, keeping them safe when not needed. We experimentally investigated what New Caledonian crows do with their tools between successive prey extractions, and whether they express tool 'safekeeping' behaviours more often when the costs (foraging at height), or likelihood (handling of demanding prey), of tool loss are high. Birds generally took care of their tools (84% of 176 prey extractions, nine subjects), either trapping them underfoot (74%) or storing them in holes (26%)--behaviours we also observed in the wild (19 cases, four subjects). Moreover, tool-handling behaviour was context-dependent, with subjects: keeping their tools safe significantly more often when foraging at height; and storing tools significantly more often in holes when extracting more demanding prey (under these conditions, foot-trapping proved challenging). In arboreal environments, safekeeping can prevent costly tool losses, removing a potentially important constraint on the evolution of habitual and complex tool behaviour.

  14. The research of structure and mechanical properties of superhard electro-spark coatings for hardwearing mining tools

    Science.gov (United States)

    Bajin, P. A.; Chijikov, A. P.; Leybo, D. V.; Chuprunov, K. O.; Yudin, A. G.; Alymov, M. A.; Kuznetsov, D. V.

    2016-01-01

    The development of low cost and hardwearing mining tools is one of the most important areas in mining industry. It is especially important for technologies of rare and rare earth metals mining due to high hardness of related ores. Coatings for electrodes, produced by extrusion of self-propagating high temperature synthesis (SHS) products from hard-alloyed materials with nanosized structure, for further application in processes of electrospark alloying and deposition were studied in this work. The results of microstructure and properties of deposited layers, interaction of support with SHS produced electrodes, comparison of frictional properties of obtained materials as well as some industrial testing results are presented in this work.

  15. HC StratoMineR: A Web-Based Tool for the Rapid Analysis of High-Content Datasets.

    Science.gov (United States)

    Omta, Wienand A; van Heesbeen, Roy G; Pagliero, Romina J; van der Velden, Lieke M; Lelieveld, Daphne; Nellen, Mehdi; Kramer, Maik; Yeong, Marley; Saeidi, Amir M; Medema, Rene H; Spruit, Marco; Brinkkemper, Sjaak; Klumperman, Judith; Egan, David A

    2016-10-01

    High-content screening (HCS) can generate large multidimensional datasets and when aligned with the appropriate data mining tools, it can yield valuable insights into the mechanism of action of bioactive molecules. However, easy-to-use data mining tools are not widely available, with the result that these datasets are frequently underutilized. Here, we present HC StratoMineR, a web-based tool for high-content data analysis. It is a decision-supportive platform that guides even non-expert users through a high-content data analysis workflow. HC StratoMineR is built by using My Structured Query Language for storage and querying, PHP: Hypertext Preprocessor as the main programming language, and jQuery for additional user interface functionality. R is used for statistical calculations, logic and data visualizations. Furthermore, C++ and graphical processor unit power is diffusely embedded in R by using the rcpp and rpud libraries for operations that are computationally highly intensive. We show that we can use HC StratoMineR for the analysis of multivariate data from a high-content siRNA knock-down screen and a small-molecule screen. It can be used to rapidly filter out undesirable data; to select relevant data; and to perform quality control, data reduction, data exploration, morphological hit picking, and data clustering. Our results demonstrate that HC StratoMineR can be used to functionally categorize HCS hits and, thus, provide valuable information for hit prioritization.

  16. MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment.

    Science.gov (United States)

    Karthikeyan, Muthukumarasamy; Pandit, Yogesh; Pandit, Deepak; Vyas, Renu

    2015-01-01

    Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.

  17. Tools and Databases of the KOMICS Web Portal for Preprocessing, Mining, and Dissemination of Metabolomics Data

    Directory of Open Access Journals (Sweden)

    Nozomu Sakurai

    2014-01-01

    Full Text Available A metabolome—the collection of comprehensive quantitative data on metabolites in an organism—has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal, where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data.

  18. Tools and databases of the KOMICS web portal for preprocessing, mining, and dissemination of metabolomics data.

    Science.gov (United States)

    Sakurai, Nozomu; Ara, Takeshi; Enomoto, Mitsuo; Motegi, Takeshi; Morishita, Yoshihiko; Kurabayashi, Atsushi; Iijima, Yoko; Ogata, Yoshiyuki; Nakajima, Daisuke; Suzuki, Hideyuki; Shibata, Daisuke

    2014-01-01

    A metabolome--the collection of comprehensive quantitative data on metabolites in an organism--has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal), where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data.

  19. The antibody mining toolbox: an open source tool for the rapid analysis of antibody repertoires.

    Science.gov (United States)

    D'Angelo, Sara; Glanville, Jacob; Ferrara, Fortunato; Naranjo, Leslie; Gleasner, Cheryl D; Shen, Xiaohong; Bradbury, Andrew R M; Kiss, Csaba

    2014-01-01

    In vitro selection has been an essential tool in the development of recombinant antibodies against various antigen targets. Deep sequencing has recently been gaining ground as an alternative and valuable method to analyze such antibody selections. The analysis provides a novel and extremely detailed view of selected antibody populations, and allows the identification of specific antibodies using only sequencing data, potentially eliminating the need for expensive and laborious low-throughput screening methods such as enzyme-linked immunosorbant assay. The high cost and the need for bioinformatics experts and powerful computer clusters, however, have limited the general use of deep sequencing in antibody selections. Here, we describe the AbMining ToolBox, an open source software package for the straightforward analysis of antibody libraries sequenced by the three main next generation sequencing platforms (454, Ion Torrent, MiSeq). The ToolBox is able to identify heavy chain CDR3s as effectively as more computationally intense software, and can be easily adapted to analyze other portions of antibody variable genes, as well as the selection outputs of libraries based on different scaffolds. The software runs on all common operating systems (Microsoft Windows, Mac OS X, Linux), on standard personal computers, and sequence analysis of 1-2 million reads can be accomplished in 10-15 min, a fraction of the time of competing software. Use of the ToolBox will allow the average researcher to incorporate deep sequence analysis into routine selections from antibody display libraries.

  20. miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.

    Science.gov (United States)

    Gupta, Samir; Ross, Karen E; Tudor, Catalina O; Wu, Cathy H; Schmidt, Carl J; Vijay-Shanker, K

    2016-04-29

    MicroRNAs are increasingly being appreciated as critical players in human diseases, and questions concerning the role of microRNAs arise in many areas of biomedical research. There are several manually curated databases of microRNA-disease associations gathered from the biomedical literature; however, it is difficult for curators of these databases to keep up with the explosion of publications in the microRNA-disease field. Moreover, automated literature mining tools that assist manual curation of microRNA-disease associations currently capture only one microRNA property (expression) in the context of one disease (cancer). Thus, there is a clear need to develop more sophisticated automated literature mining tools that capture a variety of microRNA properties and relations in the context of multiple diseases to provide researchers with fast access to the most recent published information and to streamline and accelerate manual curation. We have developed miRiaD (microRNAs in association with Disease), a text-mining tool that automatically extracts associations between microRNAs and diseases from the literature. These associations are often not directly linked, and the intermediate relations are often highly informative for the biomedical researcher. Thus, miRiaD extracts the miR-disease pairs together with an explanation for their association. We also developed a procedure that assigns scores to sentences, marking their informativeness, based on the microRNA-disease relation observed within the sentence. miRiaD was applied to the entire Medline corpus, identifying 8301 PMIDs with miR-disease associations. These abstracts and the miR-disease associations are available for browsing at http://biotm.cis.udel.edu/miRiaD . We evaluated the recall and precision of miRiaD with respect to information of high interest to public microRNA-disease database curators (expression and target gene associations), obtaining a recall of 88.46-90.78. When we expanded the evaluation to

  1. Migrancy, masculine identities and AIDS: the psychosocial context of HIV transmission on the South African gold mines.

    Science.gov (United States)

    Campbell, C

    1997-07-01

    Levels of HIV infection are particularly high amongst migrant workers in sub-Saharan Africa. This paper presents a case study of one such vulnerable group of migrants-underground workers on the South African gold mines-and highlights the psychosocial context of HIV transmission in the mining setting. On the assumption that social identities serve as an important influence on peoples' sexual behaviour, the study examines the way in which miners construct their social identities within the parameters of their particular living and working conditions. It also identifies some of the key narratives used by miners to make sense of their experience in the realms of health, ill-health, HIV and sexuality. Masculinity emerged as a leading narrative in informants' accounts of their working life, health and sexuality, and the paper examines the way in which the construction of masculine identities renders miners particularly vulnerable to HIV. The implications of these findings for HIV educational interventions are discussed.

  2. Writing Readiness Inventory Tool in Context (WRITIC): reliability and convergent validity

    NARCIS (Netherlands)

    Hartingsveldt, M.J. van; Cup, E.H.C.; Groot, I.J.M. de; Nijhuis-Van der Sanden, M.W.

    2014-01-01

    BACKGROUND/AIM: This study examined the reliability and convergent validity of the Writing Readiness Inventory Tool in Context, a measurement evaluating writing readiness in kindergarten children (aged from five to six years). METHODS: Test-retest reliability was established with 59 children, inter-

  3. Conceptualising the Use of Facebook in Ethnographic Research: As Tool, as Data and as Context

    Science.gov (United States)

    Baker, Sally

    2013-01-01

    This article proposes a three-part conceptualisation of the use of Facebook in ethnographic research: as a tool, as data and as context. Longitudinal research with young adults at a time of significant change provides many challenges for the ethnographic researcher, such as maintaining channels of communication and high rates of participant…

  4. Detecting Diseases in Medical Prescriptions Using Data Mining Tools and Combining Techniques.

    Science.gov (United States)

    Teimouri, Mehdi; Farzadfar, Farshad; Soudi Alamdari, Mahsa; Hashemi-Meshkini, Amir; Adibi Alamdari, Parisa; Rezaei-Darzi, Ehsan; Varmaghani, Mehdi; Zeynalabedini, Aysan

    2016-01-01

    Data about the prevalence of communicable and non-communicable diseases, as one of the most important categories of epidemiological data, is used for interpreting health status of communities. This study aims to calculate the prevalence of outpatient diseases through the characterization of outpatient prescriptions. The data used in this study is collected from 1412 prescriptions for various types of diseases from which we have focused on the identification of ten diseases. In this study, data mining tools are used to identify diseases for which prescriptions are written. In order to evaluate the performances of these methods, we compare the results with Naïve method. Then, combining methods are used to improve the results. Results showed that Support Vector Machine, with an accuracy of 95.32%, shows better performance than the other methods. The result of Naive method, with an accuracy of 67.71%, is 20% worse than Nearest Neighbor method which has the lowest level of accuracy among the other classification algorithms. The results indicate that the implementation of data mining algorithms resulted in a good performance in characterization of outpatient diseases. These results can help to choose appropriate methods for the classification of prescriptions in larger scales.

  5. A Comparison Study between Data Mining Tools over some Classification Methods

    Directory of Open Access Journals (Sweden)

    Abdullah H. Wahbeh

    2011-09-01

    Full Text Available Nowadays, huge amount of data and information are available for everyone, Data can now be stored in many different kinds of databases and information repositories, besides being available on the Internet or in printed form. With such amount of data, there is a need for powerful techniques for better interpretation of these data that exceeds the human's ability for comprehension and making decision in a better way. In order to reveal the best tools for dealing with the classification task that helps in decision making, this paper has conducted a comparative study between a number of some of the free available data mining and knowledge discovery tools and software packages. Results have showed that the performance of the tools for the classification task is affected by the kind of dataset used and by the way the classification algorithms were implemented within the toolkits. For the applicability issue, the WEKA toolkit has achieved the highest applicability followed by Orange, Tanagra, and KNIME respectively. Finally; WEKA toolkit has achieved the highest improvement in classification performance; when moving from the percentage split test mode to the Cross Validation test mode, followed by Orange, KNIME and finally Tanagra respectively.

  6. Design droughts as planning tool for ecosystem establishment in post-mining landscapes

    Science.gov (United States)

    Halwatura, Devanmini; Lechner, Alex; Baumgartl, Thomas; McIntyre, Neil; Arnold, Sven

    2015-04-01

    Eastern Australia has considerable mineral and energy resources and areas of high biodiversity value co-occurring over a broad range of agro-climatic environments. Water is the primary abiotic stressor for (agro)ecosystems in many parts of Eastern Australia. In the context of mined land rehabilitation quantifying the severity-duration-frequency (SDF) of droughts is crucial for successful ecosystem rehabilitation to overcome challenges of early vegetation establishment and long-term ecosystem resilience. The objective of this study was to quantify the SDF of short-term and long-term drought events of 11 selected locations across a broad range of agro-climatic environments in Eastern Australia by using three drought indices at different time scales: the Standardized Precipitation Index (SPI), the Reconnaissance Drought Index (RDI), and the Standardized Precipitation-Evapotranspiration Index (SPEI). Based on the indices we derived bivariate distribution functions of drought severity and duration, and estimated the recurrence intervals of drought events at different time scales. The correlation between the simple SPI and the more complex SPEI or RDI was stronger for the tropical and temperate locations than for the arid locations, indicating that SPEI or RDI can be replaced by SPI if evaporation plays a minor role for plant available water. Both short-term and long-term droughts were most severe and prolonged, and occurred most frequently in arid regions, but were relatively rare in tropical and temperate regions. Our approach is similar to intensity-duration-frequency (IDF) analyses of rainfall crucial to design infrastructure. In this regard, we propose to apply SDF analyses of droughts to design ecosystem components in post-mining landscapes. Together with design rainfalls, design droughts should be used to assess rehabilitation strategies and ecological management based on drought recurrence intervals, thereby minimising the risk of failure of initial ecosystem

  7. A Survey on Educational Data Mining and Research Trends

    Directory of Open Access Journals (Sweden)

    Rajni Jindal

    2013-07-01

    Full Text Available Educational Data Mining (EDM is an emerging fieldexploring data in educational context by applyingdifferent Data Mining (DM techniques/tools. It provides intrinsic knowledge of teaching and learningprocess for effective education planning. In this survey work focuses on components, research trends (1998to 2012 of EDM highlighting its related Tools, Techniques and educational Outcomes. It also highlightsthe Challenges EDM.

  8. A Survey on Educational Data Mining and Research Trends

    OpenAIRE

    Rajni Jindal; Malaya Dutta Borah

    2013-01-01

    Educational Data Mining (EDM) is an emerging fieldexploring data in educational context by applyingdifferent Data Mining (DM) techniques/tools. It provides intrinsic knowledge of teaching and learningprocess for effective education planning. In this survey work focuses on components, research trends (1998to 2012) of EDM highlighting its related Tools, Techniques and educational Outcomes. It also highlightsthe Challenges EDM.

  9. Hydraulic Mining, Extreme Floods, and the Geomorphic Context of the Trinity River, CA

    Science.gov (United States)

    Krause, A.

    2012-12-01

    The geomorphology of modern Trinity River is the product of remarkable human impacts to the flow and sediment regimes. Virtually nothing is known about the river prior to the first discovery of gold in 1848. The Trinity River experienced 110 years of hydraulic mining from 1860 to 1970, a time period nearly four time as long as hydraulic mining in the Sierra Nevada Mountains. Hydraulic mining produced vast amounts of debris that aggraded unconfined valleys of the Trinity River creating sediment wedges up to 4 meters high and 20 kilometers long. Subsequent dredger gold mining dug though the valley alluvium to the bedrock interface, mixing the sediment profile and creating large tailings piles that artificially constrain the valley width. A series of large storms in the mid 20th century reworked these mining sediments, creating the modern terraces and large hydraulic controls that persist today. Subsequent flow regulation diverted up to 90 percent of the basin runoff out of the Trinity River and virtually eliminated floods, causing riparian encroachment, channel narrowing, and a largely static channel. Established in 2000, the Trinity River Restoration Program uses a process-based restoration strategy to create a dynamic channel capable of creating and maintaining sufficient salmonid habitat to meet fish population targets. The associated restoration management actions include: flow releases, coarse sediment augmentation, mechanical channel rehabilitation, and watershed restoration. Full implementation of high flow releases to promote channel dynamism began in 2005. In May 2011, a release of 11,000 cfs was conducted for river restoration purposes. The 11,000 cfs peak magnitude release is the maximum authorized for restoration purposes, the largest release in since 1974, and the third largest release since flow regulation began in November 1960. The release caused scour and deposition, creating gravel bars in several locations. The location and magnitude of scour and

  10. KnoE: A Web Mining Tool to Validate Previously Discovered Semantic Correspondences

    Institute of Scientific and Technical Information of China (English)

    Jorge Martinez-Gil; José F.Aldana-Montes

    2012-01-01

    The problem of matching schemas or ontologies consists of providing corresponding entities in two or more knowledge models that belong to a same domain but have been developed separately.Nowadays there are a lot of techniques and tools for addressing this problem,however,the complex nature of the matching problem make existing solutions for real situations not fully satisfactory.The Google Similarity Distance has appeared recently.Its purpose is to mine knowledge from the Web using the Google search engine in order to semantically compare text expressions.Our work consists of developing a software application for validating results discovered by schema and ontology matching tools using the philosophy behind this distance.Moreover,we are interested in using not only Google,but other popular search engines with this similarity distance.The results reveal three main facts.Firstly,some web search engines can help us to validate semantic correspondences satisfactorily.Secondly there are significant differences among the web search engines.And thirdly the best results are obtained when using combinations of the web search engines that we have studied.

  11. CTSS: A Tool for Efficient Information Extraction with Soft Matching Rules for Text Mining

    Directory of Open Access Journals (Sweden)

    A. Christy

    2008-01-01

    Full Text Available The abundance of information available digitally in modern world had made a demand for structured information. The problem of text mining which dealt with discovering useful information from unstructured text had attracted the attention of researchers. The role of Information Extraction (IE software was to identify relevant information from texts, extracting information from a variety of sources and aggregating it to create a single view. Information extraction systems depended on particular corpora and were poor in recall values. Therefore, developing the system as domain-independent as well as improving the recall was an important challenge for IE. In this research, the authors proposed a domain-independent algorithm for information extraction, called SOFTRULEMINING for extracting the aim, methodology and conclusion from technical abstracts. The algorithm was implemented by combining trigram model with softmatching rules. A tool CTSS was constructed using SOFTRULEMINING and was tested with technical abstracts of www.computer.org and www.ansinet.org and found that the tool had improved its recall value and therefore the precision value in comparison with other search engines.

  12. EFTfitter: a tool for interpreting measurements in the context of effective field theories

    Energy Technology Data Exchange (ETDEWEB)

    Castro, Nuno [Universidade do Minho, Laboratorio de Instrumentacao e Fisica Experimental de Particulas, Departamento de Fisica, Braga (Portugal); Universidade do Porto, Departamento de Fisica e Astronomia, Faculdade de Ciencias, Porto (Portugal); Erdmann, Johannes; Grunwald, Cornelius; Kroeninger, Kevin [TU Dortmund, Lehrstuhl fuer Experimentelle Physik IV, Dortmund (Germany); Rosien, Nils-Arne [Universitaet Goettingen, II. Physikalisches Institut, Goettingen (Germany)

    2016-08-15

    Over the past years, the interpretation of measurements in the context of effective field theories has attracted much attention in the field of particle physics. We present a tool for interpreting sets of measurements in such models using a Bayesian ansatz by calculating the posterior probabilities of the corresponding free parameters numerically. An example is given, in which top-quark measurements are used to constrain anomalous couplings at the Wtb-vertex. (orig.)

  13. Strategic aspects of management organizational structure improvement in the context of ore mining companies

    OpenAIRE

    L.M. Varava; A.R. Arutyunyan; A.A. Varava

    2016-01-01

    The aim of the article. Exemplified by the large Ukrainian mining companies, the article substantiates the need of improving management organizational structures (MOSs) to enhance the implementation efficiency of strategies developed. The purpose of the research is to improve the management organizational structure of a company to enhance the latter’s strategic efficiency, taking into account the industry specifics, marketing characteristics, information support, and relevant capacity for div...

  14. Application of Data Mining Tools for Selected Scripts of Stock Market [

    Directory of Open Access Journals (Sweden)

    K.S.Mahajan

    2014-07-01

    Full Text Available One of the most important problems in modern finance is finding efficient ways to summarize and visualize the stock market data to give individuals or institutions useful information about the market behavior for investment decisions Therefore, Investment can be considered as one of the fundamental pillars of national economy. So, at the present time many investors look to find criterion to compare stocks together and selecting the best and also investors choose strategies that maximize the earning value of the investment process. Therefore the enormous amount of valuable data generated by the stock market has attracted researchers to explore this problem domain using different methodologies. Therefore research in data mining has gained a high attraction due to the importance of its applications and the increasing generation information. So, Data mining tools such as association rule, rule induction method and Apriori algorithm techniques are used to find association between different scripts of stock market, and also much of the research and development has taken place regarding the reasons for fluctuating Indian stock exchange.But, now days there are two important factors such as gold prices and US Dollar Prices are more dominating on Indian Stock Market and to find out the correlation between gold prices, dollar prices and BSE index statistical correlation is used and this helps the activities of stock operators, brokers, investors and jobbers. They are based on the forecasting the fluctuation of index share prices, gold prices, dollar prices and transactions of customers. Hence researcher has considered these problems as a topic for research.

  15. Application of Data Mining Tools for Selected Scripts of Stock Market

    Directory of Open Access Journals (Sweden)

    K. S. Mahajan

    2014-07-01

    Full Text Available One of the most important problems in modern finance is finding efficient ways to summarize and visualize the stock market data to give individuals or institutions useful information about the market behavior for investment decisions Therefore, Investment can be considered as one of the fundamental pillars of national economy. So, at the present time many investors look to find criterion to compare stocks together and selecting the best and also investors choose strategies that maximize the earning value of the investment process. Therefore the enormous amount of valuable data generated by the stock market has attracted researchers to explore this problem domain using different methodologies. Therefore research in data mining has gained a high attraction due to the importance of its applications and the increasing generation information. So, Data mining tools such as association rule, rule induction method and Apriori algorithm techniques are used to find association between different scripts of stock market, and also much of the research and development has taken place regarding the reasons for fluctuating Indian stock exchange. But, now days there are two important factors such as gold prices and US Dollar Prices are more dominating on Indian Stock Market and to find out the correlation between gold prices, dollar prices and BSE index statistical correlation is used and this helps the activities of stock operators, brokers, investors and jobbers. They are based on the forecasting the fluctuation of index share prices, gold prices, dollar prices and transactions of customers. Hence researcher has considered these problems as a topic for research.

  16. The evolutionary origins and ecological context of tool use in New Caledonian crows.

    Science.gov (United States)

    Rutz, Christian; St Clair, James J H

    2012-02-01

    which prolonged juvenile development enables acquisition of complex behaviours. Intriguingly, humans may well have influenced the evolution of at least some of the species' tool-oriented behaviours, via their possible introduction of candlenut trees together with the beetle larvae that infest them. Research on NC crows' tool-use behaviour in its full ecological context is still in its infancy, and we expect that, as more evidence accumulates, some of our assumptions and predictions will be proved wrong. However, it is clear from our analysis of existing work, and the development of some original ideas, that the unusual evolutionary trajectory of NC crows is probably the consequence of an intricate constellation of interplaying factors. Copyright © 2012 Elsevier B.V. All rights reserved.

  17. Muslim American adolescents' explanations of changing religious practices: Cultural tools in cultural contexts.

    Science.gov (United States)

    Cain, Kathleen M; Schiro, Isabella N; Gregory, Wesley E; Westberg, Lindsay M; Lee, Samantha R; Boyle, Colleen D

    2017-03-01

    , especially in regard to religious development. Participants reported a wide array of changes in religious practices, and they described these changes as responses to social and cultural influences. Participants' descriptions of changing practices can be understood through a Vygotskian framework in which religious practices are cultural tools that both respond to and shape surrounding cultural contexts.

  18. SMM-system: A mining tool to identify specific markers in Salmonella enterica.

    Science.gov (United States)

    Yu, Shuijing; Liu, Weibing; Shi, Chunlei; Wang, Dapeng; Dan, Xianlong; Li, Xiao; Shi, Xianming

    2011-03-01

    This report presents SMM-system, a software package that implements various personalized pre- and post-BLASTN tasks for mining specific markers of microbial pathogens. The main functionalities of SMM-system are summarized as follows: (i) converting multi-FASTA file, (ii) cutting interesting genomic sequence, (iii) automatic high-throughput BLASTN searches, and (iv) screening target sequences. The utility of SMM-system was demonstrated by using it to identify 214 Salmonella enterica-specific protein-coding sequences (CDSs). Eighteen primer pairs were designed based on eighteen S. enterica-specific CDSs, respectively. Seven of these primer pairs were validated with PCR assay, which showed 100% inclusivity for the 101 S. enterica genomes and 100% exclusivity of 30 non-S. enterica genomes. Three specific primer pairs were chosen to develop a multiplex PCR assay, which generated specific amplicons with a size of 180bp (SC1286), 238bp (SC1598) and 405bp (SC4361), respectively. This study demonstrates that SMM-system is a high-throughput specific marker generation tool that can be used to identify genus-, species-, serogroup- and even serovar-specific DNA sequences of microbial pathogens, which has a potential to be applied in food industries, diagnostics and taxonomic studies. SMM-system is freely available and can be downloaded from http://foodsafety.sjtu.edu.cn/SMM-system.html.

  19. YeastMine--an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit.

    Science.gov (United States)

    Balakrishnan, Rama; Park, Julie; Karra, Kalpana; Hitz, Benjamin C; Binkley, Gail; Hong, Eurie L; Sullivan, Julie; Micklem, Gos; Cherry, J Michael

    2012-01-01

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) provides high-quality curated genomic, genetic, and molecular information on the genes and their products of the budding yeast Saccharomyces cerevisiae. To accommodate the increasingly complex, diverse needs of researchers for searching and comparing data, SGD has implemented InterMine (http://www.InterMine.org), an open source data warehouse system with a sophisticated querying interface, to create YeastMine (http://yeastmine.yeastgenome.org). YeastMine is a multifaceted search and retrieval environment that provides access to diverse data types. Searches can be initiated with a list of genes, a list of Gene Ontology terms, or lists of many other data types. The results from queries can be combined for further analysis and saved or downloaded in customizable file formats. Queries themselves can be customized by modifying predefined templates or by creating a new template to access a combination of specific data types. YeastMine offers multiple scenarios in which it can be used such as a powerful search interface, a discovery tool, a curation aid and also a complex database presentation format. DATABASE URL: http://yeastmine.yeastgenome.org.

  20. Smart-card-based automatic meal record system intervention tool for analysis using data mining approach.

    Science.gov (United States)

    Zenitani, Satoko; Nishiuchi, Hiromu; Kiuchi, Takahiro

    2010-04-01

    The Smart-card-based Automatic Meal Record system for company cafeterias (AutoMealRecord system) was recently developed and used to monitor employee eating habits. The system could be a unique nutrition assessment tool for automatically monitoring the meal purchases of all employees, although it only focuses on company cafeterias and has never been validated. Before starting an interventional study, we tested the reliability of the data collected by the system using the data mining approach. The AutoMealRecord data were examined to determine if it could predict current obesity. All data used in this study (n = 899) were collected by a major electric company based in Tokyo, which has been operating the AutoMealRecord system for several years. We analyzed dietary patterns by principal component analysis using data from the system and extracted 5 major dietary patterns: healthy, traditional Japanese, Chinese, Japanese noodles, and pasta. The ability to predict current body mass index (BMI) with dietary preference was assessed with multiple linear regression analyses, and in the current study, BMI was positively correlated with male gender, preference for "Japanese noodles," mean energy intake, protein content, and frequency of body measurement at a body measurement booth in the cafeteria. There was a negative correlation with age, dietary fiber, and lunchtime cafeteria use (R(2) = 0.22). This regression model predicted "would-be obese" participants (BMI >or= 23) with 68.8% accuracy by leave-one-out cross validation. This shows that there was sufficient predictability of BMI based on data from the AutoMealRecord System. We conclude that the AutoMealRecord system is valuable for further consideration as a health care intervention tool.

  1. Analysis of new tools for equipment scheduling in underground mining and ore safety stock management

    OpenAIRE

    Karstila, Juha

    2015-01-01

    This Master’s thesis was carried out to test the efficiency of an equipment scheduling optimization software, developed by Song et al., in 2014 for underground mining. Another focus of this thesis is in mine inventory management. More specifically in the optimization of safety stock levels of ore supply, using a real options model, developed by Song in 2015. Both the equipment scheduling software and the safety stock optimization model were tested at Agnico Eagle Kittilä mine with real mine p...

  2. Architectures for health: between historical contexts and suburban areas. Tool to support location strategies

    Directory of Open Access Journals (Sweden)

    Marta Dell’Ovo

    2016-11-01

    Full Text Available The rapid obsolescence of healthcare facilities and the lack of suitable areas to host these services in urban contexts, are forcing their locations outside cities and historical centres. The complexity and variety of crucial aspects, involved in the smooth functioning of healthcare facilities, require, in the longterm, planning decisions supported by appropriate evaluation tools. Starting from the investigation of hospital location trends in Europe and Italy, the analysis of European case studies and a deep systematic review of the literature, the paper aims to define a set of criteria focused on the design of location strategies for new healthcare facilities.

  3. NFFinder: an online bioinformatics tool for searching similar transcriptomics experiments in the context of drug repositioning.

    Science.gov (United States)

    Setoain, Javier; Franch, Mònica; Martínez, Marta; Tabas-Madrid, Daniel; Sorzano, Carlos O S; Bakker, Annette; Gonzalez-Couto, Eduardo; Elvira, Juan; Pascual-Montano, Alberto

    2015-07-01

    Drug repositioning, using known drugs for treating conditions different from those the drug was originally designed to treat, is an important drug discovery tool that allows for a faster and cheaper development process by using drugs that are already approved or in an advanced trial stage for another purpose. This is especially relevant for orphan diseases because they affect too few people to make drug research de novo economically viable. In this paper we present NFFinder, a bioinformatics tool for identifying potential useful drugs in the context of orphan diseases. NFFinder uses transcriptomic data to find relationships between drugs, diseases and a phenotype of interest, as well as identifying experts having published on that domain. The application shows in a dashboard a series of graphics and tables designed to help researchers formulate repositioning hypotheses and identify potential biological relationships between drugs and diseases. NFFinder is freely available at http://nffinder.cnb.csic.es.

  4. Development Tools for Context Aware and Secure Pervasive Computing in Embedded Systems (PECES Middleware

    Directory of Open Access Journals (Sweden)

    Ran Zhao

    2013-01-01

    Full Text Available The main objective of the PECES project is the development of system software to enable the communication among heterogeneous devices across multiple smart spaces, breaking the traditional barrier of “smart islands” where only the services offered in a nearby spatial area can be used easily. PECES development tools help the application developer to build and test the PECES middleware based applications. This paper presents a set of tools, namely Peces Project, Peces Device Definition, Peces Ontology Instantiation, Peces Security Configuration, Peces Service Definition, Peces Role Specification Definition,  Peces Hierarchical Role Specification Definition, Peces Event Editor, Peces Event Diagram and Peces Testing which enable application developers to build, model and test the PECES middleware based smart space application using the novel concepts such as role assignment, context ontologies and security.

  5. DynGO: a tool for visualizing and mining of Gene Ontology and its associations

    Directory of Open Access Journals (Sweden)

    Wu Cathy H

    2005-08-01

    Full Text Available Abstract Background A large volume of data and information about genes and gene products has been stored in various molecular biology databases. A major challenge for knowledge discovery using these databases is to identify related genes and gene products in disparate databases. The development of Gene Ontology (GO as a common vocabulary for annotation allows integrated queries across multiple databases and identification of semantically related genes and gene products (i.e., genes and gene products that have similar GO annotations. Meanwhile, dozens of tools have been developed for browsing, mining or editing GO terms, their hierarchical relationships, or their "associated" genes and gene products (i.e., genes and gene products annotated with GO terms. Tools that allow users to directly search and inspect relations among all GO terms and their associated genes and gene products from multiple databases are needed. Results We present a standalone package called DynGO, which provides several advanced functionalities in addition to the standard browsing capability of the official GO browsing tool (AmiGO. DynGO allows users to conduct batch retrieval of GO annotations for a list of genes and gene products, and semantic retrieval of genes and gene products sharing similar GO annotations. The result are shown in an association tree organized according to GO hierarchies and supported with many dynamic display options such as sorting tree nodes or changing orientation of the tree. For GO curators and frequent GO users, DynGO provides fast and convenient access to GO annotation data. DynGO is generally applicable to any data set where the records are annotated with GO terms, as illustrated by two examples. Conclusion We have presented a standalone package DynGO that provides functionalities to search and browse GO and its association databases as well as several additional functions such as batch retrieval and semantic retrieval. The complete

  6. An Evaluation of Text Mining Tools as Applied to Selected Scientific and Engineering Literature.

    Science.gov (United States)

    Trybula, Walter J.; Wyllys, Ronald E.

    2000-01-01

    Addresses an approach to the discovery of scientific knowledge through an examination of data mining and text mining techniques. Presents the results of experiments that investigated knowledge acquisition from a selected set of technical documents by domain experts. (Contains 15 references.) (Author/LRW)

  7. Assessment of Spatial Data Mining Tools for Integration into an Object-Oriented GIS (GIDB)

    Science.gov (United States)

    2007-11-02

    A variety of data mining techniques are under evaluation on the spatial data of concern in our setting. We are planning to integrate a number of...these techniques into our geospatial system (GIDB). Three approaches are under special consideration and are described in the paper. A COTS data mining system

  8. An Evaluation of Text Mining Tools as Applied to Selected Scientific and Engineering Literature.

    Science.gov (United States)

    Trybula, Walter J.; Wyllys, Ronald E.

    2000-01-01

    Addresses an approach to the discovery of scientific knowledge through an examination of data mining and text mining techniques. Presents the results of experiments that investigated knowledge acquisition from a selected set of technical documents by domain experts. (Contains 15 references.) (Author/LRW)

  9. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool

    OpenAIRE

    Silins, Ilona; Korhonen, Anna; Stenius, Ulla

    2014-01-01

    Toxicity caused by chemical mixtures has emerged as a significant challenge for toxicologists and risk assessors. Information on individual chemicals' modes of action is an important part of the hazard identification step. In this study, an automatic text mining-based tool was employed as a method to identify the carcinogenic modes of action of pesticides frequently found in fruit on the Swedish market. The current available scientific literature on the 26 most common pesticides found in appl...

  10. Mining Affective Context in Short Films for Emotion-Aware Recommendation

    OpenAIRE

    Orellana-Rodriguez, Claudia; Díaz-Aviles, Ernesto; Nejdl, Wolfgang

    2015-01-01

    Emotion is fundamental to human experience and impactsour daily activities and decision-making processes where,e.g., the affective state of a user influences whether or notshe decides to consume a recommended item movie, book,product or service. However, information retrieval and recommendationtasks have largely ignored emotion as a sourceof user context, in part because emotion is difficult to measureand easy to misunderstand. In this paper we explore therole of emotions in short films and ...

  11. Hemodynamic flow modeling through an abdominal aorta aneurysm using data mining tools.

    Science.gov (United States)

    Filipovic, Nenad; Ivanovic, Milos; Krstajic, Damjan; Kojic, Milos

    2011-03-01

    Geometrical changes of blood vessels, called aneurysm, occur often in humans with possible catastrophic outcome. Then, the blood flow is enormously affected, as well as the blood hemodynamic interaction forces acting on the arterial wall. These forces are the cause of the wall rupture. A mechanical quantity characteristic for the blood-wall interaction is the wall shear stress, which also has direct physiological effects on the endothelial cell behavior. Therefore, it is very important to have an insight into the blood flow and shear stress distribution when an aneurysm is developed in order to help correlating the mechanical conditions with the pathogenesis of pathological changes on the blood vessels. This insight can further help in improving the prevention of cardiovascular diseases evolution. Computational fluid dynamics (CFD) has been used in general as a tool to generate results for the mechanical conditions within blood vessels with and without aneurysms. However, aneurysms are very patient specific and reliable results from CFD analyses can be obtained by a cumbersome and time-consuming process of the computational model generation followed by huge computations. In order to make the CFD analyses efficient and suitable for future everyday clinical practice, we have here employed data mining (DM) techniques. The focus was to combine the CFD and DM methods for the estimation of the wall shear stresses in an abdominal aorta aneurysm (AAA) underprescribed geometrical changes. Additionally, computing on the grid infrastructure was performed to improve efficiency, since thousands of CFD runs were needed for creating machine learning data. We used several DM techniques and found that our DM models provide good prediction of the shear stress at the AAA in comparison with full CFD model results on real patient data.

  12. The Sustainability Analysis Framework: An Effective Knowledge Communication Tool in a Whole of Government Context

    Directory of Open Access Journals (Sweden)

    Kendal Hodgman

    2007-06-01

    Full Text Available This paper focuses on the structural formation of the Sustainability Analysis Framework (SAF, which has proved to be an effective knowledge communication tool in the largest state administration in Australia, the Government of New South Wales. The SAF, devised by the author, has been implemented across thirty-eight state agencies and demonstrates that a whole of government Sustainability perspective is achievable. The level of the NSW Government's commitment to this process is evidenced by the fact that it was coordinated at the highest levels of the administration and engaged high-level input from a comprehensive portfolio of its agencies. The SAF's successful deployment across this significant bureaucracy shows that the generally applicable mechanism is effective in data collation, information sharing, knowledge organisation and the communication of Sustainability practice and wisdom. At the heart of the paper is the author's contention that one of the key problems which jeopardises our common future on Earth is the lack of effective tools to communicate Sustainability thinking and practice. In the course of the project, upon which this paper is based, the author identified the need for a visually and conceptually accessible mechanism to accelerate the uptake of Sustainability practice in a whole of organization context. In designing the SAF, which has successfully bridged this knowledge communication gap, the author employed two of humanity's fundamental learning tools – the diagram and the story.

  13. Vetiver Grass: a potential tool for phytoremediation of iron ore mine site spoil dump

    OpenAIRE

    Anita Mukherjee; Khanindra Pathak

    2015-01-01

    The impact of mining has lead to the generation of a large amount of spoil dumps that has become dangerous to human health, wildlife and biodiversity. Thus it is essential that the post mining areas and waste land generated need to be rapidly vegetated. Vetiver grass (Chrysopogon zizanioides (L.) Roberty) is a tropical plant which grows naturally in various soil conditions and is well known for its ability to resist DNA damage while growing on typically polluted soil conditions. The spoil dum...

  14. Female Teachers’ Perception of Reflective Teaching as a Teacher Development Tool in the Saudi Context

    Directory of Open Access Journals (Sweden)

    Nadia Shukri

    2014-10-01

    Full Text Available Teacher development in the EFL context has been well-established in Western educational institutions. However, although there are some limited studies in the Middle East, it is still under-researched in Saudi Arabia. This study investigates the EFL teachers’ perceptions of reflective teaching as a tool for teacher development and its challenges in higher education in Saudi Arabia. It also aims to raise teachers’ awareness of the importance of teacher development and its empowering impact in meeting their students’ needs. Based on a review of previous studies, the present study hypothesized a significant relevance between reflective teaching practice and the promotion of professional development. A structured questionnaire is used as a quantitative methodology followed by qualitative analysis of the findings. The findings support the hypothesis in which the participants’ views revealed a strong relationship between reflective teaching and professionalism in teaching. Finally, further recommendations that suggest methods of reflective teaching are proposed. 

  15. ChemicalTagger: A tool for semantic text-mining in chemistry

    Directory of Open Access Journals (Sweden)

    Hawizy Lezan

    2011-05-01

    Full Text Available Abstract Background The primary method for scientific communication is in the form of published scientific articles and theses which use natural language combined with domain-specific terminology. As such, they contain free owing unstructured text. Given the usefulness of data extraction from unstructured literature, we aim to show how this can be achieved for the discipline of chemistry. The highly formulaic style of writing most chemists adopt make their contributions well suited to high-throughput Natural Language Processing (NLP approaches. Results We have developed the ChemicalTagger parser as a medium-depth, phrase-based semantic NLP tool for the language of chemical experiments. Tagging is based on a modular architecture and uses a combination of OSCAR, domain-specific regex and English taggers to identify parts-of-speech. The ANTLR grammar is used to structure this into tree-based phrases. Using a metric that allows for overlapping annotations, we achieved machine-annotator agreements of 88.9% for phrase recognition and 91.9% for phrase-type identification (Action names. Conclusions It is possible parse to chemical experimental text using rule-based techniques in conjunction with a formal grammar parser. ChemicalTagger has been deployed for over 10,000 patents and has identified solvents from their linguistic context with >99.5% precision.

  16. CAFÉ-Map: Context Aware Feature Mapping for mining high dimensional biomedical data.

    Science.gov (United States)

    Minhas, Fayyaz Ul Amir Afsar; Asif, Amina; Arif, Muhammad

    2016-12-01

    Feature selection and ranking is of great importance in the analysis of biomedical data. In addition to reducing the number of features used in classification or other machine learning tasks, it allows us to extract meaningful biological and medical information from a machine learning model. Most existing approaches in this domain do not directly model the fact that the relative importance of features can be different in different regions of the feature space. In this work, we present a context aware feature ranking algorithm called CAFÉ-Map. CAFÉ-Map is a locally linear feature ranking framework that allows recognition of important features in any given region of the feature space or for any individual example. This allows for simultaneous classification and feature ranking in an interpretable manner. We have benchmarked CAFÉ-Map on a number of toy and real world biomedical data sets. Our comparative study with a number of published methods shows that CAFÉ-Map achieves better accuracies on these data sets. The top ranking features obtained through CAFÉ-Map in a gene profiling study correlate very well with the importance of different genes reported in the literature. Furthermore, CAFÉ-Map provides a more in-depth analysis of feature ranking at the level of individual examples. CAFÉ-Map Python code is available at: http://faculty.pieas.edu.pk/fayyaz/software.html#cafemap . The CAFÉ-Map package supports parallelization and sparse data and provides example scripts for classification. This code can be used to reconstruct the results given in this paper. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. LAS HERRAMIENTAS PREHISTÓRICAS DE LAS MINAS DE VARISCITA DE PALAZUELO DE LAS CUEVAS (ZAMORA) Y PICO CENTENO (HUELVA). ANÁLISIS COMPARATIVO/Prehistoric mining tools of the variscite mines of Palazuelo de las Cuevas (Zamora) and Pico Centeno (Huelva). A comparative analysis

    National Research Council Canada - National Science Library

    Rodrigo Villalobos García; Carlos P Odriozola Lloret

    2016-01-01

    ... (Aliste disctrict, Zamora) and Pico Centeno (Encinasola, Huelva). Knapped stone mining tools are not absent of the Iberian prehistoric mines, but they appear in low frequency and, probably for that reason, had not attracted scholars' attention...

  18. Information and diagnostic tools of objective control as means to improve performance of mining machines

    Science.gov (United States)

    Zvonarev, I. E.; Shishlyannikov, D. I.

    2017-02-01

    The paper justifies the relevance of developing and implementing automated onboard systems for operation data and maintenance recording in heading-and-winning machines. The analysis of advantages and disadvantages of existing automated onboard systems for operation data and maintenance recording in heading-and-winning machines for potassium mines are presented. The basic technical requirements for the design, operating algorithms and functions of recording systems of mining machines for potassium mines are formulated. A method of controlling operating parameters is presented; the concept of the onboard automated recording system for the Ural heading-and-winning machine is outlined. The results of experimental studies of variations in loading of the Ural-20R miner’s operating member drives, using the VATUR portable measuring complex, are given. It is proved that existing means of objective control of operating parameters of the URAL-20R heading-and-winning machine do not assure its optimal operation. The authors present a technique of analyzing the data provided by parameter recorders that allow increasing efficiency of mechanical complexes by determining numerical values characterizing the technical and technological level of potassium ore production organization. The efficiency assessment criteria for engineering and maintenance departments of mining enterprises are advanced. A technology of continuous automated monitoring of potassium mine’s outburst hazard is described.

  19. Tools and strategies for instrument monitoring, data mining and data access

    Science.gov (United States)

    van Hees, R. M., ,, Dr

    2009-04-01

    particular, the packet table API allows very compact storage of compound data sets and very fast read/write access. Details about this implementation and pitfalls will be given in the presentation. [Data Mining] The ability to select relevant data is a requirement that all data centers have to offer. The NL-SCIA-DC allows the users to select data using several criteria including: time, geo-location, type of observation and data quality. The result of the query are [i] location and name of relevant data products (files), or [ii] listing of meta data of the relevant measurements, or [iii] listing of the measurements (level 2 or higher). For this application, we need the power of a relational database, the SQL language, and the availability of spatial functions. PostgreSQL, extended with postGIS support turned out to be a good choice. Common queries on tables with millions of rows can be executed within seconds. [Data Deployment] The dissemination of scientific data is often cumbersome by the usage of many different formats to store the products. Therefore, time-consuming and inefficient conversions are needed to use data products from different origin. Within the Atmospheric Data Access for the Geospatial User Community (ADAGUC) project we provide selected space borne atmospheric and land data sets in the same data format and consistent internal structure, so that users can easily use and combine data. The common format for storage is HDF5, but the netCDF-4 API is used to create the data sets. The standard for metadata and dataset attributes follow the netCDF Climate and Forecast conventions, in addition metadata complies to the ISO 19115:2003 INSPIRE profile are added. The advantage of netCDF-4 is that the API is essentially equal to netCDF-3 (with a few extensions), while the data format is HDF5 (recognized by many scientific tools). The added metadata ensures product traceability. Details will be given in the presentation and several posters.

  20. Dendritic Cells in the Context of Human Tumors: Biology and Experimental Tools.

    Science.gov (United States)

    Volovitz, Ilan; Melzer, Susanne; Amar, Sarah; Bocsi, József; Bloch, Merav; Efroni, Sol; Ram, Zvi; Tárnok, Attila

    2016-01-01

    Dendritic cells (DC) are the most potent and versatile antigen-presenting cells (APC) in the immune system. DC have an exceptional ability to comprehend the immune context of a captured antigen based on molecular signals identified from its vicinity. The analyzed information is then conveyed to other immune effector cells. Such capability enables DC to play a pivotal role in mediating either an immunogenic response or immune tolerance towards an acquired antigen. This review summarizes current knowledge on DC in the context of human tumors. It covers the basics of human DC biology, elaborating on the different markers, morphology and function of the different subsets of human DC. Human blood-borne DC are comprised of at least three subsets consisting of one plasmacytoid DC (pDC) and two to three myeloid DC (mDC) subsets. Some tissues have unique DC. Each subset has a different phenotype and function and may induce pro-tumoral or anti-tumoral effects. The review also discusses two methods fundamental to the research of DC on the single-cell level: multicolor flow cytometry (FCM) and image-based cytometry (IC). These methods, along with new genomics and proteomics tools, can provide high-resolution information on specific DC subsets and on immune and tumor cells with which they interact. The different layers of collected biological data may then be integrated using Immune-Cytomics modeling approaches. Such novel integrated approaches may help unravel the complex network of cellular interactions that DC carry out within tumors, and may help harness this complex immunological information into the development of more effective treatments for cancer.

  1. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool.

    Science.gov (United States)

    Silins, Ilona; Korhonen, Anna; Stenius, Ulla

    2014-01-01

    Toxicity caused by chemical mixtures has emerged as a significant challenge for toxicologists and risk assessors. Information on individual chemicals' modes of action is an important part of the hazard identification step. In this study, an automatic text mining-based tool was employed as a method to identify the carcinogenic modes of action of pesticides frequently found in fruit on the Swedish market. The current available scientific literature on the 26 most common pesticides found in apples and oranges was evaluated. The literature was classified according to a taxonomy that specifies the main type of scientific evidence used for determining carcinogenic properties of chemicals. The publication profiles of many pesticides were similar, containing evidence for both genotoxic and non-genotoxic modes of action, including effects such as oxidative stress, chromosomal changes and cell proliferation. We also found that 18 of the 26 pesticides studied here had previously caused tumors in at least one animal species, findings which support the mode of action data. This study shows how a text-mining tool could be used to identify carcinogenic modes of action for a group of chemicals in large quantities of text. This strategy could support the risk assessment process of chemical mixtures.

  2. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool

    Directory of Open Access Journals (Sweden)

    Ilona eSilins

    2014-06-01

    Full Text Available Toxicity caused by chemical mixtures has emerged as a significant challenge for toxicologists and risk assessors. Information on individual chemicals’ modes of action is an important part of the hazard identification step. In this study, an automatic text mining-based tool was employed as a method to identify the carcinogenic modes of action of pesticides frequently found in fruit on the Swedish market. The current available scientific literature on the 26 most common pesticides found in apples and oranges was evaluated. The literature was classified according to a taxonomy that specifies the main type of scientific evidence used for determining carcinogenic properties of chemicals. The publication profiles of many pesticides were similar, containing evidence for both genotoxic and non-genotoxic modes of action, including effects such as oxidative stress, chromosomal changes and cell proliferation. We also found that 18 of the 26 pesticides studied here had previously caused tumors in at least one animal species, findings which support the mode of action data. This study demonstrates how a text-mining tool could be used to identify carcinogenic modes of action for a group of chemicals in large quantities of text. The strategy could support the risk assessment process of chemical mixtures.

  3. Automated tools to be used for ascertaining structural condition in South African hard rock mines

    CSIR Research Space (South Africa)

    Teleka, R

    2011-11-01

    Full Text Available imagery, sonic beacons I. INTRODUCTION The condition of hard rock mines in South Africa is known to be both laborious and hazardous. It is therefore not uncommon for accidents to occur in the industry. The majority of accidents and therefore... is using simple ultrasonic beacons [14]. These beacons will provide the position of the safety data collected, which will enable construction of a safety map. The safety map will enable inspectors visualize the condition of the rock mass prior...

  4. An Integrated Suite of Text and Data Mining Tools - Phase II

    Science.gov (United States)

    2007-11-02

    USA, July, 2003 [10] Watts, Robert J., Porter, Alan L., “R&D Cluster Quality Measures and Technology Maturity,” Technology Forecasting & Social ...science & technology documents databases, illustrated for the case of ‘knowledge discovery and data mining,’ Ciencia da Informacao 28 (1999) 1-8. [13...intelligence and forecasting, Technol. Forecast. and Social Change 69 (2002) 495-506. [23] R.J. Watts, Knowledge discovery using the Tech OASIS: Meeting

  5. Chemometrics and vibrational spectroscopy as green tools for mine phytoremediation strategies

    Science.gov (United States)

    Mokgalaka-Matlala, N. S.; Regnier, T.; Combrinck, S.; Kouekam, C. R.; Weiersbye, I. M.

    This study describes the use of near infrared (NIR) spectroscopy in combination with chemometrics to characterise Combretum erythrophyllum plant material to determine differences in the chemical profiles of samples harvested from mine contaminated areas and those of natural populations. The chemometric computation of near infrared vibrational spectra was used to generate principal component analysis and partial least squares models. These models were used to determine seasonal differences in the chemical matrices of samples harvested from the mine sites with different levels of contamination. Principal component analysis scatter plots illustrated clustering of phenolic profiles of samples depending on whether they originated from contaminated or uncontaminated soils. A partial least squares model was developed to link the variations in the chemical composition and levels of contamination in all samples collected in the same season (autumn). The levels of total soluble phenolic compounds in leaf extracts of C. erythrophyllum were measured using the Folin-Ciocalteau assay. Data analysis of the samples revealed that plants harvested from mine sites, particularly in summer, produced a higher level of phenolic compounds than those of the natural population, thereby displaying a good correlation with the chemometric models.

  6. Intrinsic evaluation of text mining tools may not predict performance on realistic tasks.

    Science.gov (United States)

    Caporaso, J Gregory; Deshpande, Nita; Fink, J Lynn; Bourne, Philip E; Cohen, K Bretonnel; Hunter, Lawrence

    2008-01-01

    Biomedical text mining and other automated techniques are beginning to achieve performance which suggests that they could be applied to aid database curators. However, few studies have evaluated how these systems might work in practice. In this article we focus on the problem of annotating mutations in Protein Data Bank (PDB) entries, and evaluate the relationship between performance of two automated techniques, a text-mining-based approach (MutationFinder) and an alignment-based approach, in intrinsic versus extrinsic evaluations. We find that high performance on gold standard data (an intrinsic evaluation) does not necessarily translate to high performance for database annotation (an extrinsic evaluation). We show that this is in part a result of lack of access to the full text of journal articles, which appears to be critical for comprehensive database annotation by text mining. Additionally, we evaluate the accuracy and completeness of manually annotated mutation data in the PDB, and find that it is far from perfect. We conclude that currently the most cost-effective and reliable approach for database annotation might incorporate manual and automatic annotation methods.

  7. Supporting Small-Group Learning Using Multiple Web 2.0 Tools: A Case Study in the Higher Education Context

    Science.gov (United States)

    Laru, Jari; Naykki, Piia; Jarvela, Sanna

    2012-01-01

    In this single-case study, small groups of learners were supported by use of multiple social software tools and face-to-face activities in the context of higher education. The aim of the study was to explore how designed learning activities contribute to students' learning outcomes by studying probabilistic dependencies between the variables.…

  8. New tools to monitor personal exposure to respirable coal mine dust

    Energy Technology Data Exchange (ETDEWEB)

    Volkwein, J.C.; Thimons, E.D. [NIOSH/Pittsburgh Research Laboratory, Pittsburgh, PA (USA)

    2001-07-01

    The compliance sampling method for coal mine dust in the United States has not changed appreciably in the last 30 years. A call for more frequent sampling with immediately available results is leading to new instruments that may supplement or replace the existing sampler. This report discusses two such instruments that provide immediate results of dust levels. The first, called the Respirable Dust Dosimeter (RDD), is designed to supplement the existing U. S. cyclone compliance sampling method. It uses the pressure drop across a filter to provide an inexpensive screening type of measurement. The RDD has been tested in the laboratory and a limited number of coal mines. Side by side testing has compared the RDD performance with personal coal mine samplers in triplicate area sample measurements. Results show that the differential pressure of specific filtration media can be an effective surrogate for respirable mass. Data show that there is a dependence on coal type and an effect of relative humidity. There also appear to be two distinct responses related to coal type. For specific coals the coefficients of determination, R{sub 2}, are better than 0.9. For general use the detector tube, using laboratory generated calibration curves, can be used to estimate respirable dust levels. However, a calibration to a specific coal type may substantially improve the accuracy. Another sampler in development is called the Personal Dust Monitor (PDM). Designed to give compliance quality accuracy, the PDM uses the frequency change of a vibrating element to measure dust and give immediate feedback of results. Results of laboratory comparison of the prototype PDM sampler to personal gravimetric samplers indicate R{sub 2} values of better than 0.99 for three different coal types. Work is in progress to package the components into a person-wearable unit combined with a cap lamp system with a total projected weight of less than 1.4 kg (3 lbs). 4 refs., 10 figs.

  9. Some Topological Properties of Rough Sets with Tools for Data Mining

    Directory of Open Access Journals (Sweden)

    A S Salama

    2011-05-01

    Full Text Available Rough set theory has a significant importance in many fields. For example, some branches of artificial intelligence, such as inductive reasoning, automatic classification, pattern recognition, learning algorithms, classification theory, cluster analysis, measurement theory and taxonomy. Also, in the domains of Medicine, Pharmacology, Banking, Market research and Engineering the rough set theory has demonstrated its usefulness. The main aim of this paper is to describe some topological properties of rough sets and open the door about more accurate topological measures of data mining.

  10. Web Usage Mining Analysis of Federated Search Tools for Egyptian Scholars

    Science.gov (United States)

    Mohamed, Khaled A.; Hassan, Ahmed

    2008-01-01

    Purpose: This paper aims to examine the behaviour of the Egyptian scholars while accessing electronic resources through two federated search tools. The main purpose of this article is to provide guidance for federated search tool technicians and support teams about user issues, including the need for training. Design/methodology/approach: Log…

  11. Web Usage Mining Analysis of Federated Search Tools for Egyptian Scholars

    Science.gov (United States)

    Mohamed, Khaled A.; Hassan, Ahmed

    2008-01-01

    Purpose: This paper aims to examine the behaviour of the Egyptian scholars while accessing electronic resources through two federated search tools. The main purpose of this article is to provide guidance for federated search tool technicians and support teams about user issues, including the need for training. Design/methodology/approach: Log…

  12. The Practice of Mining Companies in Building Relationships with Local Communities in the Context of CSR Formula

    Directory of Open Access Journals (Sweden)

    Marzena Majer

    2013-01-01

    Full Text Available This paper presents results regarding the analysis of activities of coal mining companies operating in Poland in terms of Corporate Social Responsibility (CSR, with a particular emphasis focused on the area of community involvement. This sphere is extremely important for entities operating in the sector due to the scale of the impact on the social environment and the potential threat of resistance from local communities. Mining companies have developed forms of action in this area and are subject to a number of legal regulations, especially in such critical points as mining damages or acquiring concessions for mining operations. Therefore, it is worth analysing these issues from the perspective of the CSR formula, and the way in which mining companies build a relationship with stakeholder groups which are undoubtedly the local communities. This represents a specific challenge for mining companies and requires a change in management approach based on the acceptance of responsibility for the impact they have and to seek solutions that will benefit both the companies themselves and their surroundings. It should also affect the change in approach for building relationships with local groups beyond just the legal requirements.

  13. Hydroponics as a valid tool to assess arsenic availability in mine soils.

    Science.gov (United States)

    Moreno-Jiménez, E; Esteban, E; Fresno, T; de Egea, C López; Peñalosa, J M

    2010-04-01

    The low solubility of As in mine soils limits its phytoavailability. This makes the extrapolation of data obtained under hydroponic conditions unrealistic because the concentration in nutrient solution frequently overexposes plants to this metalloid. This work evaluates whether As supply in hydroponics resembles, to some extent, the As phytoavailable fraction in soils and the implications for phytoremediation. Phytotoxicity of As, in terms of biomass production, chlorophyll levels, and As concentrations in plants, was estimated and compared in both soils and hydroponics. In order for hydroponic conditions to be compared to soil conditions, plant exposure levels were measured in both cultures. Hydroponic As concentration ranging from 2-8microM equated to the same plant organ concentrations from soils with 700-3000mgkg(-1). Total and extractable As fractions exceeded those values, but As concentrations in pore water were bellow them. According to our results (i) hydroponics should include doses in the range 0-10microM As to allow the extrapolation of the results to As-polluted soils, and (ii) phytoextraction of As in mining sites will be limited by low As phytoavailability.

  14. Constructed wetlands as green tools for management of boron mine wastewater.

    Science.gov (United States)

    Türker, Onur Can; Türe, Cengiz; Böcük, Harun; Yakar, Anil

    2014-01-01

    Constructed wetlands are of increasing interest worldwide given that they represent an eco-technological solution to many environmental problems such as wastewater treatment. Turkey possesses approximately 70% of the world's total boron (B) reserves, and B contamination occurs in both natural and cultivated sites throughout Turkey, particularly in the north-west of the country. This study analyzes B removal and plant uptake of B in pilot plots of subsurface horizontal-flow constructed wetlands. Constructed wetlands were vegetated with Typha latifolia (referred to as CW1) and Phragmites australis (referred to as CW2) to treat wastewater from a borax reserve in Turkey--the largest of its type in the world and were assessed under field conditions. The B concentrations of water inflows to the systems were determined to be 10.2, 28.2, 84.6, 232.3, 716.4, and 2019.1 mg l(-1). The T. latifolia in the CW1 treatment group absorbed a total of 1300 mg kg(-1) B, whereas P. australis absorbed 839 mg kg(-1). As a result, CW1 had an average removal efficiency of 40.7%, while that of CW2 was 27.2%. Our results suggest that constructed wetlands are an effective, economic and eco-friendly solution to treating B mine wastewater and controlling the adverse environmental effects of B mining.

  15. Application of LC-high-resolution MS with 'intelligent' data mining tools for screening reactive drug metabolites.

    Science.gov (United States)

    Ma, Shuguang; Chowdhury, Swapan K

    2012-03-01

    Biotransformation of chemically stable compounds to reactive metabolites that can bind covalently to macromolecules (such as proteins and DNA) is considered an undesirable property of drug candidates. Due to the possible link, which has not yet been conclusively demonstrated, between reactive metabolites and adverse drug reactions, screening for metabolic activation of lead compounds through in vitro chemical trapping experiments has become an integral part of the drug discovery process in many laboratories. In this review, we provide an overview of the recent advances in the application of high-resolution MS. These advances facilitated the development of accurate-mass-based data mining tools for high-throughput screening of reactive drug metabolites in drug discovery.

  16. Wild plants as tools for the remediation of abandoned mining sites with a high arsenic content

    Science.gov (United States)

    Martínez-Lopez, Salvadora; Martínez-Sanchez, MJose; Perez-Sirvent, Carmen; Martínez, Lucia B.; Bech, Jaume

    2014-05-01

    The aim of this study was to assess the environmental risk posed by arsenic when new vegetation types are introduced, analyzing the transfer of arsenic in different plant species that grow spontaneously in mining areas of SE Spain (Sierra Minera of Cartagena), and the contribution of such plants to the environmental risk represented by their ingestion by animals living in the same ecosystems. When dealing with remediation projects in zones affected by mining activities, the risk posed by the ingestion of the plants by fauna is often forgotten. To study the transfer to the trophic chain, two mammals, sheep and vole, were selected. The risk analysis was centered in the contribution of these natural plants to the ingestion calculated. For this study, 21 vegetal species naturally growing in the soils were collected from the Sierra Minera. The vegetal material studied is clearly associated with the Mediterranean Region (S.E. of Spain) and the plant species collected are endemisms and plants characteristic of the zone. Physico-chemical properties were obtained by means of the usual procedures. To determine the arsenic content, the soil samples and plant materials were digested in a microwave system and the arsenic concentration was determined using atomic fluorescence spectrometry with an automated continuous flow hydride generation system. A semiquantitative estimation of the mineralogical composition of the samples was made by X Ray Diffraction analysis. The soils were classified into three groups: Low (group 1) (7-35 mg/kg) medium (group 2) (35-327 mg/kg) and high (group 3) (> 327 mg/kg), according to their As content. The mineralogy and As content of the soils studied depends on the materials related with mining activity. The descriptive statistical analysis of the population of plants studied showed the As range in roots to be 0.31-150 mg/kg while leaf concentrations were lower (0.21-83.4 mg/kg). The potential risk of As entering the food chain through of the plant

  17. Field validation of specific ecotoxicological tools for aquatic systems impacted with acid mine drainage

    Energy Technology Data Exchange (ETDEWEB)

    Lopes, I.; Goncalves, F.; Nogueira, A.; Soares, A.M.V.M.; Ribeiro, R. [Instituto do Ambiente e Vida, Coimbra (Portugal). Departamento de Zoologia da Universidade de Coimbra

    2000-07-01

    Acid mine drainage (AMD) is characterised by very low pH and high heavy metal concentrations. Serious ecotoxicological effects, often leading to the complete disruption of the ecosystem, can be observed at the regions suffering this kind of contamination. Those effects can be caused either by low pH itself or by other contaminants that emerge with water acidification (mobilisation and increased solubility of heavy metals). The discrimination between the toxicity due to each of these two factors is not possible with the existing toxicity tests; the addition of chelating agents or serial dilution methods seriously alter the chemical and physical properties of the effluent. A toxicity test, based on the survival time of Ceriodaphnia dubia (Crustacea, Cladocera) neonates exposed to the unchanged effluent was developed and field validated, on an AMD contaminated site. 28 refs.

  18. Role of Data Mining in E-Payment systems

    CERN Document Server

    Pattanaik, Sabyasachi

    2010-01-01

    Data Mining deals extracting hidden knowledge, unexpected pattern and new rules from large database. Various customized data mining tools have been developed for domain specific applications such as Biomedicine, DNA analysis and telecommunication. Trends in data mining include further efforts towards the exploration of new application areas and methods for handling complex data types, algorithm scalability, constraint based data mining and visualization methods. In this paper we will present domain specific Secure Multiparty computation technique and applications. Data mining has matured as a field of basic and applied research in computer science in general. In this paper, we survey some of the recent approaches and architectures where data mining has been applied in the fields of e-payment systems. In this paper we limit our discussion to data mining in the context of e-payment systems. We also mention a few directions for further work in this domain, based on the survey.

  19. MONITORING OF COAL BED EXTRACTION AS AN EFFECTIVE TOOL FOR IMPROVING THE PRODUCTION RESULTS OF A MINE

    Directory of Open Access Journals (Sweden)

    Witold BIAŁY

    2015-07-01

    Full Text Available The basic source of information necessary for proper and effective management of a hard coal mine is continuous monitoring of the mining process. An increased number of machines and devices used in a mine caused a need for continuous monitoring of mining departments’ work. Monitoring of the extraction of hard coal beds is crucial for this process management, as it determines the proper course of the mining process. Hence, monitoring can be considered the most important element of the controlling process, especially in the area of mining process management in a mine. Effective monitoring and proper, quick reacting to any irregularities in this process have a significant influence on the production results of a mine.

  20. SplicerAV: a tool for mining microarray expression data for changes in RNA processing

    Directory of Open Access Journals (Sweden)

    Garcia-Blanco Mariano A

    2010-02-01

    Full Text Available Abstract Background Over the past two decades more than fifty thousand unique clinical and biological samples have been assayed using the Affymetrix HG-U133 and HG-U95 GeneChip microarray platforms. This substantial repository has been used extensively to characterize changes in gene expression between biological samples, but has not been previously mined en masse for changes in mRNA processing. We explored the possibility of using HG-U133 microarray data to identify changes in alternative mRNA processing in several available archival datasets. Results Data from these and other gene expression microarrays can now be mined for changes in transcript isoform abundance using a program described here, SplicerAV. Using in vivo and in vitro breast cancer microarray datasets, SplicerAV was able to perform both gene and isoform specific expression profiling within the same microarray dataset. Our reanalysis of Affymetrix U133 plus 2.0 data generated by in vitro over-expression of HRAS, E2F3, beta-catenin (CTNNB1, SRC, and MYC identified several hundred oncogene-induced mRNA isoform changes, one of which recognized a previously unknown mechanism of EGFR family activation. Using clinical data, SplicerAV predicted 241 isoform changes between low and high grade breast tumors; with changes enriched among genes coding for guanyl-nucleotide exchange factors, metalloprotease inhibitors, and mRNA processing factors. Isoform changes in 15 genes were associated with aggressive cancer across the three breast cancer datasets. Conclusions Using SplicerAV, we identified several hundred previously uncharacterized isoform changes induced by in vitro oncogene over-expression and revealed a previously unknown mechanism of EGFR activation in human mammary epithelial cells. We analyzed Affymetrix GeneChip data from over 400 human breast tumors in three independent studies, making this the largest clinical dataset analyzed for en masse changes in alternative mRNA processing

  1. What can management theories offer evidence-based practice? A comparative analysis of measurement tools for organisational context

    Directory of Open Access Journals (Sweden)

    Pennington Lindsay

    2009-05-01

    Full Text Available Abstract Background Given the current emphasis on networks as vehicles for innovation and change in health service delivery, the ability to conceptualise and measure organisational enablers for the social construction of knowledge merits attention. This study aimed to develop a composite tool to measure the organisational context for evidence-based practice (EBP in healthcare. Methods A structured search of the major healthcare and management databases for measurement tools from four domains: research utilisation (RU, research activity (RA, knowledge management (KM, and organisational learning (OL. Included studies were reports of the development or use of measurement tools that included organisational factors. Tools were appraised for face and content validity, plus development and testing methods. Measurement tool items were extracted, merged across the four domains, and categorised within a constructed framework describing the absorptive and receptive capacities of organisations. Results Thirty measurement tools were identified and appraised. Eighteen tools from the four domains were selected for item extraction and analysis. The constructed framework consists of seven categories relating to three core organisational attributes of vision, leadership, and a learning culture, and four stages of knowledge need, acquisition of new knowledge, knowledge sharing, and knowledge use. Measurement tools from RA or RU domains had more items relating to the categories of leadership, and acquisition of new knowledge; while tools from KM or learning organisation domains had more items relating to vision, learning culture, knowledge need, and knowledge sharing. There was equal emphasis on knowledge use in the different domains. Conclusion If the translation of evidence into knowledge is viewed as socially mediated, tools to measure the organisational context of EBP in healthcare could be enhanced by consideration of related concepts from the organisational

  2. What can management theories offer evidence-based practice? A comparative analysis of measurement tools for organisational context.

    Science.gov (United States)

    French, Beverley; Thomas, Lois H; Baker, Paula; Burton, Christopher R; Pennington, Lindsay; Roddam, Hazel

    2009-05-19

    Given the current emphasis on networks as vehicles for innovation and change in health service delivery, the ability to conceptualize and measure organisational enablers for the social construction of knowledge merits attention. This study aimed to develop a composite tool to measure the organisational context for evidence-based practice (EBP) in healthcare. A structured search of the major healthcare and management databases for measurement tools from four domains: research utilisation (RU), research activity (RA), knowledge management (KM), and organisational learning (OL). Included studies were reports of the development or use of measurement tools that included organisational factors. Tools were appraised for face and content validity, plus development and testing methods. Measurement tool items were extracted, merged across the four domains, and categorised within a constructed framework describing the absorptive and receptive capacities of organisations. Thirty measurement tools were identified and appraised. Eighteen tools from the four domains were selected for item extraction and analysis. The constructed framework consists of seven categories relating to three core organisational attributes of vision, leadership, and a learning culture, and four stages of knowledge need, acquisition of new knowledge, knowledge sharing, and knowledge use. Measurement tools from RA or RU domains had more items relating to the categories of leadership, and acquisition of new knowledge; while tools from KM or learning organisation domains had more items relating to vision, learning culture, knowledge need, and knowledge sharing. There was equal emphasis on knowledge use in the different domains. If the translation of evidence into knowledge is viewed as socially mediated, tools to measure the organisational context of EBP in healthcare could be enhanced by consideration of related concepts from the organisational and management sciences. Comparison of measurement tools across

  3. Anni 2.0: A multipurpose text-mining tool for the life sciences

    NARCIS (Netherlands)

    R. Jelier (Rob); M.J. Schuemie (Martijn); A. Veldhoven (Antoine); L.C.J. Dorssers (Lambert); G.W. Jenster (Guido); J.A. Kors (Jan)

    2008-01-01

    textabstractAnni 2.0 is an online tool (http://biosemantics.org/anni/) to aid the biomedical researcher with a broad range of information needs. Anni provides an ontology-based interface to MEDLINE and retrieves documents and associations for several classes of biomedical concepts, including genes,

  4. NCBI GEO: mining millions of expression profiles--database and tools.

    Science.gov (United States)

    Barrett, Tanya; Suzek, Tugba O; Troup, Dennis B; Wilhite, Stephen E; Ngau, Wing-Chi; Ledoux, Pierre; Rudnev, Dmitry; Lash, Alex E; Fujibuchi, Wataru; Edgar, Ron

    2005-01-01

    The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest fully public repository for high-throughput molecular abundance data, primarily gene expression data. The database has a flexible and open design that allows the submission, storage and retrieval of many data types. These data include microarray-based experiments measuring the abundance of mRNA, genomic DNA and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. GEO currently holds over 30,000 submissions representing approximately half a billion individual molecular abundance measurements, for over 100 organisms. Here, we describe recent database developments that facilitate effective mining and visualization of these data. Features are provided to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.

  5. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomics Data.

    Science.gov (United States)

    Bolser, Dan; Staines, Daniel M; Pritchard, Emily; Kersey, Paul

    2016-01-01

    Ensembl Plants ( http://plants.ensembl.org ) is an integrative resource presenting genome-scale information for a growing number of sequenced plant species (currently 33). Data provided includes genome sequence, gene models, functional annotation, and polymorphic loci. Various additional information are provided for variation data, including population structure, individual genotypes, linkage, and phenotype data. In each release, comparative analyses are performed on whole genome and protein sequences, and genome alignments and gene trees are made available that show the implied evolutionary history of each gene family. Access to the data is provided through a genome browser incorporating many specialist interfaces for different data types, and through a variety of additional methods for programmatic access and data mining. These access routes are consistent with those offered through the Ensembl interface for the genomes of non-plant species, including those of plant pathogens, pests, and pollinators.Ensembl Plants is updated 4-5 times a year and is developed in collaboration with our international partners in the Gramene ( http://www.gramene.org ) and transPLANT projects ( http://www.transplantdb.org ).

  6. The environmental context of Acidithiobacillus ferrooxidans and its potential role as an ecosystem engineer in sulphidic mine waste

    Energy Technology Data Exchange (ETDEWEB)

    Ebenaa, Gustav

    2001-06-01

    Microorganisms are the causative agent of the environmental problems since they catalyse the weathering of the (sulphidic) waste. The chemical oxidation alone is not fast enough to create any severe environmental problems. Acidithiobacillus ferrooxidans is thought to be a key organism in weathering of sulphide minerals. A. ferrooxidans is affected by several more or less abiotic factors. The influence of temperature, pH and nutrient deficiency as potentially limiting factors for the activity of A. ferrooxidans has been investigated. It seems that temperature has less influence on its activity, but rather reflects the origin of the bacterial isolate. An alkaline pH seems enough to hinder growth and activity. The nutrients do not seem to be a limiting factor in the studied environment. The possible regulation of the activity of A. ferrooxidans is therefore a way to, at least partly, mitigate the environmental impact from mine waste. Waste from the mining industry is the largest waste problem in Sweden. With amounts over 600 million tonnes one could easily imagine the tremendous cost involved in the abatement. The MiMi-programme, with researchers from several relevant fields, has as its aim to evaluate present and to find alternative techniques to mitigate the environmental impact from mine waste. The understanding of A. ferrooxidans and its role as an ecosystem engineer is essential both in evaluating present techniques and even more so in finding alternative abatement techniques for sulphidic mine waste.

  7. Developing the School Physical Activity and Nutrition Environment Tool to Measure Qualities of the Obesogenic Context

    Science.gov (United States)

    John, Deborah H.; Gunter, Katherine; Jackson, Jennifer A.; Manore, Melinda

    2016-01-01

    Background: Practical tools are needed that reliably measure the complex physical activity (PA) and nutrition environments of elementary schools that influence children's health and learning behaviors for obesity prevention. The School Physical Activity and Nutrition-Environment Tool (SPAN-ET) was developed and beta tested in 6 rural Oregon…

  8. Thermal Error Modeling of a Machine Tool Using Data Mining Scheme

    Science.gov (United States)

    Wang, Kun-Chieh; Tseng, Pai-Chang

    In this paper the knowledge discovery technique is used to build an effective and transparent mathematic thermal error model for machine tools. Our proposed thermal error modeling methodology (called KRL) integrates the schemes of K-means theory (KM), rough-set theory (RS), and linear regression model (LR). First, to explore the machine tool's thermal behavior, an integrated system is designed to simultaneously measure the temperature ascents at selected characteristic points and the thermal deformations at spindle nose under suitable real machining conditions. Second, the obtained data are classified by the KM method, further reduced by the RS scheme, and a linear thermal error model is established by the LR technique. To evaluate the performance of our proposed model, an adaptive neural fuzzy inference system (ANFIS) thermal error model is introduced for comparison. Finally, a verification experiment is carried out and results reveal that the proposed KRL model is effective in predicting thermal behavior in machine tools. Our proposed KRL model is transparent, easily understood by users, and can be easily programmed or modified for different machining conditions.

  9. Exploring the potential of a structural alphabet-based tool for mining multiple target conformations and target flexibility insight.

    Science.gov (United States)

    Regad, Leslie; Chéron, Jean-Baptiste; Triki, Dhoha; Senac, Caroline; Flatters, Delphine; Camproux, Anne-Claude

    2017-01-01

    Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels.

  10. Response process and test-retest reliability of the Context Assessment for Community Health tool in Vietnam.

    Science.gov (United States)

    Duc, Duong M; Bergström, Anna; Eriksson, Leif; Selling, Katarina; Thi Thu Ha, Bui; Wallin, Lars

    2016-01-01

    The recently developed Context Assessment for Community Health (COACH) tool aims to measure aspects of the local healthcare context perceived to influence knowledge translation in low- and middle-income countries. The tool measures eight dimensions (organizational resources, community engagement, monitoring services for action, sources of knowledge, commitment to work, work culture, leadership, and informal payment) through 49 items. The study aimed to explore the understanding and stability of the COACH tool among health providers in Vietnam. To investigate the response process, think-aloud interviews were undertaken with five community health workers, six nurses and midwives, and five physicians. Identified problems were classified according to Conrad and Blair's taxonomy and grouped according to an estimation of the magnitude of the problem's effect on the response data. Further, the stability of the tool was examined using a test-retest survey among 77 respondents. The reliability was analyzed for items (intraclass correlation coefficient (ICC) and percent agreement) and dimensions (ICC and Bland-Altman plots). In general, the think-aloud interviews revealed that the COACH tool was perceived as clear, well organized, and easy to answer. Most items were understood as intended. However, seven prominent problems in the items were identified and the content of three dimensions was perceived to be of a sensitive nature. In the test-retest survey, two-thirds of the items and seven of eight dimensions were found to have an ICC agreement ranging from moderate to substantial (0.5-0.7), demonstrating that the instrument has an acceptable level of stability. This study provides evidence that the Vietnamese translation of the COACH tool is generally perceived to be clear and easy to understand and has acceptable stability. There is, however, a need to rephrase and add generic examples to clarify some items and to further review items with low ICC.

  11. Response process and test–retest reliability of the Context Assessment for Community Health tool in Vietnam

    Directory of Open Access Journals (Sweden)

    Duong M. Duc

    2016-06-01

    Full Text Available Background: The recently developed Context Assessment for Community Health (COACH tool aims to measure aspects of the local healthcare context perceived to influence knowledge translation in low- and middle-income countries. The tool measures eight dimensions (organizational resources , community engagement, monitoring services for action, sources of knowledge, commitment to work, work culture, leadership, and informal payment through 49 items. Objective: The study aimed to explore the understanding and stability of the COACH tool among health providers in Vietnam. Designs: To investigate the response process, think-aloud interviews were undertaken with five community health workers, six nurses and midwives, and five physicians. Identified problems were classified according to Conrad and Blair's taxonomy and grouped according to an estimation of the magnitude of the problem's effect on the response data. Further, the stability of the tool was examined using a test–retest survey among 77 respondents. The reliability was analyzed for items (intraclass correlation coefficient (ICC and percent agreement and dimensions (ICC and Bland–Altman plots. Results: In general, the think-aloud interviews revealed that the COACH tool was perceived as clear, well organized, and easy to answer. Most items were understood as intended. However, seven prominent problems in the items were identified and the content of three dimensions was perceived to be of a sensitive nature. In the test–retest survey, two-thirds of the items and seven of eight dimensions were found to have an ICC agreement ranging from moderate to substantial (0.5–0.7, demonstrating that the instrument has an acceptable level of stability. Conclusions: This study provides evidence that the Vietnamese translation of the COACH tool is generally perceived to be clear and easy to understand and has acceptable stability. There is, however, a need to rephrase and add generic examples to clarify

  12. Development Tools for Context Aware and Secure Pervasive Computing in Embedded Systems (PECES) Middleware

    OpenAIRE

    Ran Zhao; Kirusnapillai Selvarajah; Neil Speirs

    2013-01-01

    The main objective of the PECES project is the development of system software to enable the communication among heterogeneous devices across multiple smart spaces, breaking the traditional barrier of “smart islands” where only the services offered in a nearby spatial area can be used easily. PECES development tools help the application developer to build and test the PECES middleware based applications. This paper presents a set of tools, namely Peces Project, Peces Device Definit...

  13. Application and Survey of Business Intelligence (BI) Tools within the Context of Military Decision Making

    Science.gov (United States)

    2012-06-01

    are integrated in context of naming conflicts and differences time variance of historical data is preserved in order to perceive trends, deviations...GIS to locate new Taco Bell and Pizza Hut restaurants using traffic patterns [2]. Health insurance is using GIS to select affiliated physicians

  14. CS-MINER: A Tool for Association Mining in Binding-Database.

    Science.gov (United States)

    Mani-Varnosfaderani, Ahmad; Valadkhani, Abolfazl; Jalali-Heravi, Mehdi

    2015-04-01

    This paper introduces the algorithms, implementation strategies, features, and applications of CS-MINER, a tool for visualization and analysis of drug-like chemical space. The CS-MINER is the abstract abbreviation for Chemical Space Miner and correlates the medicinal target space and chemical space, in a systematic way. The database in this software consists of a large collection of drug-like molecules. To prepare this database, a large number of molecules for 110 important biological targets were collected from Binding-DB. A total of 1497 physicochemical properties were calculated for each molecule. The CS-MINER uses the discriminant analysis techniques for tracing the collected data and finally separates the molecules based on their therapeutic targets and activities. The developed multivariate classifiers can be used for ligand-based virtual screening of more than 0.5 million random molecules of PubChem and ZINC databases. In order to validate the models, selected subspaces in CS-MINER were compared with DrugBank molecules. At the end of the analysis, the software provides an interactive environment for visualization of the selected chemical subspaces in the form of 2- and 3-dimensional plots. In general, CS-MINER is a tool for comparing the relative position of active biosimilar molecules in chemical space and is freely available at www.csminer.com.

  15. PPLook: an automated data mining tool for protein-protein interaction

    Directory of Open Access Journals (Sweden)

    Xia Li

    2010-06-01

    Full Text Available Abstract Background Extracting and visualizing of protein-protein interaction (PPI from text literatures are a meaningful topic in protein science. It assists the identification of interactions among proteins. There is a lack of tools to extract PPI, visualize and classify the results. Results We developed a PPI search system, termed PPLook, which automatically extracts and visualizes protein-protein interaction (PPI from text. Given a query protein name, PPLook can search a dataset for other proteins interacting with it by using a keywords dictionary pattern-matching algorithm, and display the topological parameters, such as the number of nodes, edges, and connected components. The visualization component of PPLook enables us to view the interaction relationship among the proteins in a three-dimensional space based on the OpenGL graphics interface technology. PPLook can also provide the functions of selecting protein semantic class, counting the number of semantic class proteins which interact with query protein, counting the literature number of articles appearing the interaction relationship about the query protein. Moreover, PPLook provides heterogeneous search and a user-friendly graphical interface. Conclusions PPLook is an effective tool for biologists and biosystem developers who need to access PPI information from the literature. PPLook is freely available for non-commercial users at http://meta.usc.edu/softs/PPLook.

  16. Transforming health care the financial impact of technology, electronic tools and data mining

    CERN Document Server

    Fasano, Phil

    2013-01-01

    The future of healthcare technologies, and what they mean for investors and entrepreneurs The healthcare technology revolution is just around the corner. And when it arrives, it will change and enrich our lives in ways we can only begin to imagine. Doctors will perform blood pressure readings via video chat and nutritionists will analyze diet based on photos taken with cellphone cameras. Transforming Health Care combines healthcare, technology, and finance in an innovative new way that explains the future of healthcare and its effects on patient care, exploring the emergence of electronic tools that will transform the medical industry. Explaining how technology, not politics, will lead the future of the healthcare revolution, author and healthcare technology expert Phil Fasano presents real-life examples that show how the next generation of medical breakthroughs will come from the instant exchange of information across the world Explores how new technologies will radically change the future of healthcare by m...

  17. PRIME: A Mass Spectrum Data Mining Tool for De Novo Sequencing and PTMs Identification

    Institute of Scientific and Technical Information of China (English)

    Bo Yan; You-Xing Qu; Feng-Lou Mao; Victor N. Olman; Ying Xu

    2005-01-01

    De novo sequencing is one of the most promising proteomics techniques for identification of protein posttranslation modifications (PTMs) in studying protein regulations and functions. We have developed a computer tool PRIME for identification of b and y ions in tandem mass spectra, a key challenging problem in de novo sequencing. PRIME utilizes a feature that ions of the same and different types follow different mass-difference distributions to separate b from y ions correctly. We have formulated the problem as a graph partition problem. A linear integer-programming algorithm has been implemented to solve the graph partition problem rigorously and efficiently. The performance of PRIME has been demonstrated on a large amount of simulated tandem mass spectra derived from Yeast genome and its power of detecting PTMs has been tested on 216 simulated phosphopeptides.

  18. Use of genomic tools to improve cattle health in the context of infectious diseases

    Directory of Open Access Journals (Sweden)

    Mikolaj Marek Raszek

    2016-03-01

    Full Text Available Although infectious diseases impose a heavy economic burden on the cattle industry, the etiology of many disorders that affect livestock is not fully elucidated, and effective countermeasures are often lacking. The main tools available until now have been vaccines, antibiotics and antiparasitic drugs. Although these have been very successful in some cases, the appearance of parasite and microbial resistance to these treatments is a cause of concern. This review describes the rapid gains achieved to track disease progression, identify the pathogens involved, and map pathogen interactions with the host. Next-generation sequencing provides important opportunities to tackle problems associated with pathogenic illnesses. Use of novel genomic tools subsequently aids in treatment development, as well as successful creation of breeding programs aimed towards less susceptible livestock. These may be important tools for mitigating the long term effects of combating infection and helping reduce the reliance on antibiotic treatment.

  19. High School Students' Use of Digital Tools for Learning English Vocabulary in an EFL Context

    Science.gov (United States)

    Cojocnean, Diana

    2015-01-01

    This study investigated Romanian high school students' use of digital tools for learning vocabulary in English. Although students have a wide range of technological affordances at their disposal, little is known about how they make use of them or the extent to which they are aware of how to use them in their vocabulary learning. The study features…

  20. Data mining tools for Sentinel 1 and Sentinel 2 data exploitation

    Science.gov (United States)

    Espinoza Molina, Daniela; Datcu, Mihai

    2016-10-01

    With the new planned Sentinel missions, the availability of Earth Observation data is increasing everyday offering a larger number of applications that can be created using these data. Currently, three of the five missions were launched and they are delivering a wealth of data and imagery of the Earth's surface as, for example, the Sentinel-1 carries an advanced radar instrument to provide an all-weather, day-and-night supply of Earth imagery. The second mission, the Sentinel-2, carries an optical instrument payload that will sample 13 spectral bands at different resolutions. Even though, we count on tools for automated loading and visual exploration of the Sentinel data, we still face the problem of extracting relevant structures from the images, finding similar patterns in a scene, exploiting the data, and creating final user applications based on these processed data. In this paper, we present our approach for processing radar and multi-spectral Sentinel data. Our approach is mainly composed of three steps: 1) the generation of a data model that explains the information contained in a Sentinel product. The model is formed by primitive descriptors and metadata entries, 2) the storage of this model in a database system, 3) the semantic definition of the image content based on machine learning algorithms and relevance feedback methods.

  1. MetalS(3), a database-mining tool for the identification of structurally similar metal sites.

    Science.gov (United States)

    Valasatava, Yana; Rosato, Antonio; Cavallaro, Gabriele; Andreini, Claudia

    2014-08-01

    We have developed a database search tool to identify metal sites having structural similarity to a query metal site structure within the MetalPDB database of minimal functional sites (MFSs) contained in metal-binding biological macromolecules. MFSs describe the local environment around the metal(s) independently of the larger context of the macromolecular structure. Such a local environment has a determinant role in tuning the chemical reactivity of the metal, ultimately contributing to the functional properties of the whole system. The database search tool, which we called MetalS(3) (Metal Sites Similarity Search), can be accessed through a Web interface at http://metalweb.cerm.unifi.it/tools/metals3/ . MetalS(3) uses a suitably adapted version of an algorithm that we previously developed to systematically compare the structure of the query metal site with each MFS in MetalPDB. For each MFS, the best superposition is kept. All these superpositions are then ranked according to the MetalS(3) scoring function and are presented to the user in tabular form. The user can interact with the output Web page to visualize the structural alignment or the sequence alignment derived from it. Options to filter the results are available. Test calculations show that the MetalS(3) output correlates well with expectations from protein homology considerations. Furthermore, we describe some usage scenarios that highlight the usefulness of MetalS(3) to obtain mechanistic and functional hints regardless of homology.

  2. Modeling multiple land use changes using ANN, CART and MARS: Comparing tradeoffs in goodness of fit and explanatory power of data mining tools

    Science.gov (United States)

    Tayyebi, Amin; Pijanowski, Bryan C.

    2014-05-01

    Over half of the earth's terrestrial surface has been modified by humans. This modification is called land use change and its pattern is known to occur in a non-linear way. The land use change modeling community can advance its models using data mining tools. Here, we present three data mining land use change models, one based on Artificial Neural Network (ANN), another on Classification And Regression Trees (CART) and another Multivariate Adaptive Regression Splines (MARS). We reconfigured the three data mining models to concurrently simulate multiple land use classes (e.g. agriculture, forest and urban) in South-Eastern Wisconsin (SEWI), USA (time interval 1990-2000) and in Muskegon River Watershed (MRW), Michigan, USA (time interval 1978-1998). We compared the results of the three data mining tools using relative operating characteristic (ROC) and percent correct match (PCM). We found that ANN provided the best accuracy in both areas for three land use classes (e.g. urban, agriculture and forest). In addition, in both regions, CART and MARS both showed that forest gain occurred in areas close to current forests, agriculture patches and away from roads. Urban increased in areas of high urban density, close to roads and in areas with few forests and wetlands. We also found that agriculture gain is more likely for the areas closer to the agriculture and forest patches. Elevation strongly influenced urbanization and forest gain in MRW while it has no effect in SEWI.

  3. Natural Language Processing and Fuzzy Tools for Business Processes in a Geolocation Context

    Directory of Open Access Journals (Sweden)

    Isis Truck

    2017-01-01

    Full Text Available In the geolocation field where high-level programs and low-level devices coexist, it is often difficult to find a friendly user interface to configure all the parameters. The challenge addressed in this paper is to propose intuitive and simple, thus natural language interfaces to interact with low-level devices. Such interfaces contain natural language processing (NLP and fuzzy representations of words that facilitate the elicitation of business-level objectives in our context. A complete methodology is proposed, from the lexicon construction to a dialogue software agent including a fuzzy linguistic representation, based on synonymy.

  4. Scenario Planning as a Tool to Promote Innovation in Regional Development Context

    OpenAIRE

    Edgar, Boo; Abouzeedan, Adli; Hedner, Thomas

    2011-01-01

    Scenario planning is a tool that can be used to project and promote innovation activities in organizations. The more disruptive the innovation is, the more beneficial effects will be expected to come out of utilizing a scenario planning process. This planning approach may be considered when discussing innovation in relation to regional development as well as in respect to the introduction of novel and potentially radical innovations. In this paper, we discuss the utilization of scenario plann...

  5. Control banding tools for occupational exposure assessment of nanomaterials - Ready for use in a regulatory context?

    DEFF Research Database (Denmark)

    Liguori, Biase; Hansen, Steffen Foss; Baun, Anders

    2016-01-01

    area of concern. Therefore, a number of Control Banding (CB)-based tools have been developed in order to assess and manage the potential risks associated with occupational exposure to nanomaterials. In this paper we provide a comparative analysis of different nanomaterial-specific types of control-banding/risk...... developed for different purposes, with different application domains and inclusion criteria. The exposure assessments and derived risk levels are based on different concepts and assumptions and outputs in different formats. The use of requested input parameters for exposure assessment differ greatly among...

  6. Planning for integrated mine closure

    Energy Technology Data Exchange (ETDEWEB)

    Linda Starke (ed.)

    2008-07-01

    This document presents an Integrated Mine Closure Planning Toolkit for the mining and metals sector. The toolkit is intended to be used to promote a more disciplined approach to integrated closure planning and to increase the uniformity of good practices across the sector. The concepts apply equally well to both large and small companies. The document is not intended to be prescriptive; it provides a suite of tools that can be brought to bear in formulating well-considered decisions when planning for closure. It uses a risk and opportunity based process to guide the practitioner through the iterative process of preparing for planned closure. The first section, with head office and mine management personnel as the primary audience, describes the participants of effective closure planning. Sections 2 and 3 provide the frameworks for a conceptual closure plan and a detailed plan. The key audiences for these two sections are mine management and head office personnel, financial modellers and estimators, governments, academics and non-governmental organizations. The forth section addresses decommissioning and post closure planning, and Section 5 draws together the conclusions and outlines several unresolved issues. The five narrative sections are followed by Section 6 which outlines 13 tools, some of which are already available in earlier ICMM publications, member practices and other sector-affiliated publications. Others have been developed to cover identified gaps. These tools provide the practitioner with practical work processes, examples and contexts within which to apply closure planning discipline.

  7. From intellectual craftsmanship to virtual context: methodological tools for social research

    Directory of Open Access Journals (Sweden)

    Tania Steren

    2010-01-01

    Full Text Available The present article examines the distinct methodological procedures, focusing particularly on research techniques. Both traditional and new strategies used in social research are characterized in the context of the information society. The paper highlights the impact of incorporating new information and communication technologies in bibliographical and documentary research and in fieldwork. In addition, the dichotomy between qualitative and quantitative approaches is challenged, in view of their complementarity and interfaces. The paradigm of complexity shows significant advance from an epistemological perspective, although while operative methodology it stands on its onsets. Therefore, the study discusses the issue of triangulation, conceived as a promising process in the search for a qual/quan approach. Therefore, there is room for an attempt to further explore the characteristics of technical procedures and resources that can be used by researchers who look for developing inquiries involving mixed methods.

  8. GENDER DISPARITIES REGARDING WAGE AS A MOTIVATIONAL TOOL IN THE CURRENT ECONOMIC CONTEXT

    Directory of Open Access Journals (Sweden)

    DEMYEN SUZANA

    2014-02-01

    Full Text Available The deepening process of globalization, negative trends regarding demographic evolution both nationally and internationally, also the emigration phenomenon and the long-term effects of the economic crisis, are the main challenges in terms of creating a general support and to encourage a fair and effective management of human resources, regardless of the industry they are developing their activity. Motivation consists in a series of problems that need to be solved in order to generate both individual and team performance, and wage is seen as one of the most important motivational tools. Though we have witnessed a less serious gap between wages according to the gender criterion, still there can be identified certain issues that need to be solved regardless the most recent trends in management

  9. Investigating the efficacy of an intelligent operation planning and support tool for acute healthcare contexts.

    Science.gov (United States)

    Wickramasinghe, Nilmini; Kent, Bridie; Moghimi, Fatemeh Hoda; Nguyen, Lemai; Redley, Bernice; Taylor, Nyree; Muhammed, Imran; Botti, Mari

    2013-01-01

    Nurses are the largest group of healthcare professionals in hospitals providing 24-hour care to patients. Hence, nurses are pivotal in coordinating and communicating patient care information in the complex network of healthcare professionals, services and other care processes. Yet, despite nurses' central role in health care delivery, intelligent systems have historically rarely been designed around nurses' operational needs. This could explain the poor integration of technologies into nursing work processes and consequent rejection by nursing professionals. The complex nature of acute care delivery in hospitals and the frequently interrupted patterns of nursing work suggest that nurses require flexible intelligent systems that can support and adapt to their variable workflow patterns. This study is designed to explore nurses' initial reactions to a new intelligent operational planning and support tool (IOPST) for acute healthcare. The following reports on the first stage of a longitudinal project to use an innovative approach involving nurses in the development of the IOPST; from conceptualization to implementation.

  10. PubstractHelper: A Web-based Text-Mining Tool for Marking Sentences in Abstracts from PubMed Using Multiple User-Defined Keywords.

    Science.gov (United States)

    Chen, Chou-Cheng; Ho, Chung-Liang

    2014-01-01

    While a huge amount of information about biological literature can be obtained by searching the PubMed database, reading through all the titles and abstracts resulting from such a search for useful information is inefficient. Text mining makes it possible to increase this efficiency. Some websites use text mining to gather information from the PubMed database; however, they are database-oriented, using pre-defined search keywords while lacking a query interface for user-defined search inputs. We present the PubMed Abstract Reading Helper (PubstractHelper) website which combines text mining and reading assistance for an efficient PubMed search. PubstractHelper can accept a maximum of ten groups of keywords, within each group containing up to ten keywords. The principle behind the text-mining function of PubstractHelper is that keywords contained in the same sentence are likely to be related. PubstractHelper highlights sentences with co-occurring keywords in different colors. The user can download the PMID and the abstracts with color markings to be reviewed later. The PubstractHelper website can help users to identify relevant publications based on the presence of related keywords, which should be a handy tool for their research. http://bio.yungyun.com.tw/ATM/PubstractHelper.aspx and http://holab.med.ncku.edu.tw/ATM/PubstractHelper.aspx.

  11. 基于网络挖掘的上下文相关词汇级复述研究%Context-Dependent Lexical Paraphrasing Based on Web Mining

    Institute of Scientific and Technical Information of China (English)

    赵世奇; 张宇; 赵琳; 刘挺; 李生

    2009-01-01

    Lexical paraphrasing is the task of extracting word-level paraphrases. Lexical paraphrases should be context dependent since a word may have different paraphrases in distinct contexts. This paper investigates a framework for acquiring context-dependent lexical paraphrases, in which a web mining method is developed for extracting candidate paraphrases and a classification method is introduced in paraphrase validation. Evaluations are carried out on the People's Daily corpus and the results show that: (1) the web mining method performs well in candidate paraphrase extraction, which extracts 2.3 correct paraphrases on average for each test word in each given context sentence; (2) the classifier for paraphrase validation is effective, which achieves an f-measure of 0.6023;(3) 75.11% and 98.31% of the paraphrases extracted by our method cannot be recognized by the two widely used context-independent methods, i.e., the thesaurus-based and clustering-based methods respectively. This indicates that the presented context-dependent method is a considerable supplement to the context-independent ones.%词汇级复述研究旨在为词汇获取复述.词汇级复述是上下文相关的,即对同一个词在不同上下文中应获取不同的复述词.提出了一种获取上下文相关词汇级复述的方法.该方法包括两部分:基于网络挖掘的候选复述词获取以及基于二元分类的复述词确认.在语料库上的实验结果表明:(1) 基于网络挖掘的候选复述词获取方法是切实可行的,平均为每个待复述词在每个给定的上下文句子中获取2.3个正确复述词;(2) 利用二元分类的方法进行复述确认是有效的,其F值达到0.6023;(3) 利用该方法抽取得到的复述中,有75.11%和98.31%无法通过两种常用的上下文无关方法,即基于辞典和基于聚类的方法来获得.这证明了所提出的上下文相关复述方法可以有效地补充传统的上下文无关方法.

  12. PRESY: A Context Based Query Reformulation Tool for Information Retrieval on the Web

    Directory of Open Access Journals (Sweden)

    Abdelkrim Bouramoul

    2010-01-01

    Full Text Available Problem statement: The huge number of information on the web as well as the growth of new inexperienced users creates new challenges for information retrieval. It has become increasingly difficult for these users to find relevant documents that satisfy their individual needs. Certainly the current search engines (such as Google, Bing and Yahoo offer an efficient way to browse the web content. However, the result quality is highly based on uses queries which need to be more precise to find relevant documents. This task still complicated for the majority of inept users who cannot express their needs with significant words in the query. For that reason, we believe that a reformulation of the initial user's query can be a good alternative to improve the information selectivity. This study proposed a novel approach and presents a prototype system called Profile-based Reformulation System (PRESY for information retrieval on the web. Approach: It used an incremental approach to categorize users by constructing a contextual base. The latter was composed of two types of context (static and dynamic obtained using the users' profiles. The architecture proposed was implemented using .Net environment to perform queries reformulating tests. Results: The experiments gave at the end of this article show that the precision of the returned content is effectively improved. The tests were performed with the most popular searching engine (i.e., Google, Bind and Yahoo selected in particular for their high selectivity. Among the given results, we found that query reformulation improve the first three results by 10.7 and 11.7% of the next seven returned elements. So as we could see the reformulation of users' initial queries improves the pertinence of returned content. Conclusion/Recommendations: Therefore, we believed that the exploitation of contextual data based on users' profiles could be a very good way to reformulate user query. This complementary mechanism would

  13. LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes.

    Science.gov (United States)

    Cañada, Andres; Capella-Gutierrez, Salvador; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso; Krallinger, Martin

    2017-05-22

    A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes-CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. AssociationViewer: a scalable and integrated software tool for visualization of large-scale variation data in genomic context.

    Science.gov (United States)

    Martin, Olivier; Valsesia, Armand; Telenti, Amalio; Xenarios, Ioannis; Stevenson, Brian J

    2009-03-01

    We present a tool designed for visualization of large-scale genetic and genomic data exemplified by results from genome-wide association studies. This software provides an integrated framework to facilitate the interpretation of SNP association studies in genomic context. Gene annotations can be retrieved from Ensembl, linkage disequilibrium data downloaded from HapMap and custom data imported in BED or WIG format. AssociationViewer integrates functionalities that enable the aggregation or intersection of data tracks. It implements an efficient cache system and allows the display of several, very large-scale genomic datasets. The Java code for AssociationViewer is distributed under the GNU General Public Licence and has been tested on Microsoft Windows XP, MacOSX and GNU/Linux operating systems. It is available from the SourceForge repository. This also includes Java webstart, documentation and example datafiles.

  15. Soil quality index as a tool for Scots pine (Pinus sylvestris) monoculture conversion planning on afforested, reclaimed mine land

    Institute of Scientific and Technical Information of China (English)

    Marcin Pietrzykowski

    2014-01-01

    In Central Europe, a large portion of post-mining sites were afforested with Scots pine, which is characterized by good adaptability and a tolerance for poor habitat at the beginning of forest ecosystem development. Conversion of monoculture on mine sites into more biodi-verse mixed hardwood forests, especially on more fertile deposits, can be an emerging need in this part of Europe in next decades. The ability to classify the forests at these post-mining sites will facilitate proper species selection as well as the management and formation of the developed ecosystem’s stability. This work describes the guidelines that can be followed to assess reclaimed mine soil (RMS) quality, using the mine soil quality index (MSQI) and a classification of developed forest sites as a basis of tree-stand species selection and conversion of pine monocul-tures. The research was conducted on four post-mining facilities (lignite, hard coal, sulphur, and sand pit mining areas) on different RMS sub-strates dominant in Central Europe. Soil quality assessment takes into account the following features of the soil: texture soil nutrients (Ca, Mg, K, Na, P); acidity (pH KCl); and Corg-to-Nt ratio in the initial organic horizon. An analysis was conducted of classification systems using the MSQI validation correlation (atp =0.05) with vegetation features af-fected by succession: aboveground biomass of forest floor and ecological indicators of vascular plants (calculated on the basis of Ellenberg’s (2009) system). Eventually, in the analysed data set, the MSQI ranged from 0.270 for soils on quaternary sands to 0.720 for a mix of quaternary loamy sands with neogene clays. Potential forest habitat types and the role of the pine in the next generation of tree stands on different RMS parent rock substrate were proposed.

  16. Literature mining in support of drug discovery.

    Science.gov (United States)

    Agarwal, Pankaj; Searls, David B

    2008-11-01

    The drug discovery enterprise provides strong drivers for data integration. While attention in this arena has tended to focus on integration of primary data from omics and other large platform technologies contributing to drug discovery and development, the scientific literature remains a major source of information valuable to pharmaceutical enterprises, and therefore tools for mining such data and integrating it with other sources are of vital interest and economic impact. This review provides a brief overview of approaches to literature mining as they relate to drug discovery, and offers an illustrative case study of a 'lightweight' approach we have implemented within an industrial context.

  17. PROFESSIONAL TRAINING IN MINING-METALLURGICAL CONTEXT / FORMACIÓN PROFESIONAL EN CONTEXTO MINERO-METALÚRGICO

    Directory of Open Access Journals (Sweden)

    María Magdalena Romero Ramírez

    2012-10-01

    Full Text Available The process of professional training is intentionally oriented to make up people according to the current society, committed with the history and traditions, deeply receptive and ready to take on work and professional performance in their profession, grow up in the order of the technical and professional preparation, as well as in their spiritual and personal conditions. Therefore, it necessary prepares a professional capable to get answer at the demands of social need from its context, recognizing the formative process as a process of complexity social character, cultural and essentially human. The paper provides an analysis of the didactic-metodologyc treatment of training in the engineering careers from technological, social and humanistic, integrated at the context as well as some alternatives through concrete actions as part of the continuous improvement of the formative process.ResumenEl proceso de formación de los profesionales en la educación superior está intencionalmente orientado a formar un ciudadano que reúna las condiciones que la sociedad actual demanda: sujetos altamente comprometidos con la historia y las tradiciones de su medio, profundamente reflexivos, capaces de comportarse esencialmente humanos, preparados para asumir los desempeños laborales y profesionales que las distintas profesiones requieren; con todas las posibilidades para insertarse en el vertiginoso avance de la ciencia, la tecnología, el arte y dispuestos a crecer tanto en el orden de la preparación técnica y profesional como en sus condiciones personales y espirituales. De ahí la necesidad de formar un profesional capaz de dar respuestas a las exigencias de las necesidades sociales desde su contexto, reconociendo al proceso formativo como un proceso de carácter complejo social, cultural y esencialmente humano. En el trabajo se hace un análisis del tratamiento didáctico-metodológico en la formación del profesional del perfil minero

  18. Data mining in radiology.

    Science.gov (United States)

    Kharat, Amit T; Singh, Amarjit; Kulkarni, Vilas M; Shah, Digish

    2014-04-01

    Data mining facilitates the study of radiology data in various dimensions. It converts large patient image and text datasets into useful information that helps in improving patient care and provides informative reports. Data mining technology analyzes data within the Radiology Information System and Hospital Information System using specialized software which assesses relationships and agreement in available information. By using similar data analysis tools, radiologists can make informed decisions and predict the future outcome of a particular imaging finding. Data, information and knowledge are the components of data mining. Classes, Clusters, Associations, Sequential patterns, Classification, Prediction and Decision tree are the various types of data mining. Data mining has the potential to make delivery of health care affordable and ensure that the best imaging practices are followed. It is a tool for academic research. Data mining is considered to be ethically neutral, however concerns regarding privacy and legality exists which need to be addressed to ensure success of data mining.

  19. Data mining in radiology

    Directory of Open Access Journals (Sweden)

    Amit T Kharat

    2014-01-01

    Full Text Available Data mining facilitates the study of radiology data in various dimensions. It converts large patient image and text datasets into useful information that helps in improving patient care and provides informative reports. Data mining technology analyzes data within the Radiology Information System and Hospital Information System using specialized software which assesses relationships and agreement in available information. By using similar data analysis tools, radiologists can make informed decisions and predict the future outcome of a particular imaging finding. Data, information and knowledge are the components of data mining. Classes, Clusters, Associations, Sequential patterns, Classification, Prediction and Decision tree are the various types of data mining. Data mining has the potential to make delivery of health care affordable and ensure that the best imaging practices are followed. It is a tool for academic research. Data mining is considered to be ethically neutral, however concerns regarding privacy and legality exists which need to be addressed to ensure success of data mining.

  20. Data mining in radiology

    Science.gov (United States)

    Kharat, Amit T; Singh, Amarjit; Kulkarni, Vilas M; Shah, Digish

    2014-01-01

    Data mining facilitates the study of radiology data in various dimensions. It converts large patient image and text datasets into useful information that helps in improving patient care and provides informative reports. Data mining technology analyzes data within the Radiology Information System and Hospital Information System using specialized software which assesses relationships and agreement in available information. By using similar data analysis tools, radiologists can make informed decisions and predict the future outcome of a particular imaging finding. Data, information and knowledge are the components of data mining. Classes, Clusters, Associations, Sequential patterns, Classification, Prediction and Decision tree are the various types of data mining. Data mining has the potential to make delivery of health care affordable and ensure that the best imaging practices are followed. It is a tool for academic research. Data mining is considered to be ethically neutral, however concerns regarding privacy and legality exists which need to be addressed to ensure success of data mining. PMID:25024513

  1. Advancing the argument for validity of the Alberta Context Tool with healthcare aides in residential long-term care

    Directory of Open Access Journals (Sweden)

    Cummings Greta G

    2011-07-01

    Full Text Available Abstract Background Organizational context has the potential to influence the use of new knowledge. However, despite advances in understanding the theoretical base of organizational context, its measurement has not been adequately addressed, limiting our ability to quantify and assess context in healthcare settings and thus, advance development of contextual interventions to improve patient care. We developed the Alberta Context Tool (the ACT to address this concern. It consists of 58 items representing 10 modifiable contextual concepts. We reported the initial validation of the ACT in 2009. This paper presents the second stage of the psychometric validation of the ACT. Methods We used the Standards for Educational and Psychological Testing to frame our validity assessment. Data from 645 English speaking healthcare aides from 25 urban residential long-term care facilities (nursing homes in the three Canadian Prairie Provinces were used for this stage of validation. In this stage we focused on: (1 advanced aspects of internal structure (e.g., confirmatory factor analysis and (2 relations with other variables validity evidence. To assess reliability and validity of scores obtained using the ACT we conducted: Cronbach's alpha, confirmatory factor analysis, analysis of variance, and tests of association. We also assessed the performance of the ACT when individual responses were aggregated to the care unit level, because the instrument was developed to obtain unit-level scores of context. Results Item-total correlations exceeded acceptable standards (> 0.3 for the majority of items (51 of 58. We ran three confirmatory factor models. Model 1 (all ACT items displayed unacceptable fit overall and for five specific items (1 item on adequate space for resident care in the Organizational Slack-Space ACT concept and 4 items on use of electronic resources in the Structural and Electronic Resources ACT concept. This prompted specification of two additional

  2. Tools for organisational risk evaluation for occupational health stressors (OREOHS) for the small-scale mining industry

    CSIR Research Space (South Africa)

    Edwards, AL

    2009-07-01

    Full Text Available Some of the main reasons for occupational health and safety deficiencies in small-scale mining are unawareness of risks of chronic occupational diseases and inadequately implemented education and training. The key needs of the sector is to provide...

  3. Monte Carlo simulations as a feasibility tool for non-metallic land-mine detection by thermal-neutron backscattering

    NARCIS (Netherlands)

    Maucec, M; de Meijer, RJ

    2002-01-01

    The use of Monte Carlo simulations is presented for modelling a simplified land-mine detector system with thermal neutron backscattering (TNB) analysis based on a Cf-252-neutron source. Different aspects and a variety of external conditions, related to localisation and identification of a buried obj

  4. EST2uni: an open, parallel tool for automated EST analysis and database creation, with a data mining web interface and microarray expression data integration

    Directory of Open Access Journals (Sweden)

    Nuez Fernando

    2008-01-01

    Full Text Available Abstract Background Expressed sequence tag (EST collections are composed of a high number of single-pass, redundant, partial sequences, which need to be processed, clustered, and annotated to remove low-quality and vector regions, eliminate redundancy and sequencing errors, and provide biologically relevant information. In order to provide a suitable way of performing the different steps in the analysis of the ESTs, flexible computation pipelines adapted to the local needs of specific EST projects have to be developed. Furthermore, EST collections must be stored in highly structured relational databases available to researchers through user-friendly interfaces which allow efficient and complex data mining, thus offering maximum capabilities for their full exploitation. Results We have created EST2uni, an integrated, highly-configurable EST analysis pipeline and data mining software package that automates the pre-processing, clustering, annotation, database creation, and data mining of EST collections. The pipeline uses standard EST analysis tools and the software has a modular design to facilitate the addition of new analytical methods and their configuration. Currently implemented analyses include functional and structural annotation, SNP and microsatellite discovery, integration of previously known genetic marker data and gene expression results, and assistance in cDNA microarray design. It can be run in parallel in a PC cluster in order to reduce the time necessary for the analysis. It also creates a web site linked to the database, showing collection statistics, with complex query capabilities and tools for data mining and retrieval. Conclusion The software package presented here provides an efficient and complete bioinformatics tool for the management of EST collections which is very easy to adapt to the local needs of different EST projects. The code is freely available under the GPL license and can be obtained at http

  5. Minería de datos sobre streams de redes sociales, una herramienta al servicio de la Bibliotecología = Data Mining Streams of Social Networks, A Tool to Improve The Library Services

    Directory of Open Access Journals (Sweden)

    Sonia Jaramillo Valbuena

    2015-12-01

    Full Text Available Los sistemas de soporte al trabajo colaborativo son herramientas valiosas en contextos en los cuales se requiere la participación de un grupo de personas para llevar a cabo una determinada tarea. Uno de estos contextos es la Bibliotecología, Archivística y Documentación. Las interacciones entre los usuarios y profesionales de esta área, mediante el uso de herramientas tales como Twitter, Facebook, fuentes RSS y blogs, generan grandes flujos de datos (streams no estructurados. Estos streams pueden ser estudiados para analizar aspectos tales como influencia, relaciones de cercanía, opinión y generación de recomendaciones, logrando de esta forma que las bibliotecas obtengan otros beneficios del uso de las tecnologías de información y comunicación. Desde la perspectiva de la minería de datos, el procesamiento de estos streams plantea importantes desafíos. El uso de algoritmos de minería en este contexto, implica que en estos se deban considerar aspectos como la adaptación a la alta velocidad en que llegan los datos, la diversidad de las fuentes de datos y su estructura, la variabilidad de los datos en el tiempo y el trabajo sin restricciones de memoria. Este artículo revisa el estado del arte en lo referente a algoritmos de minería de datos sobre streams originados en redes sociales, específicamente, Facebook y Twitter. Se presenta una revisión de las técnicas más representativas de clasificación y agrupamiento, y de cómo cada una de ellas aporta al descubrimiento de conocimiento en el área de la Bibliotecología. Para concluir se presentan algunos de los problemas que son objeto de investigación activa = The Groupware systems are a valuable source for disseminating information in contexts in which the participation of a group of people is required to perform a task. One such context is the Library, Archives and Documentation. The interactions among users and professionals in this area, who use tools such as Twitter

  6. MONITORING OF MINING

    Directory of Open Access Journals (Sweden)

    Berislav Šebečić

    1996-12-01

    Full Text Available The way mining was monitored in the past depended on knowledge, interest and the existing legal regulations. Documentary evidence about this work can be found in archives, libraries and museums. In particular, there is the rich archival material (papers and books concerning the work of the one-time Imperial and Royal Mining Captaincies in Zagreb, Zadar, Klagenfurt and Split, A minor part of the documentation has not yet been transferred to Croatia. From mining handbooks and books we can also find out about mining in Croatia. In the context of Austro-Hungary. For example, we can find out that the first governorships in Zagreb and Zadar headed the Ban, Count Jelacic and Baron Mamula were also the top mining authorities, though this, probably from political motives, was suppressed in the guides and inventories or the Mining Captaincies. At the end of the 1850s, Croatia produced 92-94% of sea salt, up to 8.5% of sulphur, 19.5% of asphalt and 100% of oil for the Austro-Hungarian empire. From data about mining in the Split Mining Captaincy, prepared for the Philadephia Exhibition, it can be seen that in the exploratory mining operations in which there were 33,372 independent mines declared in 1925 they were looking mainly for bauxite (60,0%, then dark coal (19,0%, asphalts (10.3% and lignites (62%. In 1931, within the area covered by the same captaincy, of 74 declared mines, only 9 were working. There were five coal mines, three bauxite mines and one for asphalt. I suggest that within state institution, the Mining Captaincy or Authority be renewed, or that a Mining and Geological Authority be set ap, which would lead to the more complete affirmation of Croatian mining (the paper is published in Croatian.

  7. The Use of Web 2.0 Tools by Students in Learning and Leisure Contexts: A Study in a Portuguese Institution of Higher Education

    Science.gov (United States)

    Costa, Carolina; Alvelos, Helena; Teixeira, Leonor

    2016-01-01

    This study analyses and compares the use of Web 2.0 tools by students in both learning and leisure contexts. Data were collected based on a questionnaire applied to 234 students from the University of Aveiro (Portugal) and the results were analysed by using descriptive analysis, paired samples t-tests, cluster analyses and Kruskal-Wallis tests.…

  8. Data mining techniques for e -learning

    Directory of Open Access Journals (Sweden)

    Irina IONIȚĂ

    2016-10-01

    Full Text Available Data Mining (DM, sometimes called Knowledge Discovery in Databases (KDD, is a powerful new technology with great potential to help companies focus on the most important information in the data they have collected via transactions. In the education field, the prediction of students learning performance, detection of inappropriate learning behaviours, and development of student profile may be considered e-learning problems where data mining can successfully solve them. In this paper, the authoress analyses the possibilities to apply data mining techniques in e-learning context, to predict the students’ status referring to their activities and the interest in using advanced tutoring tools. The experiments were performed on the basis of data provided by an e-learning platform (Moodle regarding the logging parameters of students enrolled on Interactive Tutoring Systems discipline during the second semester of current year.

  9. Main development trends and some technical decisions on mining tools equipped with super-hard composite materials inserts

    Science.gov (United States)

    Dvornikov, LT; Krestovozdvizhensky, PD; Nikitenko, SM; Korneyev, VA; Korneyev, PA

    2017-02-01

    Directions of a significant increase in effectiveness of rock destruction with tools equipped with super-hard composite material inserts are reviewed and justified. Drilling tool designs with the cutting insert in the form of elliptical Cassinian oval and the asymmetric ring cleaves are suggested. Versions of laboratory stand designs in order to determine the power consumption of rock destruction are developed.

  10. A Mapping of Tools for Informing Water Sensitive Urban Design Planning Decisions—Questions, Aspects and Context Sensitivity

    DEFF Research Database (Denmark)

    Lerer, Sara Maria; Arnbjerg-Nielsen, Karsten; Mikkelsen, Peter Steen

    2015-01-01

    Water Sensitive Urban Design (WSUD) poses new challenges for decision makers compared with traditional stormwater management, e.g., because WSUD offers a larger selection of measures and because many measures are multifunctional. These challenges have motivated the development of many decision...... support tools. This review shows that the tools differ in terms of the types of questions they can assist in answering. We identified three main groups: “How Much”-tools, “Where”-tools and “Which”-tools. The “How Much”-tools can further be grouped into tools quantifying hydraulic impacts, hydrologic...... impacts, water quality impacts, non-flow-related impacts and economic impacts. Additionally, the tools differ in terms of how many aspects of water they address, from those focused only on bio-physical aspects to those attempting to find the best WSUD based on multiple criteria. Finally, we suggest...

  11. Data Mining as a Powerful Tool for Creating Novel Drugs in Cardiovascular Medicine: The Importance of a "Back-and-Forth Loop" Between Clinical Data and Basic Research.

    Science.gov (United States)

    Kitakaze, Masafumi; Asakura, Masanori; Nakano, Atsushi; Takashima, Seiji; Washio, Takashi

    2015-06-01

    Cardiovascular diseases, which lead to cardiovascular events including death, progress with many deleterious pathophysiological sequels. If a cause-and-effect relationship follows a one-to-one relation, we can focus on a cause to treat an effect, but such a relation cannot be applied in cardiovascular diseases. To identify novel drugs in the cardiovascular field, we generally adopt two different strategies: induction and deduction. In the cardiovascular field, it is difficult to use deduction because cardiovascular diseases are caused by many factors, leading us to use induction. In this method, we consider all clinical data, such as medical records or genetic data, and identify a few candidates. Recent computational and mathematical advances enable us to use data-mining methods to uncover hidden relationships between many parameters and clinical outcomes. However, because these candidates are not identified as promoting or inhibiting factors, or as causal or consequent factors of cardiovascular diseases, we need to test them in basic research, and bring them back to the clinical field to test their efficacy in clinical trials. With such a "back-and-forth loop" between clinical observation and basic research, data-mining methods may provide novel strategies leading to new tools for clinicians, basic findings for researchers, and better outcomes for patients.

  12. Chimpanzees' Context-Dependent Tool Use Provides Evidence for Separable Representations of Hand and Tool Even during Active Use within Peripersonal Space

    Science.gov (United States)

    Povinelli, Daniel J.; Reaux, James E.; Frey, Scott H.

    2010-01-01

    Considerable attention has been devoted to behaviors in which tools are used to perform actions in extrapersonal space by extending the reach. Evidence suggests that these behaviors result in an expansion of the body schema and peripersonal space. However, humans often use tools to perform tasks within peripersonal space that cannot be…

  13. Chimpanzees' Context-Dependent Tool Use Provides Evidence for Separable Representations of Hand and Tool Even during Active Use within Peripersonal Space

    Science.gov (United States)

    Povinelli, Daniel J.; Reaux, James E.; Frey, Scott H.

    2010-01-01

    Considerable attention has been devoted to behaviors in which tools are used to perform actions in extrapersonal space by extending the reach. Evidence suggests that these behaviors result in an expansion of the body schema and peripersonal space. However, humans often use tools to perform tasks within peripersonal space that cannot be…

  14. Performance Analysis of Anti-Phishing Tools and Study of Classification Data Mining Algorithms for a Novel Anti-Phishing System

    Directory of Open Access Journals (Sweden)

    Rajendra Gupta

    2015-11-01

    Full Text Available The term Phishing is a kind of spoofing website which is used for stealing sensitive and important information of the web user such as online banking passwords, credit card information and user's password etc. In the phishing attack, the attacker generates the warning message to the user about the security issues, ask for confidential information through phishing emails, ask to update the user's account information etc. Several experimental design considerations have been proposed earlier to countermeasure the phishing attack. The earlier systems are not giving more than 90 percentage successful results. In some cases, the system tool gives only 50-60 percentage successful result. In this paper, a novel algorithm is developed to check the performance of the anti-phishing system and compared the received data set with the data set of existing anti-phishing tools. The performance evaluation of novel anti-phishing system is studied with four different classification data mining algorithms which are Class Imbalance Problem (CIP, Rule based Classifier (Sequential Covering Algorithm (SCA, Nearest Neighbour Classification (NNC, Bayesian Classifier (BC on the data set of phishing and legitimate websites. The proposed system shows less error rate and better performance as compared to other existing system tools.

  15. A New Tool for Assessing Context Conditioning Induced by US-Unpredictability in Humans: The Martians Task Restyled

    Science.gov (United States)

    Meulders, Ann; Vervliet, Bram; Vansteenwegen, Debora; Hermans, Dirk; Baeyens, Frank

    2011-01-01

    Unpredictability of an unconditioned stimulus (US) typically produces context conditioning in animals and humans. We modified the Martians task--a computer game measuring learning of Pavlovian associations through conditioned suppression--for assessing context conditioning in humans. One between-subjects and one within-subjects study are reported.…

  16. A New Tool for Assessing Context Conditioning Induced by US-Unpredictability in Humans: The Martians Task Restyled

    Science.gov (United States)

    Meulders, Ann; Vervliet, Bram; Vansteenwegen, Debora; Hermans, Dirk; Baeyens, Frank

    2011-01-01

    Unpredictability of an unconditioned stimulus (US) typically produces context conditioning in animals and humans. We modified the Martians task--a computer game measuring learning of Pavlovian associations through conditioned suppression--for assessing context conditioning in humans. One between-subjects and one within-subjects study are reported.…

  17. Environmental accounting as a management tool in the Mediterranean context: the Spanish economy during the last 20 years.

    Science.gov (United States)

    Lomas, Pedro L; Alvarez, Sergio; Rodríguez, Marta; Montes, Carlos

    2008-07-01

    Although human presence is one of the main characteristics of the Mediterranean identity since ancient times, a false dialectic between conservation and social-economic development has emerged in recent decades. On the one hand, an economic growth policy is taken as the paradigm of social-economic development; on the other hand, there is a multi-scale conservation policy, in which natural protected areas, as patches of preserved nature, are used as one of the main tools to deal with the challenge of sustainability. The Mediterranean Basin is the habitat of many unique species and one of the 25 main biodiversity hotspots in the world, and as a consequence a strong conservation policy has been used to protect environmental values. At the same time, Mediterranean countries are deeply involved in promoting strong economic growth policies, which are not always compatible with environmental ones. In this paper, Spain has been studied as one model of this situation. Due to political reasons, Spanish economic growth and conservationist policies were pursued together during the last 20 years. As a result, Spain owns one of the largest networks of natural protected areas in Western Europe, and at the same time it has experienced one of the strongest periods of economic growths in the European and Mediterranean context during the 1980s and 1990s. An historical series of resource use in five annual periods in the last 20 years of conservation policy, and the effects on the preservation of natural capital have been investigated by means of the eMergy (spelled with an 'm') synthesis approach, which was used to characterize the flow of environmental services supplied by ecosystems, but not in monetary terms. This study shows that Spain is becoming less self-sufficient and more inefficient in resource use, comprehensively measured in eMergy terms. A large part of Spain's economy depends on imported goods and services, and most economic activities are based on tourist services and

  18. Deep neural networks: A promising tool for fault characteristic mining and intelligent diagnosis of rotating machinery with massive data

    Science.gov (United States)

    Jia, Feng; Lei, Yaguo; Lin, Jing; Zhou, Xin; Lu, Na

    2016-05-01

    Aiming to promptly process the massive fault data and automatically provide accurate diagnosis results, numerous studies have been conducted on intelligent fault diagnosis of rotating machinery. Among these studies, the methods based on artificial neural networks (ANNs) are commonly used, which employ signal processing techniques for extracting features and further input the features to ANNs for classifying faults. Though these methods did work in intelligent fault diagnosis of rotating machinery, they still have two deficiencies. (1) The features are manually extracted depending on much prior knowledge about signal processing techniques and diagnostic expertise. In addition, these manual features are extracted according to a specific diagnosis issue and probably unsuitable for other issues. (2) The ANNs adopted in these methods have shallow architectures, which limits the capacity of ANNs to learn the complex non-linear relationships in fault diagnosis issues. As a breakthrough in artificial intelligence, deep learning holds the potential to overcome the aforementioned deficiencies. Through deep learning, deep neural networks (DNNs) with deep architectures, instead of shallow ones, could be established to mine the useful information from raw data and approximate complex non-linear functions. Based on DNNs, a novel intelligent method is proposed in this paper to overcome the deficiencies of the aforementioned intelligent diagnosis methods. The effectiveness of the proposed method is validated using datasets from rolling element bearings and planetary gearboxes. These datasets contain massive measured signals involving different health conditions under various operating conditions. The diagnosis results show that the proposed method is able to not only adaptively mine available fault characteristics from the measured signals, but also obtain superior diagnosis accuracy compared with the existing methods.

  19. The power of data--from data mining to consumer pricing and quality-of-care tools.

    Science.gov (United States)

    Malof, Leah C

    2013-01-01

    Transparency tools, whether offered by carriers or third-party administrators, rely on adequate experience, by market and by service, to provide information to consumers about health care costs and quality of care. The opportunities for savings to individual consumers and to employer-sponsored health plans are clearly significant and possible if people will use the tools and act. This article reviews two studies showing a shift in consumer claims experience to less costly services afte the implementation of a transparency tool and when combined with a consumer-driven health plan. It also outlines best practices employers can implement to carefully craft interventions to engage and create value in the minds of health care consumers.

  20. Variations of Pb in a mine-impacted tropical river, Taxco, Mexico: Use of geochemical, isotopic and statistical tools

    Energy Technology Data Exchange (ETDEWEB)

    Arcega-Cabrera, F. [Unidad de Quimica en Sisal, Facultad de Quimica, UNAM, Sisal 97355 (Mexico)], E-mail: arcega@icmyl.unam.mx; Armienta, M.A. [Instituto de Geofisica, UNAM, Mexico 04510 (Mexico); Daessle, L.W. [Instituto de Investigaciones Oceanologicas, UABC, Ensenada 22870 (Mexico); Castillo-Blum, S.E. [Facultad de Quimica, UNAM, Mexico 04510 (Mexico); Talavera, O. [Escuela de Ciencias de la Tierra, UAG, Taxco Viejo 40201 (Mexico); Dotor, A. [Instituto de Geofisica, UNAM, Mexico 04510 (Mexico)

    2009-01-15

    The potential environmental threat from Pb in Mexican rivers impacted by historic mining activities was studied using geochemical, isotopic and statistical methods. Lead geochemical fractionation and factor analysis of fractionated and total Pb indicate that anthropogenic sources have contributed significantly to Pb concentrations, while natural sources have contributed only small amounts. The analyses also indicate that two main processes are controlling the total Pb variation throughout the year in both rivers: erosion with discharge processes, and proportional dilution related to differences in grain-size distribution processes. Bio-available Pb in riverbed sediments was greater than 50% in 80% of the sampling stations indicating a high potential environmental risk, according to the risk assessment criteria (RAC). Nevertheless, based on the environmental chemistry of Pb and on multivariate statistical analysis, these criteria did not apply in this particular case. Significant differences (p < 0.05) in total Pb concentrations (from 50 to 5820 mg kg{sup -1}) and in the geochemical fractionation were observed as a function of seasonality and location along the river flow path. In the Cacalotenango and Taxco rivers, the highest concentrations of total Pb were found at stations close to tailings during the rainy and post-rainy seasons. The geochemistry of Pb was mainly controlled, during the dry and post-rainy seasons by the organic matter and carbonate content, and in the rainy season by hydrological conditions (e.g., the increase in river flux), hydrological basin erosion, and the suspended solids concentration. Isotopic analyses of the {sup 210}Pb/{sup 214}Pb ratio showed three processes in the Cacalotenango and Taxco rivers. First, the accumulation of atmospheric excess {sup 210}Pb, favoured during calmer hydrodynamic conditions in the river basin commonly during dry periods, is recorded by a {sup 210}Pb/{sup 214}Pb ratio of >1. In the case of the Cacalotenango

  1. Scientific Data Mining in Astronomy

    OpenAIRE

    Borne, Kirk

    2009-01-01

    We describe the application of data mining algorithms to research problems in astronomy. We posit that data mining has always been fundamental to astronomical research, since data mining is the basis of evidence-based discovery, including classification, clustering, and novelty discovery. These algorithms represent a major set of computational tools for discovery in large databases, which will be increasingly essential in the era of data-intensive astronomy. Historical examples of data mining...

  2. Drought severity-duration-frequency curves: a foundation for risk assessment and planning tool for ecosystem establishment in post-mining landscapes

    Science.gov (United States)

    Halwatura, D.; Lechner, A. M.; Arnold, S.

    2015-02-01

    Eastern Australia has considerable mineral and energy resources, with areas of high biodiversity value co-occurring over a broad range of agro-climatic environments. Lack of water is the primary abiotic stressor for (agro)ecosystems in many parts of eastern Australia. In the context of mined land rehabilitation quantifying the severity-duration-frequency (SDF) of droughts is crucial for successful ecosystem rehabilitation to overcome challenges of early vegetation establishment and long-term ecosystem resilience. The objective of this study was to quantify the SDF of short-term and long-term drought events of 11 selected locations across a broad range of agro-climatic environments in eastern Australia by using three drought indices at different timescales: the Standardized Precipitation Index (SPI), the Reconnaissance Drought Index (RDI), and the Standardized Precipitation-Evapotranspiration Index (SPEI). Based on the indices we derived bivariate distribution functions of drought severity and duration, and estimated the recurrence intervals of drought events at different timescales. The correlation between the simple SPI and the more complex SPEI or RDI was stronger for the tropical and temperate locations than for the arid locations, indicating that SPEI or RDI can be replaced by SPI if evaporation plays a minor role for plant available water (tropics). Both short-term and long-term droughts were most severe and prolonged, and recurred most frequently in arid regions, but were relatively rare in tropical and temperate regions. Our approach is similar to intensity-duration-frequency (IDF) analyses of rainfall, which are crucial for the design of hydraulic infrastructure. In this regard, we propose to apply SDF analyses of droughts to design ecosystem components in post-mining landscapes. Together with design rainfalls, design droughts should be used to assess rehabilitation strategies and ecological management using drought recurrence intervals, thereby minimising

  3. De-MetaST-BLAST: a tool for the validation of degenerate primer sets and data mining of publicly available metagenomes.

    Directory of Open Access Journals (Sweden)

    Christopher A Gulvik

    Full Text Available Development and use of primer sets to amplify nucleic acid sequences of interest is fundamental to studies spanning many life science disciplines. As such, the validation of primer sets is essential. Several computer programs have been created to aid in the initial selection of primer sequences that may or may not require multiple nucleotide combinations (i.e., degeneracies. Conversely, validation of primer specificity has remained largely unchanged for several decades, and there are currently few available programs that allows for an evaluation of primers containing degenerate nucleotide bases. To alleviate this gap, we developed the program De-MetaST that performs an in silico amplification using user defined nucleotide sequence dataset(s and primer sequences that may contain degenerate bases. The program returns an output file that contains the in silico amplicons. When De-MetaST is paired with NCBI's BLAST (De-MetaST-BLAST, the program also returns the top 10 nr NCBI database hits for each recovered in silico amplicon. While the original motivation for development of this search tool was degenerate primer validation using the wealth of nucleotide sequences available in environmental metagenome and metatranscriptome databases, this search tool has potential utility in many data mining applications.

  4. A Novel Computational Tool for Mining Real-Life Data: Application in the Metastatic Colorectal Cancer Care Setting

    Science.gov (United States)

    Siegelmann-Danieli, Nava; Farkash, Ariel; Katzir, Itzhak; Vesterman Landes, Janet; Rotem Rabinovich, Hadas; Lomnicky, Yossef; Carmeli, Boaz; Parush-Shear-Yashuv, Naama

    2016-01-01

    Background Randomized clinical trials constitute the gold-standard for evaluating new anti-cancer therapies; however, real-life data are key in complementing clinically useful information. We developed a computational tool for real-life data analysis and applied it to the metastatic colorectal cancer (mCRC) setting. This tool addressed the impact of oncology/non-oncology parameters on treatment patterns and clinical outcomes. Methods The developed tool enables extraction of any computerized information including comorbidities and use of drugs (oncological/non-oncological) per individual HMO member. The study in which we evaluated this tool was a retrospective cohort study that included Maccabi Healthcare Services members with mCRC receiving bevacizumab with fluoropyrimidines (FP), FP plus oxaliplatin (FP-O), or FP plus irinotecan (FP-I) in the first-line between 9/2006 and 12/2013. Results The analysis included 753 patients of whom 15.4% underwent subsequent metastasectomy (the Surgery group). For the entire cohort, median overall survival (OS) was 20.5 months; in the Surgery group, median duration of bevacizumab-containing therapy (DOT) pre-surgery was 6.1 months; median OS was not reached. In the Non-surgery group, median OS and DOT were 18.7 and 11.4 months, respectively; no significant OS differences were noted between FP-O and FP-I, whereas FP use was associated with shorter OS (12.3 month; p <0.002; notably, these patients were older). Patients who received both FP-O- and FP-I-based regimens achieved numerically longer OS vs. those who received only one of these regimens (22.1 [19.9–24.0] vs. 18.9 [15.5–21.9] months). Among patients assessed for wild-type KRAS and treated with subsequent anti-EGFR agent, OS was 25.4 months and 18.7 months for 124 treated vs. 37 non-treated patients (non-significant). Cox analysis (controlling for age and gender) identified several non-oncology parameters associated with poorer clinical outcomes including concurrent use of

  5. A Novel Computational Tool for Mining Real-Life Data: Application in the Metastatic Colorectal Cancer Care Setting.

    Directory of Open Access Journals (Sweden)

    Nava Siegelmann-Danieli

    Full Text Available Randomized clinical trials constitute the gold-standard for evaluating new anti-cancer therapies; however, real-life data are key in complementing clinically useful information. We developed a computational tool for real-life data analysis and applied it to the metastatic colorectal cancer (mCRC setting. This tool addressed the impact of oncology/non-oncology parameters on treatment patterns and clinical outcomes.The developed tool enables extraction of any computerized information including comorbidities and use of drugs (oncological/non-oncological per individual HMO member. The study in which we evaluated this tool was a retrospective cohort study that included Maccabi Healthcare Services members with mCRC receiving bevacizumab with fluoropyrimidines (FP, FP plus oxaliplatin (FP-O, or FP plus irinotecan (FP-I in the first-line between 9/2006 and 12/2013.The analysis included 753 patients of whom 15.4% underwent subsequent metastasectomy (the Surgery group. For the entire cohort, median overall survival (OS was 20.5 months; in the Surgery group, median duration of bevacizumab-containing therapy (DOT pre-surgery was 6.1 months; median OS was not reached. In the Non-surgery group, median OS and DOT were 18.7 and 11.4 months, respectively; no significant OS differences were noted between FP-O and FP-I, whereas FP use was associated with shorter OS (12.3 month; p <0.002; notably, these patients were older. Patients who received both FP-O- and FP-I-based regimens achieved numerically longer OS vs. those who received only one of these regimens (22.1 [19.9-24.0] vs. 18.9 [15.5-21.9] months. Among patients assessed for wild-type KRAS and treated with subsequent anti-EGFR agent, OS was 25.4 months and 18.7 months for 124 treated vs. 37 non-treated patients (non-significant. Cox analysis (controlling for age and gender identified several non-oncology parameters associated with poorer clinical outcomes including concurrent use of diuretics and proton

  6. Simultaneous Screening of Glutathione and Cyanide Adducts Using Precursor Ion and Neutral Loss Scans-Dependent Product Ion Spectral Acquisition and Data Mining Tools

    Science.gov (United States)

    Jian, Wenying; Liu, Hua-Fen; Zhao, Weiping; Jones, Elliott; Zhu, Mingshe

    2012-05-01

    Drugs can be metabolically activated to soft and hard electrophiles, which are readily trapped by glutathione (GSH) and cyanide (CN), respectively. These adducts are often detected and structurally characterized using separate tandem mass spectrometry methods. We describe a new method for simultaneous screening of GSH and CN adducts using precursor ion (PI) and neutral loss (NL) scans-dependent product ion spectral acquisition and data mining tools on an triple quadrupole linear ion trap mass spectrometry. GSH, potassium cyanide, and their stable isotope labeled analogues were incubated with liver microsomes and a test compound. Negative PI scan of m/z 272 for detection of GSH adducts and positive NL scans of 27 and 29 Da for detection of CN adducts were conducted as survey scans to trigger acquisition of enhanced resolution (ER) spectrum and subsequent enhanced product ion (EPI) spectrum. Post-acquisition data mining of EPI data set using NL filters of 129 and 27 Da was then performed to reveal the GSH adducts and CN adducts, respectively. Isotope patterns and EPI spectra of the detected adducts were utilized for identification of their molecular weights and structures. The effectiveness of this method was evaluated by analyzing reactive metabolites of nefazodone formed from rat liver microsomes. In addition to known GSH- and CN-trapped reactive metabolites, several new CN adducts of nefazodone were identified. The results suggested that current approach is highly effective in the analysis of both soft and hard reactive metabolites and can be used as a high-throughput method in drug discovery.

  7. Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences.

    Science.gov (United States)

    Ganapathiraju, Madhavi K; Mitchell, Asia D; Thahir, Mohamed; Motwani, Kamiya; Ananthasubramanian, Seshan

    2012-12-01

    Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.

  8. Using workload measurement tools in diverse care contexts: the experience of staff in mental health and learning disability inpatient settings.

    Science.gov (United States)

    Fanneran, T; Brimblecombe, N; Bradley, E; Gregory, S

    2015-12-01

    What is known on the subject? Difficulties with the recruitment and retention of qualified nursing staff have resulted in nursing shortages worldwide with a consequential impact on the quality of care. It is increasingly recommended that evidence-based staffing levels are central to the development of workforce plans. Due to a paucity of empirical research in mental health and learning disability services the staffing needs and requirements for these settings are undefined and the availability of tools to aid staffing decisions is limited. What this paper adds to existing knowledge? This paper provides a valuable insight into the practical uses of these tools as perceived by staff members with day-to-day experience of the requirements of mental health and learning disability wards. It reveals that while workload measurement tools are considered a valuable aid for the development of workforce plans, they are limited in their ability to capture all aspects of care provision in these settings. It further emphasizes the inapplicability of a one-shoe-fits-all approach for determining nurse staffing levels and the need for individual and customized workforce plans. What are the implications for practice? This study demonstrates that the development of tools for use in mental health and learning disability services is in its infancy, yet no tool that has been validated as such. It highlights the potential for workload measurement tools to aid staffing decisions; however, a more holistic approach that considers additional factors is needed to ensure robust workforce planning models are developed for these services. The critical challenge of determining the correct level and skill mix of nursing staff required to deliver safe and effective health care has become an international concern. It is recommended that evidence-based staffing decisions are central to the development of future workforce plans. Workforce planning in mental health and learning disability nursing is

  9. METSP: a maximum-entropy classifier based text mining tool for transporter-substrate identification with semistructured text.

    Science.gov (United States)

    Zhao, Min; Chen, Yanming; Qu, Dacheng; Qu, Hong

    2015-01-01

    The substrates of a transporter are not only useful for inferring function of the transporter, but also important to discover compound-compound interaction and to reconstruct metabolic pathway. Though plenty of data has been accumulated with the developing of new technologies such as in vitro transporter assays, the search for substrates of transporters is far from complete. In this article, we introduce METSP, a maximum-entropy classifier devoted to retrieve transporter-substrate pairs (TSPs) from semistructured text. Based on the high quality annotation from UniProt, METSP achieves high precision and recall in cross-validation experiments. When METSP is applied to 182,829 human transporter annotation sentences in UniProt, it identifies 3942 sentences with transporter and compound information. Finally, 1547 confidential human TSPs are identified for further manual curation, among which 58.37% pairs with novel substrates not annotated in public transporter databases. METSP is the first efficient tool to extract TSPs from semistructured annotation text in UniProt. This tool can help to determine the precise substrates and drugs of transporters, thus facilitating drug-target prediction, metabolic network reconstruction, and literature classification.

  10. Data Mining – Innovative Method for Obtaining Information in Marketingand Business Management

    Directory of Open Access Journals (Sweden)

    Mirela-Cristina Voicu

    2011-05-01

    Full Text Available The existence of massive amounts of data raised the question of using their reorientation to a retrospective to a prospective operation. Data mining offers the promise of an important aid for discovering hidden patterns in data that can be used to predict the behavior of customers, products and processes. Data mining tools must be guided by users who understand the business, the general nature of the data and analytical methods involved. It discovers information within the data that queries and reports can’t effectively reveal. It is vital to collect data and prepare properly, to face reality models. Choosing the most appropriate product data mining is to find a tool with the capabilities required, an interface that matches the skills of users and can be applied in a specific business problem. In this context, the purpose of this paper is to illustrate some of the problems of company activity problems which can be solved by using data mining techniques.

  11. Benefits of a clinical data warehouse with data mining tools to collect data for a radiotherapy trial.

    Science.gov (United States)

    Roelofs, Erik; Persoon, Lucas; Nijsten, Sebastiaan; Wiessler, Wolfgang; Dekker, André; Lambin, Philippe

    2013-07-01

    Collecting trial data in a medical environment is at present mostly performed manually and therefore time-consuming, prone to errors and often incomplete with the complex data considered. Faster and more accurate methods are needed to improve the data quality and to shorten data collection times where information is often scattered over multiple data sources. The purpose of this study is to investigate the possible benefit of modern data warehouse technology in the radiation oncology field. In this study, a Computer Aided Theragnostics (CAT) data warehouse combined with automated tools for feature extraction was benchmarked against the regular manual data-collection processes. Two sets of clinical parameters were compiled for non-small cell lung cancer (NSCLC) and rectal cancer, using 27 patients per disease. Data collection times and inconsistencies were compared between the manual and the automated extraction method. The average time per case to collect the NSCLC data manually was 10.4 ± 2.1 min and 4.3 ± 1.1 min when using the automated method (pdata collected for NSCLC and 5.3% for rectal cancer, there was a discrepancy between the manual and automated method. Aggregating multiple data sources in a data warehouse combined with tools for extraction of relevant parameters is beneficial for data collection times and offers the ability to improve data quality. The initial investments in digitizing the data are expected to be compensated due to the flexibility of the data analysis. Furthermore, successive investigations can easily select trial candidates and extract new parameters from the existing databases. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  12. Qlikview Audit Tool (QLIKVIEW) -

    Data.gov (United States)

    Department of Transportation — This tool supports the cyclical financial audit process. Qlikview supports large volumes of financial transaction data that can be mined, summarized and presented to...

  13. Mining Deployment Optimization

    Science.gov (United States)

    Čech, Jozef

    2016-09-01

    The deployment problem, researched primarily in the military sector, is emerging in some other industries, mining included. The principal decision is how to deploy some activities in space and time to achieve desired outcome while complying with certain requirements or limits. Requirements and limits are on the side constraints, while minimizing costs or maximizing some benefits are on the side of objectives. A model with application to mining of polymetallic deposit is presented. To obtain quick and immediate decision solutions for a mining engineer with experimental possibilities is the main intention of a computer-based tool. The task is to determine strategic deployment of mining activities on a deposit, meeting planned output from the mine and at the same time complying with limited reserves and haulage capacities. Priorities and benefits can be formulated by the planner.

  14. PMLAB: An scripting environment for process mining

    OpenAIRE

    Carmona Vargas, Josep; Solé, Marc

    2014-01-01

    In a decade of process mining research, several algorithms have been proposed to solve particular process mining tasks. At the same pace, tools have appeared both in the academic and the commercial domains. These tools have enabled the use of process mining practices to a rather limited extent. In this paper we advocate for a change in the mentality: process mining may be an exploratory discipline, and PMLAB - a Python-based scripting environment supporting this - is proposed. This demo prese...

  15. "easyMine" – realistic and systematic mine detection simulation tooltion

    Directory of Open Access Journals (Sweden)

    U. Böttger

    2004-01-01

    Full Text Available Mine detection is to date mainly performed with metal detectors, although new methods for UXO detection are explored worldwide. The main problem for the mine detection to date is, that there exist some ideas of which sensor combinations could yield a high score, but until now there is no systematic analysis of mine detection methods together with realistic environmental conditions to conclude on a physically and technically optimized sensor combination. This gap will be removed by a project “easyMine" (Realistic and systematic Mine Detection Simulation Tool which will result in a simulation tool for optimizing land mine detection in a realistic mine field. The project idea for this software tool is presented, that will simulate the closed chain of mine detection, including the mine in its natural environment, the sensor, the evaluation and application of the measurements by an user. The tool will be modularly designed. Each chain link will be an independent, exchangeable sub- module and will describe a stand alone part of the whole mine detection procedure. The advantage of the tool will be the evaluation of very different kinds of sensor combinations in relation of their real potential for mine detection. Three detection methods (metal detector, GPR and imaging IR-radiometry will be explained to be introduced into the easyMine software tool in a first step. An actual example for land mine detection problem will be presented and approaches for solutions with easyMine will be shown.

  16. Longwall mining

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1995-03-14

    As part of EIA`s program to provide information on coal, this report, Longwall-Mining, describes longwall mining and compares it with other underground mining methods. Using data from EIA and private sector surveys, the report describes major changes in the geologic, technological, and operating characteristics of longwall mining over the past decade. Most important, the report shows how these changes led to dramatic improvements in longwall mining productivity. For readers interested in the history of longwall mining and greater detail on recent developments affecting longwall mining, the report includes a bibliography.

  17. THE INFLUENCE OF CONTEXT IN VIRTUAL TOOL USING LA INFLUENCIA DEL CONTEXTO EN LOS USOS DE LAS HERRAMIENTAS VIRTUALES

    Directory of Open Access Journals (Sweden)

    María Luisa Sevillano García

    2010-12-01

    Full Text Available This article is the product of an investigation that has pursued the following objectives: To know the uses of virtual tools by a representative population in age, profession, gender and formation. To identify the difficulties they encounter and to look for their causes. To assess the advantages of every tool in professional life, in leisure time and in teaching work.A sampling by quota has been chosen, basically for its rigor. The data acquisition tools -questionnaire protocol, interview script, and discussion group- than have been developed and applied make it possible to go beyond the frontier of quantitative vs qualitative analysis and to advance into a mixed methodology.  A quick, sure, selective, pertinent use of the studied virtual tools represents a principle of innovation, access to knowledge and development of professional performance. The qualifying in basic competences for building, transferring and processing general and specialized knowledge that can be found on the net and that is useful throughout one's life. Este artículo es producto de una investigación que ha perseguido los siguientes objetivos: Conocer los usos de las herramientas virtuales por una población representativa en edades, profesiones, género y formación. Identificar las dificultades que encuentran y buscar las causas que las originan .Valorar las ventajas de cada una de las herramientas en la vida profesional, en el ocio y en la labor formativa.Se ha elegido un muestreo por cuotas, fundamentalmente, por el rigor que otorga.Las herramientas de toma de datos elaboradas y aplicadas —protocolo de cuestionario, guión de entrevista y grupo de discusión— posibilitan traspasar la frontera del análisis cuantitativo versus cualitativo para adentrarse en una metodología mixta.Un aprovechamiento acertado, rápido, seguro, selectivo de las herramientas virtuales estudiadas representa un principio de innovación, acceso al conocimiento y desarrollo de la actuaci

  18. Analysis of biological processes and diseases using text mining approaches.

    Science.gov (United States)

    Krallinger, Martin; Leitner, Florian; Valencia, Alfonso

    2010-01-01

    A number of biomedical text mining systems have been developed to extract biologically relevant information directly from the literature, complementing bioinformatics methods in the analysis of experimentally generated data. We provide a short overview of the general characteristics of natural language data, existing biomedical literature databases, and lexical resources relevant in the context of biomedical text mining. A selected number of practically useful systems are introduced together with the type of user queries supported and the results they generate. The extraction of biological relationships, such as protein-protein interactions as well as metabolic and signaling pathways using information extraction systems, will be discussed through example cases of cancer-relevant proteins. Basic strategies for detecting associations of genes to diseases together with literature mining of mutations, SNPs, and epigenetic information (methylation) are described. We provide an overview of disease-centric and gene-centric literature mining methods for linking genes to phenotypic and genotypic aspects. Moreover, we discuss recent efforts for finding biomarkers through text mining and for gene list analysis and prioritization. Some relevant issues for implementing a customized biomedical text mining system will be pointed out. To demonstrate the usefulness of literature mining for the molecular oncology domain, we implemented two cancer-related applications. The first tool consists of a literature mining system for retrieving human mutations together with supporting articles. Specific gene mutations are linked to a set of predefined cancer types. The second application consists of a text categorization system supporting breast cancer-specific literature search and document-based breast cancer gene ranking. Future trends in text mining emphasize the importance of community efforts such as the BioCreative challenge for the development and integration of multiple systems into

  19. Added value of nuclear magnetic resonance logging as delineation tool in EHO context : an example from Orinoco Heavy oil belt

    Energy Technology Data Exchange (ETDEWEB)

    Cochet, F.; Mus, E. [PDVSA Petroleos de Venezuela SA, Caracas (Venezuela, Bolivarian Republic of). Petrocedeno; Marcos, J. [Baker Hughes, Houston, TX (United States)

    2009-07-01

    In preparation for the upcoming implementation of a thermal oil recovery project, an intensive delineation campaign was recently conducted in a specific area of the Junin block main field of Petrocedeno, operated by Petroleos de Venezuela SA (PDVSA) Petrocedeno joint venture. A complete acquisition program was defined in terms of conventional logging (GR, resistivity, neutron/density, sonic, formation pressure), coring, and unconventional logging with imagery logs and nuclear magnetic resonance (NMR) with the objective of delineating the field. The purpose of this paper was to show the significant added value and step forward the NMR acquisition brought in fluid and reservoir characterization. The paper described the input of standard NMR logging, including fluid characterization; petrophysical support; tool limitation; NMR gamma inversion reprocessing; methodology; and application to EOR project area. Applications for field development were also presented including application for cold and hot production development. It was concluded that the NMR is a suitable tool for fluid characterization and for improving future development of the Petrocedeno extra heavy oil field. It has the capability of identifying fluids that are poorly characterized, or not characterized, by conventional logs. 10 refs., 18 figs.

  20. E-Learning Lifecycles:How Communities and Context can affect E-learning Specifications and Tool Design

    Directory of Open Access Journals (Sweden)

    Michael Magee

    2004-10-01

    Full Text Available The development of a large body of e-learning specifications, such as IMS and SCORM, has led to the proposal for a new way to facilitate content workflow. This involves the movement of educational digital content and the knowledge of pedagogical communities into an online space. Several projects have looked at the theoretical structure of these specifications. They implemented a series of tools in order to examine and research the issues around the actual usage of these specifications. The CAREO, ALOHA and ALOHA 2 projects were designed to expose both individual users and whole institutions to these ideas. Initial research into the result of those interactions indicates that there is some utility in the adoption of e-learning specifications. The future success of them will depend on their ability to adapt and meet the needs of the educational community as they begin to adopt, use and evolve the way they use the specifications and the tools created around them.

  1. Human health risk assessment of lead from mining activities at semi-arid locations in the context of total lead exposure.

    Science.gov (United States)

    Zheng, Jiajia; Huynh, Trang; Gasparon, Massimo; Ng, Jack; Noller, Barry

    2013-12-01

    Lead from historical mining and mineral processing activities may pose potential human health risks if materials with high concentrations of bioavailable lead minerals are released to the environment. Since the Joint Expert Committee on Food Additives of Food and Agriculture Organization/World Health Organization withdrew the Provisional Tolerable Weekly Intake of lead in 2011, an alternative method was required for lead exposure assessment. This study evaluated the potential lead hazard to young children (0-7 years) from a historical mining location at a semi-arid area using the U.S. EPA Integrated Exposure Uptake Biokinetic (IEUBK) Model, with selected site-specific input data. This study assessed lead exposure via the inhalation pathway for children living in a location affected by lead mining activities and with specific reference to semi-arid conditions and made comparison with the ingestion pathway by using the physiologically based extraction test for gastro-intestinal simulation. Sensitivity analysis for major IEUBK input parameters was conducted. Three groups of input parameters were classified according to the results of predicted blood concentrations. The modelled lead absorption attributed to the inhalation route was lower than 2 % (mean ± SE, 0.9 % ± 0.1 %) of all lead intake routes and was demonstrated as a less significant exposure pathway to children's blood, compared with ingestion. Whilst dermal exposure was negligible, diet and ingestion of soil and dust were the dominant parameters in terms of children's blood lead prediction. The exposure assessment identified the changing role of dietary intake when house lead loadings varied. Recommendations were also made to conduct comprehensive site-specific human health risk assessment in future studies of lead exposure under a semi-arid climate.

  2. Assessing and Developing the Application of LEED Green Building Rating System as a Sustainable Project Management and Market Tool in the Italian Context

    Directory of Open Access Journals (Sweden)

    Walaa S. E. Ismaee

    2016-07-01

    Full Text Available The paper discusses the recent introduction of the LEED system to the Italian context in order to assess its role to promote sustainable building process in the Italian context, pointing out its potentials on one hand as well as their gaps and limitations on the other hand, and suggests means for its future development. The study discusses the application of LEED as a ‘Sustainable Project management tool’ to guide sustainable building performance. This requires investigating the following: its structure, tools, assessment criteria along with its benchmarks and references. It also discusses the application of LEED as a ‘Sustainable building Certification and market tool’. This investigates the role and value of the LEED certification in the Italian Green market. The research method is comprised of three parts. The first part is a comparative analysis of LEED categories against Italian national initiatives for sustainability. The comparison showed that most LEED categories are already mandated by national norms and directives but they may differ in their stringency creating some areas of precedence of LEED system or drawbacks. This streamlines the adaptation process of LEED system to the Italian context. The second part investigates LEED projects’ market analysis. The result showed that the shift towards a sustainable building process is occurring slowly and on a vertical scale focusing on some building sectors rather than others. Its market diffusion in the Italian context faces challenges regarding the insufficient availability of green materials and products satisfying its requirements, as well as high soft cost of sustainability tests and expertise required. The Third part presents a practical review-citing the methodology and results of a survey conducted by the researchers in mid-2012. It is composed of a web-based questionnaire and interviews among a sample of LEED professionals in Italy. The result shows that LEED systems needs

  3. Data Mining Application in Higher Learning Institutions

    Directory of Open Access Journals (Sweden)

    Naeimeh DELAVARI

    2008-04-01

    Full Text Available One of the biggest challenges that higher learning institutions face today is to improve the quality of managerial decisions. The managerial decision making process becomes more complex as the complexity of educational entities increase. Educational institute seeks more efficient technology to better manage and support decision making procedures or assist them to set new strategies and plan for a better management of the current processes. One way to effectively address the challenges for improving the quality is to provide new knowledge related to the educational processes and entities to the managerial system. This knowledge can be extracted from historical and operational data that reside in the educational organization's databases using the techniques of data mining technology. Data mining techniques are analytical tools that can be used to extract meaningful knowledge from large data sets. This paper presents the capabilities of data mining in the context of higher educational system by i proposing an analytical guideline for higher education institutions to enhance their current decision processes, and ii applying data mining techniques to discover new explicit knowledge which could be useful for the decision making processes.

  4. Study of Spanish mining accidents using data mining techniques

    OpenAIRE

    Sanmiquel Pera, Lluís; Rossell Garriga, Josep Maria; Vintró Sánchez, Carla

    2015-01-01

    Mining is an economic sector with a high number of accidents. Mines are hazardous places and workers can suffer a wide variety of injuries. Utilizing a database composed of almost 70,000 occupational accidents and fatality reports corresponding to the decade 2003–2012 in the Spanish mining sector, the paper analyzes the main causes of those accidents. To carry out the study, powerful statistical tools have been applied, such as Bayesian classi¿ers, decision trees or contingency t...

  5. Uranium from Africa - An overview on past and current mining activities: Re-appraising associated risks and chances in a global context

    Science.gov (United States)

    Winde, Frank; Brugge, Doug; Nidecker, Andreas; Ruegg, Urs

    2017-05-01

    In 2003, nuclear power received renewed interest as a perceived climate-neutral way to meet high energy demands of large industrialized countries, such as China, India, Russia and the USA. It triggered a growing demand for uranium (U) as nuclear fuel. Dubbed the 'nuclear renaissance', the U-price rose over tenfold before the global credit crisis dampend the rush. Many efforts to capitalise on the renewed demand focused on Africa. This paper provides an overview on the type and extent of uranium mining, production and exploration on the African continent and discusses the economic benefits as well as the potential environmental and health risks and the long-term needs for remediation of legacy sites. The actual historical results of uranium mining activities in more than thirty African countries provide data against which to assess the existing risks of uranium development. The already existing uraniferous waste in several African countries threatens scarce water resources and the health of adjacent residents. Responsibility should rest with the governments and the companies to ensure that these threats are not realized.

  6. Data mining in agriculture

    CERN Document Server

    Mucherino, Antonio; Pardalos, Panos M

    2009-01-01

    Data Mining in Agriculture represents a comprehensive effort to provide graduate students and researchers with an analytical text on data mining techniques applied to agriculture and environmental related fields. This book presents both theoretical and practical insights with a focus on presenting the context of each data mining technique rather intuitively with ample concrete examples represented graphically and with algorithms written in MATLAB®. Examples and exercises with solutions are provided at the end of each chapter to facilitate the comprehension of the material. For each data mining technique described in the book variants and improvements of the basic algorithm are also given. Also by P.J. Papajorgji and P.M. Pardalos: Advances in Modeling Agricultural Systems, 'Springer Optimization and its Applications' vol. 25, ©2009.

  7. Data Mining in Social Media

    Science.gov (United States)

    Barbier, Geoffrey; Liu, Huan

    The rise of online social media is providing a wealth of social network data. Data mining techniques provide researchers and practitioners the tools needed to analyze large, complex, and frequently changing social media data. This chapter introduces the basics of data mining, reviews social media, discusses how to mine social media data, and highlights some illustrative examples with an emphasis on social networking sites and blogs.

  8. Documentary analysis of risk-assessment and safety-planning policies and tools in a mental health context.

    Science.gov (United States)

    Higgins, Agnes; Doyle, Louise; Morrissey, Jean; Downes, Carmel; Gill, Ailish; Bailey, Sive

    2016-08-01

    Despite the articulated need for policies and processes to guide risk assessment and safety planning, limited guidance exists on the processes or procedures to be used to develop such policies, and there is no body of research that examines the quality or content of the risk-management policies developed. The aim of the present study was to analyse the policies of risk and safety management used to guide mental health nursing practice in Ireland. A documentary analysis was performed on 123 documents received from 22 of the 23 directors of nursing contacted. Findings from the analysis revealed a wide variation in how risk, risk assessment, and risk management were defined. Emphasis within the risk documentation submitted was on risk related to self and others, with minimal attention paid to other types of risks. In addition, there was limited evidence of recovery-focused approaches to positive risk taking that involved service users and their families within the risk-related documentation. Many of the risk-assessment tools had not been validated, and lacked consistency or guidance in relation to how they were to be used or applied. The tick-box approach and absence of space for commentary within documentation have the potential to impact severely on the quality of information collected and documented, and subsequent clinical decision-making. Managers, and those tasked with ensuring safety and quality, need to ensure that policies and processes are, where possible, informed by best evidence and are in line with national mental health policy on recovery.

  9. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  10. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  11. SPELEOTHERAPY DEVELOPMENT IN ROMANIA ON THE WORLD CONTEXT AND PERSPECTIVES FOR USE OF SOME SALT MINES AND KARST CAVES FOR SPELEOTHERAPEUTIC AND BALNEOCLIMATIC TOURISM PURPOSES

    Directory of Open Access Journals (Sweden)

    Simionca Iuri

    2013-09-01

    Full Text Available Speleotherapy (ST is a relatively new method of complementary medicine, although, as is well known ancient tribes appreciated isolation in underwater caves, salt or karst caves, or possessing various ores and was officially recognized in the years 1950-1960 in Germany (K.Spannagel, 1961 and Poland (M. Skulimowski, 1965. Professor Mieczyslaw Skulimowski has granted of speleotherapy in "Wieliczka" Salt Mine, the name of subterraneotherapy, soon to be also called the Skulimowski method. Number of speleotherapeutic centers and speleotherapy symposiums has increased considerably (H.Trimmel, 1994. Speleotherapy in the underground now is an effective therapeutic method for the treatment of obstructive respiratory tract diseases, especially of patients with bronchial asthma (BA, the upward and effective practice in Central and Eastern Europe, but also in the West: Austria, Czech Republic, Germany, Poland, Russia, Slovakia, Ukraine, Belarus, Hungary etc.

  12. Tools for Large Graph Mining

    Science.gov (United States)

    2005-06-01

    intuitive results on a variety of synthetic and real-world datasets. Here, we will verify their scalability. Figure 5.9 shows results on a “ caveman ...show timing results on a “ caveman ” graph with 3 caves. The plot shows wall-clock time vs. the number of edges E in the graph, for both SPLIT (dashed

  13. 基于情境历史的移动用户偏好挖掘研究%Mobile User Preferences Mining System Based on Context History

    Institute of Scientific and Technical Information of China (English)

    刘启华

    2012-01-01

    This paper proposed a context-aware system framework for proactive personalized mobile networking applications based on user models utilizing the context history. Based on the proposed framework, this paper presented a system called CAMTRS, which was an context-aware mobile tourism recommender system that serves a tourist with information needed in his specific context that are interesting to him given his goal for that moment. From experiment and evaluation, the proposed framework is a promising approach to pro- vider proactive personalized services to mobile users.%提出一个移动互联网环境下用于个性化信息服务的基于情境历史的移动用户偏好挖掘方法,并构建移动旅游信息推荐原型系统CAMTRS。实验结果显示:该方法能较好地获取移动互联网环境下用户的需求偏好,有助于改进个性化推荐系统的预测效果。

  14. CMIN - herramienta case basada en CRISP-DM para el soporte de proyectos de minería de datos CMIN — a CRISP-DM-based case tool for supporting data mining projects

    Directory of Open Access Journals (Sweden)

    Mendoza Martha

    2010-12-01

    Full Text Available En este artículo se presenta la CMIN, una herramienta CASE (Computer Aided Software Engineering integrada (que soporta todas las fases de un proceso basada en CRISP-DM 1.0 (Cross – Industry Standard Process for Data Mining para soportar el desarrollo de proyectos de minería de datos.Primero se expone la funcionalidad general de CMIN, lo que incluye la gestión de procesos, plantillas y proyectos, y se destaca la capacidad de CMIN para realizar el seguimiento de los proyectos de una forma fácil e intuitiva y la manera como CMIN posibilita que el usuario incremente su conocimiento en el uso de CRISP-DM o de cualquier otro proceso que se defina en la herramienta a través de las ayudas e información que se ofrece en cada paso del proceso. Después, se detalla cómo CMIN permite enlazar en tiempo de ejecución (sin necesidad de volver a compilar la herramienta nuevos algoritmos de minería de datos que apoyen la labor de modelado (basada en un flujo de trabajo o workflow en un proyecto de minería de datos. Finalmente, se ofrecen los resultados de dos evaluaciones de la herramienta, las conclusiones y el trabajo futuro.This paper introduces CMIN, an integrated computer aided software engineering (CASE tool based on cross-industry standard process for data mining (CRISP-DM 1.0 designed to support carrying out data mining projects. It is “integrated” in the sense that it supports all phases of a process. A general overview of how CMIN works is presented first, including a treatment of processes, templates and project management. CMIN’s capacity for easily and intuitively monitoring projects is highlighted, as is the manner in which CMIN allows a user to increase knowledge regarding using CRISP-DM or any other process defined in the CASE tool through the help and information presented in each step. Next, it is shown how CMIN can bind new data mining algorithms in runtime (without the need to recompile the tool to support modelling tasks

  15. DATA, TEXT, AND WEB MINING FOR BUSINESS INTELLIGENCE: A SURVEY

    Directory of Open Access Journals (Sweden)

    Abdul-Aziz Rashid

    2013-03-01

    Full Text Available The Information and Communication Technologies revolution brought a digital world with huge amounts of data available. Enterprises use mining technologies to search vast amounts of data for vital insight and knowledge. Mining tools such as data mining, text mining, and web mining are used to find hidden knowledge in large databases or the Internet. Mining tools are automated software tools used to achieve business intelligence by finding hidden relations, and predicting future events from vast amounts of data. This uncovered knowledge helps in gaining completive advantages, better customers’ relationships, and even fraud detection. In this survey, we’ll describe how these techniques work, how they are implemented. Furthermore, we shall discuss how business intelligence is achieved using these mining tools. Then look into some case studies of success stories using mining tools. Finally, we shall demonstrate some of the main challenges to the mining technologies that limit their potential.

  16. Data, Text and Web Mining for Business Intelligence : A Survey

    Directory of Open Access Journals (Sweden)

    Abdul-Aziz Rashid Al-Azmi

    2013-04-01

    Full Text Available The Information and Communication Technologies revolution brought a digital world with huge amountsof data available. Enterprises use mining technologies to search vast amounts of data for vital insight andknowledge. Mining tools such as data mining, text mining, and web mining are used to find hiddenknowledge in large databases or the Internet. Mining tools are automated software tools used to achievebusiness intelligence by finding hidden relations,and predicting future events from vast amounts of data.This uncovered knowledge helps in gaining completive advantages, better customers’ relationships, andeven fraud detection. In this survey, we’ll describe how these techniques work, how they are implemented.Furthermore, we shall discuss how business intelligence is achieved using these mining tools. Then lookinto some case studies of success stories using mining tools. Finally, we shall demonstrate some of the mainchallenges to the mining technologies that limit their potential.

  17. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  18. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  19. Web Mining Technology and Designing of the Tools%Web上的数据挖掘技术和工具设计

    Institute of Scientific and Technical Information of China (English)

    谢丹夏

    2001-01-01

    More and more commerce-related transactions are becoming digital. The more you know about your customers,the better you can serve them. Every customer action on a Web site generates data,not just high-level interactions such as buying something,but also something as simple as using a search engine or navigating through a site. All these interactions between digital service providers and the consumer can be recorded and stored in digital databases.These large data sets contain information helpful to business marketing strategies,both for retrospective analysis as well as data-driven forecasting. Web mining tools will provide companies with previously unknown statistics and useful insights into the behavior of their online customers via analyzing the data on the web.%电子商务网站的网上业务量巨大,在每天的大量业务中蕴涵了许多用户的隐藏信息。每个顾客在WEB上的行为都会产生相关数据,这不只是包括购买的信息,而且还有利用搜索引擎和在站点内浏览的相关数据。所有的交互数据都可以被网站后台的数据库记录下来,这些大量的数据集合包含了对历史记录的市场分析以及数据驱动的市场预测非常有益的潜在信息。利用完善的数据库技术,现在的公司能够比较容易地搜集到大量的客户信息。而通过把数据挖掘在W阴上的应用,即W衄ⅢMNG技术,公司可以利用有效的顾客信息,发掘潜在的市场,提高竞争力。

  20. Data mining mobile devices

    CERN Document Server

    Mena, Jesus

    2013-01-01

    With today's consumers spending more time on their mobiles than on their PCs, new methods of empirical stochastic modeling have emerged that can provide marketers with detailed information about the products, content, and services their customers desire.Data Mining Mobile Devices defines the collection of machine-sensed environmental data pertaining to human social behavior. It explains how the integration of data mining and machine learning can enable the modeling of conversation context, proximity sensing, and geospatial location throughout large communities of mobile users

  1. Quantification of Operational Risk Using A Data Mining

    Science.gov (United States)

    Perera, J. Sebastian

    1999-01-01

    What is Data Mining? - Data Mining is the process of finding actionable information hidden in raw data. - Data Mining helps find hidden patterns, trends, and important relationships often buried in a sea of data - Typically, automated software tools based on advanced statistical analysis and data modeling technology can be utilized to automate the data mining process

  2. Factor Structure, Reliability and Measurement Invariance of the Alberta Context Tool and the Conceptual Research Utilization Scale, for German Residential Long Term Care

    Science.gov (United States)

    Hoben, Matthias; Estabrooks, Carole A.; Squires, Janet E.; Behrens, Johann

    2016-01-01

    We translated the Canadian residential long term care versions of the Alberta Context Tool (ACT) and the Conceptual Research Utilization (CRU) Scale into German, to study the association between organizational context factors and research utilization in German nursing homes. The rigorous translation process was based on best practice guidelines for tool translation, and we previously published methods and results of this process in two papers. Both instruments are self-report questionnaires used with care providers working in nursing homes. The aim of this study was to assess the factor structure, reliability, and measurement invariance (MI) between care provider groups responding to these instruments. In a stratified random sample of 38 nursing homes in one German region (Metropolregion Rhein-Neckar), we collected questionnaires from 273 care aides, 196 regulated nurses, 152 allied health providers, 6 quality improvement specialists, 129 clinical leaders, and 65 nursing students. The factor structure was assessed using confirmatory factor models. The first model included all 10 ACT concepts. We also decided a priori to run two separate models for the scale-based and the count-based ACT concepts as suggested by the instrument developers. The fourth model included the five CRU Scale items. Reliability scores were calculated based on the parameters of the best-fitting factor models. Multiple-group confirmatory factor models were used to assess MI between provider groups. Rather than the hypothesized ten-factor structure of the ACT, confirmatory factor models suggested 13 factors. The one-factor solution of the CRU Scale was confirmed. The reliability was acceptable (>0.7 in the entire sample and in all provider groups) for 10 of 13 ACT concepts, and high (0.90–0.96) for the CRU Scale. We could demonstrate partial strong MI for both ACT models and partial strict MI for the CRU Scale. Our results suggest that the scores of the German ACT and the CRU Scale for nursing

  3. Development of tools for managing the impacts on surface due to changing hydrological regimes surrounding closed underground coal mines (ECSC Coal RTD programme, contract 7220-PR-136)

    Energy Technology Data Exchange (ETDEWEB)

    Veschkens, M. [ISSeP, Liege (Belgium); Unland, W.; Kories, H. [DMT, Am Technologiepark, Essen (Germany)

    2005-07-01

    This paper demonstrates how box model approach and FE and box mixed model approach allow to better understand and model water flows in complex mined coal measures and interactions between shallow aquifers and flooded coal measures. Benefits of these approaches are illustrated on the basis of case studies in Liege and Ruhr coal basins. (authors)

  4. Assessment of variation in the alberta context tool: the contribution of unit level contextual factors and specialty in Canadian pediatric acute care settings

    Directory of Open Access Journals (Sweden)

    Cummings Greta G

    2011-10-01

    Full Text Available Abstract Background There are few validated measures of organizational context and none that we located are parsimonious and address modifiable characteristics of context. The Alberta Context Tool (ACT was developed to meet this need. The instrument assesses 8 dimensions of context, which comprise 10 concepts. The purpose of this paper is to report evidence to further the validity argument for ACT. The specific objectives of this paper are to: (1 examine the extent to which the 10 ACT concepts discriminate between patient care units and (2 identify variables that significantly contribute to between-unit variation for each of the 10 concepts. Methods 859 professional nurses (844 valid responses working in medical, surgical and critical care units of 8 Canadian pediatric hospitals completed the ACT. A random intercept, fixed effects hierarchical linear modeling (HLM strategy was used to quantify and explain variance in the 10 ACT concepts to establish the ACT's ability to discriminate between units. We ran 40 models (a series of 4 models for each of the 10 concepts in which we systematically assessed the unique contribution (i.e., error variance reduction of different variables to between-unit variation. First, we constructed a null model in which we quantified the variance overall, in each of the concepts. Then we controlled for the contribution of individual level variables (Model 1. In Model 2, we assessed the contribution of practice specialty (medical, surgical, critical care to variation since it was central to construction of the sampling frame for the study. Finally, we assessed the contribution of additional unit level variables (Model 3. Results The null model (unadjusted baseline HLM model established that there was significant variation between units in each of the 10 ACT concepts (i.e., discrimination between units. When we controlled for individual characteristics, significant variation in the 10 concepts remained. Assessment of the

  5. System Runs Analysis with Process Mining

    Directory of Open Access Journals (Sweden)

    S. A. Shershakov

    2015-01-01

    Full Text Available Information systems (IS produce numerous traces and logs at runtime. In the context of SOA-based (service-oriented architecture IS, these logs contain details about sequences of process and service calls. Modern application monitoring and error tracking tools provide only rather straightforward log search and filtering functionality. However, “clever” analysis of the logs is highly useful, since it can provide valuable insights into the system architecture, interaction of business domains and services. Here we took runs event logs (trace data of a big booking system and discovered architectural guidelines violations and common anti-patterns. We applied mature process mining techniques for discovery and analysis of these logs. The aims of process mining are to discover, analyze, and improve processes on the basis of IS behavior recorded as event logs. In several specific examples, we show successful applications of process mining to system runtime analysis and motivate further research in this area.The article is published in the authors’ wording.

  6. Data mining for dummies

    CERN Document Server

    Brown, Meta S

    2014-01-01

    Delve into your data for the key to success Data mining is quickly becoming integral to creating value and business momentum. The ability to detect unseen patterns hidden in the numbers exhaustively generated by day-to-day operations allows savvy decision-makers to exploit every tool at their disposal in the pursuit of better business. By creating models and testing whether patterns hold up, it is possible to discover new intelligence that could change your business''s entire paradigm for a more successful outcome. Data Mining for Dummies shows you why it doesn''t take a data scientist to gain

  7. Mining Recurrent Pattern Identification on Large Database

    Directory of Open Access Journals (Sweden)

    Shivangi Srivastava

    2014-04-01

    Full Text Available Recurrent pattern mining is an important problem in the context of data mining. In this paper data mining algorithms have been discussed and compared. Recurrent pattern mining has been an important area in data mining research and it is the first step in the analysis of data rising in a broad range of applications. The algorithms are compared with respect to the items like methodology and its basic principles in terms of the elements user like support, and scan of the database (full or partial.

  8. Using airborne LiDAR in geoarchaeological contexts: Assessment of an automatic tool for the detection and the morphometric analysis of grazing archaeological structures (French Massif Central).

    Science.gov (United States)

    Roussel, Erwan; Toumazet, Jean-Pierre; Florez, Marta; Vautier, Franck; Dousteyssier, Bertrand

    2014-05-01

    Airborne laser scanning (ALS) of archaeological regions of interest is nowadays a widely used and established method for accurate topographic and microtopographic survey. The penetration of the vegetation cover by the laser beam allows the reconstruction of reliable digital terrain models (DTM) of forested areas where traditional prospection methods are inefficient, time-consuming and non-exhaustive. The ALS technology provides the opportunity to discover new archaeological features hidden by vegetation and provides a comprehensive survey of cultural heritage sites within their environmental context. However, the post-processing of LiDAR points clouds produces a huge quantity of data in which relevant archaeological features are not easily detectable with common visualizing and analysing tools. Undoubtedly, there is an urgent need for automation of structures detection and morphometric extraction techniques, especially for the "archaeological desert" in densely forested areas. This presentation deals with the development of automatic detection procedures applied to archaeological structures located in the French Massif Central, in the western forested part of the Puy-de-Dôme volcano between 950 and 1100 m a.s.l.. These unknown archaeological sites were discovered by the March 2011 ALS mission and display a high density of subcircular depressions with a corridor access. The spatial organization of these depressions vary from isolated to aggregated or aligned features. Functionally, they appear to be former grazing constructions built from the medieval to the modern period. Similar grazing structures are known in other locations of the French Massif Central (Sancy, Artense, Cézallier) where the ground is vegetation-free. In order to develop a reliable process of automatic detection and mapping of these archaeological structures, a learning zone has been delineated within the ALS surveyed area. The grazing features were mapped and typical morphometric attributes

  9. Web Crime Mining by Means of Data Mining Techniques

    Directory of Open Access Journals (Sweden)

    Javad Hosseinkhani

    2014-03-01

    Full Text Available The purpose of this study is to provide a review to mining useful information by means of Data Mining. The procedure of extracting knowledge and information from large set of data is data mining that applying artificial intelligence method to find unseen relationships of data. There is more study on data mining applications that attracted more researcher attention and one of the crucial field is criminology that applying in data mining which is utilized for identifying crime characteristics. Detecting and exploring crimes and investigating their relationship with criminals are involved in the analyzing crime process. Criminology is a suitable field for using data mining techniques that shows the high volume and the complexity of relationships between crime datasets. Therefore, for further analysis development, the identifying crime characteristic will be the first step and obtained knowledge from data mining approaches is a very useful tool to help and support police forces. This research aims to provide a review to extract useful information by means of Data Mining, in order to find crime hot spots out and predict crime trends for them using crime data mining techniques.

  10. Real world data mining applications

    CERN Document Server

    Abou-Nasr, Mahmoud; Stahlbock, Robert; Weiss, Gary M

    2014-01-01

    Data mining applications range from commercial to social domains, with novel applications appearing swiftly; for example, within the context of social networks. The expanding application sphere and social reach of advanced data mining raise pertinent issues of privacy and security. Present-day data mining is a progressive multidisciplinary endeavor. This inter- and multidisciplinary approach is well reflected within the field of information systems. The information systems research addresses software and hardware requirements for supporting computationally and data-intensive applications. Furthermore, it encompasses analyzing system and data aspects, and all manual or automated activities. In that respect, research at the interface of information systems and data mining has significant potential to produce actionable knowledge vital for corporate decision-making. The aim of the proposed volume is to provide a balanced treatment of the latest advances and developments in data mining; in particular, exploring s...

  11. Mining explosion identification as an application to treaty verification

    Science.gov (United States)

    Arrowsmith, Marie

    from earthquakes that are <250 km from the mine; as the earthquake dataset expands spatially, discrimination performance degrades. One-dimensional path corrections provide improvement, but additional calibrations are necessary to optimize this discriminant. The second discriminant, time-frequency, capitalizes on the unique spectral signature of delay-fired mining events as a function of time. This discriminant separates the larger types of blasts with the longest source duration at all stations. Smaller blasts do not discriminate because of the shorter shot durations. The third discriminant, time-of-day, assesses the event origin time. Strictly speaking, this type of analysis may have a secondary role in the discrimination of an individual event but may be quite useful in assessing man-made seismic activity in a regional context. Mining events occur between 9 am and 6 pm, while earthquakes are randomly distributed in time. We have utilized waveform correlation techniques to better understand how factors such as mining blast type and location within the mine are manifested in the waveforms. Initial results show good correlation between blast types within two main pits; as the correlation threshold is increased, we are able to resolve spatial location within individual pits for the simplest types of mining blasts. In the Altai-Sayan region, we calculated these same discriminants for ˜260 earthquakes and ˜850 mining events. The amplitude ratio discriminant shows significant overlap of the earthquake and mining populations. Certain events do separate, but the lack of ground-truth makes is what makes these events difficult to identify. Similar results are seen for the time-frequency discriminant. We do not know if the discriminant itself fails, or if the majority of our data points are from smaller shots that have shorter time durations. Time-of-day results are similar to the WUS in that presumed mining events fall within working hours and indicate the assessment utility

  12. Applied data mining for business and industry

    CERN Document Server

    Giudici, Paolo

    2009-01-01

    The increasing availability of data in our current, information overloaded society has led to the need for valid tools for its modelling and analysis. Data mining and applied statistical methods are the appropriate tools to extract knowledge from such data. This book provides an accessible introduction to data mining methods in a consistent and application oriented statistical framework, using case studies drawn from real industry projects and highlighting the use of data mining methods in a variety of business applications. Introduces data mining methods and applications.Covers classical and Bayesian multivariate statistical methodology as well as machine learning and computational data mining methods.Includes many recent developments such as association and sequence rules, graphical Markov models, lifetime value modelling, credit risk, operational risk and web mining.Features detailed case studies based on applied projects within industry.Incorporates discussion of data mining software, with case studies a...

  13. Translating global recommendations on HIV and infant feeding to the local context: the development of culturally sensitive counselling tools in the Kilimanjaro Region, Tanzania

    Directory of Open Access Journals (Sweden)

    Åstrøm Anne N

    2006-10-01

    Full Text Available Abstract Background This paper describes the process used to develop an integrated set of culturally sensitive, evidence-based counselling tools (job aids by using qualitative participatory research. The aim of the intervention was to contribute to improving infant feeding counselling services for HIV positive women in the Kilimanjaro Region of Tanzania. Methods Formative research using a combination of qualitative methods preceded the development of the intervention and mapped existing practices, perceptions and attitudes towards HIV and infant feeding (HIV/IF among mothers, counsellors and community members. Intervention Mapping (IM protocol guided the development of the overall intervention strategy. Theories of behaviour change, a review of the international HIV/IF guidelines and formative research findings contributed to the definition of performance and learning objectives. Key communication messages and colourful graphic illustrations related to infant feeding in the context of HIV were then developed and/or adapted from existing generic materials. Draft materials were field tested with intended audiences and subjected to stakeholder technical review. Results An integrated set of infant feeding counselling tools, referred to as 'job aids', was developed and included brochures on feeding methods that were found to be socially and culturally acceptable, a Question and Answer Guide for counsellors, a counselling card on the risk of transmission of HIV, and an infant feeding toolbox for demonstration. Each brochure describes the steps to ensure safer infant feeding using simple language and images based on local ideas and resources. The brochures are meant to serve as both a reference material during infant feeding counselling in the ongoing prevention of mother to child transmission (pMTCT of HIV programme and as take home material for the mother. Conclusion The study underscores the importance of formative research and a systematic theory

  14. Typesafe Modeling in Text Mining

    CERN Document Server

    Steeg, Fabian

    2011-01-01

    Based on the concept of annotation-based agents, this report introduces tools and a formal notation for defining and running text mining experiments using a statically typed domain-specific language embedded in Scala. Using machine learning for classification as an example, the framework is used to develop and document text mining experiments, and to show how the concept of generic, typesafe annotation corresponds to a general information model that goes beyond text processing.

  15. Process mining

    DEFF Research Database (Denmark)

    van der Aalst, W.M.P.; Rubin, V.; Verbeek, H.M.W.

    2010-01-01

    Process mining includes the automated discovery of processes from event logs. Based on observed events (e.g., activities being executed or messages being exchanged) a process model is constructed. One of the essential problems in process mining is that one cannot assume to have seen all possible...... behavior. At best, one has seen a representative subset. Therefore, classical synthesis techniques are not suitable as they aim at finding a model that is able to exactly reproduce the log. Existing process mining techniques try to avoid such “overfitting” by generalizing the model to allow for more...

  16. DATA MINING POTENCY ESTIMATORS FROM TOXICOLOGICAL DATABASES

    OpenAIRE

    Piegorsch, Walter W.; Simmons, Susan J.; Zeiger, Errol

    2004-01-01

    We discuss use of data mining techniques to study estimators of toxic potency (activity) in toxicological databases. The methods are slight variations on the standard data mining motif, but fit fully within the larger context of knowledge discovery in databases. An example illustrates the general theme of the approach, using results from a U.S. National Toxicology Program Salmonella mutagenicity database.

  17. a New Generation Mining Head with Disc Tool of Complex Trajectory / GŁOWICA URABIAJĄCA Nowej Generacji Z NARZĘDZIAMI Dyskowymi O ZŁOŻONEJ Trajektorii

    Science.gov (United States)

    Gospodarczyk, Piotr; Kotwica, Krzysztof; Stopka, Grzegorz

    2013-12-01

    In Polish underground mining plenty of dog headings are drilled with mechanical methods with the use of arm roadheaders equipped with milling units. Cutting tools applied on the units - rotary tangent bits in unfavourable mining and geological conditions or improper work conditions are affected by an accelerated wear process. It influences the speed and costs of drilling such excavations. The article presents a new and innovative solution of a mining head with asymmetric disc tools of complex trajectory elaborated at the Department of Mining, Dressing and Transport Machines, AGH University of Science and Technology, Krakow as an alternative for standard milling units. Advantages of the applied mining method using so called back incision were described as well as principles of construction and work of the suggested solution of the head. In order to work out a construction of the head prototype it was necessary to determine principles and guidelines based on laboratory tests. A construction of a specially prepared laboratory stand for examination of disc tools of complex trajectory, planned research methodology, course of tests and obtained results were presented. An analysis of the results allowed determining the above listed principles and guidelines for a construction of a prototype head. They were the base to work out, with cooperation of the REMAG Ltd Company in Katowice, a technical project of a new head solution, adapted for mounting on the arm of a medium arm roadheader KR 150. A constructed head underwent tests on a research stand on the REMAG testing area and received positive preliminary tests results. W polskim górnictwie podziemnym bardzo duża liczba wyrobisk korytarzowych drążona jest metodami mechanicznymi z wykorzystaniem ramionowych kombajnów chodnikowych, wyposażonych w organy frezujące. Stosowane na tych organach narzędzia skrawające - noże styczno-obrotowe, w niekorzystnych warunkach górniczo-geologicznych lub przy nieprawid

  18. Application of 57Fe Mössbauer spectroscopy as a tool for mining exploration of bornite (Cu5FeS4) copper ore

    Science.gov (United States)

    Gainov, R. R.; Vagizov, F. G.; Golovanevskiy, V. A.; Ksenofontov, V. A.; Klingelhöfer, G.; Klekovkina, V. V.; Shumilova, T. G.; Pen'kov, I. N.

    2014-04-01

    Nuclear resonance methods, including Mössbauer spectroscopy,are considered as unique techniques suitable for remote on-line mineralogical analysis. The employment of these methods provides potentially significant commercial benefits for mining industry. As applied to copper sulfide ores, Mössbauer spectroscopy method is suitable for the analysis noted. Bornite (formally Cu5FeS4) is a significant part of copper ore and identification of its properties is important for economic exploitation of commercial copper ore deposits. A series of natural bornite samples was studied by 57Fe Mössbauer spectroscopy. Two aspects were considered: reexamination of 57Fe Mössbauer properties of natural bornite samples and their stability irrespective of origin and potential use of miniaturized Mössbauer spectrometers MIMOS II for in-situ bornite identification. The results obtained show a number of potential benefits of introducing the available portative Mössbauer equipment into the mining industry for express mineralogical analysis. In addition, results of some preliminary 63,65Cu nuclear quadrupole resonance (NQR) studies of bornite are reported and their merits with Mössbauer techniques for bornite detection discussed.

  19. Social big data mining

    CERN Document Server

    Ishikawa, Hiroshi

    2015-01-01

    Social Media. Big Data and Social Data. Hypotheses in the Era of Big Data. Social Big Data Applications. Basic Concepts in Data Mining. Association Rule Mining. Clustering. Classification. Prediction. Web Structure Mining. Web Content Mining. Web Access Log Mining, Information Extraction and Deep Web Mining. Media Mining. Scalability and Outlier Detection.

  20. Generative Contexts

    Science.gov (United States)

    Lyles, Dan Allen

    Educational research has identified how science, technology, engineering, and mathematics (STEM) practice and education have underperforming metrics in racial and gender diversity, despite decades of intervention. These disparities are part of the construction of a culture of science that is alienating to these populations. Recent studies in a social science framework described as "Generative Justice" have suggested that the context of social and scientific practice might be modified to bring about more just and equitable relations among the disenfranchised by circulating the value they and their non-human allies create back to them in unalienated forms. What is not known are the underlying principles of social and material space that makes a system more or less generative. I employ an autoethnographic method at four sites: a high school science class; a farm committed to "Black and Brown liberation"; a summer program geared towards youth environmental mapping; and a summer workshop for Harlem middle school students. My findings suggest that by identifying instances where material affinity, participatory voice, and creative solidarity are mutually reinforcing, it is possible to create educational contexts that generate unalienated value, and circulate it back to the producers themselves. This cycle of generation may help explain how to create systems of justice that strengthen and grow themselves through successive iterations. The problem of lack of diversity in STEM may be addressed not merely by recruiting the best and the brightest from underrepresented populations, but by changing the context of STEM education to provide tools for its own systematic restructuring.

  1. Mining Web-based Educational Systems to Predict Student Learning Achievements

    Directory of Open Access Journals (Sweden)

    José del Campo-Ávila

    2015-03-01

    Full Text Available Educational Data Mining (EDM is getting great importance as a new interdisciplinary research field related to some other areas. It is directly connected with Web-based Educational Systems (WBES and Data Mining (DM, a fundamental part of Knowledge Discovery in Databases. The former defines the context: WBES store and manage huge amounts of data. Such data are increasingly growing and they contain hidden knowledge that could be very useful to the users (both teachers and students. It is desirable to identify such knowledge in the form of models, patterns or any other representation schema that allows a better exploitation of the system. The latter reveals itself as the tool to achieve such discovering. Data mining must afford very complex and different situations to reach quality solutions. Therefore, data mining is a research field where many advances are being done to accommodate and solve emerging problems. For this purpose, many techniques are usually considered. In this paper we study how data mining can be used to induce student models from the data acquired by a specific Web-based tool for adaptive testing, called SIETTE. Concretely we have used top down induction decision trees algorithms to extract the patterns because these models, decision trees, are easily understandable. In addition, the conducted validation processes have assured high quality models.

  2. Perspectives on the use of data mining in pharmaco-vigilance.

    Science.gov (United States)

    Almenoff, June; Tonning, Joseph M; Gould, A Lawrence; Szarfman, Ana; Hauben, Manfred; Ouellet-Hellstrom, Rita; Ball, Robert; Hornbuckle, Ken; Walsh, Louisa; Yee, Chuen; Sacks, Susan T; Yuen, Nancy; Patadia, Vaishali; Blum, Michael; Johnston, Mike; Gerrits, Charles; Seifert, Harry; Lacroix, Karol

    2005-01-01

    In the last 5 years, regulatory agencies and drug monitoring centres have been developing computerised data-mining methods to better identify reporting relationships in spontaneous reporting databases that could signal possible adverse drug reactions. At present, there are no guidelines or standards for the use of these methods in routine pharmaco-vigilance. In 2003, a group of statisticians, pharmaco-epidemiologists and pharmaco-vigilance professionals from the pharmaceutical industry and the US FDA formed the Pharmaceutical Research and Manufacturers of America-FDA Collaborative Working Group on Safety Evaluation Tools to review best practices for the use of these methods.In this paper, we provide an overview of: (i) the statistical and operational attributes of several currently used methods and their strengths and limitations; (ii) information about the characteristics of various postmarketing safety databases with which these tools can be deployed; (iii) analytical considerations for using safety data-mining methods and interpreting the results; and (iv) points to consider in integration of safety data mining with traditional pharmaco-vigilance methods. Perspectives from both the FDA and the industry are provided. Data mining is a potentially useful adjunct to traditional pharmaco-vigilance methods. The results of data mining should be viewed as hypothesis generating and should be evaluated in the context of other relevant data. The availability of a publicly accessible global safety database, which is updated on a frequent basis, would further enhance detection and communication about safety issues.

  3. Domain-oriented data-driven data mining:a new understanding for data mining

    Institute of Scientific and Technical Information of China (English)

    WANG Guo-yin; WANG Yan

    2008-01-01

    Recent advances in computing, communications, digital storage technologies, and high-throughput dam-acquisition technologies, make it possible to gather and store incredible volumes of data. It creates unprecedented opportunities for large-scale knowledge discovery from database. Data mining is an emerging area of computational intelligence that offers new theories, techniques, and tools for processing large volumes of data, such as data analysis, decision making, etc.There are many researchers working on designing efficient data mining techniques, methods, and algorithms. Unfortunate-ly, most data mining researchers pay much attention to technique problems for developing data mining models and methods,while little to basic issues of data mining. In this paper, we will propose a new understanding for data mining, that is, do-main-oriented data-driven data mining (3DM) model. Some data-driven data mining algorithms developed in our Lab are al-so presented to show its validity.

  4. [WEB-based medical data mining integration].

    Science.gov (United States)

    Yao, Gang; Zhang, Xiaoxiang; Wang, Huoming

    2014-06-01

    An integration of medical data management system based on WEB and data mining tool is reportedly in this paper. In the application process of this system, web-based medical data mining user sends requests to the server by using client browser with http protocol, the commands are then received by the server and the server calls the data mining tools remote object for data processing, and the results are sent back to the customer browser through the http protocol and presented to the user. In order to prove the feasibility of the proposed solution, the test is done under the NET platform by using SAS and SPSS, and the detail steps are given. By the practical test, it was proved that the web-based data mining tool integration solutions proposed in this paper would have its broad prospects for development, which would open up a new route to the development of medical data mining.

  5. Environmental Decision Making on Acid Mine Drainage Issues in South Africa: An Argument for the Precautionary Principle.

    Science.gov (United States)

    Morodi, T J; Mpofu, Charles

    2017-06-28

    This paper examines the issue of acid mine drainage in South Africa and environmental decision making processes that could be taken to mitigate the problem in the context of both conventional risk assessment and the precautionary principle. It is argued that conventional risk assessment protects the status quo and hence cannot be entirely relied upon as an effective tool to resolve environmental problems in the context of South Africa, a developing country with complex environmental health concerns. The complexity of the environmental issues is discussed from historical and political perspectives. An argument is subsequently made that the precautionary principle is an alternative tool, and its adoption can be used to empower local communities. This work, therefore, adds to new knowledge by problematising conventional risk assessment and proposing the framing of the acid mine drainage issues in a complex and contextual scenario of a developing country-South Africa.

  6. Data Mining Solutions for the Business Environment

    Directory of Open Access Journals (Sweden)

    Ruxandra-Stefania PETRE

    2014-02-01

    Full Text Available Over the past years, data mining became a matter of considerable importance due to the large amounts of data available in the applications belonging to various domains. Data mining, a dynamic and fast-expanding field, that applies advanced data analysis techniques, from statistics, machine learning, database systems or artificial intelligence, in order to discover relevant patterns, trends and relations contained within the data, information impossible to observe using other techniques. The paper focuses on presenting the applications of data mining in the business environment. It contains a general overview of data mining, providing a definition of the concept, enumerating six primary data mining techniques and mentioning the main fields for which data mining can be applied. The paper also presents the main business areas which can benefit from the use of data mining tools, along with their use cases: retail, banking and insurance. Also the main commercially available data mining tools and their key features are presented within the paper. Besides the analysis of data mining and the business areas that can successfully apply it, the paper presents the main features of a data mining solution that can be applied for the business environment and the architecture, with its main components, for the solution, that would help improve customer experiences and decision-making

  7. Surface Mines, Other - Longwall Mining Panels

    Data.gov (United States)

    NSGIC Education | GIS Inventory — Coal mining has occurred in Pennsylvania for over a century. A method of coal mining known as Longwall Mining has become more prevalent in recent decades. Longwall...

  8. Coal Mines, Active - Longwall Mining Panels

    Data.gov (United States)

    NSGIC GIS Inventory (aka Ramona) — Coal mining has occurred in Pennsylvania for over a century. A method of coal mining known as Longwall Mining has become more prevalent in recent decades. Longwall...

  9. Bring Your Own Toy: Socialisation of Two-Year-Olds through Tool-Mediated Activities in an Australian Early Childhood Education Context

    Science.gov (United States)

    Kultti, Anne; Pramling, Niklas

    2015-01-01

    The study focuses on how young children are socialised in early childhood education practice in activities with and around toys. A premise of this study is the theoretical notion of sociocultural theory that people do things with artefacts and other cultural tools, and tools do things with people. This is captured in the unit of analysis,…

  10. Radioecological impacts of tin mining.

    Science.gov (United States)

    Aliyu, Abubakar Sadiq; Mousseau, Timothy Alexander; Ramli, Ahmad Termizi; Bununu, Yakubu Aliyu

    2015-12-01

    The tin mining activities in the suburbs of Jos, Plateau State, Nigeria, have resulted in technical enhancement of the natural background radiation as well as higher activity concentrations of primordial radionuclides in the topsoil of mining sites and their environs. Several studies have considered the radiological human health risks of the mining activity; however, to our knowledge no documented study has investigated the radiological impacts on biota. Hence, an attempt is made to assess potential hazards using published data from the literature and the ERICA Tool. This paper considers the effects of mining and milling on terrestrial organisms like shrubs, large mammals, small burrowing mammals, birds (duck), arthropods (earth worm), grasses, and herbs. The dose rates and risk quotients to these organisms are computed using conservative values for activity concentrations of natural radionuclides reported in Bitsichi and Bukuru mining areas. The results suggest that grasses, herbs, lichens, bryophytes and shrubs receive total dose rates that are of potential concern. The effects of dose rates to specific indicator species of interest are highlighted and discussed. We conclude that further investigation and proper regulations should be set in place in order to reduce the risk posed by the tin mining activity on biota. This paper also presents a brief overview of the impact of mineral mining on biota based on documented literature for other countries.

  11. Data preprocessing in data mining

    CERN Document Server

    García, Salvador; Herrera, Francisco

    2015-01-01

    Data Preprocessing for Data Mining addresses one of the most important issues within the well-known Knowledge Discovery from Data process. Data directly taken from the source will likely have inconsistencies, errors or most importantly, it is not ready to be considered for a data mining process. Furthermore, the increasing amount of data in recent science, industry and business applications, calls to the requirement of more complex tools to analyze it. Thanks to data preprocessing, it is possible to convert the impossible into possible, adapting the data to fulfill the input demands of each data mining algorithm. Data preprocessing includes the data reduction techniques, which aim at reducing the complexity of the data, detecting or removing irrelevant and noisy elements from the data. This book is intended to review the tasks that fill the gap between the data acquisition from the source and the data mining process. A comprehensive look from a practical point of view, including basic concepts and surveying t...

  12. Asteroid mining

    Science.gov (United States)

    Gertsch, Richard E.

    1992-01-01

    The earliest studies of asteroid mining proposed retrieving a main belt asteroid. Because of the very long travel times to the main asteroid belt, attention has shifted to the asteroids whose orbits bring them fairly close to the Earth. In these schemes, the asteroids would be bagged and then processed during the return trip, with the asteroid itself providing the reaction mass to propel the mission homeward. A mission to one of these near-Earth asteroids would be shorter, involve less weight, and require a somewhat lower change in velocity. Since these asteroids apparently contain a wide range of potentially useful materials, our study group considered only them. The topics covered include asteroid materials and properties, asteroid mission selection, manned versus automated missions, mining in zero gravity, and a conceptual mining method.

  13. Process mining

    DEFF Research Database (Denmark)

    van der Aalst, W.M.P.; Rubin, V.; Verbeek, H.M.W.

    2010-01-01

    Process mining includes the automated discovery of processes from event logs. Based on observed events (e.g., activities being executed or messages being exchanged) a process model is constructed. One of the essential problems in process mining is that one cannot assume to have seen all possible...... behavior. At best, one has seen a representative subset. Therefore, classical synthesis techniques are not suitable as they aim at finding a model that is able to exactly reproduce the log. Existing process mining techniques try to avoid such “overfitting” by generalizing the model to allow for more...... behavior. This generalization is often driven by the representation language and very crude assumptions about completeness. As a result, parts of the model are “overfitting” (allow only for what has actually been observed) while other parts may be “underfitting” (allowfor much more behavior without strong...

  14. Below the Surface: New Tools--And Savvy Librarians--Are Turning the ILS into a Gold Mine for Making More Informed Decisions

    Science.gov (United States)

    Cummins, Caroline

    2006-01-01

    In this article, the author discusses the benefits offered by integrated library systems (ILS) for making more informed decisions. Library software vendors, realizing ILS products can reveal business intelligence, have begun to offer tools like Director's Station to help library managers get more out of their data, and librarians are taking…

  15. Evaluation of a web based informatics system with data mining tools for predicting outcomes with quantitative imaging features in stroke rehabilitation clinical trials

    Science.gov (United States)

    Wang, Ximing; Kim, Bokkyu; Park, Ji Hoon; Wang, Erik; Forsyth, Sydney; Lim, Cody; Ravi, Ragini; Karibyan, Sarkis; Sanchez, Alexander; Liu, Brent

    2017-03-01

    Quantitative imaging biomarkers are used widely in clinical trials for tracking and evaluation of medical interventions. Previously, we have presented a web based informatics system utilizing quantitative imaging features for predicting outcomes in stroke rehabilitation clinical trials. The system integrates imaging features extraction tools and a web-based statistical analysis tool. The tools include a generalized linear mixed model(GLMM) that can investigate potential significance and correlation based on features extracted from clinical data and quantitative biomarkers. The imaging features extraction tools allow the user to collect imaging features and the GLMM module allows the user to select clinical data and imaging features such as stroke lesion characteristics from the database as regressors and regressands. This paper discusses the application scenario and evaluation results of the system in a stroke rehabilitation clinical trial. The system was utilized to manage clinical data and extract imaging biomarkers including stroke lesion volume, location and ventricle/brain ratio. The GLMM module was validated and the efficiency of data analysis was also evaluated.

  16. Focused Decision Support: a Data Mining Tool to Query the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial Dataset and Guide Screening Management for the Individual Patient.

    Science.gov (United States)

    Sharma, Arjun; Hostetter, Jason; Morrison, James; Wang, Kenneth; Siegel, Eliot

    2016-04-01

    The Prostate, Lung, Colorectal, and Ovarian Cancer (PLCO) Screening Trial enrolled ~155,000 participants to determine whether certain screening exams reduced mortality from prostate, lung, colorectal, and ovarian cancer. Repurposing the data provides an unparalleled resource for matching patients with the outcomes of demographically or diagnostically comparable patients. A web-based application was developed to query this subset of patient information against a given patient's demographics and risk factors. Analysis of the matched data yields outcome information which can then be used to guide management decisions and imaging software. Prognostic information is also estimated via the proportion of matched patients that progress to cancer. The US Preventative Services Task Force provides screening recommendations for cancers of the breast, colorectal tract, and lungs. There is wide variability in adherence of clinicians to these guidelines and others published by the Fleischner Society and various cancer organizations. Data mining the PLCO dataset for clinical decision support can optimize the use of limited healthcare resources, focusing screening on patients for whom the benefit to risk ratio is the greatest and most efficacious. A data driven, personalized approach to cancer screening maximizes the economic and clinical efficacy and enables early identification of patients in which the course of disease can be improved. Our dynamic decision support system utilizes a subset of the PLCO dataset as a reference model to determine imaging and testing appropriateness while offering prognostic information for various cancers.

  17. A Survey of Educational Data-Mining Research

    Science.gov (United States)

    Huebner, Richard A.

    2013-01-01

    Educational data mining (EDM) is an emerging discipline that focuses on applying data mining tools and techniques to educationally related data. The discipline focuses on analyzing educational data to develop models for improving learning experiences and improving institutional effectiveness. A literature review on educational data mining topics…

  18. Data mining

    CERN Document Server

    Gorunescu, Florin

    2011-01-01

    The knowledge discovery process is as old as Homo sapiens. Until some time ago, this process was solely based on the 'natural personal' computer provided by Mother Nature. Fortunately, in recent decades the problem has begun to be solved based on the development of the Data mining technology, aided by the huge computational power of the 'artificial' computers. Digging intelligently in different large databases, data mining aims to extract implicit, previously unknown and potentially useful information from data, since 'knowledge is power'. The goal of this book is to provide, in a friendly way

  19. Mining Review

    Science.gov (United States)

    ,

    2013-01-01

    In 2012, the estimated value of mineral production increased in the United States for the third consecutive year. Production and prices increased for most industrial mineral commodities mined in the United States. While production for most metals remained relatively unchanged, with the notable exception of gold, the prices for most metals declined. Minerals remained fundamental to the U.S. economy, contributing to the real gross domestic product (GDP) at several levels, including mining, processing and manufacturing finished products. Minerals’ contribution to the GDP increased for the second consecutive year.

  20. Use of a Pan–Genomic DNA Microarray in Determination of the Phylogenetic Relatedness among Cronobacter spp. and Its Use as a Data Mining Tool to Understand Cronobacter Biology

    Directory of Open Access Journals (Sweden)

    Ben D. Tall

    2017-03-01

    Full Text Available Cronobacter (previously known as Enterobacter sakazakii is a genus of Gram-negative, facultatively anaerobic, oxidase-negative, catalase-positive, rod-shaped bacteria of the family Enterobacteriaceae. These organisms cause a variety of illnesses such as meningitis, necrotizing enterocolitis, and septicemia in neonates and infants, and urinary tract, wound, abscesses or surgical site infections, septicemia, and pneumonia in adults. The total gene content of 379 strains of Cronobacter spp. and taxonomically-related isolates was determined using a recently reported DNA microarray. The Cronobacter microarray as a genotyping tool gives the global food safety community a rapid method to identify and capture the total genomic content of outbreak isolates for food safety, environmental, and clinical surveillance purposes. It was able to differentiate the seven Cronobacter species from one another and from non-Cronobacter species. The microarray was also able to cluster strains within each species into well-defined subgroups. These results also support previous studies on the phylogenic separation of species members of the genus and clearly highlight the evolutionary sequence divergence among each species of the genus compared to phylogenetically-related species. This review extends these studies and illustrates how the microarray can also be used as an investigational tool to mine genomic data sets from strains. Three case studies describing the use of the microarray are shown and include: (1 the determination of allelic differences among Cronobacter sakazakii strains possessing the virulence plasmid pESA3; (2 mining of malonate and myo-inositol alleles among subspecies of Cronobacter dublinensis strains to determine subspecies identity; and (3 lastly using the microarray to demonstrate sequence divergence and phylogenetic relatedness trends for 13 outer-membrane protein alleles among 240 Cronobacter and phylogenetically-related strains. The goal of

  1. DAMEWARE - Data Mining & Exploration Web Application Resource

    CERN Document Server

    Brescia, Massimo; Esposito, Francesco; Fiore, Michelangelo; Garofalo, Mauro; Guglielmo, Magda; Longo, Giuseppe; Manna, Francesco; Nocella, Alfonso; Vellucci, Civita

    2016-01-01

    Astronomy is undergoing through a methodological revolution triggered by an unprecedented wealth of complex and accurate data. DAMEWARE (DAta Mining & Exploration Web Application and REsource) is a general purpose, Web-based, Virtual Observatory compliant, distributed data mining framework specialized in massive data sets exploration with machine learning methods. We present the DAMEWARE (DAta Mining & Exploration Web Application REsource) which allows the scientific community to perform data mining and exploratory experiments on massive data sets, by using a simple web browser. DAMEWARE offers several tools which can be seen as working environments where to choose data analysis functionalities such as clustering, classification, regression, feature extraction etc., together with models and algorithms.

  2. Adaptive context exploitation

    Science.gov (United States)

    Steinberg, Alan N.; Bowman, Christopher L.

    2013-05-01

    This paper presents concepts and an implementation scheme to improve information exploitation processes and products by adaptive discovery and processing of contextual information. Context is used in data fusion - and in inferencing in general - to provide expectations and to constrain processing. It also is used to infer or refine desired information ("problem variables") on the basis of other available information ("context variables"). Contextual exploitation becomes critical in several classes of inferencing problems in which traditional information sources do not provide sufficient resolution between entity states or when such states are poorly or incompletely modeled. An adaptive evidence-accrual inference method - adapted from developments in target recognition and scene understanding - is presented; whereby context variables are selected on the basis of (a) their utility in refining explicit problem variables, (b) the probability of evaluating these variables to within a given accuracy, given candidate system actions (data collection, mining or processing), and (c) the cost of such actions. The Joint Directors of Laboratories (JDL) Data Fusion Model, with its extension to dual Resource Management functions, has been adapted to accommodate adaptive information exploitation, to include adaptive context exploitation. The interplay of Data Fusion and Resource Management (DF&RM) functionality in exploiting contextual information is illustrated in terms of the dual-node DF&RM architecture. An important advance is in the integration of data mining methods for data search/discovery and for abductive model refinement.

  3. Web使用挖掘系统研制中的主要问题和应对策略%Key Issues and Solution Strategy in R&D of Web Usage Mining Tools

    Institute of Scientific and Technical Information of China (English)

    张锋; 常会友

    2003-01-01

    With the rapid development of WWW, Web Usage Mining, as well as Web Mining, has become a hot direction in academic and industrial circles. It is generally believed that there are three tasks, preprocessing, knowledge discovery and pattern analysis, in Web Usage Mining. Though Web Usage Mining is still ranged in the application of traditional data mining techniques, in view of changes in application environment and operated data concerned, some new difficulties have arisen accordingly. This paper takes efforts to address such challenges in the three phases and introduces some proposed solutions simultaneously.

  4. DGT as a useful monitoring tool for radionuclides and trace metals in environments impacted by uranium mining: Case study of the Sagnes wetland in France.

    Science.gov (United States)

    Leermakers, Martine; Phrommavanh, Vannapha; Drozdzak, Jagoda; Gao, Yue; Nos, Jérémy; Descostes, Michael

    2016-07-01

    The Diffusive Gradients in Thin films (DGT) technique was used to analyse U, (226)Ra and other trace metals in stream water and soil porewater in a wetland in France impacted by uranium mining. High resolution profiles of metals in soil porewater obtained by DGT could be measured for the first time up to a depth of 75 cm by the construction of a novel DGT holder. In stream water, the DGT technique was compared to speciation carried out by filtration (0.45 μm) and ultrafiltration (UF) (500 kDa/100 kDa/10 kDa) and DGT porewater profiles were compared with piezometer data obtained in a parallel study. An increase in the trace concentrations of dissolved (0.45 μm) and particulate U, (226)Ra, and elements such as Al, Fe, Mn and Ba was observed in the stream water as it passes through the bog as a results of mobilization from the wetland. The porewater results indicate DGT labile metals species to be present in porewater and mobilization of uranium and other elements linked to the presence of enriched clays. In stream water, colloids and particles govern the behavior of U, Al and Fe, whereas Mn, Ba and Ra are essentially transported as truly dissolved metal species with DGT labile concentrations accounting for 100% of the dissolved fraction. The combined approaches of DGT and UF allow us to obtain a better understanding on the biogeochemical processes involved in the retention and mobility of U and (226)Ra in the wetland.

  5. Bridging data mining and semantic web

    OpenAIRE

    Aman, Edris

    2016-01-01

    Nowadays Semantic Web is widely adopted standard of knowledge representation. Hence, knowledge engineers are applying sophisticated methods to capture, discover and represent knowledge in Semantic Web form. Studies show that, to represent knowledge in Semantic Web standard, data mining techniques such as Decision Trees, Association Rules, etc., play an important role. These techniques are implemented in publicly available Data Mining tools. These tools represent knowledge discovered in human ...

  6. Poetry Writing as Expressive Pedagogy in an EFL Context: Identifying Possible Assessment Tools for Haiku Poetry in EFL Freshman College Writing

    Science.gov (United States)

    Iida, Atsushi

    2008-01-01

    Japanese poetry, haiku, has been widely accepted in western countries. While previous studies have reported on the applicability of haiku poetry to teaching practices in a variety of contexts, few researchers have discussed assessment which is one of the most important factors in language teaching. The aim of this study is to produce assessment…

  7. Useful design tools?

    DEFF Research Database (Denmark)

    Jensen, Jesper Ole

    2005-01-01

    vague and contested concept of sustainability into concrete concepts and building projects. It describes a typology of tools: process tools, impact assessment tools, multi-criteria tools and tools for monitoring. It includes a Danish paradigmatic case study of stakeholder participation in the planning...... of a new sustainable settlement. The use of design tools is discussed in relation to innovation and stakeholder participation, and it is stressed that the usefulness of design tools is context dependent....

  8. Mining Method

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Young Shik; Lee, Kyung Woon; Kim, Oak Hwan; Kim, Dae Kyung [Korea Institute of Geology Mining and Materials, Taejon (Korea, Republic of)

    1996-12-01

    The reducing coal market has been enforcing the coal industry to make exceptional rationalization and restructuring efforts since the end of the eighties. To the competition from crude oil and natural gas has been added the growing pressure from rising wages and rising production cost as the workings get deeper. To improve the competitive position of the coal mines against oil and gas through cost reduction, studies to improve mining system have been carried out. To find fields requiring improvements most, the technologies using in Tae Bak Colliery which was selected one of long running mines were investigated and analyzed. The mining method appeared the field needing improvements most to reduce the production cost. The present method, so-called inseam roadway caving method presently is using to extract the steep and thick seam. However, this method has several drawbacks. To solve the problems, two mining methods are suggested for a long term and short term method respectively. Inseam roadway caving method with long-hole blasting method is a variety of the present inseam roadway caving method modified by replacing timber sets with steel arch sets and the shovel loaders with chain conveyors. And long hole blasting is introduced to promote caving. And pillar caving method with chock supports method uses chock supports setting in the cross-cut from the hanging wall to the footwall. Two single chain conveyors are needed. One is installed in front of chock supports to clear coal from the cutting face. The other is installed behind the supports to transport caved coal from behind. This method is superior to the previous one in terms of safety from water-inrushes, production rate and productivity. The only drawback is that it needs more investment. (author). 14 tabs., 34 figs.

  9. The Mining Minds digital health and wellness framework.

    Science.gov (United States)

    Banos, Oresti; Bilal Amin, Muhammad; Ali Khan, Wajahat; Afzal, Muhammad; Hussain, Maqbool; Kang, Byeong Ho; Lee, Sungyong

    2016-07-15

    The provision of health and wellness care is undergoing an enormous transformation. A key element of this revolution consists in prioritizing prevention and proactivity based on the analysis of people's conducts and the empowerment of individuals in their self-management. Digital technologies are unquestionably destined to be the main engine of this change, with an increasing number of domain-specific applications and devices commercialized every year; however, there is an apparent lack of frameworks capable of orchestrating and intelligently leveraging, all the data, information and knowledge generated through these systems. This work presents Mining Minds, a novel framework that builds on the core ideas of the digital health and wellness paradigms to enable the provision of personalized support. Mining Minds embraces some of the most prominent digital technologies, ranging from Big Data and Cloud Computing to Wearables and Internet of Things, as well as modern concepts and methods, such as context-awareness, knowledge bases or analytics, to holistically and continuously investigate on people's lifestyles and provide a variety of smart coaching and support services. This paper comprehensively describes the efficient and rational combination and interoperation of these technologies and methods through Mining Minds, while meeting the essential requirements posed by a framework for personalized health and wellness support. Moreover, this work presents a realization of the key architectural components of Mining Minds, as well as various exemplary user applications and expert tools to illustrate some of the potential services supported by the proposed framework. Mining Minds constitutes an innovative holistic means to inspect human behavior and provide personalized health and wellness support. The principles behind this framework uncover new research ideas and may serve as a reference for similar initiatives.

  10. Informatics & Data-Mining Group

    Science.gov (United States)

    Aims to create software tools to support the RAS Initiative; to mine existing large datasets, such as TCGA, for RAS-centric data; and to help integrate internal and external data in ways that elucidate the biology of RAS-driven cancers.

  11. Mining Texts in Reading to Write.

    Science.gov (United States)

    Greene, Stuart

    1992-01-01

    Proposes a set of strategies for connecting reading and writing, placing the discussion in the context of other pedagogical approaches designed to exploit the relationship between reading and writing. Explores ways in which students employ the strategies involved in "mining" a text--reconstructing context, inferring or imposing structure, and…

  12. Planning the Mine and Mining the Plan

    Science.gov (United States)

    Boucher, D. S.; Chen, N.

    2016-11-01

    Overview of best practices used in the terrestrial mining industry when developing a mine site towards production. The intent is to guide planners towards an effective and well constructed roadmap for the development of ISRU mining activities. A strawman scenario is presented as an illustration for lunar mining of water ice.

  13. Development of mining technology and equipment for seafloor massive sulfide deposits

    Science.gov (United States)

    Liu, Shaojun; Hu, Jianhua; Zhang, Ruiqiang; Dai, Yu; Yang, Hengling

    2016-09-01

    Seafloor massive sulfide(SMS) deposits which consist of Au, Ag, Cu, and other metal elements, have been a target of commercial mining in recent decades. The demand for established and reliable commercial mining system for SMS deposits is increasing within the marine mining industry. The current status and progress of mining technology and equipment for SMS deposits are introduced. First, the mining technology and other recent developments of SMS deposits are comprehensively explained and analyzed. The seafloor production tools manufactured by Nautilus Minerals and similar mining tools from Japan for SMS deposits are compared and discussed in turn. Second, SMS deposit mining technology research being conducted in China is described, and a new SMS deposits mining tool is designed according to the environmental requirement. Finally, some new trends of mining technology of SMS deposits are summarized and analyzed. All of these conclusions and results have reference value and guiding significance for the research of SMS deposit mining in China.

  14. An Examination of the Changes in Science Teaching Orientations and Technology-Enhanced Tools for Student Learning in the Context of Professional Development

    Science.gov (United States)

    Campbell, Todd; Zuwallack, Rebecca; Longhurst, Max; Shelton, Brett E.; Wolf, Paul G.

    2014-01-01

    This research examines how science teaching orientations and beliefs about technology-enhanced tools change over time in professional development (PD). The primary data sources for this study came from learning journals of 8 eighth grade science teachers at the beginning and conclusion of a year of PD. Based on the analysis completed, Information…

  15. Incorporating ecosystem services into environmental management of deep-seabed mining

    Science.gov (United States)

    Le, Jennifer T.; Levin, Lisa A.; Carson, Richard T.

    2017-03-01

    Accelerated exploration of minerals in the deep sea over the past decade has raised the likelihood that commercial mining of the deep seabed will commence in the near future. Environmental concerns create a growing urgency for development of environmental regulations under commercial exploitation. Here, we consider an ecosystem services approach to the environmental policy and management of deep-sea mineral resources. Ecosystem services link the environment and human well-being, and can help improve sustainability and stewardship of the deep sea by providing a quantitative basis for decision-making. This paper briefly reviews ecosystem services provided by habitats targeted for deep-seabed mining (hydrothermal vents, seamounts, nodule provinces, and phosphate-rich margins), and presents practical steps to incorporate ecosystem services into deep-seabed mining regulation. The linkages and translation between ecosystem structure, ecological function (including supporting services), and ecosystem services are highlighted as generating human benefits. We consider criteria for identifying which ecosystem services are vulnerable to potential mining impacts, the role of ecological functions in providing ecosystem services, development of ecosystem service indicators, valuation of ecosystem services, and implementation of ecosystem services concepts. The first three steps put ecosystem services into a deep-seabed mining context; the last two steps help to incorporate ecosystem services into a management and decision-making framework. Phases of environmental planning discussed in the context of ecosystem services include conducting strategic environmental assessments, collecting baseline data, monitoring, establishing marine protected areas, assessing cumulative impacts, identifying thresholds and triggers, and creating an environmental damage compensation regime. We also identify knowledge gaps that need to be addressed in order to operationalize ecosystem services

  16. Capturing the context of maternal deaths from verbal autopsies: a reliability study of the maternal data extraction tool (M-DET.

    Directory of Open Access Journals (Sweden)

    Ann L Montgomery

    Full Text Available BACKGROUND: The availability of quality data to inform policy is essential to reduce maternal deaths. To characterize maternal deaths in settings without complete vital registration systems, we designed and assessed the inter-rater reliability of a tool to systematically extract data and characterize the events that precede a nationally representative sample of maternal deaths in India. METHOD/PRINCIPAL FINDINGS: Of 1017 nationally representative pregnancy-related deaths, which occurred between 2001 and 2003, we randomly selected 105 reports. Two independent coders used the maternal data extraction tool (questions with coding guidelines to collect information on antenatal care access, final pregnancy outcome; planned place of birth and care provider; community consultation, transport, admission, hospital referral; and verification of cause of death assignment. Kappa estimated inter-rater agreement was calculated and classified as poor (K≤0.4, moderate (K = 0.4≤0.6, substantial (K = 0.6≤ 0.8 and high (K>0.8 using the criteria from Landis & Koch. The data extraction tool had high agreement for gestational age, pregnancy outcome, transport, death en route and admission to hospital; substantial agreement for receipt of antenatal care, planned place of birth, readmission and referral to higher level hospital, and whether or not death occurred in the intrapartum period; moderate to substantial agreement for classification of deaths as direct or indirect obstetric deaths or incidental deaths; moderate agreement for classification of community healthcare consultation and total number of healthcare contacts; and poor agreement for the classification of deaths as sudden deaths and other/unknown cause of death. The ability of the tool to identify the most-responsible-person in labour varied from moderate agreement to high agreement. CONCLUSIONS: This data extraction tool achieved good inter-rater reliability and can be used to collect data on

  17. To what degree could responsible mining play a role to prevent human rights violations arising from the industry? : A human rights law perspective

    OpenAIRE

    Nguyen, Mai

    2012-01-01

    The study primarily aims to address the legal consequences of the rise of responsible mining and responsible mining itself to international human rights law in regulating the mining industry. Firstly, the thesis describes the rise of responsible mining in the context of considerable power of the industry and mining companies. Secondly, the thesis explains how international human rights law governs the mining industry and whereabouts responsible mining could be in the system. Thirdly, the t...

  18. GIS and Urban Mining

    Directory of Open Access Journals (Sweden)

    Xuan Zhu

    2014-03-01

    Full Text Available Geographical information systems (GIS are a kind of location intelligence technology that supports systematic collection, integration, analysis and sharing of spatial data. They provide an effective tool for characterising and visualising geographical distributions of recyclable resources or materials dispersed across urban environments in what may be described as “urban mines”. As logistics can be a key barrier to recycling, GIS are critical for capturing and analysing location intelligence about the distribution and values of recyclable resources and associated collection systems to effectively empower and inform the policy makers and the broader community with comprehensive, accurate and accessible information. This paper reviews the functionality of modern GIS, discusses the potential role of GIS in urban mining studies, and describes how GIS can be used to measure, report, analyse and visualise the spatial or geographical characteristics of dispersed stocks of recyclable waste and their collection and recovery systems. Such information can then be used to model material flows and assess the social and environmental impacts of urban mining. Issues and challenges in the use of GIS for urban mining are also to be addressed.

  19. Organizational Data Mining

    Science.gov (United States)

    Nemati, Hamid R.; Barko, Christopher D.

    Many organizations today possess substantial quantities of business information but have very little real business knowledge. A recent survey of 450 business executives reported that managerial intuition and instinct are more prevalent than hard facts in driving organizational decisions. To reverse this trend, businesses of all sizes would be well advised to adopt Organizational Data Mining (ODM). ODM is defined as leveraging Data Mining tools and technologies to enhance the decision-making process by transforming data into valuable and actionable knowledge to gain a competitive advantage. ODM has helped many organizations optimize internal resource allocations while better understanding and responding to the needs of their customers. The fundamental aspects of ODM can be categorized into Artificial Intelligence (AI), Information Technology (IT), and Organizational Theory (OT), with OT being the key distinction between ODM and Data Mining. In this chapter, we introduce ODM, explain its unique characteristics, and report on the current status of ODM research. Next we illustrate how several leading organizations have adopted ODM and are benefiting from it. Then we examine the evolution of ODM to the present day and conclude our chapter by contemplating ODM's challenging yet opportunistic future.

  20. Data Mining and Analysis

    Science.gov (United States)

    Samms, Kevin O.

    2015-01-01

    The Data Mining project seeks to bring the capability of data visualization to NASA anomaly and problem reporting systems for the purpose of improving data trending, evaluations, and analyses. Currently NASA systems are tailored to meet the specific needs of its organizations. This tailoring has led to a variety of nomenclatures and levels of annotation for procedures, parts, and anomalies making difficult the realization of the common causes for anomalies. Making significant observations and realizing the connection between these causes without a common way to view large data sets is difficult to impossible. In the first phase of the Data Mining project a portal was created to present a common visualization of normalized sensitive data to customers with the appropriate security access. The tool of the visualization itself was also developed and fine-tuned. In the second phase of the project we took on the difficult task of searching and analyzing the target data set for common causes between anomalies. In the final part of the second phase we have learned more about how much of the analysis work will be the job of the Data Mining team, how to perform that work, and how that work may be used by different customers in different ways. In this paper I detail how our perspective has changed after gaining more insight into how the customers wish to interact with the output and how that has changed the product.

  1. Mutation extraction tools can be combined for robust recognition of genetic variants in the literature.

    Science.gov (United States)

    Jimeno Yepes, Antonio; Verspoor, Karin

    2014-01-01

    As the cost of genomic sequencing continues to fall, the amount of data being collected and studied for the purpose of understanding the genetic basis of disease is increasing dramatically. Much of the source information relevant to such efforts is available only from unstructured sources such as the scientific literature, and significant resources are expended in manually curating and structuring the information in the literature. As such, there have been a number of systems developed to target automatic extraction of mutations and other genetic variation from the literature using text mining tools. We have performed a broad survey of the existing publicly available tools for extraction of genetic variants from the scientific literature. We consider not just one tool but a number of different tools, individually and in combination, and apply the tools in two scenarios. First, they are compared in an intrinsic evaluation context, where the tools are tested for their ability to identify specific mentions of genetic variants in a corpus of manually annotated papers, the Variome corpus. Second, they are compared in an extrinsic evaluation context based on our previous study of text mining support for curation of the COSMIC and InSiGHT databases. Our results demonstrate that no single tool covers the full range of genetic variants mentioned in the literature. Rather, several tools have complementary coverage and can be used together effectively. In the intrinsic evaluation on the Variome corpus, the combined performance is above 0.95 in F-measure, while in the extrinsic evaluation the combined recall performance is above 0.71 for COSMIC and above 0.62 for InSiGHT, a substantial improvement over the performance of any individual tool. Based on the analysis of these results, we suggest several directions for the improvement of text mining tools for genetic variant extraction from the literature.

  2. Development of opencast mines

    Energy Technology Data Exchange (ETDEWEB)

    Szebenyi, F.

    1987-01-01

    The role and works of the Central Institute for Mining Development and its legal predecessors, the Mining Research Institute and Mines Design Institute, in relation with opencast lignite mining in Hungary, are summarized. Investigations aimed at the determination of the heating technical properties of lignites are reviewed. Different lignite mines, their geological features, production possibilities and development conditions are outlined.

  3. Opinion Mining and Analysis for Arabic Language

    Directory of Open Access Journals (Sweden)

    Mohammed N. Al-Kabi

    2014-06-01

    Full Text Available Social media constitutes a major component of Web 2.0 and includes social networks, blogs, forum discussions, micro-blogs, etc. Users of social media generate a huge volume of reviews and comments on daily basis. These reviews and comments reflect the opinions of users about different issues, such as: products, news, entertainments, or sports. Therefore different establishments may need to analyze these reviews and comments. For examples: It is essential for companies to know the pros and cons of their products or services in the eyes of customers. Governments may want In addition to know the attitude of people towards certain decisions, services, etc. Although the manual analysis of textual reviews and comments can be more accurate than the automatic methods, nonetheless, it is time consuming, expensive, and can be In addition subjective. In addition, the huge amount of data contained in social networks can make it impractical to perform analysis manually. This paper focuses on evaluating social content in Arabic language and contexts. Currently, Middle East is an area rich of major political and social reforms. The social media can be a rich source of information to evaluate such contexts. In this research we developed an opinion mining and analysis tool to collect different forms of Arabic language (i.e. Standard or MSA, and colloquial. The tool accepts comments or opinions as input and generates polarity based outputs related to the comments. For example the output can be whether the comment or review is: (subjective or objective, (positive or negative, and (strong or weak. The evaluation of the performance of the developed tool showed that it yields more accurate results when it is applied on domain-based Arabic reviews relative to general-based Arabic reviews.

  4. Text mining of web-based medical content

    CERN Document Server

    Neustein, Amy

    2014-01-01

    Text Mining of Web-Based Medical Content examines web mining for extracting useful information that can be used for treating and monitoring the healthcare of patients. This work provides methodological approaches to designing mapping tools that exploit data found in social media postings. Specific linguistic features of medical postings are analyzed vis-a-vis available data extraction tools for culling useful information.

  5. Data mining methods and applications

    CERN Document Server

    Lawrence, Kenneth D; Klimberg, Ronald K

    2007-01-01

    With today's information explosion, many organizations are now able to access a wealth of valuable data. Unfortunately, most of these organizations find they are ill-equipped to organize this information, let alone put it to work for them. Gain a Competitive Advantage Employ data mining in research and forecasting Build models with data management tools and methodology optimization Gain sophisticated breakdowns and complex analysis through multivariate, evolutionary, and neural net methodsLearn how to classify data and maintain qualityTransform Data into Business Acumen Data Mining Methods and

  6. Techniques, Applications and Challenging Issue in Text Mining

    Directory of Open Access Journals (Sweden)

    Shaidah Jusoh

    2012-11-01

    Full Text Available Text mining is a very exciting research area as it tries to discover knowledge from unstructured texts. These texts can be found on a desktop, intranets and the internet. The aim of this paper is to give an overview of text mining in the contexts of its techniques, application domains and the most challenging issue. The focus is given on fundamentals methods of text mining which include natural language possessing and information extraction. This paper also gives a short review on domains which have employed text mining. The challenging issue in text mining which is caused by the complexity in a natural language is also addressed in this paper.

  7. Coal Mine Permit Boundaries

    Data.gov (United States)

    Earth Data Analysis Center, University of New Mexico — ESRI ArcView shapefile depicting New Mexico coal mines permitted under the Surface Mining Control and Reclamation Act of 1977 (SMCRA), by either the NM Mining these...

  8. Exploration and Mining Roadmap

    Energy Technology Data Exchange (ETDEWEB)

    none,

    2002-09-01

    This Exploration and Mining Technology Roadmap represents the third roadmap for the Mining Industry of the Future. It is based upon the results of the Exploration and Mining Roadmap Workshop held May 10 ñ 11, 2001.

  9. Evaluating Water Management Practice for Sustainable Mining

    Directory of Open Access Journals (Sweden)

    Xiangfeng Zhang

    2014-02-01

    Full Text Available To move towards sustainable development, the mining industry needs to identify better mine water management practices for reducing raw water use, increasing water use efficiency, and eliminating environmental impacts in a precondition of securing mining production. However, the selection of optimal mine water management practices is technically challenging due to the lack of scientific tools to comprehensively evaluate management options against a set of conflicting criteria. This work has provided a solution to aid the identification of more sustainable mine water management practices. The solution includes a conceptual framework for forming a decision hierarchy; an evaluation method for assessing mine water management practices; and a sensitivity analysis in view of different preferences of stakeholders or managers. The solution is applied to a case study of the evaluation of sustainable water management practices in 16 mines located in the Bowen Basin in Queensland, Australia. The evaluation results illustrate the usefulness of the proposed solution. A sensitivity analysis is performed according to preference weights of stakeholders or managers. Some measures are provided for assessing sensitivity of strategy ranking outcomes if the weight of an indicator changes. Finally, some advice is given to improve the mine water management in some mines.

  10. Data mining in pharma sector: benefits.

    Science.gov (United States)

    Ranjan, Jayanthi

    2009-01-01

    The amount of data getting generated in any sector at present is enormous. The information flow in the pharma industry is huge. Pharma firms are progressing into increased technology-enabled products and services. Data mining, which is knowledge discovery from large sets of data, helps pharma firms to discover patterns in improving the quality of drug discovery and delivery methods. The paper aims to present how data mining is useful in the pharma industry, how its techniques can yield good results in pharma sector, and to show how data mining can really enhance in making decisions using pharmaceutical data. This conceptual paper is written based on secondary study, research and observations from magazines, reports and notes. The author has listed the types of patterns that can be discovered using data mining in pharma data. The paper shows how data mining is useful in the pharma industry and how its techniques can yield good results in pharma sector. Although much work can be produced for discovering knowledge in pharma data using data mining, the paper is limited to conceptualizing the ideas and view points at this stage; future work may include applying data mining techniques to pharma data based on primary research using the available, famous significant data mining tools. Research papers and conceptual papers related to data mining in Pharma industry are rare; this is the motivation for the paper.

  11. The Math-Biology Values Instrument: Development of a Tool to Measure Life Science Majors' Task Values of Using Math in the Context of Biology.

    Science.gov (United States)

    Andrews, Sarah E; Runyon, Christopher; Aikens, Melissa L

    2017-01-01

    In response to calls to improve the quantitative training of undergraduate biology students, there have been increased efforts to better integrate math into biology curricula. One challenge of such efforts is negative student attitudes toward math, which are thought to be particularly prevalent among biology students. According to theory, students' personal values toward using math in a biological context will influence their achievement and behavioral outcomes, but a validated instrument is needed to determine this empirically. We developed the Math-Biology Values Instrument (MBVI), an 11-item college-level self--report instrument grounded in expectancy-value theory, to measure life science students' interest in using math to understand biology, the perceived usefulness of math to their life science career, and the cost of using math in biology courses. We used a process that integrates multiple forms of validity evidence to show that scores from the MBVI can be used as a valid measure of a student's value of math in the context of biology. The MBVI can be used by instructors and researchers to help identify instructional strategies that influence math-biology values and understand how math-biology values are related to students' achievement and decisions to pursue more advanced quantitative-based courses. © 2017 S. E. Andrews et al. CBE—Life Sciences Education © 2017 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  12. QSAR Modelling of CYP3A4 Inhibition as a Screening Tool in the Context of DrugDrug Interaction Studies.

    Science.gov (United States)

    Hamon, Véronique; Horvath, Dragos; Gaudin, Cédric; Desrivot, Julie; Junges, Céline; Arrault, Alban; Bertrand, Marc; Vayer, Philippe

    2012-09-01

    Drugdrug interaction potential (DDI), especially cytochrome P450 (CYP) 3A4 inhibition potential, is one of the most important parameters to be optimized before preclinical and clinical pharmaceutical development as regard to the number of marketed drug metabolized mainly by this CYP and potentially co-administered with the future drug. The present study aims to develop in silico models for CYP3A4 inhibition prediction to help medicinal chemists during the discovery phase and even before the synthesis of new chemical entities (NCEs), focusing on NCEs devoid of any inhibitory potential toward this CYP. In order to find a relevant relationship between CYP3A4 inhibition and chemical features of the screened compounds, we applied a genetic-algorithm-based QSAR exploratory tool SQS (Stochastic QSAR Sampler) in combination with different description approaches comprising alignment-independent Volsurf descriptors, ISIDA fragments and Topological Fuzzy Pharmacophore Triplets. The experimental data used to build models were extracted from an in-house database. We derived a model with good prediction ability that was confirmed on both newly synthesized compound and public dataset retrieved from Pubchem database. This model is a promising efficient tool for filtering out potentially problematic compounds.

  13. Mining review

    Science.gov (United States)

    McCartan, L.; Morse, D.E.; Plunkert, P.A.; Sibley, S.F.

    2004-01-01

    The average annual growth rate of real gross domestic product (GDP) from the third quarter of 2001 through the second quarter of 2003 in the United States was about 2.6 percent. GDP growth rates in the third and fourth quarters of 2003 were about 8 percent and 4 percent, respectively. The upward trends in many sectors of the U.S. economy in 2003, however, were shared by few of the mineral materials industries. Annual output declined in most nonfuel mining and mineral processing industries, although there was an upward turn toward yearend as prices began to increase.

  14. Process mining using convex polytopes

    OpenAIRE

    Alemany Puig, Lluís

    2017-01-01

    Process Mining is a relatively young field of study that highlights the difficulty to infer models of processes from which to extract enough information to make predictions about its behaviour, find bottlenecks and causality relationships so as to be able to answer as many questions as one can make about them. In this context, a process may be understood as any activity performed by humans or computers or the result between the interaction of the two. Research on this topic has...

  15. Urban Mining

    DEFF Research Database (Denmark)

    2015-01-01

    The catalogue is one of the results of a small taught course at teh Aarhus School of Architecture. The course was offered to bachelor students and was specific focused on harvesting materials in an urban context and on building with waste.......The catalogue is one of the results of a small taught course at teh Aarhus School of Architecture. The course was offered to bachelor students and was specific focused on harvesting materials in an urban context and on building with waste....

  16. Numerical analysis and geotechnical assessment of mine scale model

    Institute of Scientific and Technical Information of China (English)

    Khanal Manoj; Adhikary Deepak; Balusu Rao

    2012-01-01

    Various numerical methods are available to model,simulate,analyse and interpret the results; however a major task is to select a reliable and intended tool to perform a realistic assessment of any problem.For a model to be a representative of the realistic mining scenario,a verified tool must be chosen to perform an assessment of mine roof support requirement and address the geotechnical risks associated with longwall mining.The dependable tools provide a safe working environment,increased production,efficient management of resources and reduce environmental impacts of mining.Although various methods,for example,analytical,experimental and empirical are being adopted in mining,in recent days numerical tools are becoming popular due to the advancement in computer hardware and numerical methods.Empirical rules based on past experiences do provide a general guide,however due to the heterogeneous nature of mine geology (i.e.,none of the mine sites are identical),numerical simulations of mine site specific conditions would lend better insights into some underlying issues.The paper highlights the use of a continuum mechanics based tool in coal mining with a mine scale model.The continuum modelling can provide close to accurate stress fields and deformation.The paper describes the use of existing mine data to calibrate and validate the model parameters,which then are used to assess geotechnical issues related with installing a new high capacity longwall mine at the mine site.A variety of parameters,for example,chock convergences,caveability of overlying sandstones,abutment and vertical stresses have been estimated.

  17. Coal Mines, Abandoned - Digitized Mined Areas

    Data.gov (United States)

    NSGIC GIS Inventory (aka Ramona) — Coal mining has occurred in Pennsylvania for over a century. The maps to these coal mines are stored at many various public and private locations (if they still...

  18. THE UNIFICATION OF THE CODE LISTS PROVIDED WITHIN THE DATA MODEL ORIGINATING FROM THE INSPIRE TECHNICAL GUIDELINES AND THE ONES PROVIDED FOR GESUT DATABASES IN THE CONTEXT OF POTENTIAL EXPLOITATION IN THE MINING INDUSTRY

    Directory of Open Access Journals (Sweden)

    Andrzej ZYGMUNIAK

    2016-07-01

    Full Text Available This study is aimed at exposing differences between two data models in case of code lists values provided there. The first of them is an obligatory one for managing Geodesic Register of Utility Networks databases in Poland [9] and the second is the model originating from the Technical Guidelines issued to the INSPIRE Directive. Since the second one mentioned is the basis for managing spatial databases among European parties, correlating these two data models has an effect in easing the way of harmonizing and, in consequence, exchanging spatial data. Therefore, the study presents the possibilities of increasing compatibility between the values of the code lists concerning attributes for objects provid-ed in both models. In practice, it could lead to an increase of the competitiveness of entities managing or processing such databases and to greater involvement in scientific or research projects when it comes to the mining industry. More-over, since utility networks located on mining areas are under particular protection, the ability of making them more fitted to their own needs will make it possible for mining plants to exchange spatial data in a more efficient way.

  19. Wikipedia Mining

    Science.gov (United States)

    Nakayama, Kotaro; Ito, Masahiro; Erdmann, Maike; Shirakawa, Masumi; Michishita, Tomoyuki; Hara, Takahiro; Nishio, Shojiro

    Wikipedia, a collaborative Wiki-based encyclopedia, has become a huge phenomenon among Internet users. It covers a huge number of concepts of various fields such as arts, geography, history, science, sports and games. As a corpus for knowledge extraction, Wikipedia's impressive characteristics are not limited to the scale, but also include the dense link structure, URL based word sense disambiguation, and brief anchor texts. Because of these characteristics, Wikipedia has become a promising corpus and a new frontier for research. In the past few years, a considerable number of researches have been conducted in various areas such as semantic relatedness measurement, bilingual dictionary construction, and ontology construction. Extracting machine understandable knowledge from Wikipedia to enhance the intelligence on computational systems is the main goal of "Wikipedia Mining," a project on CREP (Challenge for Realizing Early Profits) in JSAI. In this paper, we take a comprehensive, panoramic view of Wikipedia Mining research and the current status of our challenge. After that, we will discuss about the future vision of this challenge.

  20. On 3D Geo-visualization of a Mine Surface Plant and Mine Roadway

    Institute of Scientific and Technical Information of China (English)

    WANG Yunjia; FU Yongming; FU Erjiang

    2007-01-01

    Constructing the 3D virtual scene of a coal mine is the objective requirement for modernizing and processing information on coal mining production. It is also the key technology to establish a "digital mine". By exploring current worldwide research, software and hardware tools and application demands, combined with the case study site (the Dazhuang mine of Pingdingshan coal group), an approach for 3D geo-visualization of a mine surface plant and mine roadway is deeply discussed. In this study, the rapid modeling method for a large range virtual scene based on Arc/Info and SiteBuilder3D is studied, and automatic generation of a 3D scene from a 2D scene is realized. Such an automatic method which can convert mine roadway systems from 2D to 3D is realized for the Dazhuang mine. Some relevant application questions are studied, including attribute query, coordinate query, distance measure, collision detection and the dynamic interaction between 2D and 3D virtual scenes in the virtual scene of a mine surface plant and mine roadway. A prototype system is designed and developed.

  1. Mathematical-logical modeling of regulations on mining safety. [Boolean algebra analysis

    Energy Technology Data Exchange (ETDEWEB)

    Fajkos, A.; Suchan, L.

    1979-09-01

    Complexity of the logical structure of mine safety regulations results from the complexity of mining problems. This complexity sometimes makes it difficult to precisely formulate mining safety regulations and to monitor their observance by the miners. It is suggested that mathematical- logical modeling can be an efficient tool in analyzing mine safety regulations. A short description of the method based on Boolean algebra, and three examples of its use in the field of mine safety regulations are presented. (2 refs.) (In Czech)

  2. Data Mining Techniques: A Source for Consumer Behavior Analysis

    CERN Document Server

    Raorane, Abhijit

    2011-01-01

    Various studies on consumer purchasing behaviors have been presented and used in real problems. Data mining techniques are expected to be a more effective tool for analyzing consumer behaviors. However, the data mining method has disadvantages as well as advantages. Therefore, it is important to select appropriate techniques to mine databases. The objective of this paper is to know consumer behavior, his psychological condition at the time of purchase and how suitable data mining method apply to improve conventional method. Moreover, in an experiment, association rule is employed to mine rules for trusted customers using sales data in a super market industry

  3. [Cultural detection of thermotolerant Campylobacter spp. in food--potentials and limitations of diagnostic tools in the context of official food control].

    Science.gov (United States)

    Messelhäusser, Ute; Thärigen, Diana; Fella, Christiane; Schreiner, Hermann; Busch, Ulrich; Höller, Christiane

    2015-01-01

    Thermotolerant Campylobacter spp. rank among the most important foodborne pathogens in Germany. Therefore a necessity for rapid and routinely useable detection methods exists also in the area of food microbiology. A reliable, cultura qualitative, but also quantitative detection of thermotolerant Campylobacter spp. pose a challenge, at least concerning special food matrices, especially because in the context of official food control the cultural detection of thermotolerant Campylobacter spp. is needed. This was the reason, why different cultural detection methods, beside the standard procedure of ISO 10272:2006, in combination with molecular and immunological screening methods were tested at the Bavarian Health and Food Safety Authority (LGL) during the last years for the use in routine diagnostic using different food matrices of animal and plant origin. The results of the comparative studies showed clearly that no enrichment broth tested gave completely satisfactory results for an only culture-based detection the combination with a screening method is therefore recommended for a rapid and reliable detection. But in this case the user should take into account that the sensitivity of such molecular and immunological methods is normally so high that in some cases, depending on the food matrix and processing step, the isolation of the pathogen would not be possible in samples, which were positive in the screening methods.

  4. The psychological portrait as a tool to improve the subjective well-being of the client in the context of personal sales

    Directory of Open Access Journals (Sweden)

    Kiseleva Elena S.

    2016-01-01

    Full Text Available The definition of the concept of relationship marketing is discussed. The notion of personal sale, which emphasizes the importance of knowing the psychology of customers, the ability to demonstrate significant competence and knowledge of modern techniques in sales by the seller is represented. The theory of marketing introduced by the concept of a "personal agent", most accurately reflects the activity of the seller in the context of relationship marketing. The necessity of the application of psychology in marketing is proved. This brief description of the four basic psychological concepts, which are the basis of the marketing strategy, is clarified by leading marketers. The three areas of application of differential psychology in marketing are suggested. The basis of the psychological techniques is represented by the typical characteristics of a person. The first point of the customer portrait is showed by the theory of spirits. The second point of psychological portrait is based on the theory of Socionics. The third point of the customer portrait refers to the theory of the types of perception of the world. Three stages of the process of building a psychological portrait of the client are proposed and described. Recommendations for dealing with clients of different psychological types are discussed. A technique of creation a psychological portrait of the client allows improving subjective well-being of customers and promotes the growth of the main indicators characterizing the effectiveness of personal sales.

  5. Specific industrial medicine in lignite mining

    Energy Technology Data Exchange (ETDEWEB)

    Zerlett, G.; Toussaint, R.; Hilfert, R.; Koehne, G.

    1986-11-01

    Lignite is almost exclusively mined open-cast in West Germany. Mining is carried out at present in deep open-cast mines (with depths up to 400 m) using modern mining conveyer techniques with large-scale equipment (rotary bucket excavators and thickeners) conveyer belt and railway links as well as earth-moving equipment (accessory equipment). Specific occupational diseases are not known in lignite mining. Occupational diseases in the sense of degenerative processes of the lumbar spine and avulsion fractures of spinous processes of the lower cervical spine in drivers of earth-moving equipment as a result of whole-body vibration stress have been observed. In the context of occupational disease and industrial accident prevention, close collaboration and a division of labor between industrial physicians and safety engineers of ergonomics specialists have proved to be very efficient. Industrial medical preventive examinations based on statutory regulations and the lignite mining ordinance have a long tradition and are given a great deal of space. First aid at work is organized in a largely decentralized manner owing to the extensiveness of the mining area. The industrial medical care of the employees takes place on site in the various sections (open-cast mining, workshops, brickett factories, processing units etc.). The entire medical technical diagnostics is carried out centrally in an industrial medical center.

  6. Mining knowledge from text repositories using information extraction: A review

    Indian Academy of Sciences (India)

    Sandeep R Sirsat; Dr Vinay Chavan; Dr Shrinivas P Deshpande

    2014-02-01

    There are two approaches to mining text form online repositories. First, when the knowledge to be discovered is expressed directly in the documents to be mined, Information Extraction (IE) alone can serve as an effective tool for such text mining. Second, when the documents contain concrete data in unstructured form rather than abstract knowledge, Information Extraction (IE) can be used to first transform the unstructured data in the document corpus into a structured database, and then use some state-of-the-art data mining algorithms/tools to identify abstract patterns in this extracted data. This paper presents the review of several methods related to these two approaches.

  7. Bringing together hydrologic models and Earth Observation data with water users through the WebGIS tool SPIDER in the context of the SIRIUS project

    Science.gov (United States)

    Garrido, Jesús; Osann, Anna; Calera, Alfonso; Moreno-Rivera, Juan Manuel; Momblanch, Andrea; Andreu, Joaquin; Solera, Abel; Fernández, Miguel

    2013-04-01

    Scientific expertise on irrigated agriculture or hydrological modelling has achieved advance models with tested results. However, real connexions between this knowledge and its applications, and water end-users (either water managers on the field, or water policy makers) need a meeting point. According with the main aim of Global Monitoring for Environment and Security (GMES) in order to provide global, timely and easily accessible information in applications like land and water management, the EU-project SIRIUS (Sustainable Irrigation water management and River-basin governance: Implementing User-driven Services, www.sirius-gmes.es), is linking hydrologic models and Earth Observation data with water users, through the webGIS tool SPIDER (System of Participatory Information, Decision support and Expert knowledge for River basin water management). The models employed are AQUATOOL (http://www.upv.es/aquatool/) and HidroMORE+® (http://www.hidromore.es/). AQUATOOL is a Decision Support System (DSS) for the management of the water resources in a river basin which integrates in a comprehensive way all relevant water elements and its interactions, in order to provide different scenarios that incorporate water offers and demands. On the other hand, HidroMORE+® computes spatially distributed water balance components remote sensing driven, in large areas at high spatial and temporal resolution. Mainly applied to irrigation practices, HidroMORE+® is aimed to monitories the crop evolutions and water demands. Either AQUATOOL products such scenario reports, or HidroMORE+® products such time series of the water balance components can be integrated in SPIDER, which has been designed to display all these types of products. However, a general feature of models is that they often provide too many parameters, which makes it very difficult for non-experts to understand. Then, it is needed to select among the output variables those that provide maximum useful information, according

  8. 一种新的用于数据挖掘工具的网页净化算法%An new algorithm of Web page purification for data mining tools

    Institute of Scientific and Technical Information of China (English)

    孙楠; 张华伟

    2011-01-01

    In order to eliminate noise preferably and extract topic content from Web pages efficiently,an algorithm of Web page purification is presented. This algorithm argues that topic content of Web page is mainly contained in and ,hereby Web noise can be preprocessed. Then with the content match of relevant Web page, the topic content of Web page can be acquired by way of calculating the importance of node. This algorithm has achieved very precise results, correctly extracting 98.2% of the pages in a set of 6 318 pages in portal sites. When used for data mining tools, this algorithm is better than the other similar algorithms. It can eliminate noise efficiently.%为了更好地消除网页噪声,有效地提取网页的主题内容,提出了一种新的网页净化算法.该算法认为网页的主题内容主要包含在< table>标记和<p>标记里面,并据此对网页噪声进行预处理,然后与相关网页进行内容匹配,通过计算节点重要度,获取网页的主题内容.对门户网站的6318个网页的检测表明,该算法可以有效地提取网页的主题内容,准确率达到98.2%以上.用于数据挖掘工具时,该算法优于其他同类算法,可以有效地去除网页噪声.

  9. Analysis and prediction of pest dynamics in an agroforestry context using Tiko'n, a generic tool to develop food web models

    Science.gov (United States)

    Rojas, Marcela; Malard, Julien; Adamowski, Jan; Carrera, Jaime Luis; Maas, Raúl

    2017-04-01

    While it is known that climate change will impact future plant-pest population dynamics, potentially affecting crop damage, agroforestry with its enhanced biodiversity is said to reduce the outbreaks of pest insects by providing natural enemies for the control of pest populations. This premise is known in the literature as the natural enemy hypothesis and has been widely studied qualitatively. However, disagreement still exists on whether biodiversity enhancement reduces pest outbreaks, showing the need of quantitatively understanding the mechanisms behind the interactions between pests and natural enemies, also known as trophic interactions. Crop pest models that study insect population dynamics in agroforestry contexts are very rare, and pest models that take trophic interactions into account are even rarer. This may be due to the difficulty of representing complex food webs in a quantifiable model. There is therefore a need for validated food web models that allow users to predict the response of these webs to changes in climate in agroforestry systems. In this study we present Tiko'n, a Python-based software whose API allows users to rapidly build and validate trophic web models; the program uses a Bayesian inference approach to calibrate the models according to field data, allowing for the reuse of literature data from various sources and reducing the need for extensive field data collection. Tiko'n was run using coffee leaf miner (Leucoptera coffeella) and associated parasitoid data from a shaded coffee plantation, showing the mechanisms of insect population dynamics within a tri-trophic food web in an agroforestry system.

  10. Identifying phonological processing deficits in Northern Sotho-speaking children: The use of non-word repetition as a language assessment tool in the South African context

    Directory of Open Access Journals (Sweden)

    Carien Wilsenach

    2016-05-01

    Full Text Available Diagnostic testing of speech/language skills in the African languages spoken in South Africa is a challenging task, as standardised language tests in the official languages of South Africa barely exist. Commercially available language tests are in English, and have been standardised in other parts of the world. Such tests are often translated into African languages, a practice that speech language therapists deem linguistically and culturally inappropriate. In response to the need for developing clinical language assessment instruments that could be used in South Africa, this article reports on data collected with a Northern Sotho non-word repetition task (NRT. Non-word repetition measures various aspects of phonological processing, including phonological working memory (PWM, and is used widely by speech language therapists, linguists, and educational psychologists in the Western world. The design of a novel Northern Sotho NRT is described, and it is argued that the task could be used successfully in the South African context to discriminate between children with weak and strong Northern Sotho phonological processing ability, regardless of the language of learning and teaching. The NRT was piloted with 120 third graders, and showed moderate to strong correlations with other measures of PWM, such as digit span and English non-word repetition. Furthermore, the task was positively associated with both word and fluent reading in Northern Sotho, and it reliably predicted reading outcomes in the tested population. Suggestions are made for improving the current version of the Northern Sotho NRT, whereafter it should be suitable to test learners from various age groups.

  11. Identifying phonological processing deficits in Northern Sotho-speaking children: The use of non-word repetition as a language assessment tool in the South African context.

    Science.gov (United States)

    Wilsenach, Carien

    2016-01-01

    Diagnostic testing of speech/language skills in the African languages spoken in South Africa is a challenging task, as standardised language tests in the official languages of South Africa barely exist. Commercially available language tests are in English, and have been standardised in other parts of the world. Such tests are often translated into African languages, a practice that speech language therapists deem linguistically and culturally inappropriate. In response to the need for developing clinical language assessment instruments that could be used in South Africa, this article reports on data collected with a Northern Sotho non-word repetition task (NRT). Non-word repetition measures various aspects of phonological processing, including phonological working memory (PWM), and is used widely by speech language therapists, linguists, and educational psychologists in the Western world. The design of a novel Northern Sotho NRT is described, and it is argued that the task could be used successfully in the South African context to discriminate between children with weak and strong Northern Sotho phonological processing ability, regardless of the language of learning and teaching. The NRT was piloted with 120 third graders, and showed moderate to strong correlations with other measures of PWM, such as digit span and English non-word repetition. Furthermore, the task was positively associated with both word and fluent reading in Northern Sotho, and it reliably predicted reading outcomes in the tested population. Suggestions are made for improving the current version of the Northern Sotho NRT, whereafter it should be suitable to test learners from various age groups.

  12. Data mining and business analytics with R

    CERN Document Server

    Ledolter, Johannes

    2013-01-01

    Collecting, analyzing, and extracting valuable information from a large amount of data requires easily accessible, robust, computational and analytical tools. Data Mining and Business Analytics with R utilizes the open source software R for the analysis, exploration, and simplification of large high-dimensional data sets. As a result, readers are provided with the needed guidance to model and interpret complicated data and become adept at building powerful models for prediction and classification. Highlighting both underlying concepts and practical computational skills, Data Mining

  13. Antimicrobials of Bacillus species: mining and engineering

    OpenAIRE

    Zhao, Xin

    2016-01-01

    Bacillus sp. have been successfully used to suppress various bacterial and fungal pathogens. Due to the wide availability of whole genome sequence data and the development of genome mining tools, novel antimicrobials are being discovered and updated,;not only bacteriocins, but also NRPs and PKs. A new classification system of known and putative antimicrobial compounds of Bacillus by genome mining is presented in Chapter 2. Importantly, predicting, isolating and screening of Bacillus strains w...

  14. National Underground Mines Inventory

    Science.gov (United States)

    1983-10-01

    08 019 726 LONG PARK 15 0502379 08 095 2904 GEO a1 MINE 0502383 08 085 2904 BESSIE 0 MINE 0502387 08 667 2904 PAYSTREAK 0502397 08 113 2904 BUENO MILL...35 061QUESTA MINE 2901267 35 055 43560 ’ RUDY NO, I S 2 2901364 35 031 MT, TAYLOR 2901375 35 061 0 MARQUEZ SHAFT 2901597 35 031 6534 MARIANO LAKE MINE

  15. Data Mining and Statistics for Decision Making

    CERN Document Server

    Tufféry, Stéphane

    2011-01-01

    Data mining is the process of automatically searching large volumes of data for models and patterns using computational techniques from statistics, machine learning and information theory; it is the ideal tool for such an extraction of knowledge. Data mining is usually associated with a business or an organization's need to identify trends and profiles, allowing, for example, retailers to discover patterns on which to base marketing objectives. This book looks at both classical and recent techniques of data mining, such as clustering, discriminant analysis, logistic regression, generalized lin

  16. Mining ergonomics

    Energy Technology Data Exchange (ETDEWEB)

    McPhee, B.

    2007-02-15

    Changes in work practices and a drive for greater productivity have introduced a range of emerging issues in ergonomics in mining. Some of the practices appear to be at odds with the need to improve general occupational health and safety. Longer shift lengths and fatigue, mental overload and underload, intermittent heavy physical work, reduced task variation, sedentary work in fixed postures and whole-body vibration all have risks for health and safety. The increasing age of some of the workforce is of concern. There appears to be a need to recognise these as potential causes of health problems. The article gives a review of these problems are reports on research findings. 36 refs., 3 figs.

  17. Mining lore : Bankhead, mining for coal

    Energy Technology Data Exchange (ETDEWEB)

    Nichiporuk, A.

    2007-09-15

    Bankhead, Alberta was one of the first communities to be established because of mining. It was founded in 1903 by the Canadian Pacific Railway (CPR) on Cascade Mountain in the Bow River Valley of Banff National Park. In 1904, Mine No. 80 was opened by the Pacific Coal Company to fuel CPR's steam engines. In order to avoid flooding the mine, the decision was made to mine up the steep seams instead of down. The mine entered full production in 1905. This article described the working conditions and pay scale for the mine workers, noting that there was not much in terms of safety equipment. There were many accidents and 15 men lost their lives at the mine. During the mine's 20-year operation, miners went on strike 6 times. The last strike marked the closure of the mine in June 1922 and the end of industry in national parks. CPR was ordered to clear out and move the mining equipment as well as the houses, buildings and essentially the entire town. During its peak production, Mine No. 80 produced about a half million tons of coal. 1 ref., 1 fig.

  18. Data mining concepts and techniques

    CERN Document Server

    Han, Jiawei

    2005-01-01

    Our ability to generate and collect data has been increasing rapidly. Not only are all of our business, scientific, and government transactions now computerized, but the widespread use of digital cameras, publication tools, and bar codes also generate data. On the collection side, scanned text and image platforms, satellite remote sensing systems, and the World Wide Web have flooded us with a tremendous amount of data. This explosive growth has generated an even more urgent need for new techniques and automated tools that can help us transform this data into useful information and knowledge.Like the first edition, voted the most popular data mining book by KD Nuggets readers, this book explores concepts and techniques for the discovery of patterns hidden in large data sets, focusing on issues relating to their feasibility, usefulness, effectiveness, and scalability. However, since the publication of the first edition, great progress has been made in the development of new data mining methods, systems, and app...

  19. Philippine Environmental Impact Assessment, Mining and Genuine Development

    Directory of Open Access Journals (Sweden)

    Allan Ingelson, William Holden & Meriam Bravante

    2009-06-01

    Full Text Available Genuine development reflects sustainability. To promote genuine development in the context of mining, the environmental impact assessment process in the Philippines needs to be changed to respect ecological integrity, mitigate cumulative environmental effects, provide more information on environmental impacts to residents affected by a proposed mine and facilitate meaningful public participation in the impact assessment process.

  20. Data Mining: A Hybrid Methodology for Complex and Dynamic Research

    Science.gov (United States)

    Lang, Susan; Baehr, Craig

    2012-01-01

    This article provides an overview of the ways in which data and text mining have potential as research methodologies in composition studies. It introduces data mining in the context of the field of composition studies and discusses ways in which this methodology can complement and extend our existing research practices by blending the best of what…

  1. Data Mining: A Hybrid Methodology for Complex and Dynamic Research

    Science.gov (United States)

    Lang, Susan; Baehr, Craig

    2012-01-01

    This article provides an overview of the ways in which data and text mining have potential as research methodologies in composition studies. It introduces data mining in the context of the field of composition studies and discusses ways in which this methodology can complement and extend our existing research practices by blending the best of what…

  2. Software Tools Streamline Project Management

    Science.gov (United States)

    2009-01-01

    Three innovative software inventions from Ames Research Center (NETMARK, Program Management Tool, and Query-Based Document Management) are finding their way into NASA missions as well as industry applications. The first, NETMARK, is a program that enables integrated searching of data stored in a variety of databases and documents, meaning that users no longer have to look in several places for related information. NETMARK allows users to search and query information across all of these sources in one step. This cross-cutting capability in information analysis has exponentially reduced the amount of time needed to mine data from days or weeks to mere seconds. NETMARK has been used widely throughout NASA, enabling this automatic integration of information across many documents and databases. NASA projects that use NETMARK include the internal reporting system and project performance dashboard, Erasmus, NASA s enterprise management tool, which enhances organizational collaboration and information sharing through document routing and review; the Integrated Financial Management Program; International Space Station Knowledge Management; Mishap and Anomaly Information Reporting System; and management of the Mars Exploration Rovers. Approximately $1 billion worth of NASA s projects are currently managed using Program Management Tool (PMT), which is based on NETMARK. PMT is a comprehensive, Web-enabled application tool used to assist program and project managers within NASA enterprises in monitoring, disseminating, and tracking the progress of program and project milestones and other relevant resources. The PMT consists of an integrated knowledge repository built upon advanced enterprise-wide database integration techniques and the latest Web-enabled technologies. The current system is in a pilot operational mode allowing users to automatically manage, track, define, update, and view customizable milestone objectives and goals. The third software invention, Query

  3. A survey on occupational hazards of heading drivers using hand-transmitted vibrating tools in a Northern China Coal Mine%北方某煤矿掘进工人手传振动职业危害调查

    Institute of Scientific and Technical Information of China (English)

    徐相蓉; 宫曼漫; 袁志伟; 王如刚; 杨秋月; 何丽华; 王生

    2016-01-01

    目的:调查我国北方某大型煤矿暴露于手传振动的掘进工人的症状与体征,以了解手传振动对作业人员健康的危害情况。方法采用问卷调查与体检相结合的方法对167名掘进工人进行调查,同时测量现场工人所用风锤的手传振动强度,应用SPSS 16.0软件进行统计分析。结果手传振动对作业人员的健康损伤以手麻、手部多汗、耳鸣、记忆力减退等症状为主,体检体征以凉温觉、痛觉、触觉、振动觉异常为主。对手麻这一典型症状进行危险因素分析,发现工龄、是否饮酒、每天接触手传振动的时间、是否佩戴手套均为影响因素。结论长期接触手传振动会导致作业人员健康损害,为有效预防手传振动对作业工人的健康危害,建议接触手传振动的作业人员要加强防护,降低振动强度暴露,减小工人发生危害的风险。%Objective To investigate the symptoms and signs of heading drivers using hand-transmitted vibrating tools in a northern China coal mine in order to know the impact of hand-transmitted vibration on heading drivers.Methods 167 heading drivers were evaluated by questionnaire investigation and health test.The vibration strength of hand-transmitted vibrating tools was tested.SPSS 1 6.0 was used to analyze data.Results The main symptoms of workers using hand-transmitted vibrating tools were hand numbness, palmar hyperhidrosis, tinnitus and memory loss.The main sings were paresthesia of temperature, pain, touch and vibration sense.Analysis on risk factors of hand numbness shows that seniority showed that working years,drinking,duration of daily exposure to hand-transmitted vibration and gloves were affecting factors. Conclusions Occupational exposure to hand-transmitted vibration does harm to health.Protection and lowering intensity should be strengthened to protect workers using hand-transmitted vibrating tools.

  4. Mining and environment

    Energy Technology Data Exchange (ETDEWEB)

    Kisgyorgy, S.

    1986-01-01

    The realization of new mining projects should be preceded by detailed studies on the impact of mining activities on the environment. For defining the conditions of environmental protection and for making proper financial plans the preparation of an information system is needed. The possible social effects of the mining investments have to be estimated, first of all from the points of view of waste disposal, mining hydrology, subsidence due to underground mining etc.

  5. Integration of data mining within a Strategic Knowledge Management framework

    Directory of Open Access Journals (Sweden)

    Sanaz Moayer

    2012-08-01

    Full Text Available In today’s globally interconnected economy, knowledge is recognised as a valuable intangible asset and source of competitive advantage for firms operating in both established and emerging industries. Within these contexts Knowledge Management (KM manifests as set of organising principles and heuristics which shape management routines, structures, technologies and cultures within organisations.When employed as an integral part of business strategy KM can blend and develop the expertise and capacity embedded in human and technological networks. This may improve processes or add value to products, services, brands and reputation. We argue that if located within a suitable strategic framework, KM can enable sustainable competitive advantage by mobilising the intangible value in networks to create products, processes or services with unique characteristics that are hard to substitute or replicate. Despite the promise of integrated knowledge strategies within high technology and professional service industries, there has been limited discussion of business strategies linked to Knowledge Management in traditional capital intensive industries such as mining and petroleum. Within these industries IT-centric Knowledge Management Systems (KMS have dominated, with varying degrees of success as business analysis, process improvement and cost reduction tools. This paper aims to explore the opportunities and benefits arising from the application of a strategic KM and Data Mining framework within the local operations of large domestic or multinational mining companies, located in Western Australia (WA. The paper presents a high level conceptual framework for integrating so called hard, ICT and soft, human systems representing the explicit and tacit knowledge embedded within broader networks of mining activity. This Strategic Knowledge Management (SKM framework is presented as a novel first step towards improving organisational performance and realisation of

  6. Trace elements and Pb isotopes in soils and sediments impacted by uranium mining

    Energy Technology Data Exchange (ETDEWEB)

    Cuvier, A., E-mail: alicia.cuvier@hotmail.fr [ECOLAB, Université de Toulouse, CNRS, INPT, UPS, Toulouse (France); IRSN/PRP-ENV/SESURE/Laboratoire d' études radioécologiques en milieu continental et marin, BP 1, 13108 Saint Paul Lez Durance Cedex (France); Pourcelot, L. [IRSN/PRP-ENV/SESURE/Laboratoire d' études radioécologiques en milieu continental et marin, BP 1, 13108 Saint Paul Lez Durance Cedex (France); Probst, A. [ECOLAB, Université de Toulouse, CNRS, INPT, UPS, Toulouse (France); Prunier, J. [Observatoire Midi-Pyrénées, laboratoire Géosciences Environnement Toulouse, CNRS/IRD/Université Paul Sabatier, 14 avenue Edouard Belin, 31400 Toulouse (France); Le Roux, G., E-mail: gael.leroux@ensat.fr [ECOLAB, Université de Toulouse, CNRS, INPT, UPS, Toulouse (France)

    2016-10-01

    The purpose of this study is to evaluate the contamination in As, Ba, Co, Cu, Mn, Ni, Sr, V, Zn and REE, in a high uranium activity (up to 21,000 Bq ∙ kg{sup −1}) area, downstream of a former uranium mine. Different geochemical proxies like enrichment factor and fractions from a sequential extraction procedure are used to evaluate the level of contamination, the mobility and the availability of the potential contaminants. Pb isotope ratios are determined in the total samples and in the sequential leachates to identify the sources of the contaminants and to determine the mobility of radiogenic Pb in the context of uranium mining. In spite of the large uranium contamination measured in the soils and the sediments (EF ≫ 40), trace element contamination is low to moderate (2 < EF < 5), except for Ba (5 < EF < 15), due to the precipitation of barium sulfate resulting from mining activities. Most of the trace elements are associated with the most mobile fractions of the sediments/soils, implying an enhanced potential availability. Even if no Pb enrichment is highlighted, the Pb isotopic signature of the contaminated soils is strongly radiogenic. Measurements performed on the sequential leachates reveal inputs of radiogenic Pb in the most mobile fractions of the contaminated soil. Inputs of low-mobile radiogenic Pb from mining activities may also contribute to the Pb signature recorded in the residual phase of the contaminated samples. We demonstrate that Pb isotopes are efficient tools to trace the origin and the mobility of the contaminants in environments affected by uranium mining. - Highlights: • Contamination of soils is evidenced by a multiproxy approach. • Enrichment factors highlight a low contamination except for U, S and Ba. • Pb isotope ratios point out inputs of radiogenic Pb from the mine. • Radiogenic Pb is mainly in the acid-soluble and the reducible fractions.

  7. Proceedings: Fourth Workshop on Mining Scientific Datasets

    Energy Technology Data Exchange (ETDEWEB)

    Kamath, C

    2001-07-24

    Commercial applications of data mining in areas such as e-commerce, market-basket analysis, text-mining, and web-mining have taken on a central focus in the JCDD community. However, there is a significant amount of innovative data mining work taking place in the context of scientific and engineering applications that is not well represented in the mainstream KDD conferences. For example, scientific data mining techniques are being developed and applied to diverse fields such as remote sensing, physics, chemistry, biology, astronomy, structural mechanics, computational fluid dynamics etc. In these areas, data mining frequently complements and enhances existing analysis methods based on statistics, exploratory data analysis, and domain-specific approaches. On the surface, it may appear that data from one scientific field, say genomics, is very different from another field, such as physics. However, despite their diversity, there is much that is common across the mining of scientific and engineering data. For example, techniques used to identify objects in images are very similar, regardless of whether the images came from a remote sensing application, a physics experiment, an astronomy observation, or a medical study. Further, with data mining being applied to new types of data, such as mesh data from scientific simulations, there is the opportunity to apply and extend data mining to new scientific domains. This one-day workshop brings together data miners analyzing science data and scientists from diverse fields to share their experiences, learn how techniques developed in one field can be applied in another, and better understand some of the newer techniques being developed in the KDD community. This is the fourth workshop on the topic of Mining Scientific Data sets; for information on earlier workshops, see http://www.ahpcrc.org/conferences/. This workshop continues the tradition of addressing challenging problems in a field where the diversity of applications is

  8. Operation Context

    DEFF Research Database (Denmark)

    Stüben, Henning; Tietjen, Anne

    2006-01-01

    Abstract: This paper seeks to challenge the notion of context from an operational perspective. Can we grasp the forces that shape the complex conditions for an architectural or urban design within the notion of context? By shifting the gaze towards the agency of architecture, contextual analysis...

  9. A practical approach for content mining of Tweets.

    Science.gov (United States)

    Yoon, Sunmoo; Elhadad, Noémie; Bakken, Suzanne

    2013-07-01

    Use of data generated through social media for health studies is gradually increasing. Twitter is a short-text message system developed 6 years ago, now with more than 100 million users generating over 300 million Tweets every day. Twitter may be used to gain real-world insights to promote healthy behaviors. The purposes of this paper are to describe a practical approach to analyzing Tweet contents and to illustrate an application of the approach to the topic of physical activity. The approach includes five steps: (1) selecting keywords to gather an initial set of Tweets to analyze; (2) importing data; (3) preparing data; (4) analyzing data (topic, sentiment, and ecologic context); and (5) interpreting data. The steps are implemented using tools that are publically available and free of charge and designed for use by researchers with limited programming skills. Content mining of Tweets can contribute to addressing challenges in health behavior research.

  10. Text Mining in Biomedical Domain with Emphasis on Document Clustering.

    Science.gov (United States)

    Renganathan, Vinaitheerthan

    2017-07-01

    With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise.

  11. A top-down / bottom-up approach for multi-actors and multi-criteria assessment of mining projects for sustainable development. Application on Arlit Uranium mines (Niger); Une demarche Top-Down / Bottom-Up pour l'evaluation en termes multicriteres et multi-acteurs des projets miniers dans l'optique du developpement durable. Application sur les mines d'Uranium d'Arlit (Niger)

    Energy Technology Data Exchange (ETDEWEB)

    Chamaret, A

    2007-06-15

    This thesis aims to appraise the relevance of using an hybrid top-down / bottom-up approach to evaluate mining projects in the perspective of sustainable development. With the advent of corporate social responsibility and sustainable development concepts, new social expectations have appeared towards companies that go beyond a sole requirement of profit earning capacity. If companies do not answer to these expectations, they risk to lose their social legitimacy. Traditionally associated with social, environmental, economical and political impacts and risks, mining activity is particularly concerned by these new issues. Whereas mineral resources needs have never been so high, mining companies are now expected to limit their negative effects and to take into account their different audiences' expectations in order to define, together, the terms of their social license to operate. Considering the diversity of issues, scales, actors and contexts, the challenge is real and necessitates tools to better understand issues and to structure dialogues. Based on the Uranium mines of Arlit (Niger) case study, this work shows that associating participatory approaches to structuration tools and literature propositions, appears as an efficient formula to better organize issues diversity and to build a structured dialogue between mining companies and their stakeholders. First Part aims to present the theoretical, institutional and sectorial contexts of the thesis. Second Part exposes work and results of the evaluation carried out in Niger. And, Third Part, shows the conclusions that can be derived from this work and presents a proposal for an evaluation framework, potentially applicable to other mining sites. (author)

  12. Context in a wider context

    Directory of Open Access Journals (Sweden)

    John Traxler

    2011-07-01

    Full Text Available This paper attempts to review and reconsider the role of context in mobile learning and starts by outlining definitions of context-aware mobile learning as the technologies have become more mature, more robust and more widely available and as the notion of context has become progressively richer. The future role of context-aware mobile learning is considered within the context of the future of mobile learning as it moves from the challenges and opportunities of pedagogy and technology to the challenges and opportunities of policy, scale, sustainability, equity and engagement with augmented reality, «blended learning», «learner devices», «user-generated contexts» and the «internet of things». This is essentially a perspective on mobile learning, and other forms of technology-enhanced learning (TEL, where educators and their institutions set the agenda and manage change. There are, however, other perspectives on context. The increasing availability and use of smart-phones and other personal mobile devices with similar powerful functionality means that the experience of context for many people, in the form of personalized or location-based services, is an increasingly social and informal experience, rather than a specialist or educational experience. This is part of the transformative impact of mobility and connectedness on our societies brought about by these universal, ubiquitous and pervasive technologies. This paper contributes a revised understanding of context in the wider context (sic of the transformations taking place in our societies. These are subtle but pervasive transformations of jobs, work and the economy, of our sense of time, space and place, of knowing and learning, and of community and identity. This leads to a radical reconsideration of context as the notions of ‹self› and ‹other› are transformed.

  13. WEB STRUCTURE MINING USING PAGERANK, IMPROVED PAGERANK – AN OVERVIEW

    OpenAIRE

    V.Lakshmi Praba; T. Vasantha

    2011-01-01

    Web Mining is the extraction of interesting and potentially useful patterns and information from Web. It includes Web documents, hyperlinks between documents, and usage logs of web sites. The significant task for web mining can be listed out as Information Retrieval, Information Selection / Extraction, Generalization and Analysis. Web information retrieval tools consider only the text on pages and ignore information in the links. The goal of Web structure mining is to explore structural summa...

  14. A Comprehensive Study of CRM through Data Mining Techniques

    CERN Document Server

    Farooqi, Md Rashid

    2012-01-01

    In today's competitive scenario in corporate world, "Customer Retention" strategy in Customer Relationship Management (CRM) is an increasingly pressed issue. Data mining techniques play a vital role in better CRM. This paper attempts to bring a new perspective by focusing the issue of data mining applications, opportunities and challenges in CRM. It covers the topic such as customer retention, customer services, risk assessment, fraud detection and some of the data mining tools which are widely used in CRM.

  15. WEKA-G: Parallel data mining on computational grids

    Directory of Open Access Journals (Sweden)

    PIMENTA, A.

    2009-12-01

    Full Text Available Data mining is a technology that can extract useful information from large amounts of data. However, mining a database often requires a high computational power. To resolve this problem, this paper presents a tool (Weka-G, which runs in parallel algorithms used in the mining process data. As the environment for doing so, we use a computational grid by adding several features within a WAN.

  16. Local Contexts

    Directory of Open Access Journals (Sweden)

    Philippe Schlenker

    2009-07-01

    Full Text Available The dynamic approach posits that a presupposition must be satisfied in its local context. But how is a local context derived from the global one? Extant dynamic analyses must specify in the lexical entry of any operator what its 'Context Change Potential' is, and for this very reason they fail to be sufficiently explanatory. To circumvent the problem, we revise two assumptions of the dynamic approach: we take the update process to be derivative from a classical, non-dynamic semantics -- which obviates the need for dynamic lexical entries; and we deny that a local context encodes what the speech act participants 'take for granted.' Instead, we take the local context of an expression E in a sentence S to be the smallest domain that one may restrict attention to when assessing E without jeopardizing the truth conditions of S. To match the results of dynamic semantics, local contexts must be computed incrementally, using only information about the expressions that precede E. This version of the theory can be shown to be nearly equivalent to the dynamic theory of Heim 1983 -- but unlike the latter, it is entirely predictive. We also suggest that local contexts can, at some cost, be computed symmetrically, taking into account information about all of S (except E; this leads to gradient predictions, whose assessment is left for future research. doi:10.3765/sp.2.3 BibTeX info

  17. Context Awareness

    DEFF Research Database (Denmark)

    Brønsted, Jeppe

    forståelse. Traditionelt har denne form for information ikke været tilstede i informationssystemer, men med bølgen af Pervasive Computing og Communication begynder ting så småt at ændre sig. Et af målene for context aware computing er, at udnytte denne ekstra ressource af informationer der ellers er...... forbeholdt mennesker, til at forbedre interaktionen med og udnyttelsen af IT-systemer. Information om kontekst fra forskellige kilder kombineres og systemet foreslår relevante handlinger eller udfører dem automatisk. I dette dokument beskrives begreberne context og context awareness og hvorfor de er vigtige...

  18. Context Awareness

    DEFF Research Database (Denmark)

    Brønsted, Jeppe

    forståelse. Traditionelt har denne form for information ikke været tilstede i informationssystemer, men med bølgen af Pervasive Computing og Communication begynder ting så småt at ændre sig. Et af målene for context aware computing er, at udnytte denne ekstra ressource af informationer der ellers er...... forbeholdt mennesker, til at forbedre interaktionen med og udnyttelsen af IT-systemer. Information om kontekst fra forskellige kilder kombineres og systemet foreslår relevante handlinger eller udfører dem automatisk. I dette dokument beskrives begreberne context og context awareness og hvorfor de er vigtige...

  19. No rock unturned: revitalizing the economies of mining dependent communities

    Energy Technology Data Exchange (ETDEWEB)

    Kuyek, J.; Coumans, C. [Mining Watch Canada, Ottawa, ON (Canada)

    2003-08-20

    The purpose is to study how communities that are dependent on mining can revitalize their economies and deal with the social and environmental problems that occur when a mine closes. This first part of the project consists of a literature review and bibliography. Emphasis is on literature published since 1980. The review covers the macroeconomic context of mining towns, rural and remote communities, environmental issues, social composition of mining towns, political and economic issues, infrastructure, long distance commuting, economic alternatives, economic diversification strategies, community economic development, government programmes and policy, case studies of specific communities, and recommendations. 254 refs.

  20. Data mining in e-commerce: A survey

    Indian Academy of Sciences (India)

    N R Srinivasa Raghavan

    2005-04-01

    Data mining has matured as a field of basic and applied research in computer science in general and e-commerce in particular. In this paper, we survey some of the recent approaches and architectures where data mining has been applied in the fields of e-commerce and e-business. Our intent is not to survey the plethora of algorithms in data mining; instead, our current focus being e-commerce, we limit our discussion to data mining in the context of e-commerce. We also mention a few directions for further work in this domain, based on the survey.

  1. Text Mining applied to Molecular Biology

    NARCIS (Netherlands)

    R. Jelier (Rob)

    2008-01-01

    textabstractThis thesis describes the development of text-mining algorithms for molecular biology, in particular for DNA microarray data analysis. Concept profiles were introduced, which characterize the context in which a gene is mentioned in literature, to retrieve functional associations

  2. Change Mining in Adaptive Process Management Systems

    NARCIS (Netherlands)

    Günther, Christian W.; Rinderle, S.B.; Reichert, M.U.; van der Aalst, Wil

    2006-01-01

    The wide-spread adoption of process-aware information systems has resulted in a bulk of computerized information about real-world processes. This data can be utilized for process performance analysis as well as for process improvement. In this context process mining offers promising perspectives. So

  3. Analyzing Vessel Behavior Using Process Mining

    NARCIS (Netherlands)

    Maggi, F.M.; Mooij, A.J.; Aalst, W.M.P. van der

    2013-01-01

    In the maritime domain, electronic sensors such as AIS receivers and radars collect large amounts of data about the vessels in a certain geographical area. We investigate the use of process mining techniques for analyzing the behavior of the vessels based on these data. In the context of maritime sa

  4. Analyzing Vessel Behavior Using Process Mining

    NARCIS (Netherlands)

    Maggi, F.M.; Mooij, A.J.; Aalst, W.M.P. van der

    2013-01-01

    In the maritime domain, electronic sensors such as AIS receivers and radars collect large amounts of data about the vessels in a certain geographical area. We investigate the use of process mining techniques for analyzing the behavior of the vessels based on these data. In the context of maritime sa

  5. Context matters!

    DEFF Research Database (Denmark)

    Bojesen, Anders

    2004-01-01

    This paper explores how the context of competencies affects the way we see and value competence and how it thereby forms communication and strategies of action. The paper puts forward the view that the context of competence is often spoken of in incomprehensible terms and generally taken...... for granted and unproblematic, although it is agreed to be of great importance. By crystallising three different modes of contextualised competence thinking (prescriptive, descriptive and analytical) the paper shows that the underlying assumptions about context - the interaction between the individual...... and the social - has major consequences for the specific enactment of competence. The paper argues in favour of a second order observation strategy for the context of competence. But in doing so it also shows that prevailing second-order competence theories so far, in criticising (counter) positions (and...

  6. Mobile Context Toolbox

    DEFF Research Database (Denmark)

    Stopczynski, Arkadiusz; Larsen, Jakob Eg; Skomail, Lukasz

    2010-01-01

    In this paper we describe an open framework utilizing sensors and application data on the Maemo mobile platform enabling rapid prototyping of context-aware mobile applications. The framework has an extensible layered architecture allowing new hardware and software sensors and features to be added...... to the context framework. We present initial results from in-the-wild experiments where contextual data was acquired using the tool. In the experiments 6 participants were using a Nokia N900 mobile phone continuously with a logger application for an average of 33 days. The study has provided valuable insights...

  7. Process mining in healthcare: A literature review.

    Science.gov (United States)

    Rojas, Eric; Munoz-Gama, Jorge; Sepúlveda, Marcos; Capurro, Daniel

    2016-06-01

    Process Mining focuses on extracting knowledge from data generated and stored in corporate information systems in order to analyze executed processes. In the healthcare domain, process mining has been used in different case studies, with promising results. Accordingly, we have conducted a literature review of the usage of process mining in healthcare. The scope of this review covers 74 papers with associated case studies, all of which were analyzed according to eleven main aspects, including: process and data types; frequently posed questions; process mining techniques, perspectives and tools; methodologies; implementation and analysis strategies; geographical analysis; and medical fields. The most commonly used categories and emerging topics have been identified, as well as future trends, such as enhancing Hospital Information Systems to become process-aware. This review can: (i) provide a useful overview of the current work being undertaken in this field; (ii) help researchers to choose process mining algorithms, techniques, tools, methodologies and approaches for their own applications; and (iii) highlight the use of process mining to improve healthcare processes.

  8. Text mining resources for the life sciences.

    Science.gov (United States)

    Przybyła, Piotr; Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable-those that have the crucial ability to share information, enabling smooth integration and reusability. © The Author(s) 2016. Published by Oxford University Press.

  9. Sustainable mining management; Gestion minera sostenible

    Energy Technology Data Exchange (ETDEWEB)

    Tejera Oliver, J. L.

    2009-07-01

    Mining activities are carried out by the older man and have provided resources, since ancient times, for their development and progress. With the discovery of fire will show the first metals that have marked the civilizations of copper, bronze and iron, and is the prehistory of the Stone Age tools that man has made from the exploitation of quarries first. The industrial revolution of the nineteenth century is linked to coal and steel, and could not conceiver of todays society without oil and gas, without silicon and coltan. But the mines are often aggressive and, despite their need and what they contribute to the development are answered by the societies where are made. during recent years there has been growing international efforts to try to make the minimum requirements of sustainable exploitation (European Directives, GMI, GRI, etc.) In AENOR, and within the Technical Committee of Standardization 22 Mining and Explosives, chaired by AITEMIN, was established the subcommittee 3, chaired by IGME, where, with the participation of all stake holders, have developed some standards on sustainable mining management sustainable mining that will be a tool available to mining companies to demonstrate their sustainable use to Society. (Author)

  10. Visual Based Retrieval Systems and Web Mining--Introduction.

    Science.gov (United States)

    Iyengar, S. S.

    2001-01-01

    Briefly discusses Web mining and image retrieval techniques, and then presents a summary of articles in this special issue. Articles focus on Web content mining, artificial neural networks as tools for image retrieval, content-based image retrieval systems, and personalizing the Web browsing experience using media agents. (AEF)

  11. Mining RNA-seq data for infections and contaminations.

    Directory of Open Access Journals (Sweden)

    Thomas Bonfert

    Full Text Available RNA sequencing (RNA-seq provides novel opportunities for transcriptomic studies at nucleotide resolution, including transcriptomics of viruses or microbes infecting a cell. However, standard approaches for mapping the resulting sequencing reads generally ignore alternative sources of expression other than the host cell and are little equipped to address the problems arising from redundancies and gaps among sequenced microbe and virus genomes. We show that screening of sequencing reads for contaminations and infections can be performed easily using ContextMap, our recently developed mapping software. Based on mapping-derived statistics, mapping confidence, similarities and misidentifications (e.g. due to missing genome sequences of species/strains can be assessed. Performance of our approach is evaluated on three real-life sequencing data sets and compared to state-of-the-art metagenomics tools. In particular, ContextMap vastly outperformed GASiC and GRAMMy in terms of runtime. In contrast to MEGAN4, it was capable of providing individual read mappings to species and resolving non-unique mappings, thus allowing the identification of misalignments caused by sequence similarities between genomes and missing genome sequences. Our study illustrates the importance and potentials of routinely mining RNA-seq experiments for infections or contaminations by microbes and viruses. By using ContextMap, gene expression of infecting agents can be analyzed and novel insights in infection processes and tumorigenesis can be obtained.

  12. First Mexican coal mine recovery after mine fire, Esmeralda Mine

    Energy Technology Data Exchange (ETDEWEB)

    Santillan, M.A. [Minerales Monclova, SA de CV, Palau Coahuila (Mexico)

    2005-07-01

    The fire started on 8 May 1998 in the development section from methane released into the mine through a roof-bolt hole. The flames spread quickly as the coal was ignited. After eight hours the Safety Department decided to seal the vertical ventilation shafts and the slopes. The quality of coal in the Esmeralda Mine is very high quality, and Minerales Monclova (MIMOSA) decided to recover the facilities. However, the Esmeralda Mine coals have a very high gas content of 12 m{sup 3}/t. During the next 2.5 months, MIMOSA staff and specialists observed and analysed the gas behaviour supported by a chromatograph. With the results of the observations and analyses, MIMOSA in consultation with the specialists developed a recovery plan based on flooding the area in which fire might have propagated and in which rekindling was highly probable. At the same time MIMOSA trained rescue teams. By 20 August 1998, the mine command centre had re-opened the slopes seal. Using a 'Step-by-Step' system, the rescue team began the recovery process by employing cross-cuts and using an auxiliary fan to establish the ventilation circuit. The MIMOSA team advanced into the mine as far as allowed by the water level and was able to recover the main fan. The official mine recovery date was 30 November 1998. Esmeralda Mine was back in operation in December 1998. 1 ref., 3 figs.

  13. Efficient Selection of Data Mining Method

    Directory of Open Access Journals (Sweden)

    Mirela Danubianu

    2011-10-01

    Full Text Available Data mining tools can access large amounts of data and find patterns that can solve various problems, often with surprising solutions. We have analyzed the data mining methods, techniques and algorithms with their characteristics, with their advantages and weakness. Taking into account the tasks to be resolved in order to discover the different types of knowledge, the kind of databases to work on and the type of data, as well as the area for which on desire the implementation of the data mining system we have try to find a way to efficiently choose the proper methods in a given situation. ExpertDM system has the aim to find the best data mining methods for solving a task and specifying the transformation which need to be made for bringing the data at a proper form for applying these methods.

  14. Mining robotics sensors

    CSIR Research Space (South Africa)

    Green, JJ

    2012-04-01

    Full Text Available causes of fatalities in underground narrow reef mining. Data are gathered and processed from multiple underground mine sources, and techniques such as surfel modeling and synthetic view generation are explored towards creating visualisations of the data...

  15. Mines and Mineral Resources

    Data.gov (United States)

    Department of Homeland Security — Mines in the United States According to the Homeland Security Infrastructure Program Tiger Team Report Table E-2.V.1 Sub-Layer Geographic Names, a mine is defined as...

  16. 30 CFR 57.7050 - Tool and drill steel racks.

    Science.gov (United States)

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Tool and drill steel racks. 57.7050 Section 57.7050 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND HEALTH SAFETY AND HEALTH STANDARDS-UNDERGROUND METAL AND NONMETAL MINES Drilling...

  17. 30 CFR 56.7050 - Tool and drill steel racks.

    Science.gov (United States)

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Tool and drill steel racks. 56.7050 Section 56.7050 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR METAL AND NONMETAL MINE SAFETY AND HEALTH SAFETY AND HEALTH STANDARDS-SURFACE METAL AND NONMETAL MINES Drilling and...

  18. Cultural text mining: using text mining to map the emergence of transnational reference cultures in public media repositories

    NARCIS (Netherlands)

    Pieters, Toine; Verheul, Jaap

    2014-01-01

    This paper discusses the research project Translantis, which uses innovative technologies for cultural text mining to analyze large repositories of digitized public media, such as newspapers and journals.1 The Translantis research team uses and develops the text mining tool Texcavator, which is base

  19. Towards semantic web mining

    OpenAIRE

    Berendt, Bettina; Hotho, Andreas; Stumme, Gerd

    2002-01-01

    Semantic Web Mining aims at combining the two fast-developing research areas Semantic Web and Web Mining. The idea is to improve, on the one hand, the results of Web Mining by exploiting the new semantic structures in the Web; and to make use of Web Mining, on overview of where the two areas meet today, and sketches ways of how a closer integration could be profitable.

  20. Mining in El Salvador

    DEFF Research Database (Denmark)

    Pacheco Cueva, Vladimir

    2014-01-01

    In this guest article, Vladimir Pacheco, a social scientist who has worked on mining and human rights shares his perspectives on a current campaign against mining in El Salvador – Central America’s smallest but most densely populated country.......In this guest article, Vladimir Pacheco, a social scientist who has worked on mining and human rights shares his perspectives on a current campaign against mining in El Salvador – Central America’s smallest but most densely populated country....

  1. Applying Web Usage Mining for Personalizing Hyperlinks in Web-Based Adaptive Educational Systems

    Science.gov (United States)

    Romero, Cristobal; Ventura, Sebastian; Zafra, Amelia; de Bra, Paul

    2009-01-01

    Nowadays, the application of Web mining techniques in e-learning and Web-based adaptive educational systems is increasing exponentially. In this paper, we propose an advanced architecture for a personalization system to facilitate Web mining. A specific Web mining tool is developed and a recommender engine is integrated into the AHA! system in…

  2. New perspectives on a 140-year legacy of mining and abandoned mine cleanup in the San Juan Mountains, Colorado

    Science.gov (United States)

    Yager, Douglas B.; Fey, David L.; Chapin, Thomas; Johnson, Raymond H.

    2016-01-01

    The Gold King mine water release that occurred on 5 August 2015 near the historical mining community of Silverton, Colorado, highlights the environmental legacy that abandoned mines have on the environment. During reclamation efforts, a breach of collapsed workings at the Gold King mine sent 3 million gallons of acidic and metal-rich mine water into the upper Animas River, a tributary to the Colorado River basin. The Gold King mine is located in the scenic, western San Juan Mountains, a region renowned for its volcano-tectonic and gold-silver-base metal mineralization history. Prior to mining, acidic drainage from hydrothermally altered areas was a major source of metals and acidity to streams, and it continues to be so. In addition to abandoned hard rock metal mines, uranium mine waste poses a long-term storage and immobilization challenge in this area. Uranium resources are mined in the Colorado Plateau, which borders the San Juan Mountains on the west. Uranium processing and repository sites along the Animas River near Durango, Colorado, are a prime example of how the legacy of mining must be managed for the health and well-being of future generations. The San Juan Mountains are part of a geoenvironmental nexus where geology, mining, agriculture, recreation, and community issues converge. This trip will explore the geology, mining, and mine cleanup history in which a community-driven, watershed-based stakeholder process is an integral part. Research tools and historical data useful for understanding complex watersheds impacted by natural sources of metals and acidity overprinted by mining will also be discussed.

  3. American mines, methods and men

    Energy Technology Data Exchange (ETDEWEB)

    Walker, S.C.A. (Thames Water Utilities (UK))

    1992-04-01

    The paper is based on the author's visits to a number of American mines, to see their mining machinery and to discuss with mine management their industrial relations problems. The paper gives a brief review of American mines, methods and men and is in the form of a diary. Mines visited are: Ohio Valley Coal Company; Big John Mine; Pittsburgh Research Center of the US Bureau of Mines; Martinka Mine; Robin Hood Complex No 9 Mine (Boone County, West Virginia), Green Briar Mine (Virginia); Martin County Coal (Kentucky); Wabash Mine (Keensburgh, Illinois); Galatia Mine (Harrisburgh, Illinois); and William Station Mine (Sturgis, Kentucky). Details given include mining methods productivity and staffing levels. The mining machinery is described in detail in a separate article. 5 figs.

  4. Data Mining for CRM

    Science.gov (United States)

    Thearling, Kurt

    Data Mining technology allows marketing organizations to better understand their customers and respond to their needs. This chapter describes how Data Mining can be combined with customer relationship management to help drive improved interactions with customers. An example showing how to use Data Mining to drive customer acquisition activities is presented.

  5. The Aspergillus Mine - publishing bioinformatics

    DEFF Research Database (Denmark)

    Vesth, Tammi Camilla; Rasmussen, Jane Lind Nybo; Theobald, Sebastian

    so with no computational specialist. Here we present a setup for analysis and publication of genome data of 70 species of Aspergillus fungi. The platform is based on R, Python and uses the RShiny framework to create interactive web‐applications. It allows all participants to create interactive...... analysis which can be shared with the team and in connection with publications. We present analysis for investigation of genetic diversity, secondary and primary metabolism and general data overview. The platform, the Aspergillus Mine, is a collection of analysis tools based on data from collaboration...... with the Joint Genome Institute. The Aspergillus Mine is not intended as a genomic data sharing service but instead focuses on creating an environment where the results of bioinformatic analysis is made available for inspection. The data and code is public upon request and figures can be obtained directly from...

  6. Understanding the Social Context of the ASGM Sector in Ghana: A Qualitative Description of the Demographic, Health, and Nutritional Characteristics of a Small-Scale Gold Mining Community in Ghana.

    Science.gov (United States)

    Long, Rachel N; Renne, Elisha P; Basu, Niladri

    2015-10-12

    This descriptive paper describes factors related to demographics and health in an artisanal and small-scale gold mining (ASGM) community in Ghana's Upper East Region. Participants (n = 114) were surveyed in 2010 and 2011, adapting questions from the established national Demographic Health Survey (DHS) on factors such as population characteristics, infrastructure, amenities, education, employment, maternal and child health, and diet. In the study community, some indicators of household wealth (e.g., radios, mobile phones, refrigerators) are more common than elsewhere in Ghana, yet basic infrastructure (e.g., cement flooring, sanitation systems) and access to safe water supplies are lacking. Risk factors for poor respiratory health, such as cooking with biomass fuel smoke and smoking tobacco, are common. Certain metrics of maternal and child health are comparable to other areas of Ghana (e.g., frequency of antenatal care), whereas others (e.g., antenatal care from a skilled provider) show deficiencies. Residents surveyed do not appear to lack key micronutrients, but report lower fruit and vegetable consumption than other rural areas. The results enable a better understanding of community demographics, health, and nutrition, and underscore the need for better demographic and health surveillance and data collection across ASGM communities to inform effective policies and programs for improving miner and community health.

  7. Understanding the Social Context of the ASGM Sector in Ghana: A Qualitative Description of the Demographic, Health, and Nutritional Characteristics of a Small-Scale Gold Mining Community in Ghana

    Directory of Open Access Journals (Sweden)

    Rachel N. Long

    2015-10-01

    Full Text Available This descriptive paper describes factors related to demographics and health in an artisanal and small-scale gold mining (ASGM community in Ghana’s Upper East Region. Participants (n = 114 were surveyed in 2010 and 2011, adapting questions from the established national Demographic Health Survey (DHS on factors such as population characteristics, infrastructure, amenities, education, employment, maternal and child health, and diet. In the study community, some indicators of household wealth (e.g., radios, mobile phones, refrigerators are more common than elsewhere in Ghana, yet basic infrastructure (e.g., cement flooring, sanitation systems and access to safe water supplies are lacking. Risk factors for poor respiratory health, such as cooking with biomass fuel smoke and smoking tobacco, are common. Certain metrics of maternal and child health are comparable to other areas of Ghana (e.g., frequency of antenatal care, whereas others (e.g., antenatal care from a skilled provider show deficiencies. Residents surveyed do not appear to lack key micronutrients, but report lower fruit and vegetable consumption than other rural areas. The results enable a better understanding of community demographics, health, and nutrition, and underscore the need for better demographic and health surveillance and data collection across ASGM communities to inform effective policies and programs for improving miner and community health.

  8. Improvement and upgrade of mine fire simulation program MFIRE

    Institute of Scientific and Technical Information of China (English)

    ZHOU Li-hong; LUO Yi

    2011-01-01

    MFIRE,an underground mine fire simulation program developed in 1980's,is a tool that can be used to simulate the impacts of a mine fire event to a mine ventilation network.However,the lack of the abilities to simulate some of the important mine fire phenomena realistically hindered its wide applications.This research carried out to improve and upgrade the MFIRE 2.20 program to MFIRE 2.30.The new additions of MFIRE 2.30 including a time dependent fire model,smoke rollback prediction,the incorporation of a moving fire source,and a rewriting of MFIRE in Microsoft Visual C++ were interpreted.

  9. Mine hydrogeologic conditions prospected by comprehensive geophysical exploration

    Energy Technology Data Exchange (ETDEWEB)

    Xing, X.; Pu, C. [Feicheng Coal Mining Bureau (China)

    1997-12-01

    Mine water gushing is one of the difficult problems puzzling mine safety. The investigation of the hydrogeological condition of the working face is a prerequisite to control effectively the water hazard and guarantee safe mining. The production practice in Feicheng Mining Bureau proved that the comprehensive geophysical survey in which the DC electrical survey, Rayleigh wave method and transient electromagnetic method are the main means, is a simple, high-speed and effective tool to detect the geological structure, water abundance of aquifer and concealed water flowing structure at the working face and the front of driving end. 1 ref., 3 figs.

  10. Context trees

    OpenAIRE

    Ganzinger, Harald; Nieuwenhuis, Robert; Nivela, Pilar

    2001-01-01

    Indexing data structures are well-known to be crucial for the efficiency of the current state-of-the-art theorem provers. Examples are \\emph{discrimination trees}, which are like tries where terms are seen as strings and common prefixes are shared, and \\emph{substitution trees}, where terms keep their tree structure and all common \\emph{contexts} can be shared. Here we describe a new indexing data structure, \\emph{context trees}, where, by means of a limited kind of conte...

  11. Cloud computing in data mining – a survey

    Directory of Open Access Journals (Sweden)

    Viktor Nekvapil

    2015-01-01

    Full Text Available Cloud computing in data mining presents promising solution for businesses willing to analyse their data with lower costs or those companies which want to utilise their “big data”. In this survey, reasons for using cloud computing solutions in data mining are studied and respective tools corresponding to these reasons are evaluated. The emphasis is laid to functionality of the tools and the integration with other applications. In total, 13 solutions were evaluated.

  12. Mining and integration of pathway diagrams from imaging data.

    Science.gov (United States)

    Kozhenkov, Sergey; Baitaluk, Michael

    2012-03-01

    Pathway diagrams from PubMed and World Wide Web (WWW) contain valuable highly curated information difficult to reach without tools specifically designed and customized for the biological semantics and high-content density of the images. There is currently no search engine or tool that can analyze pathway images, extract their pathway components (molecules, genes, proteins, organelles, cells, organs, etc.) and indicate their relationships. Here, we describe a resource of pathway diagrams retrieved from article and web-page images through optical character recognition, in conjunction with data mining and data integration methods. The recognized pathways are integrated into the BiologicalNetworks research environment linking them to a wealth of data available in the BiologicalNetworks' knowledgebase, which integrates data from >100 public data sources and the biomedical literature. Multiple search and analytical tools are available that allow the recognized cellular pathways, molecular networks and cell/tissue/organ diagrams to be studied in the context of integrated knowledge, experimental data and the literature. BiologicalNetworks software and the pathway repository are freely available at www.biologicalnetworks.org. Supplementary data are available at Bioinformatics online.

  13. Mining text data

    CERN Document Server

    Aggarwal, Charu C

    2012-01-01

    Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. ""Mining Text Data"" introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including

  14. Trace elements and Pb isotopes in soils and sediments impacted by uranium mining.

    Science.gov (United States)

    Cuvier, A; Pourcelot, L; Probst, A; Prunier, J; Le Roux, G

    2016-10-01

    The purpose of this study is to evaluate the contamination in As, Ba, Co, Cu, Mn, Ni, Sr, V, Zn and REE, in a high uranium activity (up to 21,000Bq∙kg(-1)) area, downstream of a former uranium mine. Different geochemical proxies like enrichment factor and fractions from a sequential extraction procedure are used to evaluate the level of contamination, the mobility and the availability of the potential contaminants. Pb isotope ratios are determined in the total samples and in the sequential leachates to identify the sources of the contaminants and to determine the mobility of radiogenic Pb in the context of uranium mining. In spite of the large uranium contamination measured in the soils and the sediments (EF≫40), trace element contamination is low to moderate (2mining activities. Most of the trace elements are associated with the most mobile fractions of the sediments/soils, implying an enhanced potential availability. Even if no Pb enrichment is highlighted, the Pb isotopic signature of the contaminated soils is strongly radiogenic. Measurements performed on the sequential leachates reveal inputs of radiogenic Pb in the most mobile fractions of the contaminated soil. Inputs of low-mobile radiogenic Pb from mining activities may also contribute to the Pb signature recorded in the residual phase of the contaminated samples. We demonstrate that Pb isotopes are efficient tools to trace the origin and the mobility of the contaminants in environments affected by uranium mining.

  15. Restriction enzyme mining for SNPs in genomes.

    Science.gov (United States)

    Chuang, Li-Yeh; Yang, Cheng-Hong; Tsui, Ke-Hung; Cheng, Yu-Huei; Chang, Phei-Lang; Wen, Cheng-Hao; Chang, Hsueh-Wei

    2008-01-01

    Many different single nucleotide polymorphisms (SNPs) genotyping methods have been developed recently. However, most of them are expensive. Using restriction enzymes for SNP genotyping is a cost-effective method. However, restriction enzyme mining for SNPs in a genome sequence is still challenging for researchers who do not have a background in genomics and bioinformatics. In this review, the basic bioinformatics tools used for restriction enzyme mining for SNP genotyping are summarized and described. The objectives of this paper include: i) the introduction of SNPs, genotyping and PCR-restriction fragment length polymorphism (RFLP); ii) a review of components for genotyping software, including tools for primer design only or restriction enzyme mining only; iii) a review of software providing the flanking sequence for primer design; iv) recent advances in PCR-RFLP tools and natural and mutagenic PCR-RFLP; v) highlighting the strategy for restriction enzyme mining for SNP genotyping; vi) a discussion of potential problems for multiple PCR-RFLP. The different implications for restriction enzymes on sense and antisense strands are also discussed. Our PCR-RFLP freeware, SNP-RFLPing, is included in this review to illustrate many characteristics of PCR-RFLP software design. Future developments will include further sophistication of PCR-RFLP software in order to provide better visualization and a more interactive environment for SNP genotyping and to integrate the software with other tools used in association studies.

  16. Process mining data science in action

    CERN Document Server

    van der Aalst, Wil

    2016-01-01

    The first to cover this missing link between data mining and process modeling, this book provides real-world techniques for monitoring and analyzing processes in real time. It is a powerful new tool destined to play a key role in business process management.

  17. Antimicrobials of Bacillus species: mining and engineering

    NARCIS (Netherlands)

    Zhao, Xin

    2016-01-01

    Bacillus sp. have been successfully used to suppress various bacterial and fungal pathogens. Due to the wide availability of whole genome sequence data and the development of genome mining tools, novel antimicrobials are being discovered and updated,;not only bacteriocins, but also NRPs and PKs. A n

  18. Commercial Data Mining Software

    Science.gov (United States)

    Zhang, Qingyu; Segall, Richard S.

    This chapter discusses selected commercial software for data mining, supercomputing data mining, text mining, and web mining. The selected software are compared with their features and also applied to available data sets. The software for data mining are SAS Enterprise Miner, Megaputer PolyAnalyst 5.0, PASW (formerly SPSS Clementine), IBM Intelligent Miner, and BioDiscovery GeneSight. The software for supercomputing are Avizo by Visualization Science Group and JMP Genomics from SAS Institute. The software for text mining are SAS Text Miner and Megaputer PolyAnalyst 5.0. The software for web mining are Megaputer PolyAnalyst and SPSS Clementine . Background on related literature and software are presented. Screen shots of each of the selected software are presented, as are conclusions and future directions.

  19. New technologies for improving properties of the cemented carbide used in mining tools%提高钻采工具用硬质合金性能的新技术

    Institute of Scientific and Technical Information of China (English)

    张正富; 彭金辉

    2011-01-01

    The development in past two decades of main technology for improving properties of WC-Co cemented carbide used in mining tools is reviewed. These technologies focus on elevating the wear resistance and toughness, or increasing one performance but not harming another badly. Thereinto, the gradient microstructure can be used for removing the basic conflict between hardness and toughness to a certain extent. Its effect is obvious, but the technology parameters must be controlled strictly during forming the gradient microstructure. The heterogeneous microstructure, namely DC carbide, has the biggest effect on improving the toughness of alloy at the present time,and it doesn't harm the wear resistance badly at the same time. With using coarse particle WC carbonized at higher temperature, reducing cobalt content properly and controlling impurity level strictly, the heat conduction and toughness of coarse grain WC-Co alloys are increased and the thermal dilatability is decreased. So, the resistance ability to thermo-fatigue and impact is boosted remarkably for the coarse grain WC-Co alloy, and its wear resistance is not damaged obviously. In addition, the wear resistance can be enhanced greatly with adding a small quantity of diamond particles, and the toughness is not harmed obviously. Finally, the toughness and wear resistance of alloy can be improved remarkably when some element is added into the bend phase, or some second dispersing phase is precipitated in the bond phase by controlling the ingredient of alloy and technology parameters. Above-mentioned technologies are fitted for diverse applications as a result of their different effect and cost.%评述了近20年来,改善钻采工具用WC-Co硬质合金性能的主要技术方法.这些方法基本是围绕着同时提高合金的耐磨性和韧性、或者提高其中之一而不严重损害另一指标来进行的.其中,梯度硬质合金的方法能在一定程度上解决硬度与韧性的基本矛盾,效果

  20. Web Mining in Soft Computing Relevance and Future Directions

    Directory of Open Access Journals (Sweden)

    Amandeep Kour

    2013-01-01

    Full Text Available This paper summarizes the different characteristics of web data, the basic components of web mining and its different types. Web mining combines two of the activated research areas: Data Mining and World Wide Web. The Web mining research relates to several researches communities such as Database, Knowledge Discovery, Information Retrieval and Artificial Intelligence. The limitations of some of the existing web mining and knowledge discovery methods and tools are enunciated, and the significance of soft computing (comprising fuzzy logic (FL, artificial neural networks (ANNs, genetic algorithms (GAs, and rough sets (RSs highlighted. A survey of the existing literature on “soft web mining” is provided along with the commercially available systems. The prospective areas of web mining where the application of soft computing needs immediate attention are outlined with justification. Scope for future research in developing “soft web mining” systems is explained. An extensive bibliography is also provided

  1. Mining φ-Frequent Itemset Using FP-Tree

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    The problem of association rule mining has gained considerableprominence in the data mining community for its use as an important tool of knowledge discovery from large-scale databases. And there has been a spurt of research activities around this problem. However, traditional association rule mining may often derive many rules in which people are uninterested. This paper reports a generalization of association rule mining called φ-association rule mining. It allows people to have different interests on different itemsets that are the need of real application. Also, it can help to derive interesting rules and substantially reduce the amount of rules. An algorithm based on FP-tree for mining φ-frequent itemset is presented. It is shown by experiments that the proposed method is efficient and scalable over large databases.

  2. A Review on Software Mining: Current Trends and Methodologies

    Directory of Open Access Journals (Sweden)

    Gurtej Singh Ubhi

    2017-04-01

    Full Text Available With the evolution of Software Mining, it has enabled to play a crucial role in the day to day activities .By empowering the software with the data mining methods it aids to all the software developers and the managers at all the managerial levels to use it as a tool so that the relative software engineering data (in the form of code, design documents, bug reports to visualize the project‟s status of evolution and progress. To add on, further all the mining methods and algorithms help to device the models to develop any fault prone real time system for the real world more prior to the testing phase or the evolution phase. Also , in this review paper it will highlight the different methodologies for software mining along with its extension as a tool for fault tolerance and as well as a bibliography with the special prominence on mining the software engineering information

  3. Distributed Framework for Data Mining As a Service on Private Cloud

    Directory of Open Access Journals (Sweden)

    Shraddha Masih

    2014-11-01

    Full Text Available Data mining research faces two great challenges: i. Automated mining ii. Mining of distributed data. Conventional mining techniques are centralized and the data needs to be accumulated at central location. Mining tool needs to be installed on the computer before performing data mining. Thus, extra time is incurred in collecting the data. Mining is 4 done by specialized analysts who have access to mining tools. This technique is not optimal when the data is distributed over the network. To perform data mining in distributed scenario, we need to design a different framework to improve efficiency. Also, the size of accumulated data grows exponentially with time and is difficult to mine using a single computer. Personal computers have limitations in terms of computation capability and storage capacity. Cloud computing can be exploited for compute-intensive and data intensive applications. Data mining algorithms are both compute and data intensive, therefore cloud based tools can provide an infrastructure for distributed data mining. This paper is intended to use cloud computing to support distributed data mining. We propose a cloud based data mining model which provides the facility of mass data storage along with distributed data mining facility. This paper provide a solution for distributed data mining on Hadoop framework using an interface to run the algorithm on specified number of nodes without any user level configuration. Hadoop is configured over private servers and clients can process their data through common framework from anywhere in private network. Data to be mined can either be chosen from cloud data server or can be uploaded from private computers on the network. It is observed that the framework is helpful in processing large size data in less time as compared to single system.

  4. A MINE alternative to D-optimal designs for the linear model.

    Directory of Open Access Journals (Sweden)

    Amanda M Bouffier

    Full Text Available Doing large-scale genomics experiments can be expensive, and so experimenters want to get the most information out of each experiment. To this end the Maximally Informative Next Experiment (MINE criterion for experimental design was developed. Here we explore this idea in a simplified context, the linear model. Four variations of the MINE method for the linear model were created: MINE-like, MINE, MINE with random orthonormal basis, and MINE with random rotation. Each method varies in how it maximizes the MINE criterion. Theorem 1 establishes sufficient conditions for the maximization of the MINE criterion under the linear model. Theorem 2 establishes when the MINE criterion is equivalent to the classic design criterion of D-optimality. By simulation under the linear model, we establish that the MINE with random orthonormal basis and MINE with random rotation are faster to discover the true linear relation with p regression coefficients and n observations when p>>n. We also establish in simulations with n<100, p=1000, σ=0.01 and 1000 replicates that these two variations of MINE also display a lower false positive rate than the MINE-like method and additionally, for a majority of the experiments, for the MINE method.

  5. Description of basic mining legal principles.

    Science.gov (United States)

    Schmidt, Reinhard

    2014-01-01

    The Federal Mining Act manages access, via the system of mining concessions, to areas free for mining natural resources that do not belong to the surface property and deposits' owner. These cover especially important natural resources for the economy, including coal, ore, salt, crude oil and natural gas, and also terrestrial heat. For mining operations there exist, however, the same decrees for natural resources in the property of the surface owners, which are predominantly higher-value industrial minerals such as roofing slate, basalt, quartz sand, and clays for the fireproofing industry. In the case of mining laws, administrative procedures such as issuing mining concessions, approving operating plans, and issuing permits or licenses to explore according to water rights or the Federal Immission Control Act, those authorities and departments in whose remit the projects fall are dealt with by the Mining Authority. This means that the Mining Authority is the only state point of contact for the applicant, essentially an "all-in-one" service as it will itself instigate any further participation procedures required. The classic licensing procedure of mining is the operations plan procedure, whereby the operator submits an operating plan to the Mining Authority, which then examines it to ensure it fulfills mandatory legal safety objectives. If necessary these safety objectives can be met during licensing of the operating plans by stipulating additional requirements, Depending on the subject and validity period there are overall operating plans having the widest possible remit with comprehensive participation by the authorities and basic operating plans that form the basis for every mining works. There are also special operating plans, which owing to the dynamics of mining, resolve matters that suddenly become necessary or when the basic operating plans as originally conceived were not relevant. The closing-down operating plan is the designated tool for closing down

  6. Light Stable Isotopes in Aquifers Affected by Mining Activities in a Brazilian Mining Province

    Science.gov (United States)

    Moreira, R. M.; de Carvalho, J. B.

    2013-05-01

    Iron ore is presently a main item in the Brazilian commercial agenda. Large reserves have converted this utility into an important source of export earnings and, secondarily, of raw materials for the domestic industry. Parallel to a boom in mining activities in the last years environmental impacts and a stress on natural resources have soared. A region exhibiting pronouncedly intensive mining activities lies in the central part of the State of Minas Gerais, the third economy of the federation. Mines are sited right beside the capital and neighbor towns amounting to nearly five million inhabitants and a pronounced dependence on groundwater resources. Besides, this region is a water divide enclosing the sources of main contributors to the most strategic fluvial basins in the country. Iron ore is by large the main mineral but other metals (including gold and uranium), as well as non-metals such as limestone, quartz and granite, also occur. Given the significance of this commodity in the country's trade balance and the demand of water resources with acceptable quality for human consumption, the scale of ensuing water use conflicts caused by its exploration is wide ranging and has to be coped with well grounded environmental assessment approaches. Tracer hydrology techniques might be a valuable tool in this context. The characteristics of the area being impacted have been surveyed, including climate and pluviometry, stratigraphic litology, geological structure, use of soil, mineral resources and their exploration, surface and ground water hydrology and their sundry uses. Data to be processed have been procured at local public agencies but as regard local hydrological features, particularly isotopic compositions, ad hoc surveys and methodologies were required. One instance concerns pluviometric isotopy due to the alpine character of the surveyed region altitude and temperature effects might take place. Hence different sites were monitored; cumulative pluviometer samples

  7. Issues in Data Mining and Information Retrieval

    Directory of Open Access Journals (Sweden)

    Ammar Yassir

    2012-02-01

    Full Text Available Data mining, as we use the term, is the exploration and analysis of large quantities of data in order to discover meaningful patterns and rules. For the purposes of this book, we assume that the goal of data mining is to allow a corporation to improve its marketing, sales, and customer support operations through a better understanding of its customers. Keep in mind, however, that the data mining techniques and tools described here are equally applicable in fields ranging from law enforcement to radio astronomy, medicine, and industrial process control. In fact, hardly any of the data mining algorithms were first invented with commercial applications in mind. The commercial data miner employs a grab bag of techniques borrowed from statistics, computer science, and machine learning research. The choice of a particular combination of techniques to apply in a particular situation depends on the nature of the data mining task, the nature of the available data, and the skills and preferences of the data miner. Data mining is largely concerned with building models. A model is simply an algorithm or set of rules that connects a collection of inputs (often in the form of fields in a corporate database to a particular target or outcome

  8. Substructure mining using elaborate chemical representation.

    Science.gov (United States)

    Kazius, Jeroen; Nijssen, Siegfried; Kok, Joost; Bäck, Thomas; Ijzerman, Adriaan P

    2006-01-01

    Substructure mining algorithms are important drug discovery tools since they can find substructures that affect physicochemical and biological properties. Current methods, however, only consider a part of all chemical information that is present within a data set of compounds. Therefore, the overall aim of our study was to enable more exhaustive data mining by designing methods that detect all substructures of any size, shape, and level of chemical detail. A means of chemical representation was developed that uses atomic hierarchies, thus enabling substructure mining to consider general and/or highly specific features. As a proof-of-concept, the efficient, multipurpose graph mining system Gaston learned substructures of any size and shape from a mutagenicity data set that was represented in this manner. From these substructures, we extracted a set of only six nonredundant, discriminative substructures that represent relevant biochemical knowledge. Our results demonstrate the individual and synergistic importance of elaborate chemical representation and mining for nonlinear substructures. We conclude that the combination of elaborate chemical representation and Gaston provides an excellent method for 2D substructure mining as this recipe systematically explores all substructures in different levels of chemical detail.

  9. Sustainable mining, local communities and environmental regulation

    Directory of Open Access Journals (Sweden)

    Kokko Kai

    2015-12-01

    Full Text Available Sustainable mining is an objective as well as a tool for balancing economic, social, and environmental considerations. Each of these three dimensions of mining – and sustainable development – has many components, some of which were chosen for closer study in the SUMILCERE project. While there is no single component that in itself provides a definitive argument for or against sustainable mining, the research reveals some that have proven valuable in the process of balancing the different dimensions of sustainability. In the SUMILCERE project, comparative studies enabled us to identify factors such as the following, which are essential when discussing the balancing in practice of the three dimensions of sustainable mining cited above: the framework and functionality of environmental regulation to protect the environment (environmental sustainability; competitiveness of the mining industry in light of environmental regulation and its enforcement (economic sustainability; public participation and the opportunities local communities have to influence their surroundings, as well as communities’ acceptance of projects (social sustainability before and during operations; and the protection of Sámi cultural rights in mining projects (social and cultural sustainability. Although each of the three dimensions of sustainability leaves room for discretion in the weight assigned to it, ecological sustainability, protected by smart environmental regulation and minimum standards, sets essential boundaries that leave no room for compromises. Economic and social sustainability are possible only within these limits. Details of the analyses in the Kolarctic area and accounts of the methods used can befound in the cited SUMILCERE articles.

  10. Hardening Stratum, the Bitcoin Pool Mining Protocol

    Directory of Open Access Journals (Sweden)

    Recabarren Ruben

    2017-07-01

    Full Text Available Stratum, the de-facto mining communication protocol used by blockchain based cryptocurrency systems, enables miners to reliably and efficiently fetch jobs from mining pool servers. In this paper we exploit Stratum’s lack of encryption to develop passive and active attacks on Bitcoin’s mining protocol, with important implications on the privacy, security and even safety of mining equipment owners. We introduce StraTap and ISP Log attacks, that infer miner earnings if given access to miner communications, or even their logs. We develop BiteCoin, an active attack that hijacks shares submitted by miners, and their associated payouts. We build BiteCoin on WireGhost, a tool we developed to hijack and surreptitiously maintain Stratum connections. Our attacks reveal that securing Stratum through pervasive encryption is not only undesirable (due to large overheads, but also ineffective: an adversary can predict miner earnings even when given access to only packet timestamps. Instead, we devise Bedrock, a minimalistic Stratum extension that protects the privacy and security of mining participants. We introduce and leverage the mining cookie concept, a secret that each miner shares with the pool and includes in its puzzle computations, and that prevents attackers from reconstructing or hijacking the puzzles.

  11. Systems modelling for effective mine water management

    Energy Technology Data Exchange (ETDEWEB)

    Cote, C.M.; Moran, C.J.; Hedemann, C.J.; Koch, C. [University of Queensland, Brisbane, Qld. (Australia)

    2010-12-15

    Concerns about the difficulties in securing water have led the Australian coal mining industry to seek innovative ways to improve its water management and to adopt novel strategies that will lead to less water being used and more water being reused. Simulation tools are essential to assess current water management performance and to predict the efficiency of potential strategies. As water systems on coal mines are complex and consist of various inter-connected elements, a systems approach was selected, which views mine site water management as a system that obtains water from various sources (surface, groundwater), provides sufficient water of suitable quality to the mining tasks (coal beneficiation, dust suppression, underground operations) and maintains environmental performance. In this paper, the model is described and its calibration is illustrated. The results of applying the model for the comparison of the water balances of 7 coal mines in the northern Bowen Basin (Queensland, Australia) are presented. The model is used to assess the impact of applying specific water management strategies. Results show that a simple systems model is an appropriate tool for assessing site performance, for providing guidance to improve performance through strategic planning, and for guiding adoption of site objectives.

  12. Study of Tools Interoperability

    NARCIS (Netherlands)

    Krilavičius, T.

    2007-01-01

    Interoperability of tools usually refers to a combination of methods and techniques that address the problem of making a collection of tools to work together. In this study we survey different notions that are used in this context: interoperability, interaction and integration. We point out relation

  13. Data mining for ontology development.

    Energy Technology Data Exchange (ETDEWEB)

    Davidson, George S.; Strasburg, Jana (Pacific Northwest National Laboratory, Richland, WA); Stampf, David (Brookhaven National Laboratory, Upton, NY); Neymotin,Lev (Brookhaven National Laboratory, Upton, NY); Czajkowski, Carl (Brookhaven National Laboratory, Upton, NY); Shine, Eugene (Savannah River National Laboratory, Aiken, SC); Bollinger, James (Savannah River National Laboratory, Aiken, SC); Ghosh, Vinita (Brookhaven National Laboratory, Upton, NY); Sorokine, Alexandre (Oak Ridge National Laboratory, Oak Ridge, TN); Ferrell, Regina (Oak Ridge National Laboratory, Oak Ridge, TN); Ward, Richard (Oak Ridge National Laboratory, Oak Ridge, TN); Schoenwald, David Alan

    2010-06-01

    A multi-laboratory ontology construction effort during the summer and fall of 2009 prototyped an ontology for counterfeit semiconductor manufacturing. This effort included an ontology development team and an ontology validation methods team. Here the third team of the Ontology Project, the Data Analysis (DA) team reports on their approaches, the tools they used, and results for mining literature for terminology pertinent to counterfeit semiconductor manufacturing. A discussion of the value of ontology-based analysis is presented, with insights drawn from other ontology-based methods regularly used in the analysis of genomic experiments. Finally, suggestions for future work are offered.

  14. 21 Recipes for Mining Twitter

    CERN Document Server

    Russell, Matthew

    2011-01-01

    Millions of public Twitter streams harbor a wealth of data, and once you mine them, you can gain some valuable insights. This short and concise book offers a collection of recipes to help you extract nuggets of Twitter information using easy-to-learn Python tools. Each recipe offers a discussion of how and why the solution works, so you can quickly adapt it to fit your particular needs. The recipes include techniques to: Use OAuth to access Twitter dataCreate and analyze graphs of retweet relationshipsUse the streaming API to harvest tweets in realtimeHarvest and analyze friends and followers

  15. Mine your own business! Mine other's news!

    OpenAIRE

    Pham, Quang-Khai; Saint-Paul, Régis; Benatallah, Boualem; Mouaddib, Noureddine; Raschia, Guillaume

    2008-01-01

    International audience; Major media companies such as The Financial Times, the Wall Street Journal or Reuters generate huge amounts of textual news data on a daily basis. Mining frequent patterns in this mass of information is critical for knowledge workers such as financial analysts, stock traders or economists. Using existing frequent pattern mining (FPM) algorithms for the analysis of news data is difficult because of the size and lack of structuring of the free text news content. In this ...

  16. Data mining, mining data : energy consumption modelling

    Energy Technology Data Exchange (ETDEWEB)

    Dessureault, S. [Arizona Univ., Tucson, AZ (United States)

    2007-09-15

    Most modern mining operations are accumulating large amounts of data on production and business processes. Data, however, provides value only if it can be translated into information that appropriate users can utilize. This paper emphasized that a new technological focus should emerge, notably how to concentrate data into information; analyze information sufficiently to become knowledge; and, act on that knowledge. Researchers at the Mining Information Systems and Operations Management (MISOM) laboratory at the University of Arizona have created a method to transform data into action. The data-to-action approach was exercised in the development of an energy consumption model (ECM), in partnership with a major US-based copper mining company, 2 software companies, and the MISOM laboratory. The approach begins by integrating several key data sources using data warehousing techniques, and increasing the existing level of integration and data cleaning. An online analytical processing (OLAP) cube was also created to investigate the data and identify a subset of several million records. Data mining algorithms were applied using the information that was isolated by the OLAP cube. The data mining results showed that traditional cost drivers of energy consumption are poor predictors. A comparison was made between traditional methods of predicting energy consumption and the prediction formed using data mining. Traditionally, in the mines for which data were available, monthly averages of tons and distance are used to predict diesel fuel consumption. However, this article showed that new information technology can be used to incorporate many more variables into the budgeting process, resulting in more accurate predictions. The ECM helped mine planners improve the prediction of energy use through more data integration, measure development, and workflow analysis. 5 refs., 11 figs.

  17. 30 CFR 56.14205 - Machinery, equipment, and tools.

    Science.gov (United States)

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Machinery, equipment, and tools. 56.14205... NONMETAL MINE SAFETY AND HEALTH SAFETY AND HEALTH STANDARDS-SURFACE METAL AND NONMETAL MINES Machinery and Equipment Safety Practices and Operational Procedures § 56.14205 Machinery, equipment, and tools....

  18. A Modified Coal Mine Roof Rating Classification System to Design Support Requirements in Coal Mines

    Science.gov (United States)

    Taheri, Abbas; Lee, Yongha; Medina, Mario Andres Guardado

    2017-10-01

    The coal mine roof rating (CMRR) classification system has been applied in a number of coal mines worldwide including Australia. However, the current system cannot be used directly to design support measures in underground mines. Two case studies, the Eliza Hill project in Australia and Tabas coal mine in Iran were analyzed to assess the impact of various rock properties and gallery geometry on stability and to modify the CMRR classification system. Having considered the CMRR system as a working classification system, applicable information and related coal mine data were selected from the two case records. The CMRR value was evaluated and analysed by undertaking correlation between CMRR and factor of safety, followed by a parametric study based on various rock properties and gallery geometries. To improve the applicability of the current system, the CMRR system was then modified by adding additional parameters, namely, the width of roof span and the density of overburden rock. Consequently, based on the modified CMRR system (mCMRR) roof support requirements were recommended to select the suitable rock bolting system including length and spacing of rock bolt. Numerical modelling were then undertaken to verify the support requirements recommended. The support requirements recommended by the mCMRR were found to be relatively identical with numerical analysis results. Support systems proposed by mCMRR can assist mining engineers to assess the stability of underground coal mines or verify the results of other design tools.

  19. A Modified Coal Mine Roof Rating Classification System to Design Support Requirements in Coal Mines

    Science.gov (United States)

    Taheri, Abbas; Lee, Yongha; Medina, Mario Andres Guardado

    2017-01-01

    The coal mine roof rating (CMRR) classification system has been applied in a number of coal mines worldwide including Australia. However, the current system cannot be used directly to design support measures in underground mines. Two case studies, the Eliza Hill project in Australia and Tabas coal mine in Iran were analyzed to assess the impact of various rock properties and gallery geometry on stability and to modify the CMRR classification system. Having considered the CMRR system as a working classification system, applicable information and related coal mine data were selected from the two case records. The CMRR value was evaluated and analysed by undertaking correlation between CMRR and factor of safety, followed by a parametric study based on various rock properties and gallery geometries. To improve the applicability of the current system, the CMRR system was then modified by adding additional parameters, namely, the width of roof span and the density of overburden rock. Consequently, based on the modified CMRR system (mCMRR) roof support requirements were recommended to select the suitable rock bolting system including length and spacing of rock bolt. Numerical modelling were then undertaken to verify the support requirements recommended. The support requirements recommended by the mCMRR were found to be relatively identical with numerical analysis results. Support systems proposed by mCMRR can assist mining engineers to assess the stability of underground coal mines or verify the results of other design tools.

  20. Mining Educational Data to Analyze Students' Performance

    CERN Document Server

    Baradwaj, Brijesh Kumar

    2012-01-01

    The main objective of higher education institutions is to provide quality education to its students. One way to achieve highest level of quality in higher education system is by discovering knowledge for prediction regarding enrolment of students in a particular course, alienation of traditional classroom teaching model, detection of unfair means used in online examination, detection of abnormal values in the result sheets of the students, prediction about students' performance and so on. The knowledge is hidden among the educational data set and it is extractable through data mining techniques. Present paper is designed to justify the capabilities of data mining techniques in context of higher education by offering a data mining model for higher education system in the university. In this research, the classification task is used to evaluate student's performance and as there are many approaches that are used for data classification, the decision tree method is used here. By this task we extract knowledge th...

  1. Mine waste management legislation. Gold mining areas in Romania

    Science.gov (United States)

    Maftei, Raluca-Mihaela; Filipciuc, Constantina; Tudor, Elena

    2014-05-01

    as possible any adverse effects on the environment, in particular water, air, soil, fauna, flora and landscape, and any health risks to the population, arising as a result of waste management in extractive industries". Based on the Commission decision 2009/339/EC concerning the waste management facilities - classification criteria - Romanian Government issued GO 2042/2010 witch states the procedures for approving the plan of waste management in extractive industries and its applications norms. Law No. 22/2001 fallows the regulations from the Espoo Convention on assessing the impact of mining on the environment sector in a cross-border context. This work is presented within the framework of SUSMIN project.

  2. Coal mine site reclamation

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2013-02-15

    Coal mine sites can have significant effects on local environments. In addition to the physical disruption of land forms and ecosystems, mining can also leave behind a legacy of secondary detrimental effects due to leaching of acid and trace elements from discarded materials. This report looks at the remediation of both deep mine and opencast mine sites, covering reclamation methods, back-filling issues, drainage and restoration. Examples of national variations in the applicable legislation and in the definition of rehabilitation are compared. Ultimately, mine site rehabilitation should return sites to conditions where land forms, soils, hydrology, and flora and fauna are self-sustaining and compatible with surrounding land uses. Case studies are given to show what can be achieved and how some landscapes can actually be improved as a result of mining activity.

  3. Implementation of Paste Backfill Mining Technology in Chinese Coal Mines

    Directory of Open Access Journals (Sweden)

    Qingliang Chang

    2014-01-01

    Full Text Available Implementation of clean mining technology at coal mines is crucial to protect the environment and maintain balance among energy resources, consumption, and ecology. After reviewing present coal clean mining technology, we introduce the technology principles and technological process of paste backfill mining in coal mines and discuss the components and features of backfill materials, the constitution of the backfill system, and the backfill process. Specific implementation of this technology and its application are analyzed for paste backfill mining in Daizhuang Coal Mine; a practical implementation shows that paste backfill mining can improve the safety and excavation rate of coal mining, which can effectively resolve surface subsidence problems caused by underground mining activities, by utilizing solid waste such as coal gangues as a resource. Therefore, paste backfill mining is an effective clean coal mining technology, which has widespread application.

  4. Implementation of paste backfill mining technology in Chinese coal mines.

    Science.gov (United States)

    Chang, Qingliang; Chen, Jianhang; Zhou, Huaqiang; Bai, Jianbiao

    2014-01-01

    Implementation of clean mining technology at coal mines is crucial to protect the environment and maintain balance among energy resources, consumption, and ecology. After reviewing present coal clean mining technology, we introduce the technology principles and technological process of paste backfill mining in coal mines and discuss the components and features of backfill materials, the constitution of the backfill system, and the backfill process. Specific implementation of this technology and its application are analyzed for paste backfill mining in Daizhuang Coal Mine; a practical implementation shows that paste backfill mining can improve the safety and excavation rate of coal mining, which can effectively resolve surface subsidence problems caused by underground mining activities, by utilizing solid waste such as coal gangues as a resource. Therefore, paste backfill mining is an effective clean coal mining technology, which has widespread application.

  5. National context

    DEFF Research Database (Denmark)

    Ravn, Tine

    2011-01-01

    This document forms part of the tasks of Work Package 3 for the European project PLACES (Platform of Local Authorities and Communicators Engaged in Science) whose main goal is to offer to a wide and diverse community of actors a common platform to structure their science communication activities,......, at a city/regional level (www.openplaces.eu). The main objective of this document is to have an overview of different contexts of scientific culture that are present around Europe with a particular focus on the local dimension of the initiatives and policies in science communication....

  6. Mining equipment manufacturers; 17th ed.

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1996-12-31

    The report analyses and compares the financial ratios of mining equipment manufacturers in the United Kingdom. It includes companies who produce equipment for the mining industry such as cutting tools, drilling equipment, crushers, screens, centrifuges, tunnelling equipment, conveyors and other materials handling equipment, hydraulic equipment, filtration, pumping, and mixing equipment and ventilation equipment. The report presents business ratios analysing profitability, efficiency, liquidity, employee-based ratios and a series of figures on growth rate. Accounting data for the individual companies are given, together with principal ratios, areas of activity, names of directors and company secretary, trading address and holding companies.

  7. Mining equipment manufacturers. 13th edition

    Energy Technology Data Exchange (ETDEWEB)

    1992-01-01

    The report analyses and compares the financial ratios of mining equipment manufacturers in the United Kingdom. It includes companies who produce equipment for the mining industry such as cutting tools, drilling equipment, crushers, screens, centrifuges, tunnelling equipment, conveyors and other materials handling equipment, hydraulic equipment, filtration, pumping and mixing equipment and ventilation equipment. The report presents business ratios analysing profitability, efficiency, liquidity, employee-based ratios and a series of figures on growth rate. Accounting data for the individual companies are given, together with principal ratios, areas of activity, names of directors and company secretary, trading address and holding companies.

  8. Data Mining of NS-2 Trace File

    Directory of Open Access Journals (Sweden)

    Ahmed Jawad Kadhim

    2014-11-01

    Full Text Available Data mining is important process to extract the use ful information and pattern from huge amount of dat a. NS-2 is an efficient tool to build the environment of network. The results from simulate these environ ment in NS-2 is trace file that contains several columns and lines represent the network events. This trace file can be used to analyse the network according to per formance metrics but it has redundant columns and rows. So, this paper is to perform the data mining in order to find only the necessary information in analysis operation to reduce the execution time and the storage size of the trace file.

  9. AWARE: mobile context instrumentation framework

    Directory of Open Access Journals (Sweden)

    Denzil eFerreira

    2015-04-01

    Full Text Available We present a mobile instrumentation toolkit, AWARE, an open-source effort to develop an extensible and reusable platform for capturing, inferring, and generating context on mobile devices. Mobile phones are sensor-rich but resource-constrained, and therefore several considerations need to be addressed when creating a research tool that ensures problem-free context collection. We demonstrate how AWARE can mitigate researchers’ effort when building mobile data-logging tools and context-aware applications, with minimal battery impact. By encapsulating implementation details of sensor data retrieval and exposing the sensed context as higher-level abstractions, AWARE shifts the focus from software development to data analysis, both quantitative and qualitative. We have evaluated AWARE in several case studies and discuss its use, power consumption, and scalability.

  10. Management of the water balance and quality in mining areas

    Science.gov (United States)

    Pasanen, Antti; Krogerus, Kirsti; Mroueh, Ulla-Maija; Turunen, Kaisa; Backnäs, Soile; Vento, Tiia; Veijalainen, Noora; Hentinen, Kimmo; Korkealaakso, Juhani

    2015-04-01

    Although mining companies have long been conscious of water related risks they still face environmental management problems. These problems mainly emerge because mine sites' water balances have not been adequately assessed in the stage of the planning of mines. More consistent approach is required to help mining companies identify risks and opportunities related to the management of water resources in all stages of mining. This approach requires that the water cycle of a mine site is interconnected with the general hydrologic water cycle. In addition to knowledge on hydrological conditions, the control of the water balance in the mining processes require knowledge of mining processes, the ability to adjust process parameters to variable hydrological conditions, adaptation of suitable water management tools and systems, systematic monitoring of amounts and quality of water, adequate capacity in water management infrastructure to handle the variable water flows, best practices to assess the dispersion, mixing and dilution of mine water and pollutant loading to receiving water bodies, and dewatering and separation of water from tailing and precipitates. WaterSmart project aims to improve the awareness of actual quantities of water, and water balances in mine areas to improve the forecasting and the management of the water volumes. The study is executed through hydrogeological and hydrological surveys and online monitoring procedures. One of the aims is to exploit on-line water quantity and quality monitoring for the better management of the water balances. The target is to develop a practical and end-user-specific on-line input and output procedures. The second objective is to develop mathematical models to calculate combined water balances including the surface, ground and process waters. WSFS, the Hydrological Modeling and Forecasting System of SYKE is being modified for mining areas. New modelling tools are developed on spreadsheet and system dynamics platforms to

  11. Environmental consequences of the Retsof Salt Mine roof collapse

    Science.gov (United States)

    Yager, Richard M.

    2013-01-01

    In 1994, the largest salt mine in North America, which had been in operation for more than 100 years, catastrophically flooded when the mine ceiling collapsed. In addition to causing the loss of the mine and the mineral resources it provided, this event formed sinkholes, caused widespread subsidence to land, caused structures to crack and subside, and changed stream flow and erosion patterns. Subsequent flooding of the mine drained overlying aquifers, changed the groundwater salinity distribution (rendering domestic wells unusable), and allowed locally present natural gas to enter dwellings through water wells. Investigations including exploratory drilling, hydrologic and water-quality monitoring, geologic and geophysical studies, and numerical simulation of groundwater flow, salinity, and subsidence have been effective tools in understanding the environmental consequences of the mine collapse and informing decisions about management of those consequences for the future. Salt mines are generally dry, but are susceptible to leaks and can become flooded if groundwater from overlying aquifers or surface water finds a way downward into the mined cavity through hundreds of feet of rock. With its potential to flood the entire mine cavity, groundwater is a constant source of concern for mine operators. The problem is compounded by the viscous nature of salt and the fact that salt mines commonly lie beneath water-bearing aquifers. Salt (for example halite or potash) deforms and “creeps” into the mined openings over time spans that range from years to centuries. This movement of salt can destabilize the overlying rock layers and lead to their eventual sagging and collapse, creating permeable pathways for leakage of water and depressions or openings at land surface, such as sinkholes. Salt is also highly soluble in water; therefore, whenever water begins to flow into a salt mine, the channels through which it flows increase in diameter as the surrounding salt dissolves

  12. A REVIEW ON TEXT MINING IN DATA MINING

    OpenAIRE

    2016-01-01

    Data mining is the knowledge discovery in databases and the gaol is to extract patterns and knowledge from large amounts of data. The important term in data mining is text mining. Text mining extracts the quality information highly from text. Statistical pattern learning is used to high quality information. High –quality in text mining defines the combinations of relevance, novelty and interestingness. Tasks in text mining are text categorization, text clustering, entity extraction and sentim...

  13. The Challenge of Wireless Connectivity to Support Intelligent Mines

    DEFF Research Database (Denmark)

    Barbosa, Viviane S. B.; Garcia, Luis G. U.; Caldwell, George

    2016-01-01

    and increase productivity, from extraction all the way to the delivery of a processed product to the customer. In this context, one of the key enablers is wireless connectivity since it allows mining equipment to be remotely monitored and controlled. Simply put, dependable wireless connectivity is essential...... for unmanned mine operations. Although voice and narrowband data radios have been used for years to support several types of mining activities, such as fleet management (dispatch) and telemetry, the use of automated equipment introduces a new set of connectivity requirements and poses a set of challenges...... by mining automation and discuss the consequences of not providing connectivity for all applications. The work also discusses how the careful positioning of the heavy communications infrastructure (tall towers) from the early stages of the mine site project can make the provision of incremental capacity...

  14. High resolution seismic reflection, an exploration tool within an underground environment (example from Zimbabwe)

    Science.gov (United States)

    Mutyorauta, J. J.

    Metallurgical grade chromite ore in Zimbabwe is mined from two underground mines, Peak Mine and Railway Block Mine, in Shurugwi. Peak Mine is at present just over 800 m deep. In the search for new chromite ore bodies, such a depth limits the application of the conventional geophysical exploration tools. Exploration diamond drilling is becoming more and more an expensive resort. Alternative and effective geophysical techniques are therefore being actively sought after. The high resolution seismic reflection technique, carried out right within Peak Mine, has the potential to become a useful exploration tool.

  15. Risk from vibration in Indian mines

    Directory of Open Access Journals (Sweden)

    Mandal Bibhuti

    2006-01-01

    Full Text Available Equipment-induced vibration is widely recognized as a health hazard. It is a physical stressor to which many people are exposed at workplace. Mining industry is no exception. In spite of extensive research undertaken in the developed countries, information on the magnitude of the problem in India is not available. An estimated 1 million workers were engaged in the Indian mining industry in the year 2003. The actual figures could be much higher. Analysis of employees′ database of several mines reveals that 18% employees in the Indian mining industry are occupationally exposed to vibration. Large-scale mechanization considerably adds to the severity and complexity of the problem because of 1 increase in the percentage of exposed population and 2 longer duration of exposure. The clinical picture and health outcomes of exposure to hand-arm vibration and whole-body vibration are scantily documented in the Indian context. In view of the health risk and action taken in other countries, we submit that there is an urgent need to develop a practical management strategy for evaluation, monitoring and control of equipment-induced vibration in the mining industry.

  16. Decision support methods for the environmental assessment of contamination at mining sites.

    Science.gov (United States)

    Jordan, Gyozo; Abdaal, Ahmed

    2013-09-01

    Polluting mine accidents and widespread environmental contamination associated with historic mining in Europe and elsewhere has triggered the improvement of related environmental legislation and of the environmental assessment and management methods for the mining industry. Mining has some unique features such as natural background pollution associated with natural mineral deposits, industrial activities and contamination located in the three-dimensional sub-surface space, the problem of long-term remediation after mine closure, problem of secondary contaminated areas around mine sites and abandoned mines in historic regions like Europe. These mining-specific problems require special tools to address the complexity of the environmental problems of mining-related contamination. The objective of this paper is to review and evaluate some of the decision support methods that have been developed and applied to mining contamination. In this paper, only those methods that are both efficient decision support tools and provide a 'holistic' approach to the complex problem as well are considered. These tools are (1) landscape ecology, (2) industrial ecology, (3) landscape geochemistry, (4) geo-environmental models, (5) environmental impact assessment, (6) environmental risk assessment, (7) material flow analysis and (8) life cycle assessment. This unique inter-disciplinary study should enable both the researcher and the practitioner to obtain broad view on the state-of-the-art of decision support methods for the environmental assessment of contamination at mine sites. Documented examples and abundant references are also provided.

  17. Context Construction through Competition

    DEFF Research Database (Denmark)

    Kjær, Poul F.

    This paper examines the relation between the evolution of statehood and institutionalised competition in the European context. The first half of the paper develops a historical-sociological view on the evolution of modern political power in the state form in Europe while the second half the paper...... reconstructs how the institutionalisation of competition as a specific type of policy tool which has been used by emerging modern states to establish its authority vis-à-vis competing claims to authority. It furthermore engages in an examination of corporatist and governance based attempts to respectively curb...... and expand the use of competition as a tool for organising social processes and the implications of the se attempts for the state of statehood....

  18. Mutation extraction tools can be combined for robust recognition of genetic variants in the literature [v2; ref status: indexed, http://f1000r.es/3j5

    Directory of Open Access Journals (Sweden)

    Antonio Jimeno Yepes

    2014-06-01

    Full Text Available As the cost of genomic sequencing continues to fall, the amount of data being collected and studied for the purpose of understanding the genetic basis of disease is increasing dramatically. Much of the source information relevant to such efforts is available only from unstructured sources such as the scientific literature, and significant resources are expended in manually curating and structuring the information in the literature. As such, there have been a number of systems developed to target automatic extraction of mutations and other genetic variation from the literature using text mining tools. We have performed a broad survey of the existing publicly available tools for extraction of genetic variants from the scientific literature. We consider not just one tool but a number of different tools, individually and in combination, and apply the tools in two scenarios. First, they are compared in an intrinsic evaluation context, where the tools are tested for their ability to identify specific mentions of genetic variants in a corpus of manually annotated papers, the Variome corpus. Second, they are compared in an extrinsic evaluation context based on our previous study of text mining support for curation of the COSMIC and InSiGHT databases. Our results demonstrate that no single tool covers the full range of genetic variants mentioned in the literature. Rather, several tools have complementary coverage and can be used together effectively. In the intrinsic evaluation on the Variome corpus, the combined performance is above 0.95 in F-measure, while in the extrinsic evaluation the combined recall performance is above 0.71 for COSMIC and above 0.62 for InSiGHT, a substantial improvement over the performance of any individual tool. Based on the analysis of these results, we suggest several directions for the improvement of text mining tools for genetic variant extraction from the literature.

  19. Dendrochemistry Application as a Tool for Biomonitoring Environmental Pollution of Mining Areas%年轮化学示踪技术及在重现矿区重金属污染历史中的应用

    Institute of Scientific and Technical Information of China (English)

    雷梅; 郭立新; 张山岭

    2011-01-01

    The trees, naturally growing in the mining areas, can uptake heavy metals from air, soil, and irrigation water. The pathways of heavy metals entering the trees might be roots, barks, and leaves. After entering the trees, heavy metals will deposit in the tree-rings during cambium development. The long-term fluctuations of element contents in tree rings can be analysed by the laser ablation system coupled to an inductively coupled plasma mass spectrometer. Therefore, the pollution processes of mining areas can be recorded by the variation of heavy metal concentrations in the tree-rings, and the influential ratios of different pollution routes can be semi-quantitatively determined. Cases of applications of dendrochemistry in biomonitofing of air, water and soil pollution were reviewed in this article. Based on the point pollution characteristics of mining areas, the possibility of applying dendrochemistry for pollution monitoring was discussed. Aiming to improve the accuracy of research of atmospheric and soil pollution in mining areas by dendrochemistry, different influential factors of heavy metal concentrations in tree rings, such as tree species, selective uptake of elements by trees, climate variation and nutrient providing, were reviewed. Geostatistics, stable isotope tracer, and synchrotron radiation X-ray fluorescence can improve the accuracy of dendrochemistry for environmental pollution recording. Atmospheric dispersion model predicting ground-level concentrations from a point source of metal emissions, such as smelters, can be calibrated on tree rings in order to reconstruet the spatial and temporal Pb-contamination pattern. Dating of the historical record has been achieved using isotopic analysis, for example 210Pb and 137Cs. With the development of analytical techniques of heavy metals in tree rings, dendrochemistry will play a more important role in pollution monitoring in mining areas. This review highlights the strong potential for dendrochemistry to

  20. Data mining for service

    CERN Document Server

    2014-01-01

    Virtually all nontrivial and modern service related problems and systems involve data volumes and types that clearly fall into what is presently meant as "big data", that is, are huge, heterogeneous, complex, distributed, etc. Data mining is a series of processes which include collecting and accumulating data, modeling phenomena, and discovering new information, and it is one of the most important steps to scientific analysis of the processes of services.  Data mining application in services requires a thorough understanding of the characteristics of each service and knowledge of the compatibility of data mining technology within each particular service, rather than knowledge only in calculation speed and prediction accuracy. Varied examples of services provided in this book will help readers understand the relation between services and data mining technology. This book is intended to stimulate interest among researchers and practitioners in the relation between data mining technology and its application to ...

  1. International symposium on mine rescue

    Energy Technology Data Exchange (ETDEWEB)

    1988-01-01

    To celebrate the 80th anniversary of creation of organized mine rescue activities in Poland, a symposium organized by the Central Mine Rescue Station in Bytom and the Association of Mining Engineers and Technicians in Katowice took place on 28-30 September 1987 in Szczyrk and was attended by 53 representatives of mine rescue services from 15 countries. Nineteen papers by Polish participants and 21 papers by foreign participants were presented on three main topics: modern methods of fighting mining hazards; lines of improvement for mine rescue services; modern mine rescue equipment. Polish-made equipment for mine rescue, hazard detection and combating hazards was exhibited. The Central Mine Rescue Station in Bytom presented a tender for such services as technical consulting, expertise, development of technology and direct participation in rescue operations. Participants to the symposium addressed the Organizing Committee of the World Mining Congress to include mine rescue issues in the program of the next Congress.

  2. Knowledge modeling of coal mining equipments based on ontology

    Science.gov (United States)

    Zhang, Baolong; Wang, Xiangqian; Li, Huizong; Jiang, Miaomiao

    2017-06-01

    The problems of information redundancy and sharing are universe in coal mining equipment management. In order to improve the using efficiency of knowledge of coal mining equipments, this paper proposed a new method of knowledge modeling based on ontology. On the basis of analyzing the structures and internal relations of coal mining equipment knowledge, taking OWL as ontology construct language, the ontology model of coal mining equipment knowledge is built with the help of Protégé 4.3 software tools. The knowledge description method will lay the foundation for the high effective knowledge management and sharing, which is very significant for improving the production management level of coal mining enterprises.

  3. COMPARISON ANALYSIS OF WEB USAGE MINING USING PATTERN RECOGNITION TECHNIQUES

    Directory of Open Access Journals (Sweden)

    Nanhay Singh

    2013-07-01

    Full Text Available Web usage mining is the application of data mining techniques to better serve the needs of web-based applications on the web site. In this paper, we analyze the web usage mining by applying the pattern recognition techniques on web log data. Pattern recognition is defined as the act of taking in raw data and making an action based on the ‘category’ of the pattern. Web usage mining is divided into three partsPreprocessing, Pattern discovery and Pattern analysis. Further, this paper intended with experimental work in which web log data is used. We have taken the web log data from the “NASA” web server which is analyzed with “Web Log Explorer”. Web Log Explorer is a web usage mining tool which plays the vital role to carry out this work.

  4. WEB STRUCTURE MINING USING PAGERANK, IMPROVED PAGERANK – AN OVERVIEW

    Directory of Open Access Journals (Sweden)

    V. Lakshmi Praba

    2011-03-01

    Full Text Available Web Mining is the extraction of interesting and potentially useful patterns and information from Web. It includes Web documents, hyperlinks between documents, and usage logs of web sites. The significant task for web mining can be listed out as Information Retrieval, Information Selection / Extraction, Generalization and Analysis. Web information retrieval tools consider only the text on pages and ignore information in the links. The goal of Web structure mining is to explore structural summary about web. Web structure mining focusing on link information is an important aspect of web data. This paper presents an overview of the PageRank, Improved Page Rank and its working functionality in web structure mining.

  5. Efficient mining of association rules based on gravitational search algorithm

    Directory of Open Access Journals (Sweden)

    Fariba Khademolghorani

    2011-07-01

    Full Text Available Association rules mining are one of the most used tools to discover relationships among attributes in a database. A lot of algorithms have been introduced for discovering these rules. These algorithms have to mine association rules in two stages separately. Most of them mine occurrence rules which are easily predictable by the users. Therefore, this paper discusses the application of gravitational search algorithm for discovering interesting association rules. This evolutionary algorithm is based on the Newtonian gravity and the laws of motion. Furthermore, contrary to the previous methods, the proposed method in this study is able to mine the best association rules without generating frequent itemsets and is independent of the minimum support and confidence values. The results of applying this method in comparison with the method of mining association rules based upon the particle swarm optimization show that our method is successful.

  6. Mining Land Subsidence Monitoring Using SENTINEL-1 SAR Data

    Science.gov (United States)

    Yuan, W.; Wang, Q.; Fan, J.; Li, H.

    2017-09-01

    In this paper, DInSAR technique was used to monitor land subsidence in mining area. The study area was selected in the coal mine area located in Yuanbaoshan District, Chifeng City, and Sentinel-1 data were used to carry out DInSAR techniqu. We analyzed the interferometric results by Sentinel-1 data from December 2015 to May 2016. Through the comparison of the results of DInSAR technique and the location of the mine on the optical images, it is shown that DInSAR technique can be used to effectively monitor the land subsidence caused by underground mining, and it is an effective tool for law enforcement of over-mining.

  7. Statistical methods of estimating mining costs

    Science.gov (United States)

    Long, K.R.

    2011-01-01

    Until it was defunded in 1995, the U.S. Bureau of Mines maintained a Cost Estimating System (CES) for prefeasibility-type economic evaluations of mineral deposits and estimating costs at producing and non-producing mines. This system had a significant role in mineral resource assessments to estimate costs of developing and operating known mineral deposits and predicted undiscovered deposits. For legal reasons, the U.S. Geological Survey cannot update and maintain CES. Instead, statistical tools are under development to estimate mining costs from basic properties of mineral deposits such as tonnage, grade, mineralogy, depth, strip ratio, distance from infrastructure, rock strength, and work index. The first step was to reestimate "Taylor's Rule" which relates operating rate to available ore tonnage. The second step was to estimate statistical models of capital and operating costs for open pit porphyry copper mines with flotation concentrators. For a sample of 27 proposed porphyry copper projects, capital costs can be estimated from three variables: mineral processing rate, strip ratio, and distance from nearest railroad before mine construction began. Of all the variables tested, operating costs were found to be significantly correlated only with strip ratio.

  8. Seismic surveying for coal mine planning

    Energy Technology Data Exchange (ETDEWEB)

    Zhou, B. [CMTE/CSIRO Exploration and Mining, Kenmore, Qld. (Australia)

    2002-07-01

    More and more coal in Australia is extracted by underground mining methods especially by longwall mining. These methods can be particularly sensitive to relatively small-scale structural discontinuities and variations in roof and floor rock character. Traditionally, information on these features has been obtained through drilling. However, this is an expensive process and its relevance is limited to the immediate neighbourhood of the boreholes. Seismic surveying, especially by 3D seismic, is an alternative tool for geological structure delineation. It is one of the most effective geophysical methods available for identification of geological structures such as faults, folds, washouts, seam splits and thickness changes which are normally associated with potential mining hazards. Seismic data even can be used for stratigraphic identification. The information extracted from seismic data can be integrated into mine planning and design. In this paper, computer aided interpretation techniques for maximising the information from seismic data are demonstrated and the ability of seismic reflection methods to resolve localised geological features illustrated. Both synthetic and real seismic data obtained in recent 2D and 3D seismic surveys from Australian coal mines are used. 7 refs., 9 figs.

  9. The legalization of small scale mining in Colombia

    Directory of Open Access Journals (Sweden)

    Alexandra Urán

    2013-11-01

    Full Text Available The following article presents conceptual and analytical elements that allow us to broaden the debate about the legalization of the mining in Colombia. Looking for items to be able to propose alternatives in order to consolidate a new mining process socially and environmentally sustainable, claiming the value of the ancestral practices and forms of the redistributive production. To this end, it is necessary to start with the discussion of the concepts of formalization and legality, so that we can generate a theoretical framework that will allow us to explore such delicate matter, we will continue to make the framework socio-political, in which it is based that strategy. Then there is a brief context of thereformulation of legal mining, focusing our attention particularly standards which involves or has effects on artisanal mining and/or small-scale mining. There we will find ourselves with a debate on the typology of the mining and the current difficulties to define schemasof legalization or formalization of small-scale mining in Colombia. To conclude with a proposal to formalize as a mechanism of transition to an administrative system - legislativethat will connect more effectively with the realities and skills of the ethnic communities that practice small-scale mining.

  10. Radioecological challenges for mining

    Energy Technology Data Exchange (ETDEWEB)

    Vesterbacka, P.; Ikaeheimonen, T.K.; Solatie, D. [Radiation and Nuclear Safety Authority (Finland)

    2014-07-01

    In Finland, mining became popular in the mid-1990's when the mining amendments to the law made the mining activities easier for foreign companies. Also the price of the minerals rose and mining in Finland became economically profitable. Expanding mining industry brought new challenges to radiation safety aspect since radioactive substances occur in nearly all minerals. In Finnish soil and bedrock the average crystal abundance of uranium and thorium are 2.8 ppm and 10 ppm, respectively. It cannot be predicted beforehand how radionuclides behave in the mining processes which why they need to be taken into account in mining activities. Radiation and Nuclear Safety Authority (STUK) has given a national guide ST 12.1 based on the Finnish Radiation Act. The guide sets the limits for radiation doses to the public also from mining activities. In general, no measures to limit the radiation exposure are needed, if the dose from the operation liable to cause exposure to natural radiation is no greater than 0.1 mSv per year above the natural background radiation dose. If the exposure of the public may be higher than 0.1 mSv per year, the responsible party must provide STUK a plan describing the measures by which the radiation exposure is to be kept as low as is reasonably achievable. In that case the mining company responsible company has to make a radiological baseline study. The baseline study must focus on the environment that the mining activities may impact. The study describes the occurrence of natural radioactivity in the environment before any mining activities are started. The baseline study lasts usually for two to three years in natural circumstances. Based on the baseline study measurements, detailed information of the existing levels of radioactivity in the environment can be attained. Once the mining activities begin, it is important that the limits are set for the wastewater discharges to the environment and environmental surveillance in the vicinity of

  11. Acid mine drainage risks - A modeling approach to siting mine facilities in Northern Minnesota USA

    Science.gov (United States)

    Myers, Tom

    2016-02-01

    Most watershed-scale planning for mine-caused contamination concerns remediation of past problems while future planning relies heavily on engineering controls. As an alternative, a watershed scale groundwater fate and transport model for the Rainy Headwaters, a northeastern Minnesota watershed, has been developed to examine the risks of leaks or spills to a pristine downstream watershed. The model shows that the risk depends on the location and whether the source of the leak is on the surface or from deeper underground facilities. Underground sources cause loads that last longer but arrive at rivers after a longer travel time and have lower concentrations due to dilution and attenuation. Surface contaminant sources could cause much more short-term damage to the resource. Because groundwater dominates baseflow, mine contaminant seepage would cause the most damage during low flow periods. Groundwater flow and transport modeling is a useful tool for decreasing the risk to downgradient sources by aiding in the placement of mine facilities. Although mines are located based on the minerals, advance planning and analysis could avoid siting mine facilities where failure or leaks would cause too much natural resource damage. Watershed scale transport modeling could help locate the facilities or decide in advance that the mine should not be constructed due to the risk to downstream resources.

  12. Text mining in cancer gene and pathway prioritization.

    Science.gov (United States)

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes.

  13. Application of text mining in the biomedical domain.

    Science.gov (United States)

    Fleuren, Wilco W M; Alkema, Wynand

    2015-03-01

    In recent years the amount of experimental data that is produced in biomedical research and the number of papers that are being published in this field have grown rapidly. In order to keep up to date with developments in their field of interest and to interpret the outcome of experiments in light of all available literature, researchers turn more and more to the use of automated literature mining. As a consequence, text mining tools have evolved considerably in number and quality and nowadays can be used to address a variety of research questions ranging from de novo drug target discovery to enhanced biological interpretation of the results from high throughput experiments. In this paper we introduce the most important techniques that are used for a text mining and give an overview of the text mining tools that are currently being used and the type of problems they are typically applied for.

  14. Automatic detection of referral patients due to retinal pathologies through data mining.

    Science.gov (United States)

    Quellec, Gwenolé; Lamard, Mathieu; Erginay, Ali; Chabouis, Agnès; Massin, Pascale; Cochener, Béatrice; Cazuguel, Guy

    2016-04-01

    With the increased prevalence of retinal pathologies, automating the detection of these pathologies is becoming more and more relevant. In the past few years, many algorithms have been developed for the automated detection of a specific pathology, typically diabetic retinopathy, using eye fundus photography. No matter how good these algorithms are, we believe many clinicians would not use automatic detection tools focusing on a single pathology and ignoring any other pathology present in the patient's retinas. To solve this issue, an algorithm for characterizing the appearance of abnormal retinas, as well as the appearance of the normal ones, is presented. This algorithm does not focus on individual images: it considers examination records consisting of multiple photographs of each retina, together with contextual information about the patient. Specifically, it relies on data mining in order to learn diagnosis rules from characterizations of fundus examination records. The main novelty is that the content of examination records (images and context) is characterized at multiple levels of spatial and lexical granularity: 1) spatial flexibility is ensured by an adaptive decomposition of composite retinal images into a cascade of regions, 2) lexical granularity is ensured by an adaptive decomposition of the feature space into a cascade of visual words. This multigranular representation allows for great flexibility in automatically characterizing normality and abnormality: it is possible to generate diagnosis rules whose precision and generalization ability can be traded off depending on data availability. A variation on usual data mining algorithms, originally designed to mine static data, is proposed so that contextual and visual data at adaptive granularity levels can be mined. This framework was evaluated in e-ophtha, a dataset of 25,702 examination records from the OPHDIAT screening network, as well as in the publicly-available Messidor dataset. It was successfully

  15. Planetary science and exploration in the deep subsurface: results from the MINAR Program, Boulby Mine, UK

    Science.gov (United States)

    Payler, Samuel J.; Biddle, Jennifer F.; Coates, Andrew J.; Cousins, Claire R.; Cross, Rachel E.; Cullen, David C.; Downs, Michael T.; Direito, Susana O. L.; Edwards, Thomas; Gray, Amber L.; Genis, Jac; Gunn, Matthew; Hansford, Graeme M.; Harkness, Patrick; Holt, John; Josset, Jean-Luc; Li, Xuan; Lees, David S.; Lim, Darlene S. S.; McHugh, Melissa; McLuckie, David; Meehan, Emma; Paling, Sean M.; Souchon, Audrey; Yeoman, Louise; Cockell, Charles S.

    2017-04-01

    The subsurface exploration of other planetary bodies can be used to unravel their geological history and assess their habitability. On Mars in particular, present-day habitable conditions may be restricted to the subsurface. Using a deep subsurface mine, we carried out a program of extraterrestrial analog research - MINe Analog Research (MINAR). MINAR aims to carry out the scientific study of the deep subsurface and test instrumentation designed for planetary surface exploration by investigating deep subsurface geology, whilst establishing the potential this technology has to be transferred into the mining industry. An integrated multi-instrument suite was used to investigate samples of representative evaporite minerals from a subsurface Permian evaporite sequence, in particular to assess mineral and elemental variations which provide small-scale regions of enhanced habitability. The instruments used were the Panoramic Camera emulator, Close-Up Imager, Raman spectrometer, Small Planetary Linear Impulse Tool, Ultrasonic drill and handheld X-ray diffraction (XRD). We present science results from the analog research and show that these instruments can be used to investigate in situ the geological context and mineralogical variations of a deep subsurface environment, and thus habitability, from millimetre to metre scales. We also show that these instruments are complementary. For example, the identification of primary evaporite minerals such as NaCl and KCl, which are difficult to detect by portable Raman spectrometers, can be accomplished with XRD. By contrast, Raman is highly effective at locating and detecting mineral inclusions in primary evaporite minerals. MINAR demonstrates the effective use of a deep subsurface environment for planetary instrument development, understanding the habitability of extreme deep subsurface environments on Earth and other planetary bodies, and advancing the use of space technology in economic mining.

  16. Internal Fraud Risk Reduction: Results of a Data Mining Case Study

    OpenAIRE

    Jans, Mieke; LYBAERT, Nadine; Vanhoof, Koen

    2008-01-01

    Corporate fraud these days represents a huge cost to our economy. Academic literature already concentrated on how data mining techniques can be of value in the fight against fraud. All this research focusses on fraud detection, mostly in a context of external fraud. In this paper we discuss the use of a data mining approach to reduce the risk of internal fraud. Reducing fraud risk comprehends both detection and prevention, and therefore we apply descriptive data mining as opposed to the wi...

  17. Determining change in managerial practices implementation readiness in a gold mine / Yolandie Jansen van Vuuren

    OpenAIRE

    Jansen van Vuuren, Yolandie

    2011-01-01

    Performance Effectiveness Appraisal, Performance Effectiveness Review as well as Coaching and training are crucial elements of the performance management process included in a set of managerial competencies required by managers in a deep level gold mine in South Africa. This dissertation provides a contemporary review of performance management and gold mining in a South African deep level gold mine context. It provides a theoretical framework for the processes of Performance Effectiveness ...

  18. Context Construction Through Competition

    DEFF Research Database (Denmark)

    Kjær, Poul F.

    2015-01-01

    This article examines the relationship between the evolution of statehood and competition in the European context. To begin with, a particular take on the evolution of modern political power in the state form in Europe is developed. Against this background, the article reconstructs how the instit......This article examines the relationship between the evolution of statehood and competition in the European context. To begin with, a particular take on the evolution of modern political power in the state form in Europe is developed. Against this background, the article reconstructs how...... the institutionalization of competition as a specific type of policy tool has been used by emerging modern states to establish their authority vis-à-vis competing claims to public authority in society. The article, furthermore, engages in an examination of (neo-)corporatist and governance-based attempts both to curb...... and to expand the use of competition as a tool for organizing social processes, and the implications of these attempts for the state of statehood....

  19. Gold-Mining

    DEFF Research Database (Denmark)

    Raaballe, J.; Grundy, B.D.

    2002-01-01

    of operating gold mines. Asymmetric information on the reserves in the mine implies that, at a high enough price of gold, the manager of high type finds the extraction value of the company to be higher than the current market value of the non-operating gold mine. Due to this under valuation the maxim of market...... value maximization forces the manager of high type to extract the gold.The implications are three-fold. First, all managers (except the lowest type) extract the gold too soon compared to the first-best policy of leaving the gold in the mine forever. Second, a manager of high type extracts the gold...... sooner than a manager of lower type. Third, a non-operating gold mine is valued as being of the lowest type in the pool and all else equal, high-asymmetri mines are valued lower than low-asymmetri mines. In a qualitative sense these results are robust with respect to different assumptions (re cost...

  20. Implementation of Paste Backfill Mining Technology in Chinese Coal Mines

    OpenAIRE

    Qingliang Chang; Jianhang Chen; Huaqiang Zhou; Jianbiao Bai

    2014-01-01

    Implementation of clean mining technology at coal mines is crucial to protect the environment and maintain balance among energy resources, consumption, and ecology. After reviewing present coal clean mining technology, we introduce the technology principles and technological process of paste backfill mining in coal mines and discuss the components and features of backfill materials, the constitution of the backfill system, and the backfill process. Specific implementation of this technology a...

  1. CANINE: a robotic mine dog

    Science.gov (United States)

    Stancil, Brian A.; Hyams, Jeffrey; Shelley, Jordan; Babu, Kartik; Badino, Hernán.; Bansal, Aayush; Huber, Daniel; Batavia, Parag

    2013-01-01

    Neya Systems, LLC competed in the CANINE program sponsored by the U.S. Army Tank Automotive Research Development and Engineering Center (TARDEC) which culminated in a competition held at Fort Benning as part of the 2012 Robotics Rodeo. As part of this program, we developed a robot with the capability to learn and recognize the appearance of target objects, conduct an area search amid distractor objects and obstacles, and relocate the target object in the same way that Mine dogs and Sentry dogs are used within military contexts for exploration and threat detection. Neya teamed with the Robotics Institute at Carnegie Mellon University to develop vision-based solutions for probabilistic target learning and recognition. In addition, we used a Mission Planning and Management System (MPMS) to orchestrate complex search and retrieval tasks using a general set of modular autonomous services relating to robot mobility, perception and grasping.

  2. Arabic Question Answering System Based On Data Mining

    Directory of Open Access Journals (Sweden)

    Waheeb Ahmed

    2015-08-01

    Full Text Available In this study we describe An Arabic Question AnsweringQA system based on data mining approach. The system employs text mining techniques to determine the likely answers to factoid questions. It depends mainly on the use of lexical information and does not apply any complex language processing tools such as named entity recognizers parsers and ontologies. The system achieved an accuracy of 61.5.

  3. Text mining with R a tidy approach

    CERN Document Server

    Silge, Julia

    2017-01-01

    Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective. The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media. Learn how to apply the tidy text format to NLP Use sentiment analysis to mine the emotional content of text Identify a document's most important terms with frequency measurements E...

  4. Data Mining in Course Management Systems: Moodle Case Study and Tutorial

    Science.gov (United States)

    Romero, Cristobal; Ventura, Sebastian; Garcia, Enrique

    2008-01-01

    Educational data mining is an emerging discipline, concerned with developing methods for exploring the unique types of data that come from the educational context. This work is a survey of the specific application of data mining in learning management systems and a case study tutorial with the Moodle system. Our objective is to introduce it both…

  5. Data Mining in Course Management Systems: Moodle Case Study and Tutorial

    Science.gov (United States)

    Romero, Cristobal; Ventura, Sebastian; Garcia, Enrique

    2008-01-01

    Educational data mining is an emerging discipline, concerned with developing methods for exploring the unique types of data that come from the educational context. This work is a survey of the specific application of data mining in learning management systems and a case study tutorial with the Moodle system. Our objective is to introduce it both…

  6. The Traversella mining site as Piedmont geosite

    Science.gov (United States)

    Costa, Emanuele; Benna, Piera; Antonella Dino, Giovanna; Rossetti, Piergiorgio

    2017-04-01

    room and canteens, etc..); such signs are the tangible trace of a remarkable industrial activity, which can be considered as cultural heritage of historical industrial activities ("industrial archeology"). To enrich such cultural heritage, a museum for minerals and mining tools exposition is still active. Furthermore, to evidence the importance of Traversella mining site, outstanding mineralogical samples coming from Traversella area are displayed in the most famous museum all over the world. The present research aims at emphasize the extraordinary importance of this mining site both from a scientific and a historical point of view, examining the methods and the amount of production during the last three centuries, and highlighting how these activities contributed to the industrial development of the surrounding area and of the whole Piedmont Region. We also want to illustrate the sociological and environmental impact of mining activities at regional level, highlighting the importance of the site from a geoturistic point of view, thanks to of the cultural exploitation of the mining site remains, the developing and upgrade of the already existing mining museum, and the organization of geoturistic itinerary.

  7. High content image cytometry in the context of subnuclear organization.

    Science.gov (United States)

    De Vos, W H; Van Neste, L; Dieriks, B; Joss, G H; Van Oostveldt, P

    2010-01-01

    The organization of proteins in space and time is essential to their function. To accurately quantify subcellular protein characteristics in a population of cells with regard for the stochasticity of events in a natural context, there is a fast-growing need for image-based cytometry. Simultaneously, the massive amount of data that is generated by image-cytometric analyses, calls for tools that enable pattern recognition and automated classification. In this article, we present a general approach for multivariate phenotypic profiling of individual cell nuclei and quantification of subnuclear spots using automated fluorescence mosaic microscopy, optimized image processing tools, and supervised classification. We demonstrate the efficiency of our analysis by determination of differential DNA damage repair patterns in response to genotoxic stress and radiation, and we show the potential of data mining in pinpointing specific phenotypes after transient transfection. The presented approach allowed for systematic analysis of subnuclear features in large image data sets and accurate classification of phenotypes at the level of the single cell. Consequently, this type of nuclear fingerprinting shows potential for high-throughput applications, such as functional protein assays or drug compound screening.

  8. 30 CFR 56.3202 - Scaling tools.

    Science.gov (United States)

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Scaling tools. 56.3202 Section 56.3202 Mineral... HEALTH SAFETY AND HEALTH STANDARDS-SURFACE METAL AND NONMETAL MINES Ground Control Scaling and Support § 56.3202 Scaling tools. Where manual scaling is performed, a scaling bar shall be provided. This...

  9. 30 CFR 57.3202 - Scaling tools.

    Science.gov (United States)

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Scaling tools. 57.3202 Section 57.3202 Mineral... HEALTH SAFETY AND HEALTH STANDARDS-UNDERGROUND METAL AND NONMETAL MINES Ground Control Scaling and Support-Surface and Underground § 57.3202 Scaling tools. Where manual scaling is performed, a scaling...

  10. A Data Mining Approach to Intelligence Operations

    DEFF Research Database (Denmark)

    Memon, Nasrullah; Hicks, David; Harkiolakis, Nicholas

    2008-01-01

    agencies.   An emphasis in the paper is placed on Social Network Analysis and Investigative Data Mining, and the use of these technologies in the counterterrorism domain.  Tools and techniques from both areas are described, along with the important tasks for which they can be used to assist...... with the investigation and analysis of terrorist organizations.  The process of collecting data about these organizations is also considered along with the inherent difficulties that are involved....

  11. Understanding Customers' Evaluations Through Mining Airline Reviews

    OpenAIRE

    Ibrahim Yakut; Tugba Turkoglu; Fikriye Yakut

    2015-01-01

    Data mining can be evaluated as a strategic tool to determine the customer profiles in order to learn customer expectations and requirements. Airline customers have different characteristics and if passenger reviews about their trip experiences are correctly analyzed, companies can increase customer satisfaction by improving provided services. In this study, we investigate customer review data for in-flight services of airline companies and draw customer models with respect to suc...

  12. Design pattern mining using distributed learning automata and DNA sequence alignment.

    Directory of Open Access Journals (Sweden)

    Mansour Esmaeilpour

    Full Text Available CONTEXT: Over the last decade, design patterns have been used extensively to generate reusable solutions to frequently encountered problems in software engineering and object oriented programming. A design pattern is a repeatable software design solution that provides a template for solving various instances of a general problem. OBJECTIVE: This paper describes a new method for pattern mining, isolating design patterns and relationship between them; and a related tool, DLA-DNA for all implemented pattern and all projects used for evaluation. DLA-DNA achieves acceptable precision and recall instead of other evaluated tools based on distributed learning automata (DLA and deoxyribonucleic acid (DNA sequences alignment. METHOD: The proposed method mines structural design patterns in the object oriented source code and extracts the strong and weak relationships between them, enabling analyzers and programmers to determine the dependency rate of each object, component, and other section of the code for parameter passing and modular programming. The proposed model can detect design patterns better that available other tools those are Pinot, PTIDEJ and DPJF; and the strengths of their relationships. RESULTS: The result demonstrate that whenever the source code is build standard and non-standard, based on the design patterns, then the result of the proposed method is near to DPJF and better that Pinot and PTIDEJ. The proposed model is tested on the several source codes and is compared with other related models and available tools those the results show the precision and recall of the proposed method, averagely 20% and 9.6% are more than Pinot, 27% and 31% are more than PTIDEJ and 3.3% and 2% are more than DPJF respectively. CONCLUSION: The primary idea of the proposed method is organized in two following steps: the first step, elemental design patterns are identified, while at the second step, is composed to recognize actual design patterns.

  13. Design Pattern Mining Using Distributed Learning Automata and DNA Sequence Alignment

    Science.gov (United States)

    Esmaeilpour, Mansour; Naderifar, Vahideh; Shukur, Zarina

    2014-01-01

    Context Over the last decade, design patterns have been used extensively to generate reusable solutions to frequently encountered problems in software engineering and object oriented programming. A design pattern is a repeatable software design solution that provides a template for solving various instances of a general problem. Objective This paper describes a new method for pattern mining, isolating design patterns and relationship between them; and a related tool, DLA-DNA for all implemented pattern and all projects used for evaluation. DLA-DNA achieves acceptable precision and recall instead of other evaluated tools based on distributed learning automata (DLA) and deoxyribonucleic acid (DNA) sequences alignment. Method The proposed method mines structural design patterns in the object oriented source code and extracts the strong and weak relationships between them, enabling analyzers and programmers to determine the dependency rate of each object, component, and other section of the code for parameter passing and modular programming. The proposed model can detect design patterns better that available other tools those are Pinot, PTIDEJ and DPJF; and the strengths of their relationships. Results The result demonstrate that whenever the source code is build standard and non-standard, based on the design patterns, then the result of the proposed method is near to DPJF and better that Pinot and PTIDEJ. The proposed model is tested on the several source codes and is compared with other related models and available tools those the results show the precision and recall of the proposed method, averagely 20% and 9.6% are more than Pinot, 27% and 31% are more than PTIDEJ and 3.3% and 2% are more than DPJF respectively. Conclusion The primary idea of the proposed method is organized in two following steps: the first step, elemental design patterns are identified, while at the second step, is composed to recognize actual design patterns. PMID:25243670

  14. International mining forum 2004, new technologies in underground mining, safety in mines proceedings

    Energy Technology Data Exchange (ETDEWEB)

    Jerzy Kicki; Eugeniusz Sobczyk (eds.)

    2004-01-15

    The book comprises technical papers that were presented at the International Mining Forum 2004. This event aims to bring together scientists and engineers in mining, rock mechanics, and computer engineering, with a view to explore and discuss international developments in the field. Topics discussed in this book are: trends in the mining industry; new solutions and tendencies in underground mines; rock engineering problems in underground mines; utilization and exploitation of methane; prevention measures for the control of rock bursts in Polish mines; and current problems in Ukrainian coal mines.

  15. SME mining engineering handbook

    National Research Council Canada - National Science Library

    Darling, Peter

    2011-01-01

    ...) 948-4200 / (800) 763-3132 www.smenet.org SME advances the worldwide mining and minerals community through information exchange and professional development. With members in more than 70 countrie...

  16. Ensemble Data Mining Methods

    Data.gov (United States)

    National Aeronautics and Space Administration — Ensemble Data Mining Methods, also known as Committee Methods or Model Combiners, are machine learning methods that leverage the power of multiple models to achieve...

  17. Acid mine drainage

    Science.gov (United States)

    Bigham, Jerry M.; Cravotta, Charles A.

    2016-01-01

    Acid mine drainage (AMD) consists of metal-laden solutions produced by the oxidative dissolution of iron sulfide minerals exposed to air, moisture, and acidophilic microbes during the mining of coal and metal deposits. The pH of AMD is usually in the range of 2–6, but mine-impacted waters at circumneutral pH (5–8) are also common. Mine drainage usually contains elevated concentrations of sulfate, iron, aluminum, and other potentially toxic metals leached from rock that hydrolyze and coprecipitate to form rust-colored encrustations or sediments. When AMD is discharged into surface waters or groundwaters, degradation of water quality, injury to aquatic life, and corrosion or encrustation of engineered structures can occur for substantial distances. Prevention and remediation strategies should consider the biogeochemical complexity of the system, the longevity of AMD pollution, the predictive power of geochemical modeling, and the full range of available field technologies for problem mitigation.

  18. COLOMBIAN MINING SUSTAINABILITY

    National Research Council Canada - National Science Library

    JORGE MARTÍN MOLINA ESCOBAR; OSCAR JAIME RESTREPO BAENA

    2010-01-01

    This is a brief reference to mining sustainability in Colombia. First, it discusses the concept of sustainability and its relationship to mineral resources, covering topics of ethical, economical and environmental issues...

  19. Mining and robotized equipment

    Energy Technology Data Exchange (ETDEWEB)

    Krisztian, B.

    1984-01-01

    The general concepts about the expedience of using industrial robots (PR) in mining and about the most rational fields of their use are cited. The achievements in creating industrial robots for the needs of the mining industry in the USSR, Sweden (the ASEA Company), in the United States (Westinghouse Electric and Cincinnati Milacron Companies) and in Japan (the Fupitsu Fanuk Company) are noted. The necessity in a whole number of cases of a fundamental restructuring of the productive processes with respect to the planned introduction of industrial robots in mining enterprises is stressed. The questions associated with the necessity for changes introduced into systems for automating industrial processes with the introduction of industrial robots into them are also discussed. The prospects for the development, creation and introduction of industrial robots in the Hungarian (VNR) mining industry are indicated in conclusion.

  20. Data Mining of Macromolecular Structures.

    Science.gov (United States)

    van Beusekom, Bart; Perrakis, Anastassis; Joosten, Robbie P

    2016-01-01

    The use of macromolecular structures is widespread for a variety of applications, from teaching protein structure principles all the way to ligand optimization in drug development. Applying data mining techniques on these experimentally determined structures requires a highly uniform, standardized structural data source. The Protein Data Bank (PDB) has evolved over the years toward becoming the standard resource for macromolecular structures. However, the process selecting the data most suitable for specific applications is still very much based on personal preferences and understanding of the experimental techniques used to obtain these models. In this chapter, we will first explain the challenges with data standardization, annotation, and uniformity in the PDB entries determined by X-ray crystallography. We then discuss the specific effect that crystallographic data quality and model optimization methods have on structural models and how validation tools can be used to make informed choices. We also discuss specific advantages of using the PDB_REDO databank as a resource for structural data. Finally, we will provide guidelines on how to select the most suitable protein structure models for detailed analysis and how to select a set of structure models suitable for data mining.