WorldWideScience

Sample records for text mining-based approach

  1. Argo: an integrative, interactive, text mining-based workbench supporting curation

    Science.gov (United States)

    Rak, Rafal; Rowley, Andrew; Black, William; Ananiadou, Sophia

    2012-01-01

    Curation of biomedical literature is often supported by the automatic analysis of textual content that generally involves a sequence of individual processing components. Text mining (TM) has been used to enhance the process of manual biocuration, but has been focused on specific databases and tasks rather than an environment integrating TM tools into the curation pipeline, catering for a variety of tasks, types of information and applications. Processing components usually come from different sources and often lack interoperability. The well established Unstructured Information Management Architecture is a framework that addresses interoperability by defining common data structures and interfaces. However, most of the efforts are targeted towards software developers and are not suitable for curators, or are otherwise inconvenient to use on a higher level of abstraction. To overcome these issues we introduce Argo, an interoperable, integrative, interactive and collaborative system for text analysis with a convenient graphic user interface to ease the development of processing workflows and boost productivity in labour-intensive manual curation. Robust, scalable text analytics follow a modular approach, adopting component modules for distinct levels of text analysis. The user interface is available entirely through a web browser that saves the user from going through often complicated and platform-dependent installation procedures. Argo comes with a predefined set of processing components commonly used in text analysis, while giving the users the ability to deposit their own components. The system accommodates various areas and levels of user expertise, from TM and computational linguistics to ontology-based curation. One of the key functionalities of Argo is its ability to seamlessly incorporate user-interactive components, such as manual annotation editors, into otherwise completely automatic pipelines. As a use case, we demonstrate the functionality of an in

  2. Fuzzy OLAP association rules mining-based modular reinforcement learning approach for multiagent systems.

    Science.gov (United States)

    Kaya, Mehmet; Alhajj, Reda

    2005-04-01

    Multiagent systems and data mining have recently attracted considerable attention in the field of computing. Reinforcement learning is the most commonly used learning process for multiagent systems. However, it still has some drawbacks, including modeling other learning agents present in the domain as part of the state of the environment, and some states are experienced much less than others, or some state-action pairs are never visited during the learning phase. Further, before completing the learning process, an agent cannot exhibit a certain behavior in some states that may be experienced sufficiently. In this study, we propose a novel multiagent learning approach to handle these problems. Our approach is based on utilizing the mining process for modular cooperative learning systems. It incorporates fuzziness and online analytical processing (OLAP) based mining to effectively process the information reported by agents. First, we describe a fuzzy data cube OLAP architecture which facilitates effective storage and processing of the state information reported by agents. This way, the action of the other agent, not even in the visual environment. of the agent under consideration, can simply be predicted by extracting online association rules, a well-known data mining technique, from the constructed data cube. Second, we present a new action selection model, which is also based on association rules mining. Finally, we generalize not sufficiently experienced states, by mining multilevel association rules from the proposed fuzzy data cube. Experimental results obtained on two different versions of a well-known pursuit domain show the robustness and effectiveness of the proposed fuzzy OLAP mining based modular learning approach. Finally, we tested the scalability of the approach presented in this paper and compared it with our previous work on modular-fuzzy Q-learning and ordinary Q-learning.

  3. Text mining-based in silico drug discovery in oral mucositis caused by high-dose cancer therapy.

    Science.gov (United States)

    Kirk, Jon; Shah, Nirav; Noll, Braxton; Stevens, Craig B; Lawler, Marshall; Mougeot, Farah B; Mougeot, Jean-Luc C

    2018-02-23

    Oral mucositis (OM) is a major dose-limiting side effect of chemotherapy and radiation used in cancer treatment. Due to the complex nature of OM, currently available drug-based treatments are of limited efficacy. Our objectives were (i) to determine genes and molecular pathways associated with OM and wound healing using computational tools and publicly available data and (ii) to identify drugs formulated for topical use targeting the relevant OM molecular pathways. OM and wound healing-associated genes were determined by text mining, and the intersection of the two gene sets was selected for gene ontology analysis using the GeneCodis program. Protein interaction network analysis was performed using STRING-db. Enriched gene sets belonging to the identified pathways were queried against the Drug-Gene Interaction database to find drug candidates for topical use in OM. Our analysis identified 447 genes common to both the "OM" and "wound healing" text mining concepts. Gene enrichment analysis yielded 20 genes representing six pathways and targetable by a total of 32 drugs which could possibly be formulated for topical application. A manual search on ClinicalTrials.gov confirmed no relevant pathway/drug candidate had been overlooked. Twenty-five of the 32 drugs can directly affect the PTGS2 (COX-2) pathway, the pathway that has been targeted in previous clinical trials with limited success. Drug discovery using in silico text mining and pathway analysis tools can facilitate the identification of existing drugs that have the potential of topical administration to improve OM treatment.

  4. ASCOT: a text mining-based web-service for efficient search and assisted creation of clinical trials.

    Science.gov (United States)

    Korkontzelos, Ioannis; Mu, Tingting; Ananiadou, Sophia

    2012-04-30

    Clinical trials are mandatory protocols describing medical research on humans and among the most valuable sources of medical practice evidence. Searching for trials relevant to some query is laborious due to the immense number of existing protocols. Apart from search, writing new trials includes composing detailed eligibility criteria, which might be time-consuming, especially for new researchers. In this paper we present ASCOT, an efficient search application customised for clinical trials. ASCOT uses text mining and data mining methods to enrich clinical trials with metadata, that in turn serve as effective tools to narrow down search. In addition, ASCOT integrates a component for recommending eligibility criteria based on a set of selected protocols.

  5. ASCOT: a text mining-based web-service for efficient search and assisted creation of clinical trials

    Science.gov (United States)

    2012-01-01

    Clinical trials are mandatory protocols describing medical research on humans and among the most valuable sources of medical practice evidence. Searching for trials relevant to some query is laborious due to the immense number of existing protocols. Apart from search, writing new trials includes composing detailed eligibility criteria, which might be time-consuming, especially for new researchers. In this paper we present ASCOT, an efficient search application customised for clinical trials. ASCOT uses text mining and data mining methods to enrich clinical trials with metadata, that in turn serve as effective tools to narrow down search. In addition, ASCOT integrates a component for recommending eligibility criteria based on a set of selected protocols. PMID:22595088

  6. Grouping chemicals for health risk assessment: A text mining-based case study of polychlorinated biphenyls (PCBs).

    Science.gov (United States)

    Ali, Imran; Guo, Yufan; Silins, Ilona; Högberg, Johan; Stenius, Ulla; Korhonen, Anna

    2016-01-22

    As many chemicals act as carcinogens, chemical health risk assessment is critically important. A notoriously time consuming process, risk assessment could be greatly supported by classifying chemicals with similar toxicological profiles so that they can be assessed in groups rather than individually. We have previously developed a text mining (TM)-based tool that can automatically identify the mode of action (MOA) of a carcinogen based on the scientific evidence in literature, and it can measure the MOA similarity between chemicals on the basis of their literature profiles (Korhonen et al., 2009, 2012). A new version of the tool (2.0) was recently released and here we apply this tool for the first time to investigate and identify meaningful groups of chemicals for risk assessment. We used published literature on polychlorinated biphenyls (PCBs)-persistent, widely spread toxic organic compounds comprising of 209 different congeners. Although chemically similar, these compounds are heterogeneous in terms of MOA. We show that our TM tool, when applied to 1648 PubMed abstracts, produces a MOA profile for a subgroup of dioxin-like PCBs (DL-PCBs) which differs clearly from that for the rest of PCBs. This suggests that the tool could be used to effectively identify homogenous groups of chemicals and, when integrated in real-life risk assessment, could help and significantly improve the efficiency of the process. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  7. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  8. Networks Models of Actin Dynamics during Spermatozoa Postejaculatory Life: A Comparison among Human-Made and Text Mining-Based Models

    Directory of Open Access Journals (Sweden)

    Nicola Bernabò

    2016-01-01

    Full Text Available Here we realized a networks-based model representing the process of actin remodelling that occurs during the acquisition of fertilizing ability of human spermatozoa (HumanMade_ActinSpermNetwork, HM_ASN. Then, we compared it with the networks provided by two different text mining tools: Agilent Literature Search (ALS and PESCADOR. As a reference, we used the data from the online repository Kyoto Encyclopaedia of Genes and Genomes (KEGG, referred to the actin dynamics in a more general biological context. We found that HM_ALS and the networks from KEGG data shared the same scale-free topology following the Barabasi-Albert model, thus suggesting that the information is spread within the network quickly and efficiently. On the contrary, the networks obtained by ALS and PESCADOR have a scale-free hierarchical architecture, which implies a different pattern of information transmission. Also, the hubs identified within the networks are different: HM_ALS and KEGG networks contain as hubs several molecules known to be involved in actin signalling; ALS was unable to find other hubs than “actin,” whereas PESCADOR gave some nonspecific result. This seems to suggest that the human-made information retrieval in the case of a specific event, such as actin dynamics in human spermatozoa, could be a reliable strategy.

  9. Text mining with R a tidy approach

    CERN Document Server

    Silge, Julia

    2017-01-01

    Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective. The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media. Learn how to apply the tidy text format to NLP Use sentiment analysis to mine the emotional content of text Identify a document's most important terms with frequency measurements E...

  10. Texts, Transmissions, Receptions. Modern Approaches to Narratives

    NARCIS (Netherlands)

    Lardinois, A.P.M.H.; Levie, S.A.; Hoeken, H.; Lüthy, C.H.

    2015-01-01

    The papers collected in this volume study the function and meaning of narrative texts from a variety of perspectives. The word 'text' is used here in the broadest sense of the term: it denotes literary books, but also oral tales, speeches, newspaper articles and comics. One of the purposes of this

  11. Neural network approach to text processing

    Science.gov (United States)

    Sunthankar, S.

    1992-08-01

    There is a great need for fast accurate text retrieval systems to support many intelligent activities. The text search problem can be broken down into two main tasks; database searching and message routing. Database searching consists of searching through a large database of text from certain key words, phrases, or other simple functions of strings. Message routing is classifying incoming messages and sending them to the appropriate `mail box.'' These are actually very similar tasks. Both are really just pattern matching tasks. What matters are the methods used. In addition to searching and classifying, it would be nice to perform other tasks such as inferencing and prediction, so these are discussed briefly. We discuss and compare current leading edge solutions to this problem and introduce some new ideas based on recent neural network theories and experiments. All text-search and retrieval technology is predicted on the assumption that the semantic content of text can be predictd from its syntactic properties: specifically, the existence, frequency, or absence of certain character strings or words; the relationship clustering among words and phrases; the occurrence of particular patterns in particular fields within the document.

  12. Writing Treatment for Aphasia: A Texting Approach

    Science.gov (United States)

    Beeson, Pelagie M.; Higginson, Kristina; Rising, Kindle

    2013-01-01

    Purpose: Treatment studies have documented the therapeutic and functional value of lexical writing treatment for individuals with severe aphasia. The purpose of this study was to determine whether such retraining could be accomplished using the typing feature of a cellular telephone, with the ultimate goal of using text messaging for…

  13. Pedoinformatics Approach to Soil Text Analytics

    Science.gov (United States)

    Furey, J.; Seiter, J.; Davis, A.

    2017-12-01

    The several extant schema for the classification of soils rely on differing criteria, but the major soil science taxonomies, including the United States Department of Agriculture (USDA) and the international harmonized World Reference Base for Soil Resources systems, are based principally on inferred pedogenic properties. These taxonomies largely result from compiled individual observations of soil morphologies within soil profiles, and the vast majority of this pedologic information is contained in qualitative text descriptions. We present text mining analyses of hundreds of gigabytes of parsed text and other data in the digitally available USDA soil taxonomy documentation, the Soil Survey Geographic (SSURGO) database, and the National Cooperative Soil Survey (NCSS) soil characterization database. These analyses implemented iPython calls to Gensim modules for topic modelling, with latent semantic indexing completed down to the lowest taxon level (soil series) paragraphs. Via a custom extension of the Natural Language Toolkit (NLTK), approximately one percent of the USDA soil series descriptions were used to train a classifier for the remainder of the documents, essentially by treating soil science words as comprising a novel language. While location-specific descriptors at the soil series level are amenable to geomatics methods, unsupervised clustering of the occurrence of other soil science words did not closely follow the usual hierarchy of soil taxa. We present preliminary phrasal analyses that may account for some of these effects.

  14. Writing treatment for aphasia: a texting approach.

    Science.gov (United States)

    Beeson, Pélagie M; Higginson, Kristina; Rising, Kindle

    2013-06-01

    Treatment studies have documented the therapeutic and functional value of lexical writing treatment for individuals with severe aphasia. The purpose of this study was to determine whether such retraining could be accomplished using the typing feature of a cellular telephone, with the ultimate goal of using text messaging for communication. A 31-year-old man with persistent Broca's aphasia, severe apraxia of speech, global dysgraphia, and right hemiparesis participated in this study. Using a multiple baseline design, relearning and maintenance of single-word spellings (and oral naming) of targeted items were examined in response to traditional Copy and Recall Treatment (CART) for handwriting and a new paradigm using 1-handed typing on a cell phone keyboard (i.e., a texting version of CART referred to as T-CART). Marked improvements were documented in spelling and spoken naming trained in either modality, with stronger maintenance for handwriting than cell phone typing. Training resulted in functional use of texting that continued for 2 years after treatment. These results suggest that orthographic retraining using a cell phone keyboard has the potential to improve spelling knowledge and provide a means to improve functional communication skills. Combined training with both handwriting and cell phone typing should be considered in order to maximize the durability of treatment effects.

  15. Linguistic Deviances as a Stylistic Approach to Literary Texts: A ...

    African Journals Online (AJOL)

    ... of texts, show their functional significance for the interpretation of texts and relate literary effects to linguistics causes where they are felt to be relevant. This paper which focuses on Linguistic Deviances as a Stylistic Approach to Literary Texts: A Study of African Traditional Poem “Salute to the Elephant” shall be considered

  16. Comprehension des textes: Une demarche interactive (Text Comprehension: An Interactive Approach).

    Science.gov (United States)

    Cicurel, Francine

    1991-01-01

    An interactive approach to teaching French second-language reading comprehension is described. The method emphasizes involving the readers in the comprehension process and encouraging them to draw on prior learning to create hypotheses about the text's content. A four-stage instructional process is outlined. (MSE)

  17. Associated diacritical watermarking approach to protect sensitive arabic digital texts

    Science.gov (United States)

    Kamaruddin, Nurul Shamimi; Kamsin, Amirrudin; Hakak, Saqib

    2017-10-01

    Among multimedia content, one of the most predominant medium is text content. There have been lots of efforts to protect and secure text information over the Internet. The limitations of existing works have been identified in terms of watermark capacity, time complexity and memory complexity. In this work, an invisible digital watermarking approach has been proposed to protect and secure the most sensitive text i.e. Digital Holy Quran. The proposed approach works by XOR-ing only those Quranic letters that has certain diacritics associated with it. Due to sensitive nature of Holy Quran, diacritics play vital role in the meaning of the particular verse. Hence, securing letters with certain diacritics will preserve the original meaning of Quranic verses in case of alternation attempt. Initial results have shown that the proposed approach is promising with less memory complexity and time complexity compared to existing approaches.

  18. The semiotics of typography in literary texts. A multimodal approach

    DEFF Research Database (Denmark)

    Nørgaard, Nina

    2009-01-01

    to multimodal discourse proposed, for instance, by Kress & Van Leeuwen (2001) and Baldry & Thibault (2006), and, more specifically, the multimodal approach to typography suggested by Van Leeuwen (2005b; 2006), in order to sketch out a methodological framework applicable to the description and analysis...... of the semiotic potential of typography in literary texts....

  19. A concept-based approach to text categorization

    NARCIS (Netherlands)

    Schijvenaars, B.J.A.; Schuemie, M.J.; Mulligen, E.M. van; Weeber, M.; Jelier, R.; Mons, B.; Kors, J.A.; Kraaij, W.

    2005-01-01

    The Biosemantics group (Erasmus University Medical Center, Rotterdam) participated in the text categorization task of the Genomics Track. We followed a thesaurus-based approach, using the Collexis indexing system, in combination with a simple classification algorithm to assign a document to one of

  20. Building a glaucoma interaction network using a text mining approach.

    Science.gov (United States)

    Soliman, Maha; Nasraoui, Olfa; Cooper, Nigel G F

    2016-01-01

    The volume of biomedical literature and its underlying knowledge base is rapidly expanding, making it beyond the ability of a single human being to read through all the literature. Several automated methods have been developed to help make sense of this dilemma. The present study reports on the results of a text mining approach to extract gene interactions from the data warehouse of published experimental results which are then used to benchmark an interaction network associated with glaucoma. To the best of our knowledge, there is, as yet, no glaucoma interaction network derived solely from text mining approaches. The presence of such a network could provide a useful summative knowledge base to complement other forms of clinical information related to this disease. A glaucoma corpus was constructed from PubMed Central and a text mining approach was applied to extract genes and their relations from this corpus. The extracted relations between genes were checked using reference interaction databases and classified generally as known or new relations. The extracted genes and relations were then used to construct a glaucoma interaction network. Analysis of the resulting network indicated that it bears the characteristics of a small world interaction network. Our analysis showed the presence of seven glaucoma linked genes that defined the network modularity. A web-based system for browsing and visualizing the extracted glaucoma related interaction networks is made available at http://neurogene.spd.louisville.edu/GlaucomaINViewer/Form1.aspx. This study has reported the first version of a glaucoma interaction network using a text mining approach. The power of such an approach is in its ability to cover a wide range of glaucoma related studies published over many years. Hence, a bigger picture of the disease can be established. To the best of our knowledge, this is the first glaucoma interaction network to summarize the known literature. The major findings were a set of

  1. Approche globale de textes ecrits (A Global Approach to Written Texts)

    Science.gov (United States)

    Moirand, Sophie

    1976-01-01

    The hypothesis that beginners in a foreign language can learn to read specialized texts before they are able to express themselves adequately is explored. Examples of texts such as ads and news articles dealing with various specialties are given. Pedagogical procedures accompany each text. (Text is in French.) (AMH)

  2. Genre based Approach to Teach Writing Descriptive Text

    Directory of Open Access Journals (Sweden)

    Putu Ngurah Rusmawan

    2017-10-01

    Full Text Available This study aims to discuss how teaching and learning activities were carried out by using Genre based Approach in teaching writing descriptive text at junior high school. This study was conducted in the classroom of VII-1. Therefore, the appropriate design was qualitative research design. The subject of the study was the English teacher. To collect data, the researcher used observation and interview. The finding of the study described that the teaching and learning activities that were carried out by the teacher fulfilled the basic competencies. The teacher carried out the opening teaching activities by greeting, asking the students’ preparation during the lesson, checking the student’s attendance list, and informing the learning objective. The teacher carried out the main teaching activities by informing about how to write a descriptive text, giving, and asking opinions, eliciting the students’ understanding, prompting and directing to do exercises. The teacher carried out the closing teaching activities by directing the student to continue at home and eliciting the students’ reflection of what they could learn at that time.

  3. On the classification of emotional biosignals evoked while viewing affective pictures: an integrated data-mining-based approach for healthcare applications.

    Science.gov (United States)

    Frantzidis, Christos A; Bratsas, Charalampos; Klados, Manousos A; Konstantinidis, Evdokimos; Lithari, Chrysa D; Vivas, Ana B; Papadelis, Christos L; Kaldoudi, Eleni; Pappas, Costas; Bamidis, Panagiotis D

    2010-03-01

    Recent neuroscience findings demonstrate the fundamental role of emotion in the maintenance of physical and mental health. In the present study, a novel architecture is proposed for the robust discrimination of emotional physiological signals evoked upon viewing pictures selected from the International Affective Picture System (IAPS). Biosignals are multichannel recordings from both the central and the autonomic nervous systems. Following the bidirectional emotion theory model, IAPS pictures are rated along two dimensions, namely, their valence and arousal. Following this model, biosignals in this paper are initially differentiated according to their valence dimension by means of a data mining approach, which is the C4.5 decision tree algorithm. Then, the valence and the gender information serve as an input to a Mahalanobis distance classifier, which dissects the data into high and low arousing. Results are described in Extensible Markup Language (XML) format, thereby accounting for platform independency, easy interconnectivity, and information exchange. The average recognition (success) rate was 77.68% for the discrimination of four emotional states, differing both in their arousal and valence dimension. It is, therefore, envisaged that the proposed approach holds promise for the efficient discrimination of negative and positive emotions, and it is hereby discussed how future developments may be steered to serve for affective healthcare applications, such as the monitoring of the elderly or chronically ill people.

  4. Text Mining approaches for automated literature knowledge extraction and representation.

    Science.gov (United States)

    Nuzzo, Angelo; Mulas, Francesca; Gabetta, Matteo; Arbustini, Eloisa; Zupan, Blaz; Larizza, Cristiana; Bellazzi, Riccardo

    2010-01-01

    Due to the overwhelming volume of published scientific papers, information tools for automated literature analysis are essential to support current biomedical research. We have developed a knowledge extraction tool to help researcher in discovering useful information which can support their reasoning process. The tool is composed of a search engine based on Text Mining and Natural Language Processing techniques, and an analysis module which process the search results in order to build annotation similarity networks. We tested our approach on the available knowledge about the genetic mechanism of cardiac diseases, where the target is to find both known and possible hypothetical relations between specific candidate genes and the trait of interest. We show that the system i) is able to effectively retrieve medical concepts and genes and ii) plays a relevant role assisting researchers in the formulation and evaluation of novel literature-based hypotheses.

  5. Adjustable typography: an approach to enhancing low vision text accessibility.

    Science.gov (United States)

    Arditi, Aries

    2004-04-15

    Millions of people have low vision, a disability condition caused by uncorrectable or partially correctable disorders of the eye. The primary goal of low vision rehabilitation is increasing access to printed material. This paper describes how adjustable typography, a computer graphic approach to enhancing text accessibility, can play a role in this process, by allowing visually-impaired users to customize fonts to maximize legibility according to their own visual needs. Prototype software and initial testing of the concept is described. The results show that visually-impaired users tend to produce a variety of very distinct fonts, and that the adjustment process results in greatly enhanced legibility. But this initial testing has not yet demonstrated increases in legibility over and above the legibility of highly legible standard fonts such as Times New Roman.

  6. Trace of Knowledge: Benchmarking Novel Text Mining Based Measurements

    DEFF Research Database (Denmark)

    Woltmann, Sabrina

    2018-01-01

    basic and more advanced statistical learning tools from the field of computational linguistics and statistical learning to trace the knowledge fragments[2, 6]. In addition, we utilize a mixture of standard algebraic and probabilistic methods. Furthermore, pattern recognition, classification algorithms...

  7. Trace of Knowledge: Benchmarking Novel Text Mining Based Measurements

    DEFF Research Database (Denmark)

    Woltmann, Sabrina

    2018-01-01

    The impact of public research outcomes on economies, and societies, in particular, in terms of innovation and development is widely accepted and empirically investigated [9, 3]. However, many studies suggest a systematic underestimation of the impact and benefits of public research. Empirical stu...

  8. Une approche des textes scientifiques: le "par-coeur" (An Approach to Scientific Texts: "By Heart").

    Science.gov (United States)

    Descamps, Jean-Luc

    1980-01-01

    A method is provided for teaching reading comprehension of scientific or technical texts in a foreign language. The method involves analyzing language patterns and using some memorization for terminology. (MSE)

  9. A multiresolutional approach to fuzzy text meaning: A first attempt

    Energy Technology Data Exchange (ETDEWEB)

    Mehler, A.

    1996-12-31

    The present paper focuses on the connotative meaning aspect of language signs especially above the level of words. In this context the view is taken that texts can be defined as a kind of supersign, to which-in the same way as to other signs-a meaning can be assigned. A text can therefore be described as the result of a sign articulation which connects the material text sign with a corresponding meaning. For the constitution of the structural text meaning a kind of a semiotic composition principle is responsible, which leads to the emergence of interlocked levels of language units, demonstrating different grades of resolution. Starting on the level of words, and going through the level of sentences this principle reaches finally the level of texts by aggregating step by step the meaning of a unit on a higher level out of the meanings of all components one level below, which occur within this unit. Besides, this article will elaborate the hypothesis that the meaning constitution as a two-stage process, corresponding to the syntagmatic and paradigmatic restrictions of language elements among each other, obtains equally on the level of texts. On text level this two-levelledness leads to the constitution of the connotative text meaning, whose constituents are determined on word level by the syntagmatic and paradigmatic relations of the words. The formalization of the text meaning representation occurs with the help of fuzzy set theory.

  10. A Novel Approach For Syntactic Similarity Between Two Short Text

    Directory of Open Access Journals (Sweden)

    Anterpreet Kaur

    2015-06-01

    Full Text Available ABSTRACT Syntactic similarity is an important activity in the area of high field of text documents data mining natural language processing information retrieval. Natural language processing NLP is the intelligent machine where its ability is to translate the text into natural language such as English and other computer language such as c. Web mining used for task such as document clustering community mining etc to performed on web. However to find the similarity between the two documents is the difficult task. So with increasing scope in NLP require technique for dealing with many aspects of language in particular syntax semantics and paradigms.

  11. 'Texts' and 'signs': Criteria for choosing an analytical approach

    Directory of Open Access Journals (Sweden)

    Trocuk Irina V.

    2014-01-01

    Full Text Available At least for two and a half decades the concepts 'narrative' and 'narrative analysis', 'discourse' and 'discourse analysis', 'text', 'context', 'signs' and 'semiotic analysis' have become extremely popular in humanities and social sciences but still have not received precise definitions and are interpreted quite arbitrary based on the conceptual and methodological preferences of the researcher, as well as the goals and objectives of the particular applied or fundamental sociological research project. The author proposes a way to structure the field of textual analysis in sociology that goes far beyond even the broadest interpretations of the content analysis method. Undoubtedly, we need to develop clear criteria for at least the correct naming of different formats of analytical work with textual data; otherwise, we run the risk of writing not scientific articles but rather 'original discursive collages' skillfully juggling an ambiguous and diverse terminology of textual analysis. Key words:.

  12. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  13. The Application of Text Mining in Business Research

    DEFF Research Database (Denmark)

    Preuss, Bjørn

    2017-01-01

    The aim of this paper is to present a methodological concept in business research that has the potential to become one of the most powerful methods in the upcoming years when it comes to research qualitative phenomena in business and society. It presents a selection of algorithms as well elaborat...... on potential use cases for a text mining based approach to qualitative data analysis....

  14. Methodological principles and approaches to quality evaluation text portion of normative documents

    OpenAIRE

    Огродніча, Майя Леонідівна

    2013-01-01

    The paper considers the methodological approaches to estimating the quality of the text portion of normative documents. The research allowed revealing the main model provisions of the text portion of normative documents and analyzing their canonical forms. The main models of the text portion of the normative document are requirements, rules (recommendations, annexes, exclusions), concepts, comments. Formal approaches to quality assessing of the text portion of normative documents were propose...

  15. Pour une pratique de l'approche typologique des textes (Toward a Practical Application of the Typological Approach to Texts).

    Science.gov (United States)

    Richer, Jean-Jacques

    1991-01-01

    Text typologies, such as that of Jean-Michel Adam, that distinguish between narrative, descriptive, explanatory, injunctive, argumentative, and poetic texts offer promising possibilities for instruction of French as a second or foreign language. A number of diverse texts illustrate the potential for study of communicative, lexical, and syntactic…

  16. Notion de drame et approche du texte en civilisation (The Notion of Drama and an Approach Involving the Civilization Text).

    Science.gov (United States)

    Le Bihan, Andre

    1979-01-01

    Discusses the concept of the "social drama" and methods for teaching civilization on the basis of authentic texts. A response from the co-author (Pucheu, Rene) of an analysis of the election process in France is appended. (AM)

  17. The Diversity-Based Approach to Open-domain Text Summarization.

    Science.gov (United States)

    Nomoto, Tadashi; Matsumoto, Yuji

    2003-01-01

    Introduces a novel approach to unsupervised text summarization. Proposes an "information-centric" approach to evaluation, where the quality of summaries is judged not in terms of how well they match human-created summaries but in terms of how well they represent their source documents in information retrieval tasks such as document…

  18. Methodological Demonstration of a Text Analytics Approach to Country Logistics System Assessments

    DEFF Research Database (Denmark)

    Kinra, Aseem; Mukkamala, Raghava Rao; Vatrapu, Ravi

    2017-01-01

    The purpose of this study is to develop and demonstrate a semi-automated text analytics approach for the identification and categorization of information that can be used for country logistics assessments. In this paper, we develop the methodology on a set of documents for 21 countries using mach...... and the text analyst. Implications are discussed and future work is outlined....

  19. A probabilistic approach for mapping free-text queries to complex web forms

    NARCIS (Netherlands)

    Tjin-Kam-Jet, Kien; Trieschnigg, Rudolf Berend; Hiemstra, Djoerd

    Web applications with complex interfaces consisting of multiple input fields should understand free-text queries. We propose a probabilistic approach to map parts of a free-text query to the fields of a complex web form. Our method uses token models rather than only static dictionaries to create

  20. Opinion Mining in Latvian Text Using Semantic Polarity Analysis and Machine Learning Approach

    Directory of Open Access Journals (Sweden)

    Gatis Špats

    2016-07-01

    Full Text Available In this paper we demonstrate approaches for opinion mining in Latvian text. Authors have applied, combined and extended results of several previous studies and public resources to perform opinion mining in Latvian text using two approaches, namely, semantic polarity analysis and machine learning. One of the most significant constraints that make application of opinion mining for written content classification in Latvian text challenging is the limited publicly available text corpora for classifier training. We have joined several sources and created a publically available extended lexicon. Our results are comparable to or outperform current achievements in opinion mining in Latvian. Experiments show that lexicon-based methods provide more accurate opinion mining than the application of Naive Bayes machine learning classifier on Latvian tweets. Methods used during this study could be further extended using human annotators, unsupervised machine learning and bootstrapping to create larger corpora of classified text.

  1. Modelling text as process a dynamic approach to EFL classroom discourse

    CERN Document Server

    Yang, Xueyan

    2010-01-01

    A discourse analysis that is not based on grammar is likely to end up as a running commentary on a text, whereas a grammar-based one tends to treat text as a finished product rather than an on-going process. This book offers an approach to discourse analysis that is both grammar-based and oriented towards text as process. It proposes a model called TEXT TYPE within the framework of Hallidayan systemic-functional linguistics, which views grammatical choices in a text not as elements that combine to form a clause structure, but as semantic features that link successive clauses into an unfolding

  2. Interdisciplinary Approach to the Mental Lexicon: Neural Network and Text Extraction From Long-term Memory

    Directory of Open Access Journals (Sweden)

    Vardan G. Arutyunyan

    2013-01-01

    Full Text Available The paper touches upon the principles of mental lexicon organization in the light of recent research in psycho- and neurolinguistics. As a focal point of discussion two main approaches to mental lexicon functioning are considered: modular or dual-system approach, developed within generativism and opposite single-system approach, representatives of which are the connectionists and supporters of network models. The paper is an endeavor towards advocating the viewpoint that mental lexicon is complex psychological organization based upon specific composition of neural network. In this regard, the paper further elaborates on the matter of storing text in human mental space and introduces a model of text extraction from long-term memory. Based upon data available, the author develops a methodology of modeling structures of knowledge representation in the systems of artificial intelligence.

  3. A new approach for overlay text detection and extraction from complex video scene.

    Science.gov (United States)

    Kim, Wonjun; Kim, Changick

    2009-02-01

    Overlay text brings important semantic clues in video content analysis such as video information retrieval and summarization, since the content of the scene or the editor's intention can be well represented by using inserted text. Most of the previous approaches to extracting overlay text from videos are based on low-level features, such as edge, color, and texture information. However, existing methods experience difficulties in handling texts with various contrasts or inserted in a complex background. In this paper, we propose a novel framework to detect and extract the overlay text from the video scene. Based on our observation that there exist transient colors between inserted text and its adjacent background, a transition map is first generated. Then candidate regions are extracted by a reshaping method and the overlay text regions are determined based on the occurrence of overlay text in each candidate. The detected overlay text regions are localized accurately using the projection of overlay text pixels in the transition map and the text extraction is finally conducted. The proposed method is robust to different character size, position, contrast, and color. It is also language independent. Overlay text region update between frames is also employed to reduce the processing time. Experiments are performed on diverse videos to confirm the efficiency of the proposed method.

  4. Different approaches for identifying important concepts in probabilistic biomedical text summarization.

    Science.gov (United States)

    Moradi, Milad; Ghadiri, Nasser

    2018-01-01

    Automatic text summarization tools help users in the biomedical domain to acquire their intended information from various textual resources more efficiently. Some of biomedical text summarization systems put the basis of their sentence selection approach on the frequency of concepts extracted from the input text. However, it seems that exploring other measures rather than the raw frequency for identifying valuable contents within an input document, or considering correlations existing between concepts, may be more useful for this type of summarization. In this paper, we describe a Bayesian summarization method for biomedical text documents. The Bayesian summarizer initially maps the input text to the Unified Medical Language System (UMLS) concepts; then it selects the important ones to be used as classification features. We introduce six different feature selection approaches to identify the most important concepts of the text and select the most informative contents according to the distribution of these concepts. We show that with the use of an appropriate feature selection approach, the Bayesian summarizer can improve the performance of biomedical summarization. Using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) toolkit, we perform extensive evaluations on a corpus of scientific papers in the biomedical domain. The results show that when the Bayesian summarizer utilizes the feature selection methods that do not use the raw frequency, it can outperform the biomedical summarizers that rely on the frequency of concepts, domain-independent and baseline methods. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Two approaches to gathering text corpora from the WorldWideWeb

    CSIR Research Space (South Africa)

    Botha, G

    2005-11-01

    Full Text Available are available on the World Wide Web. We describe and compare two approaches to gathering language-specific corpora from this resource, and show that the use of a commercial search engine as a first stage leads to good results....

  6. A Functional Approach to Evaluating Content Knowledge and Language Development in ESL Students' Science Classification Texts.

    Science.gov (United States)

    Huang, Jingzi; Morgan, Glenn

    2003-01-01

    Investigates use of a functional approach to discourse analysis--knowledge structure analysis, which focuses on meaning, form, and function simultaneously--to evaluate both writing development and content learning. Examined written texts in science, produced by English-as-a-Second-Language students with limited to intermediate English language…

  7. Using a Text-Mining Approach to Evaluate the Quality of Nursing Records.

    Science.gov (United States)

    Chang, Hsiu-Mei; Chiou, Shwu-Fen; Liu, Hsiu-Yun; Yu, Hui-Chu

    2016-01-01

    Nursing records in Taiwan have been computerized, but their quality has rarely been discussed. Therefore, this study employed a text-mining approach and a cross-sectional retrospective research design to evaluate the quality of electronic nursing records at a medical center in Northern Taiwan. SAS Text Miner software Version 13.2 was employed to analyze unstructured nursing event records. The results show that SAS Text Miner is suitable for developing a textmining model for validating nursing records. The sensitivity of SAS Text Miner was approximately 0.94, and the specificity and accuracy were 0.99. Thus, SAS Text Miner software is an effective tool for auditing unstructured electronic nursing records.

  8. COMBINING PRODUCT AND PROCESS-BASED APPROACHES TO TEACHING WRITING DISCUSSION TEXTS

    Directory of Open Access Journals (Sweden)

    Vina Agustiana

    2016-06-01

    Full Text Available This study examines the activities of teaching-learning writing discussion texts when product and process-based approach combination is implemented in EFL writing classroom, the effects of applying the writing approach on EFL students’ writing skill, and the students’ attitude toward the implementation of writing approach in the classroom. It uses a mixed-methods through applying an embedded design by involving 24 second-grade students of a private university in West Java, Indonesia. There were four instruments used, namely field notes, videotapes, students’ tests (pre-test and post-test, and questionnaires. The findings show that the students were actively involved in class when the teacher applied the writing approach in writing classroom. There was also the improvement in the students’ writing skill based on the result taken from the students’ tests since the level of significance (two-tailed in paired t-test is less than the alpha (0.000<0.05. Qualitatively, the improvements were also found in generic features, textual language, and syntactical language aspects. In addition, the students showed highly positive attitude (4.35 average score toward the implementation of the approach in the classroom.

  9. A Novel Text Clustering Approach Using Deep-Learning Vocabulary Network

    Directory of Open Access Journals (Sweden)

    Junkai Yi

    2017-01-01

    Full Text Available Text clustering is an effective approach to collect and organize text documents into meaningful groups for mining valuable information on the Internet. However, there exist some issues to tackle such as feature extraction and data dimension reduction. To overcome these problems, we present a novel approach named deep-learning vocabulary network. The vocabulary network is constructed based on related-word set, which contains the “cooccurrence” relations of words or terms. We replace term frequency in feature vectors with the “importance” of words in terms of vocabulary network and PageRank, which can generate more precise feature vectors to represent the meaning of text clustering. Furthermore, sparse-group deep belief network is proposed to reduce the dimensionality of feature vectors, and we introduce coverage rate for similarity measure in Single-Pass clustering. To verify the effectiveness of our work, we compare the approach to the representative algorithms, and experimental results show that feature vectors in terms of deep-learning vocabulary network have better clustering performance.

  10. ON AN APPROACH TO MODELLING THE CONCEPTUAL SPACE OF LANGUAGE SIGNS AND TEXTS

    Directory of Open Access Journals (Sweden)

    Ivanova, E.V.

    2017-12-01

    Full Text Available The paper examines one of the possible approaches to exploring the conceptual space represented by language signs and texts. The notion of the cognitheme as a unit of knowledge in the form of a proposition, functional for modelling the conceptual space, is defined and some principles of the cognitheme analysis are discussed. The cognitheme is considered as a unit of modelling mental entities reflected in the language, for example, such as the concept or the conceptual space connected with a text, and at the same time as a unit of conceptualization significant in its own right, revealing elements of knowledge important for a language community and thus fixed in language signs and texts. A feasible classification of cognithemes is described, examples illustrating this classification are given.

  11. Task-based Language Teaching and Text Types in Teaching Writing Using Communicative Approach

    Directory of Open Access Journals (Sweden)

    Riyana Sari Ni Nyoman

    2018-01-01

    Full Text Available One of the most important language competencies in teaching learning process is writing. The present study focused on investigating the effect of communicative approach with task-based language teaching and communicative approach on the students’ writing competency at SMP N 2 Kediri viewed from text types(i.e. descriptive, recount, and narrative. To analyze the data, the design of the experimental study was posttest-only comparison groups by involving 60 students that were selected as the sample of the study through cluster random design. The sample’s post tests were assessed by using analytical scoring rubric. The data were then analyzed by using One-way ANOVA and the post hoc test was done by computing Multiple Comparison using Tukey HSD Test. The result showed that there was significant difference of the effect of communicative approach with task-based language teaching and communicative approach on the students’ writing competency. These findings are expected to give contribution in teaching English, particularly writing.

  12. Separation in Data Mining Based on Fractal Nature of Data

    Czech Academy of Sciences Publication Activity Database

    Jiřina, Marcel; Jiřina jr., M.

    2013-01-01

    Roč. 3, č. 1 (2013), s. 44-60 ISSN 2225-658X Institutional support: RVO:67985807 Keywords : nearest neighbor * fractal set * multifractal * IINC method * correlation dimension Subject RIV: JC - Computer Hardware ; Software http://sdiwc.net/digital-library/separation-in-data-mining-based-on- fractal -nature-of-data.html

  13. Separation in Data Mining Based on Fractal Nature of Data

    Czech Academy of Sciences Publication Activity Database

    Jiřina, Marcel; Jiřina jr., M.

    2013-01-01

    Roč. 3, č. 1 (2013), s. 44-60 ISSN 2225-658X Institutional support: RVO:67985807 Keywords : nearest neighbor * fractal set * multifractal * IINC method * correlation dimension Subject RIV: JC - Computer Hardware ; Software http://sdiwc.net/digital-library/separation-in-data-mining-based-on-fractal-nature-of-data.html

  14. A Network of Themes: A Qualitative Approach to Gerhard Richter's Text

    Directory of Open Access Journals (Sweden)

    Narvika Bovcon

    2017-07-01

    Full Text Available Gerhard Richter's books Text – a collection of painter's verbal statements about his artistic method – and Atlas – 783 sheets with images, mainly photographs and visual notations – are two archives that complement the understanding of his diverse artistic practice. The paper presents a textual model that experimentally simulates a possible ordering principle for archives. Richter's statements in the book Text are cut up and used as short quotations. Those that relate to multiple aspects of the painter's oeuvre are identified as hubs in the semantic network. The hubs are organized paratactically, as an array of different themes. The paper presents a methodological hypothesis and an experimental model that aim to connect the research of real networks with the paradigms of humanistic interpretation. We have to bear in mind that the network is a result of the researcher's interpretative approach, which is added to the initial archive included in the book Text. The breaking up of Richter's poetics into atoms of quotations is an experimental proposal of a new textuality in art history and humanities, which has its own history. In comparison to digital archives with complex interfaces that often tend to obscure the content, the elements in our experiment appear as specific configurations of the semantic network and are presented in a limited number of linear texts. The method of listing of quotations gathers the fragments into a potential “whole”, i.e. a narrativized gateway to an archive according to the researcher's interpretation.

  15. A Discourse-Based View in Interdisciplinary Approaches to Fictional Text Analysis

    Directory of Open Access Journals (Sweden)

    Альсина Соуса

    2017-12-01

    Full Text Available As patterns of communication change in a globalized society, literacy in foreign languages, especially English, becomes an issue of ever growing relevance to all those involved in the educational system, not to mention those who are to learn all their life long. As such, the goal of this article is to discuss how EFLit (English as a Foreign Literature students can gain in both linguistic competence and critical awareness thereof, should their teachers/lecturers abide to a discourse-based view on (literary language and approach the selected texts by following a pedagogical stylistics orientation also drawing eclectically on pragmatics and other areas of knowledge within the broader domain of applied linguistics. Here under focus will be a discussion of the topics on which literary and linguistic studies show greatest potential for (theoretical convergence and, above all, combined applications in lecture setting. Crucially, it will be argued that a pedagogical stylistics approach to EFLit teaching/learning both develops students’ linguistic competence and raises their awareness as to the meaning making potential of language in use in the texts at hand as well as in their larger historical and sociocultural settings. This will be illustrated by highlighting some textual features within a short extract of Fred D’Aguiar’s The Longest Memory (1995 and the linguistic competence that its comprehension would demand from students.

  16. Scaffolding in the Teaching of Writing Discussion Texts Based on SFL Genre-based Approach

    Directory of Open Access Journals (Sweden)

    Eva Fitriani Syarifah

    2015-12-01

    Full Text Available Writing in a second or foreign language seems to be the most difficult language skill for language learners to acquire (Laksmi, 2006; Lestari, 2008; Negari, 2011. Some scholars proposed the implementation of SFL – genre based approach in teaching writing (Derewianka, 1990; Rothery, 1996. However, SFL genre based approach seems to be product or teaching outcomes oriented (Ahn, 2012; Emilia, 2011. Therefore, the concept of scaffolding in which possible supports the process of students‟ individual development is important to be emerged in the teaching stages of SFL – GBA (Bodrova & Leong, 1998; Mulatsih, 2011. As a result, This study focuses on the issue of scaffoldings in the teaching of writing discussion texts based on SFL – Genre Based Approach. It particularly aims to investigate how scaffolding processes are implemented in the teaching of writing discussion texts based on SFL-GBA and how they improve the students‟ writing performance. The data rely on teaching and learning process in a classroom with six students in a tertiary level as the focus participants. The method used in the data analysis adopted a qualitative design with reference especially to the theory of the scaffolding and SFL-GBA. The results of analysis show that scaffolding processes are implemented in terms of macro and micro scaffoldings and able to improve the students‟ writing performance specifically in terms of social function, schematic structures, and language features of discussion genre. It is recommended that future related research should be conducted in more diverse of educational settings to see how scaffoldings are implemented in a variety of teaching practices.

  17. Neurolinguistic approach to natural language processing with applications to medical text analysis.

    Science.gov (United States)

    Duch, Włodzisław; Matykiewicz, Paweł; Pestian, John

    2008-12-01

    Understanding written or spoken language presumably involves spreading neural activation in the brain. This process may be approximated by spreading activation in semantic networks, providing enhanced representations that involve concepts not found directly in the text. The approximation of this process is of great practical and theoretical interest. Although activations of neural circuits involved in representation of words rapidly change in time snapshots of these activations spreading through associative networks may be captured in a vector model. Concepts of similar type activate larger clusters of neurons, priming areas in the left and right hemisphere. Analysis of recent brain imaging experiments shows the importance of the right hemisphere non-verbal clusterization. Medical ontologies enable development of a large-scale practical algorithm to re-create pathways of spreading neural activations. First concepts of specific semantic type are identified in the text, and then all related concepts of the same type are added to the text, providing expanded representations. To avoid rapid growth of the extended feature space after each step only the most useful features that increase document clusterization are retained. Short hospital discharge summaries are used to illustrate how this process works on a real, very noisy data. Expanded texts show significantly improved clustering and may be classified with much higher accuracy. Although better approximations to the spreading of neural activations may be devised a practical approach presented in this paper helps to discover pathways used by the brain to process specific concepts, and may be used in large-scale applications.

  18. Text mining approach to predict hospital admissions using early medical records from the emergency department.

    Science.gov (United States)

    Lucini, Filipe R; S Fogliatto, Flavio; C da Silveira, Giovani J; L Neyeloff, Jeruza; Anzanello, Michel J; de S Kuchenbecker, Ricardo; D Schaan, Beatriz

    2017-04-01

    Emergency department (ED) overcrowding is a serious issue for hospitals. Early information on short-term inward bed demand from patients receiving care at the ED may reduce the overcrowding problem, and optimize the use of hospital resources. In this study, we use text mining methods to process data from early ED patient records using the SOAP framework, and predict future hospitalizations and discharges. We try different approaches for pre-processing of text records and to predict hospitalization. Sets-of-words are obtained via binary representation, term frequency, and term frequency-inverse document frequency. Unigrams, bigrams and trigrams are tested for feature formation. Feature selection is based on χ 2 and F-score metrics. In the prediction module, eight text mining methods are tested: Decision Tree, Random Forest, Extremely Randomized Tree, AdaBoost, Logistic Regression, Multinomial Naïve Bayes, Support Vector Machine (Kernel linear) and Nu-Support Vector Machine (Kernel linear). Prediction performance is evaluated by F1-scores. Precision and Recall values are also informed for all text mining methods tested. Nu-Support Vector Machine was the text mining method with the best overall performance. Its average F1-score in predicting hospitalization was 77.70%, with a standard deviation (SD) of 0.66%. The method could be used to manage daily routines in EDs such as capacity planning and resource allocation. Text mining could provide valuable information and facilitate decision-making by inward bed management teams. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.

  19. Axiomatic Ontology Learning Approaches for English Translation of the Meaning of Quranic Texts

    Directory of Open Access Journals (Sweden)

    Saad Saidah

    2017-01-01

    Full Text Available Ontology learning (OL is the computational task of generating a knowledge base in the form of an ontology, given an unstructured corpus in natural language (NL. While most works in the field of ontology learning have been primarily based on a statistical approach to extract lightweight OL, very few attempts have been made to extract axiomatic OL (called heavyweight OL from NL text documents. Axiomatic OL supports more precise formal logic-based reasoning when compared to lightweight OL. Lexico-syntactic pattern matching and statisticsal one cannot lead to very accurate learning, mostly because of several linguistic nuances in the NL. Axiomatic OL is an alternative methodology that has not been explored much, where a deep linguistics analysis in computational linguistics is used to generate formal axioms and definitions instead of simply inducing a taxonomy. The ontology that is created not only stores the information about the application domain in explicit knowledge, but also can deduce the implicit knowledge from this ontology. This research will explore the English translation of the meaning of Quranic texts.

  20. Using text mining for study identification in systematic reviews: a systematic review of current approaches.

    Science.gov (United States)

    O'Mara-Eves, Alison; Thomas, James; McNaught, John; Miwa, Makoto; Ananiadou, Sophia

    2015-01-14

    The large and growing number of published studies, and their increasing rate of publication, makes the task of identifying relevant studies in an unbiased way for inclusion in systematic reviews both complex and time consuming. Text mining has been offered as a potential solution: through automating some of the screening process, reviewer time can be saved. The evidence base around the use of text mining for screening has not yet been pulled together systematically; this systematic review fills that research gap. Focusing mainly on non-technical issues, the review aims to increase awareness of the potential of these technologies and promote further collaborative research between the computer science and systematic review communities. Five research questions led our review: what is the state of the evidence base; how has workload reduction been evaluated; what are the purposes of semi-automation and how effective are they; how have key contextual problems of applying text mining to the systematic review field been addressed; and what challenges to implementation have emerged? We answered these questions using standard systematic review methods: systematic and exhaustive searching, quality-assured data extraction and a narrative synthesis to synthesise findings. The evidence base is active and diverse; there is almost no replication between studies or collaboration between research teams and, whilst it is difficult to establish any overall conclusions about best approaches, it is clear that efficiencies and reductions in workload are potentially achievable. On the whole, most suggested that a saving in workload of between 30% and 70% might be possible, though sometimes the saving in workload is accompanied by the loss of 5% of relevant studies (i.e. a 95% recall). Using text mining to prioritise the order in which items are screened should be considered safe and ready for use in 'live' reviews. The use of text mining as a 'second screener' may also be used cautiously

  1. A Lotmanian Approach to the Ideological Function of Honour in Early Modern English Texts

    Directory of Open Access Journals (Sweden)

    Jesús López-Peláez Casellas

    2013-06-01

    Full Text Available This essay aims at presenting a semiotic study of the ideological function of the concept of honour in early modern prose writing in England. By means of both a critical study of the major axiological and epistemological dimensions of sixteenth and seventeenth century honour, and a cultural semiotic approach to this notion (Lotman’s typology of cultures, it is my belief that it will be possible to clarify the social and ideological meaning and function of early modern honour, and to account for the transformations that the concept underwent during this period of transition, especially as can be observed in non-literary works (moral treatises and conduct books, mainly. Additionally, this is also intended to contribute to further study of the form and function of the code and concept of honour in the English literature (drama and poetry of this period.

  2. LIBERAL THOUGHT IN QUR’ANIC STUDIES: Tracing Humanistic Approach to Sacred Text in Islamic Scholarship

    Directory of Open Access Journals (Sweden)

    M. Nur Kholis Setiawan

    2007-03-01

    Full Text Available Literary approach to the Qur’an developed by al-Khuli created deep critiques from its opponents, in whose opinion, the usage of literary paradigm to the study of the Qur’an, according to them, implied a consequence of treating the Qur’an as a human text which clearly indicates a strong influence of a liberal mode of thinking that goes out of the line of the Qur’an’s spirit. This article shows a diametric fact compared to that they have claimed. The data proves that linguistic aspects of the Qur’an have succeeded in making an intellectual connection among progressive and liberal scholars in the classical and modern era. This supports the assumption that progressive and liberal thought whose one of its indicators is freedom of thought in accordance to Charles Kurzman term, is “children” of the Islamic civilization. Freedom of thought in the classical Islamic scholarship should be the élan of intellectualism including the field of Qur’anic studies.

  3. Comparative Study of Machine Learning Approach on Malay Translated Hadith Text Classification based on Sanad

    Directory of Open Access Journals (Sweden)

    Mohammad Najib Syuhairah Rahifah

    2017-01-01

    Full Text Available Sanad is one of important part used to determine the authentication of hadith. However, very little research work has been found on classification of Malay translated Hadith based on sanad. There are some researches done using machine learning approach on hadith classification based on sanad but using different objective with different language. This research is to see how Machine Learning techniques are used to classify Malay translated Hadith document based on sanad. In this paper, SVM, NB and k-NN are used to identify and evaluate the performance of Malay translated hadith based on sanad. The performances are evaluated based on standard performance metrics used in text classification which is accuracy and response time. The results show that SVM has the highest accuracy and k-NN has the best response time (time taken in process for classification data compare to other classifier. In future, we plan to extend this paper with the analysis on interclass similarity and also test on larger dataset.

  4. Electronic approaches to making sense of the text in the adverse event reporting system.

    Science.gov (United States)

    Benin, Andrea L; Fodeh, Samah Jamal; Lee, Kyle; Koss, Michele; Miller, Perry; Brandt, Cynthia

    2016-08-01

    Health care organizations working to eliminate preventable harm and to improve patient safety must have robust programs to collect and to analyze data on adverse events in order to use the information to affect improvement. Such adverse event reporting systems are based on frontline personnel reporting issues that arise in the course of their daily work. Limitations in how existing software systems handle these reports mean that use of this potentially rich information is resource intensive and prone to variable results. The aim of this study was to develop an electronic approach to processing the text in medical event reports that would be reliable enough to be used to improve patient safety. At Connecticut Children's Medical Center, staff manually enter reports of adverse events into a web-based software tool. We evaluated the ability of 2 electronic methods-rule-based query and semi-supervised machine learning-to identify specific types of events ("use cases") versus a reference standard. Rule-based query was tested on 5 use cases and machine learning on a subset of 2 using 9164 events reported from February 2012-January 2014. Machine learning found 93% of the weight-based errors and 92% of the errors in patient-identification. Rule-based query had accuracy of 99% or greater, high precision, and high recall for all use cases. Electronic approaches to streamlining the use of adverse event reports are feasible to automate and valuable for categorizing this important data for use in improving patient safety. © 2016 American Society for Healthcare Risk Management of the American Hospital Association.

  5. Chemical Topic Modeling: Exploring Molecular Data Sets Using a Common Text-Mining Approach.

    Science.gov (United States)

    Schneider, Nadine; Fechner, Nikolas; Landrum, Gregory A; Stiefl, Nikolaus

    2017-08-28

    Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: more and more data are being generated, for instance, by technologies such as DNA encoded libraries, peptide libraries, text mining of large literature corpora, and new in silico enumeration methods. Handling those huge sets of molecules effectively is quite challenging and requires compromises that often come at the expense of the interpretability of the results. In order to find an intuitive and meaningful approach to organizing large molecular data sets, we adopted a probabilistic framework called "topic modeling" from the text-mining field. Here we present the first chemistry-related implementation of this method, which allows large molecule sets to be assigned to "chemical topics" and investigating the relationships between those. In this first study, we thoroughly evaluate this novel method in different experiments and discuss both its disadvantages and advantages. We show very promising results in reproducing human-assigned concepts using the approach to identify and retrieve chemical series from sets of molecules. We have also created an intuitive visualization of the chemical topics output by the algorithm. This is a huge benefit compared to other unsupervised machine-learning methods, like clustering, which are commonly used to group sets of molecules. Finally, we applied the new method to the 1.6 million molecules of the ChEMBL22 data set to test its robustness and efficiency. In about 1 h we built a 100-topic model of this large data set in which we could identify interesting topics like "proteins", "DNA", or "steroids". Along with this publication we provide our data sets and an open-source implementation of the new method (CheTo) which

  6. The Voice of Chinese Health Consumers: A Text Mining Approach to Web-Based Physician Reviews.

    Science.gov (United States)

    Hao, Haijing; Zhang, Kunpeng

    2016-05-10

    skills and bedside manner, general appreciation from patients, and description of various symptoms. To the best of our knowledge, our work is the first study using an automated text-mining approach to analyze a large amount of unstructured textual data of Web-based physician reviews in China. Based on our analysis, we found that Chinese reviewers mainly concentrate on a few popular topics. This is consistent with the goal of Chinese online health platforms and demonstrates the health care focus in China's health care system. Our text-mining approach reveals a new research area on how to use big data to help health care providers, health care administrators, and policy makers hear patient voices, target patient concerns, and improve the quality of care in this age of patient-centered care. Also, on the health care consumer side, our text mining technique helps patients make more informed decisions about which specialists to see without reading thousands of reviews, which is simply not feasible. In addition, our comparison analysis of Web-based physician reviews in China and the United States also indicates some cultural differences.

  7. Identification of new and emerging occupational risks using a text mining based information system

    NARCIS (Netherlands)

    Pronk, A.; Goede, H.; Lucas Luijckx, N.; Brug, F. van de; Cnossen, H.; Tielemans, E.

    2011-01-01

    Introduction On the internet and in scientific databases relevant information is available on new and emerging occupational risks. However, the amount of information is enormous and the information is scattered over multiple and diverse data sources complicating the full utilization of the data

  8. Improving Collaborative Learning in the Classroom: Text Mining Based Grouping and Representing

    Science.gov (United States)

    Erkens, Melanie; Bodemer, Daniel; Hoppe, H. Ulrich

    2016-01-01

    Orchestrating collaborative learning in the classroom involves tasks such as forming learning groups with heterogeneous knowledge and making learners aware of the knowledge differences. However, gathering information on which the formation of appropriate groups and the creation of graphical knowledge representations can be based is very effortful…

  9. A constructivist approach to e-text design for use in undergraduate physiology courses.

    Science.gov (United States)

    Rhodes, Ashley E; Rozell, Timothy G

    2015-09-01

    Electronic textbooks, or e-texts, will have an increasingly important role in college science courses within the next few years due to the rising costs of traditional texts and the increasing availability of software allowing instructors to create their own e-text. However, few guidelines exist in the literature to aid instructors in the development and design specifically of e-texts using sound learning theories; this is especially true for undergraduate physiology e-texts. In this article, we describe why constructivism is a very important educational theory for e-text design and how it may be applied in e-text development by instructors. We also provide examples of two undergraduate physiology e-texts that were designed in accordance with this educational theory but for learners of quite different backgrounds and prior knowledge levels. Copyright © 2015 The American Physiological Society.

  10. A Study of Readability of Texts in Bangla through Machine Learning Approaches

    Science.gov (United States)

    Sinha, Manjira; Basu, Anupam

    2016-01-01

    In this work, we have investigated text readability in Bangla language. Text readability is an indicator of the suitability of a given document with respect to a target reader group. Therefore, text readability has huge impact on educational content preparation. The advances in the field of natural language processing have enabled the automatic…

  11. A Constructivist Approach to E-Text Design for Use in Undergraduate Physiology Courses

    Science.gov (United States)

    Rhodes, Ashley E.; Rozell, Timothy G.

    2015-01-01

    Electronic textbooks, or e-texts, will have an increasingly important role in college science courses within the next few years due to the rising costs of traditional texts and the increasing availability of software allowing instructors to create their own e-text. However, few guidelines exist in the literature to aid instructors in the…

  12. DataToText: A Consumer-Oriented Approach to Data Analysis

    Science.gov (United States)

    Kenny, David A.

    2010-01-01

    DataToText is a project developed where the user communicates the relevant information for an analysis and DataToText computer routine produces text output that describes in words, tables, and figures the results from the analyses. Two extended examples are given, one an example of a moderator analysis and the other an example of a dyadic data…

  13. Individual differences in reading comprehension : A componential approach to eighth graders’ expository text comprehension

    NARCIS (Netherlands)

    Welie, C.J.M.

    2017-01-01

    Why do secondary school students differ in their text comprehension? This is an important question because many secondary school students are unable to achieve the level of text comprehension required to enable learning from their school book texts. This thesis contributes to answering this question

  14. Two Approaches to Surfacing Full-Text News on an Intranet

    Science.gov (United States)

    Jones, Barrett; Maslyukova, Elena

    2006-01-01

    Integrating news services into an intranet can be tricky. Even within related organizations, slight differences in subscriptions and end-user needs can necessitate different integration approaches. This article examines how the World Bank and the International Monetary Fund took separate approaches to tackle the same integration challenge in…

  15. Text in context: a textual-linguistic approach to Amos 4: 7-8

    Directory of Open Access Journals (Sweden)

    del Barco del Barco, Francisco Javier

    2002-12-01

    Full Text Available This article will study Amos 4:7-8 from a textlinguistic approach: the form of this section will be analyzed within the structure of the chapter in which it is inserted. Such an analysis is needed because the set of verb forms used seems to be different from the rest of verb forms used in the chapter. While the whole chapter tends to be structured as a brief chain of narrative passages with wayyiqtol, the structure of Amos 4:7-8 seems to be a predictive section -developed through weqatal- inserted or pasted in the middle of the chapter. Translations usually do not note the difference between the set of verb forms used. A textlinguistic analysis of Amos 4:7-8 will show that the kind of discourse used here is different from the one used in the rest of the chapter, and, therefore, this difference should be reflected in the translation. The specific function of some discourse types is also discussed.

    En este artículo se presenta un análisis de Amos 4:7-8 a partir de los presupuestos de la lingüística textual. La forma del texto se analizará tomando en cuenta la estructura del capítulo en el que se halla inserto. Este análisis resulta necesario porque el grupo de formas verbales utilizado en la sección propuesta no parece ser el mismo que el del resto del capítulo. Mientras el capítulo en su conjunto es un discurso narrativo estructurado en torno a wayyiqtol, Amos 4:7-8 parece responder al esquema del discurso predictivo desarrollado a partir de weqatal. Un análisis textual se hace necesario porque las traducciones bíblicas no parecen hacerse eco del cambio en el uso de las formas verbales. Además de este análisis, se trata también de la función específica de algunos tipos de discurso.

  16. Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models

    Directory of Open Access Journals (Sweden)

    Jin Dai

    2014-01-01

    Full Text Available The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers.

  17. An improved algorithm for information hiding based on features of Arabic text: A Unicode approach

    Directory of Open Access Journals (Sweden)

    A.A. Mohamed

    2014-07-01

    Full Text Available Steganography means how to hide secret information in a cover media, so that other individuals fail to realize their existence. Due to the lack of data redundancy in the text file in comparison with other carrier files, text steganography is a difficult problem to solve. In this paper, we proposed a new promised steganographic algorithm for Arabic text based on features of Arabic text. The focus is on more secure algorithm and high capacity of the carrier. Our extensive experiments using the proposed algorithm resulted in a high capacity of the carrier media. The embedding capacity rate ratio of the proposed algorithm is high. In addition, our algorithm can resist traditional attacking methods since it makes the changes in carrier text as minimum as possible.

  18. Journalistic Texts in Training Russian as Foreign Language Genre-Orientation Approach

    OpenAIRE

    Vorobjeva, Lyudmila V.; Kazakova, Olga A.; Frik, Tatyana B.

    2015-01-01

    This paper explains the use of journalistic texts on foreign language lessons. The contribution of journalistic texts to realization of the purposes of training is characterized. Need of continuous updating of methodical materials in journalistic texts is approved in the paper. For the first time authors suggest using in work with foreign students' genres of modern Russian Language internet mass-media 'travel sketch' and 'politic comment'. In this paper the role of genres in formation of fore...

  19. Semi-supervised probabilistics approach for normalising informal short text messages

    CSIR Research Space (South Africa)

    Modupe, A

    2017-03-01

    Full Text Available language processing (NLP) techniques. In this study, our contribution is to target non-standard words in the short text and propose a method to which the given word is likely to be transformed. Our method uses language model probability to characterise...

  20. Balancing Linguistic and Social Needs: Evaluating Texts Using a Critical Language Awareness Approach

    Science.gov (United States)

    Case, Rod E.; Ndura, Elavie; Righettini, Marielena

    2005-01-01

    English as a second language (ESL) content-based texts are often evaluated for their presentation of sound second-language teaching practices. While such reviews are important and valuable, they ignore an examination of the race, class, and gender issues introduced in the texts. A critical perspective on textbook evaluation organized around the…

  1. Algebra Word Problem Solving Approaches in a Chemistry Context: Equation Worked Examples versus Text Editing

    Science.gov (United States)

    Ngu, Bing Hiong; Yeung, Alexander Seeshing

    2013-01-01

    Text editing directs students' attention to the problem structure as they classify whether the texts of word problems contain sufficient, missing or irrelevant information for working out a solution. Equation worked examples emphasize the formation of a coherent problem structure to generate a solution. Its focus is on the construction of three…

  2. Extracting salient sublexical units from written texts: "Emophon," a corpus-based approach to phonological iconicity.

    Science.gov (United States)

    Aryani, Arash; Jacobs, Arthur M; Conrad, Markus

    2013-01-01

    A GROWING BODY OF LITERATURE IN PSYCHOLOGY, LINGUISTICS, AND THE NEUROSCIENCES HAS PAID INCREASING ATTENTION TO THE UNDERSTANDING OF THE RELATIONSHIPS BETWEEN PHONOLOGICAL REPRESENTATIONS OF WORDS AND THEIR MEANING: a phenomenon also known as phonological iconicity. In this article, we investigate how a text's intended emotional meaning, particularly in literature and poetry, may be reflected at the level of sublexical phonological salience and the use of foregrounded elements. To extract such elements from a given text, we developed a probabilistic model to predict the exceeding of a confidence interval for specific sublexical units concerning their frequency of occurrence within a given text contrasted with a reference linguistic corpus for the German language. Implementing this model in a computational application, we provide a text analysis tool which automatically delivers information about sublexical phonological salience allowing researchers, inter alia, to investigate effects of the sublexical emotional tone of texts based on current findings on phonological iconicity.

  3. A Multi-Method Approach to Understanding Behavior Change. The Case of Texting and Driving

    Directory of Open Access Journals (Sweden)

    Karen M. HOOD

    2017-12-01

    Full Text Available Distracted driving, specifically texting and driving, has become a nationwide public health problem in the U.S. with negative, and potentially fatal consequences. In an effort to combat the growing problem, non-profit organizations, corporations, and the federal government have all stepped in to try to increase public awareness and persuade drivers to cease texting while driving. These efforts have not had the desired impact as texting and driving has continued to increase in recent years. This research investigates the potential that the messages used to curb texting and driving behavior might not be properly constructed. Specifically, we test the potential for message sponsor and self-relevance of the message to influence message outcomes. Our results suggest that messages sponsored by a combination of company and government that are self-relevant to viewers will have different outcomes than other messages. We identify practical and theoretical implications as well as future research directions.

  4. Texting As A Discursive Approach For The Production Of Agricultural Solutions

    Directory of Open Access Journals (Sweden)

    Ronan G. Zagado

    2015-08-01

    Full Text Available This paper demonstrates how the short messaging service SMS popularly known as texting has facilitated production of solutions to farm issues using the Farmers Text Centre FTC of the Philippine Rice Research PhilRice as the case study. Text messages registered in the FTC database in 2010 covering one cropping season were discourse analyzed. Interpretive qualitative research particularly the Grounded Theory was employed to interprettheorize said data. Since texting is a new emerging discourse in agricultural development Grounded Theory allows the explication of theoretical accounts that explain its existence and impact. Results indicate that timing queries received within working days from 8am to 5pm get speedy response content the easier the question the faster it gets reply length the shorter the message the better and clarity of the querytext message as well as cultural factors such as greetings and terms of respect are all important governing factors in texting for farm use. Moreover analysis reveals that the series of text messages sent back and forth by farmers and agricultural specialist in FTC suggests a dynamic process of negotiation rather than passive information sharing. The analysis further reveals that texting has allowed farmers to have access to a negotiated knowledge rather than a standard scientific recommendation vis--vis the solution to their farm issues. The term negotiated implies that farmers are actively involved in knowledge production via texting. Textholder is coined in this paper to describe farmers and agricultural specialists as co-creators of knowledge in texting as opposed to their traditional role as knowledge generator and user respectively. From the analysis reflections implications and theoretical contributions are drawn in relation to the value of SMSing in agricultural extension and communication.

  5. Mentor Texts and the Coding of Academic Writing Structures: A Functional Approach

    Directory of Open Access Journals (Sweden)

    Wilder Yesid Escobar Alméciga

    2014-10-01

    Full Text Available The purpose of the present pedagogical experience was to address the English language writing needs of university-level students pursuing a degree in bilingual education with an emphasis in the teaching of English. Using mentor texts and coding academic writing structures, an instructional design was developed to directly address the shortcomings presented through a triangulated needs analysis. Through promoting awareness of international standards of writing as well as fostering an understanding of the inherent structures of academic texts, a methodology intended to increase academic writing proficiency was explored. The study suggests that mentor texts and the coding of academic writing structures can have a positive impact on the production of students’ academic writing.

  6. A new approach to the classification of African oral texts | Kam ...

    African Journals Online (AJOL)

    Toutes ces raisons ont conduit à un nouvel examen des différents genres oraux dans le cadre africain et à proposer une division de ces textes en cinq grandes catégories. Mots clés: littérature orale, genres oraux, textes oraux, discours, énoncés, jeux de plaisanterie, chercheurs en littérature orale. Tydskrif vir Letterkunde ...

  7. The Object Oriented Approach in Systems Analysis and Design Texts: Consistency within the IS Curriculum

    Science.gov (United States)

    Wood, David F.; Kohun, Frederick G.; Laverty, Joseph Packy

    2010-01-01

    This paper reports on a study of systems analysis textbooks in terms of topics covered and academic background of the authors. It addresses the consistency within IS curricula with respect to the content of a systems analysis and design course using the object-oriented approach. The research questions addressed were 1: Is there a consistency among…

  8. Linguacultural space “Man-Nature” in literary texts: cognitive and pragmatic approach

    Directory of Open Access Journals (Sweden)

    Eldarova Ruzanna Alievna

    2016-06-01

    Full Text Available The magnitude of representation of nature images, the links to the author’s mind, the hero, the reader can be considered in literary texts as one of the most important sources for identifying the parameters of the national picture of the world and the individually author’s transformation of its components. Researches that identify patterns of functioning linguacultural spaces in the texts are able to give new results projected in the linguistic picture of the ethnic group of the world due to reflections in literary texts of archetypal, stereotyped images of peculiar linguistic culture and ethnic group as a whole as well as individually-copyright, which characterize a particular linguistic identity and its conception of the world. Cognitive paradigm of modern linguistics, anthropocentric in nature allows to consider culture as a process modeling language, which naturally highlights the problem of linguistic linguaculture of predetermined value. Great importance in this regard is the concept of space as linguocultural cognitive model of objective reality. Cognitive-pragmatic potential of a literary text is deepening due to the introduction the descriptions of nature, since they always implement the ethical, aesthetic, and intellectual abilities of the creative subject.

  9. Design of Instrument Approach Procedure Charts Comprehension Speed of Missed Approach Instructions Coded in Text or Icons

    Science.gov (United States)

    1992-02-01

    Instrument approach procedure (IAP) charts are often cluttered and confusing. The quantified effects of chart design : changes on information transfer are needed by chart manufacturers to make changes uhich will enhance information transfer : and hum...

  10. Mentor Texts and the Coding of Academic Writing Structures: A Functional Approach

    Science.gov (United States)

    Escobar Alméciga, Wilder Yesid; Evans, Reid

    2014-01-01

    The purpose of the present pedagogical experience was to address the English language writing needs of university-level students pursuing a degree in bilingual education with an emphasis in the teaching of English. Using mentor texts and coding academic writing structures, an instructional design was developed to directly address the shortcomings…

  11. When the Text Is the Problem: A Postcolonial Approach to Biblical Pedagogy

    Science.gov (United States)

    Lee, Boyung

    2007-01-01

    Postcolonial biblical scholars use the hermeneutics of decolonization to reinterpret the biblical text. One goal is to find contemporary applications for an age-old message. This article explores the challenges and implications of postcolonial hermeneutics for biblical pedagogy. First, the author explores fundamental hermeneutical principles of…

  12. Using Short Texts to Teach English as Second Language: An Integrated Approach

    Science.gov (United States)

    Kembo, Jane

    2016-01-01

    The teacher of English Language is often hard pressed to find interesting and authentic ways to present language to target second language speakers. While language can be taught and learned, part of it must be acquired and short texts provide powerful tools for doing so and reinforcing what has been taught/learned. This paper starts from research,…

  13. Approaches to the Writing of Greek in Late Antique Latin Texts

    Directory of Open Access Journals (Sweden)

    Aaron Pelttari

    2011-08-01

    Full Text Available The treatment of Greek words in manuscripts of Augustine and of Ausonius suggests that late Latin writers employed transliteration, rather than writing Greek letters, more often than has been thought, both for familiar loan-words in Latin and for words perceived as still Greek.

  14. Introducing the interpretation of medieval Hindi texts into the Hindi curriculum: An alternative approach

    Czech Academy of Sciences Publication Activity Database

    Strnad, Jaroslav

    2010-01-01

    Roč. 9, č. 2 (2010), s. 25-38 ISSN 1648-2662. [Regional Conference on Indology for Central and Eastern Europe - New Perspectives in Education about India /2./. Vilnijus, 24.8.2006-26. 8.2006] Institutional research plan: CEZ:AV0Z90210515 Keywords : Hindi * texts * analysis Subject RIV: AI - Linguistics

  15. "What is relevant in a text document?": An interpretable machine learning approach.

    Directory of Open Access Journals (Sweden)

    Leila Arras

    Full Text Available Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text's category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP, a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.

  16. A Novel Approach in Text-Independent Speaker Recognition in Noisy Environment

    Directory of Open Access Journals (Sweden)

    Nona Heydari Esfahani

    2014-10-01

    Full Text Available In this paper, robust text-independent speaker recognition is taken into consideration. The proposed method performs on manual silence-removed utterances that are segmented into smaller speech units containing few phones and at least one vowel. The segments are basic units for long-term feature extraction. Sub-band entropy is directly extracted in each segment. A robust vowel detection method is then applied on each segment to separate a high energy vowel that is used as unit for pitch frequency and formant extraction. By applying a clustering technique, extracted short-term features namely MFCC coefficients are combined with long term features. Experiments using MLP classifier show that the average speaker accuracy recognition rate is 97.33% for clean speech and 61.33% in noisy environment for -2db SNR, that shows improvement compared to other conventional methods.

  17. Tracing Knowledge Transfer from Universities to Industry: A Text Mining Approach

    DEFF Research Database (Denmark)

    Woltmann, Sabrina; Alkærsig, Lars

    2017-01-01

    that several websites contain very related and partly even traceable content from the university. The results show that university research is represented in the websites of industrial partners. We propose further improvements to enhance the results and potential areas for future implementation. This paper...... is the first step to enable the identification of common knowledge and knowledge transfer via text mining to increase its measurability....

  18. Stopping Antidepressants and Anxiolytics as Major Concerns Reported in Online Health Communities: A Text Mining Approach.

    Science.gov (United States)

    Abbe, Adeline; Falissard, Bruno

    2017-10-23

    Internet is a particularly dynamic way to quickly capture the perceptions of a population in real time. Complementary to traditional face-to-face communication, online social networks help patients to improve self-esteem and self-help. The aim of this study was to use text mining on material from an online forum exploring patients' concerns about treatment (antidepressants and anxiolytics). Concerns about treatment were collected from discussion titles in patients' online community related to antidepressants and anxiolytics. To examine the content of these titles automatically, we used text mining methods, such as word frequency in a document-term matrix and co-occurrence of words using a network analysis. It was thus possible to identify topics discussed on the forum. The forum included 2415 discussions on antidepressants and anxiolytics over a period of 3 years. After a preprocessing step, the text mining algorithm identified the 99 most frequently occurring words in titles, among which were escitalopram, withdrawal, antidepressant, venlafaxine, paroxetine, and effect. Patients' concerns were related to antidepressant withdrawal, the need to share experience about symptoms, effects, and questions on weight gain with some drugs. Patients' expression on the Internet is a potential additional resource in addressing patients' concerns about treatment. Patient profiles are close to that of patients treated in psychiatry. ©Adeline Abbe, Bruno Falissard. Originally published in JMIR Mental Health (http://mental.jmir.org), 23.10.2017.

  19. "What is relevant in a text document?": An interpretable machine learning approach

    Science.gov (United States)

    Arras, Leila; Horn, Franziska; Montavon, Grégoire; Müller, Klaus-Robert

    2017-01-01

    Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML) models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text’s category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP), a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN) and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications. PMID:28800619

  20. [Hygienic Assesment of Educational Texts: Methodical Approaches and Evaluation of Difficulties for Children of Secondary Textbooks].

    Science.gov (United States)

    Kuchma, V R; Tkachuk, E A

    2015-01-01

    The understandability and readability of the text are significant indicators of evaluation of textbooks. The aim of the study - rationale of improving the readability and understandability of textbooks. 60 modern textbooks for 5-11th classes on History, Physics, Biology and 23 textbooks of 1960-1980's edition. Flesch index was used to assess the readability, Fogh index - to evaluate understandability. The readability and understandability of texts in textbooks of 1960-1980's and modern editions have no differences and show the same complexity of old and modern textbooks for students. The indicator of understandability of textbooks for primary classes corresponds to age norm and is 4.4±0.2 points. The indicator of readability for these books is less age norm and is 53.8±2.9 points, which increases the physiological cost of educational activities of children of primary school age. Children's readability and understandability of school textbooks are a significant factor of intensity of training activities and can be objectively assessed by Flesch and Fogh indices, that it is appropriate for an objective hygienic assessment of the tension of the educational activities for children. The main direction of optimization of the tension of educational activity is to reduce the intellectual and emotional loads in children by increasing the easiness of reading textbooks due to their compliance with the age peculiarities of students.

  1. Systematic analysis of molecular mechanisms for HCC metastasis via text mining approach.

    Science.gov (United States)

    Zhen, Cheng; Zhu, Caizhong; Chen, Haoyang; Xiong, Yiru; Tan, Junyuan; Chen, Dong; Li, Jin

    2017-02-21

    To systematically explore the molecular mechanism for hepatocellular carcinoma (HCC) metastasis and identify regulatory genes with text mining methods. Genes with highest frequencies and significant pathways related to HCC metastasis were listed. A handful of proteins such as EGFR, MDM2, TP53 and APP, were identified as hub nodes in PPI (protein-protein interaction) network. Compared with unique genes for HBV-HCCs, genes particular to HCV-HCCs were less, but may participate in more extensive signaling processes. VEGFA, PI3KCA, MAPK1, MMP9 and other genes may play important roles in multiple phenotypes of metastasis. Genes in abstracts of HCC-metastasis literatures were identified. Word frequency analysis, KEGG pathway and PPI network analysis were performed. Then co-occurrence analysis between genes and metastasis-related phenotypes were carried out. Text mining is effective for revealing potential regulators or pathways, but the purpose of it should be specific, and the combination of various methods will be more useful.

  2. Geographical Text Analysis: A new approach to understanding nineteenth-century mortality.

    Science.gov (United States)

    Porter, Catherine; Atkinson, Paul; Gregory, Ian

    2015-11-01

    This paper uses a combination of Geographic Information Systems (GIS) and corpus linguistic analysis to extract and analyse disease related keywords from the Registrar-General's Decennial Supplements. Combined with known mortality figures, this provides, for the first time, a spatial picture of the relationship between the Registrar-General's discussion of disease and deaths in England and Wales in the nineteenth and early twentieth centuries. Techniques such as collocation, density analysis, the Hierarchical Regional Settlement matrix and regression analysis are employed to extract and analyse the data resulting in new insight into the relationship between the Registrar-General's published texts and the changing mortality patterns during this time. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Deriving pathway maps from automated text analysis using a grammar-based approach.

    Science.gov (United States)

    Olsson, Björn; Gawronska, Barbara; Erlendsson, Björn

    2006-04-01

    We demonstrate how automated text analysis can be used to support the large-scale analysis of metabolic and regulatory pathways by deriving pathway maps from textual descriptions found in the scientific literature. The main assumption is that correct syntactic analysis combined with domain-specific heuristics provides a good basis for relation extraction. Our method uses an algorithm that searches through the syntactic trees produced by a parser based on a Referent Grammar formalism, identifies relations mentioned in the sentence, and classifies them with respect to their semantic class and epistemic status (facts, counterfactuals, hypotheses). The semantic categories used in the classification are based on the relation set used in KEGG (Kyoto Encyclopedia of Genes and Genomes), so that pathway maps using KEGG notation can be automatically generated. We present the current version of the relation extraction algorithm and an evaluation based on a corpus of abstracts obtained from PubMed. The results indicate that the method is able to combine a reasonable coverage with high accuracy. We found that 61% of all sentences were parsed, and 97% of the parse trees were judged to be correct. The extraction algorithm was tested on a sample of 300 parse trees and was found to produce correct extractions in 90.5% of the cases.

  4. Clustering more than two million biomedical publications: comparing the accuracies of nine text-based similarity approaches.

    Directory of Open Access Journals (Sweden)

    Kevin W Boyack

    2011-03-01

    Full Text Available We investigate the accuracy of different similarity approaches for clustering over two million biomedical documents. Clustering large sets of text documents is important for a variety of information needs and applications such as collection management and navigation, summary and analysis. The few comparisons of clustering results from different similarity approaches have focused on small literature sets and have given conflicting results. Our study was designed to seek a robust answer to the question of which similarity approach would generate the most coherent clusters of a biomedical literature set of over two million documents.We used a corpus of 2.15 million recent (2004-2008 records from MEDLINE, and generated nine different document-document similarity matrices from information extracted from their bibliographic records, including titles, abstracts and subject headings. The nine approaches were comprised of five different analytical techniques with two data sources. The five analytical techniques are cosine similarity using term frequency-inverse document frequency vectors (tf-idf cosine, latent semantic analysis (LSA, topic modeling, and two Poisson-based language models--BM25 and PMRA (PubMed Related Articles. The two data sources were a MeSH subject headings, and b words from titles and abstracts. Each similarity matrix was filtered to keep the top-n highest similarities per document and then clustered using a combination of graph layout and average-link clustering. Cluster results from the nine similarity approaches were compared using (1 within-cluster textual coherence based on the Jensen-Shannon divergence, and (2 two concentration measures based on grant-to-article linkages indexed in MEDLINE.PubMed's own related article approach (PMRA generated the most coherent and most concentrated cluster solution of the nine text-based similarity approaches tested, followed closely by the BM25 approach using titles and abstracts. Approaches

  5. Clustering more than two million biomedical publications: comparing the accuracies of nine text-based similarity approaches.

    Science.gov (United States)

    Boyack, Kevin W; Newman, David; Duhon, Russell J; Klavans, Richard; Patek, Michael; Biberstine, Joseph R; Schijvenaars, Bob; Skupin, André; Ma, Nianli; Börner, Katy

    2011-03-17

    We investigate the accuracy of different similarity approaches for clustering over two million biomedical documents. Clustering large sets of text documents is important for a variety of information needs and applications such as collection management and navigation, summary and analysis. The few comparisons of clustering results from different similarity approaches have focused on small literature sets and have given conflicting results. Our study was designed to seek a robust answer to the question of which similarity approach would generate the most coherent clusters of a biomedical literature set of over two million documents. We used a corpus of 2.15 million recent (2004-2008) records from MEDLINE, and generated nine different document-document similarity matrices from information extracted from their bibliographic records, including titles, abstracts and subject headings. The nine approaches were comprised of five different analytical techniques with two data sources. The five analytical techniques are cosine similarity using term frequency-inverse document frequency vectors (tf-idf cosine), latent semantic analysis (LSA), topic modeling, and two Poisson-based language models--BM25 and PMRA (PubMed Related Articles). The two data sources were a) MeSH subject headings, and b) words from titles and abstracts. Each similarity matrix was filtered to keep the top-n highest similarities per document and then clustered using a combination of graph layout and average-link clustering. Cluster results from the nine similarity approaches were compared using (1) within-cluster textual coherence based on the Jensen-Shannon divergence, and (2) two concentration measures based on grant-to-article linkages indexed in MEDLINE. PubMed's own related article approach (PMRA) generated the most coherent and most concentrated cluster solution of the nine text-based similarity approaches tested, followed closely by the BM25 approach using titles and abstracts. Approaches using only

  6. An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy.

    Science.gov (United States)

    Ramanujam, Nedunchelian; Kaliappan, Manivannan

    2016-01-01

    Nowadays, automatic multidocument text summarization systems can successfully retrieve the summary sentences from the input documents. But, it has many limitations such as inaccurate extraction to essential sentences, low coverage, poor coherence among the sentences, and redundancy. This paper introduces a new concept of timestamp approach with Naïve Bayesian Classification approach for multidocument text summarization. The timestamp provides the summary an ordered look, which achieves the coherent looking summary. It extracts the more relevant information from the multiple documents. Here, scoring strategy is also used to calculate the score for the words to obtain the word frequency. The higher linguistic quality is estimated in terms of readability and comprehensibility. In order to show the efficiency of the proposed method, this paper presents the comparison between the proposed methods with the existing MEAD algorithm. The timestamp procedure is also applied on the MEAD algorithm and the results are examined with the proposed method. The results show that the proposed method results in lesser time than the existing MEAD algorithm to execute the summarization process. Moreover, the proposed method results in better precision, recall, and F-score than the existing clustering with lexical chaining approach.

  7. Interword and intraword pause threshold in the writing of texts by children and adolescents : a methodological approach

    Directory of Open Access Journals (Sweden)

    Florence eChenu

    2014-03-01

    Full Text Available Writing words in real life involves setting objectives, imagining a recipient, translating ideas into linguistic forms, managing grapho-motor gestures, etc. Understanding writing requires observation of the processes as they occur in real time. Analysis of pauses is one of the preferred methods for accessing the dynamics of writing and is based on the idea that pauses are behavioral correlates of cognitive processes. However, there is a need to clarify what we are observing when studying pause phenomena, as we will argue in the first section. This taken into account, the study of pause phenomena can be considered following two approaches. A first approach, driven by temporality, would define a threshold and observe where pauses, e.g. scriptural inactivity occurs. A second approach, linguistically driven, would define structural units and look for scriptural inactivity at the boundaries of these units or within these units. Taking a temporally driven approach, we present two methods which aim at the automatic identification of scriptural inactivity which is most likely not attributable to grapho-motor management in texts written by children and adolescents using digitizing tablets in association with Eye and Pen© (Chesnet & Alamargot, 2005. The first method is purely statistical and is based on the idea that the distribution of pauses exhibits different Gaussian components each of them corresponding to a different type of pause. After having reviewed the limits of this statistical method, we present a second method based on writing dynamics which attempts to identify breaking points in the writing dynamics rather than relying only on pause duration. This second method needs to be refined to overcome the fact that calculation is impossible when there is insufficient data which is often the case when working with young scriptors.

  8. Implicit prosody mining based on the human eye image capture technology

    Science.gov (United States)

    Gao, Pei-pei; Liu, Feng

    2013-08-01

    The technology of eye tracker has become the main methods of analyzing the recognition issues in human-computer interaction. Human eye image capture is the key problem of the eye tracking. Based on further research, a new human-computer interaction method introduced to enrich the form of speech synthetic. We propose a method of Implicit Prosody mining based on the human eye image capture technology to extract the parameters from the image of human eyes when reading, control and drive prosody generation in speech synthesis, and establish prosodic model with high simulation accuracy. Duration model is key issues for prosody generation. For the duration model, this paper put forward a new idea for obtaining gaze duration of eyes when reading based on the eye image capture technology, and synchronous controlling this duration and pronunciation duration in speech synthesis. The movement of human eyes during reading is a comprehensive multi-factor interactive process, such as gaze, twitching and backsight. Therefore, how to extract the appropriate information from the image of human eyes need to be considered and the gaze regularity of eyes need to be obtained as references of modeling. Based on the analysis of current three kinds of eye movement control model and the characteristics of the Implicit Prosody reading, relative independence between speech processing system of text and eye movement control system was discussed. It was proved that under the same text familiarity condition, gaze duration of eyes when reading and internal voice pronunciation duration are synchronous. The eye gaze duration model based on the Chinese language level prosodic structure was presented to change previous methods of machine learning and probability forecasting, obtain readers' real internal reading rhythm and to synthesize voice with personalized rhythm. This research will enrich human-computer interactive form, and will be practical significance and application prospect in terms of

  9. Computing symmetrical strength of N-grams: a two pass filtering approach in automatic classification of text documents.

    Science.gov (United States)

    Agnihotri, Deepak; Verma, Kesari; Tripathi, Priyanka

    2016-01-01

    The contiguous sequences of the terms (N-grams) in the documents are symmetrically distributed among different classes. The symmetrical distribution of the N-Grams raises uncertainty in the belongings of the N-Grams towards the class. In this paper, we focused on the selection of most discriminating N-Grams by reducing the effects of symmetrical distribution. In this context, a new text feature selection method named as the symmetrical strength of the N-Grams (SSNG) is proposed using a two pass filtering based feature selection (TPF) approach. Initially, in the first pass of the TPF, the SSNG method chooses various informative N-Grams from the entire extracted N-Grams of the corpus. Subsequently, in the second pass the well-known Chi Square (χ(2)) method is being used to select few most informative N-Grams. Further, to classify the documents the two standard classifiers Multinomial Naive Bayes and Linear Support Vector Machine have been applied on the ten standard text data sets. In most of the datasets, the experimental results state the performance and success rate of SSNG method using TPF approach is superior to the state-of-the-art methods viz. Mutual Information, Information Gain, Odds Ratio, Discriminating Feature Selection and χ(2).

  10. Development and testing of a text-mining approach to analyse patients' comments on their experiences of colorectal cancer care.

    Science.gov (United States)

    Wagland, Richard; Recio-Saucedo, Alejandra; Simon, Michael; Bracher, Michael; Hunt, Katherine; Foster, Claire; Downing, Amy; Glaser, Adam; Corner, Jessica

    2016-08-01

    Quality of cancer care may greatly impact on patients' health-related quality of life (HRQoL). Free-text responses to patient-reported outcome measures (PROMs) provide rich data but analysis is time and resource-intensive. This study developed and tested a learning-based text-mining approach to facilitate analysis of patients' experiences of care and develop an explanatory model illustrating impact on HRQoL. Respondents to a population-based survey of colorectal cancer survivors provided free-text comments regarding their experience of living with and beyond cancer. An existing coding framework was tested and adapted, which informed learning-based text mining of the data. Machine-learning algorithms were trained to identify comments relating to patients' specific experiences of service quality, which were verified by manual qualitative analysis. Comparisons between coded retrieved comments and a HRQoL measure (EQ5D) were explored. The survey response rate was 63.3% (21 802/34 467), of which 25.8% (n=5634) participants provided free-text comments. Of retrieved comments on experiences of care (n=1688), over half (n=1045, 62%) described positive care experiences. Most negative experiences concerned a lack of post-treatment care (n=191, 11% of retrieved comments) and insufficient information concerning self-management strategies (n=135, 8%) or treatment side effects (n=160, 9%). Associations existed between HRQoL scores and coded algorithm-retrieved comments. Analysis indicated that the mechanism by which service quality impacted on HRQoL was the extent to which services prevented or alleviated challenges associated with disease and treatment burdens. Learning-based text mining techniques were found useful and practical tools to identify specific free-text comments within a large dataset, facilitating resource-efficient qualitative analysis. This method should be considered for future PROM analysis to inform policy and practice. Study findings indicated that

  11. Personality and Education Mining based Job Advisory System

    Directory of Open Access Journals (Sweden)

    Rajendra S. Choudhary

    2014-09-01

    Full Text Available Every job demands an employee with some specific qualities in addition to the basic educational qualification. For example, an introvert person cannot be a good leader despite of a very good academic qualification. Thinking and logical ability is required for a person to be a successful software engineer. So, the aim of this paper is to present a novel approach for advising an ideal job to the job seeker while considering his personality trait and educational qualification both. Very well-known theories of personality like MBTI indicator and OCEAN theory, are used for personality mining. For education mining, score based system is used. The score based system captures the information from attributes like most scoring subject, dream job etc. After personality mining, the resultant values are coalesced with the information extracted from education mining. And finally, the most suited jobs, in terms of personality and educational qualification are recommended to the job seekers. The experiment is conducted on the students who have earned an engineering degree in the field of computer science, information technology and electronics. Nevertheless, the same architecture can easily be extended to other educational degrees also. To the best of the author’s knowledge, this is a first e-job advisory system that recommends the job best suited as per one’s personality using MBTI and OCEAN theory both.

  12. A Mine-Based Uranium Market Clearing Model

    Directory of Open Access Journals (Sweden)

    Aris Auzans

    2014-11-01

    Full Text Available Economic analysis and market simulation tools are used to evaluate uranium (U supply shocks, sale or purchase of uranium stockpiles, or market effects of new uranium mines or enrichment technologies. This work expands on an existing U market model that couples the market for primary U from uranium mines with those of secondary uranium, e.g., depleted uranium (DU upgrading or highly enriched uranium (HEU down blending, and enrichment services. This model accounts for the interdependence between the primary U supply on the U market price, the economic characteristics of each individual U mine, sources of secondary supply, and the U enrichment market. This work defines a procedure for developing an aggregate supply curve for primary uranium from marginal cost curves for individual firms (Uranium mines. Under this model, market conditions drive individual mines’ startup and short- and long-term shutdown decisions. It is applied to the uranium industry for the period 2010–2030 in order to illustrate the evolution of the front end markets under conditions of moderate growth in demand for nuclear fuel. The approach is applicable not only to uranium mines but also other facilities and reactors within the nuclear economy that may be modeled as independent, decision-making entities inside a nuclear fuel cycle simulator.

  13. Analyzing discourse and text complexity for learning and collaborating a cognitive approach based on natural language processing

    CERN Document Server

    Dascălu, Mihai

    2014-01-01

    With the advent and increasing popularity of Computer Supported Collaborative Learning (CSCL) and e-learning technologies, the need of automatic assessment and of teacher/tutor support for the two tightly intertwined activities of comprehension of reading materials and of collaboration among peers has grown significantly. In this context, a polyphonic model of discourse derived from Bakhtin’s work as a paradigm is used for analyzing both general texts and CSCL conversations in a unique framework focused on different facets of textual cohesion. As specificity of our analysis, the individual learning perspective is focused on the identification of reading strategies and on providing a multi-dimensional textual complexity model, whereas the collaborative learning dimension is centered on the evaluation of participants’ involvement, as well as on collaboration assessment. Our approach based on advanced Natural Language Processing techniques provides a qualitative estimation of the learning process and enhance...

  14. Text mining and natural language processing approaches for automatic categorization of lay requests to web-based expert forums.

    Science.gov (United States)

    Himmel, Wolfgang; Reincke, Ulrich; Michelmann, Hans Wilhelm

    2009-07-22

    Both healthy and sick people increasingly use electronic media to obtain medical information and advice. For example, Internet users may send requests to Web-based expert forums, or so-called "ask the doctor" services. To automatically classify lay requests to an Internet medical expert forum using a combination of different text-mining strategies. We first manually classified a sample of 988 requests directed to a involuntary childlessness forum on the German website "Rund ums Baby" ("Everything about Babies") into one or more of 38 categories belonging to two dimensions ("subject matter" and "expectations"). After creating start and synonym lists, we calculated the average Cramer's V statistic for the association of each word with each category. We also used principle component analysis and singular value decomposition as further text-mining strategies. With these measures we trained regression models and determined, on the basis of best regression models, for any request the probability of belonging to each of the 38 different categories, with a cutoff of 50%. Recall and precision of a test sample were calculated as a measure of quality for the automatic classification. According to the manual classification of 988 documents, 102 (10%) documents fell into the category "in vitro fertilization (IVF)," 81 (8%) into the category "ovulation," 79 (8%) into "cycle," and 57 (6%) into "semen analysis." These were the four most frequent categories in the subject matter dimension (consisting of 32 categories). The expectation dimension comprised six categories; we classified 533 documents (54%) as "general information" and 351 (36%) as a wish for "treatment recommendations." The generation of indicator variables based on the chi-square analysis and Cramer's V proved to be the best approach for automatic classification in about half of the categories. In combination with the two other approaches, 100% precision and 100% recall were realized in 18 (47%) out of the 38

  15. FOUR SQUARE WRITING METHOD APPLIED IN PRODUCT AND PROCESS BASED APPROACHES COMBINATION TO TEACHING WRITING DISCUSSION TEXT

    Directory of Open Access Journals (Sweden)

    Vina Agustiana

    2017-12-01

    Full Text Available Four Square Writing Method is a writing method which helps students in organizing concept to write by using a graphic organizer. This study aims to examine the influence of applying FSWM in combination of product and process based approaches to teaching writing discussion texts toward students’ writing skill, the teaching-learning writing process and the students’ attitude toward the implementation of the writing method. This study applies a mixed-method through applying an embedded design. 26 EFL university students of a private university in West Java, Indonesia, are involved in the study. There are 3 kinds of instrument used, namely tests (pre and post-test, field notes, and questionnaires. Data taken from students’ writing test are analyzed statistically to identify the influence of applying the writing method toward students’ writing skill; data taken from field notes are analyzed qualitatively to examine the learning writing activities at the time the writing method is implemented; and data taken from questionnaires are analyzed descriptive statistic to explore students’ attitude toward the implementation of the writing method. Regarding the result of paired t-test, the writing method is effective in improving students’ writing skill since level of significant (two-tailed is less than alpha (0.000<0.05. Furthermore, the result taken from field notes shows that each steps applied and graphic organizer used in the writing method lead students to compose discussion texts which meet a demand of genre. In addition, regard with the result taken from questionnaire, the students show highly positive attitude toward the treatment since the mean score is 4.32.

  16. Study on the Method of Association Rules Mining Based on Genetic Algorithm and Application in Analysis of Seawater Samples

    Directory of Open Access Journals (Sweden)

    Qiuhong Sun

    2014-04-01

    Full Text Available Based on the data mining research, the data mining based on genetic algorithm method, the genetic algorithm is briefly introduced, while the genetic algorithm based on two important theories and theoretical templates principle implicit parallelism is also discussed. Focuses on the application of genetic algorithms for association rule mining method based on association rule mining, this paper proposes a genetic algorithm fitness function structure, data encoding, such as the title of the improvement program, in particular through the early issues study, proposed the improved adaptive Pc, Pm algorithm is applied to the genetic algorithm, thereby improving efficiency of the algorithm. Finally, a genetic algorithm based association rule mining algorithm, and be applied in sea water samples database in data mining and prove its effective.

  17. Data Mining Based on Cloud-Computing Technology

    Directory of Open Access Journals (Sweden)

    Ren Ying

    2016-01-01

    Full Text Available There are performance bottlenecks and scalability problems when traditional data-mining system is used in cloud computing. In this paper, we present a data-mining platform based on cloud computing. Compared with a traditional data mining system, this platform is highly scalable, has massive data processing capacities, is service-oriented, and has low hardware cost. This platform can support the design and applications of a wide range of distributed data-mining systems.

  18. Text classification

    OpenAIRE

    Deveikis, Karolis

    2016-01-01

    This paper investigates the problem of text classification. The task of text classification is to assign a piece of text to one of several categories based on its content. Text classification is one of the tasks of natural language processing. Like the others, it is often solved using machine learning algorithms. There are many algorithms suitable for text classification. As a result, a problem of choice arises. In an effort to solve this problem, this paper analyzes various feature extractio...

  19. On Fixed and Fluid "Texts": "The Singer of Tales" and the Natural Approach of Tracy D. Terrell.

    Science.gov (United States)

    Worth, Frederick R.

    1990-01-01

    Relates language acquisition theories regarding comprehension, early speech, and speech emergence within the Natural Approach, which returns language learning to the living context and maintains the isolated fragments of language as a whole, to those theories expressed in a comparison between the experience of an apprentice singer of tales to that…

  20. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  1. Resins and Gums in Historical Iatrosophia Texts from Cyprus – A Botanical and Medico-pharmacological Approach

    Science.gov (United States)

    Lardos, Andreas; Prieto-Garcia, José; Heinrich, Michael

    2011-01-01

    This study explores historical iatrosophia texts from Cyprus from a botanical and medico-pharmacological point of view focusing on remedies containing resins and gums. The iatrosophia are a genre of Greek medical literature of Byzantine origin and can be described as medicine handbooks which serve as therapeutic repositories containing recipes or advice. To extract and analyze information on plant usage in such sources – which are largely unedited texts and so far have not been translated – we investigate (i) the relationship of the iatrosophia to Dioscorides’ De Materia Medica as well as historic pharmaceutical books or standard texts on modern phytotherapy and (ii) the validity of the remedies by comparing them to modern scientific data on reported biological activities. In the six texts investigated 27 substances incorporating plant exudates are mentioned. They are obtained from over 43 taxa of higher plants and in particular are used to treat dermatological, gastrointestinal, and respiratory tract conditions. The comparison to historic pharmaceutical books and phytotherapy texts reflects the gradual decline of the use of plant exudates in Western medicine. While remarkable parallels to Dioscorides’ text exist, the non-Dioscoridean influence suggests a complex pattern of knowledge exchange. Overall, this resulted in an integration of knowledge from so far poorly understood sources. The comparison with bioscientific data reveals a fragmentary picture and highlights the potential of these unexplored substances and their uses. Where relevant bioscientific data are available, we generally found a confirmation. This points to a largely rational use of the associated remedies. Taken together, the iatrosophia are a valuable resource for ethnopharmacological and natural product research. Most importantly they contribute to the understanding of the development of herbal medicines in the (Eastern) Mediterranean and Europe. PMID:21772820

  2. A Text World Theory Approach to the Teaching of Short Stories in an EFL Context: A Pedagogical Stylistic Study

    Science.gov (United States)

    Mohammadzadeh, Behbood

    2017-01-01

    The present study attempts to examine how the stylistic aspects of Text World Theory (TWT) can be used in Literature and Language Teaching classrooms in order to help students to improve their critical understanding and interpretation. The pedagogical stylistic application of this theory can enhance ELT students' language awareness, creative…

  3. GENETIC RITES: A DISCURSIVE APPROACH TO LITERARY TEXT AND ITS CONTRIBUTIONS TO THE THEORY OF DISCOURSE ANALYSIS

    Directory of Open Access Journals (Sweden)

    Denise B. A. Aguiar

    2015-12-01

    Full Text Available In this paper a double objective is pursued: on the one hand, to evaluate the productivity of the concept of genetic rites (MAINGUENEAU, 2008 in the analysis of literary texts; on the other hand, to explicit the contributions of the analysis of literary texts to a redefinition of the very theoretical framework of discourse analysis. This dual path will be followed in the context of global semantics proposed by Maingueneau (2008.In the development of our reflections, the concept of genetic rites is brought forward in various literary productions of different periods of our cultural history, so that we will be able to apprehend it in the specificity of production of sense in artistic writing, and also to survey some of its modes of insertion in the universe of fictional construction, within a metalinguistic movement typical of literature. Among the achieved results we highlight the relevance of the analysis of genetic rites for the consolidation of the concept of discursive practice and the centrality of literary text in producing a memory establishing forms of enunciability involved with the very constitution of language.

  4. Environmental approach to geographic thinking development from the perspective of José Marti’s texts

    OpenAIRE

    González, Roeris

    2014-01-01

    The article presents findings of the research the project "Systematization of the theory and practice for a didactics of geographical thinking in natural sciences teacher currently running at the Faculty of Sciences. The study is intended to highlight the potentials of José Marti's text for environmentally focused geographical thinking junior high school students. It starts by assessing the importance of geographical thinking for contemporary society and the presence of this conception in the...

  5. How Does an Interactive Approach to Literary Texts Work in an English as a Foreign Language Context? Learners' Perspectives in Close-Up

    Science.gov (United States)

    Nguyen, Ha Thi Thu

    2016-01-01

    Interactive approaches to literary texts in second/foreign language education have enjoyed wide theoretical and empirical support. However, the teaching of literary texts in traditional English as a foreign language contexts still remains information-oriented, with a focus on the transmission and replication of an objectified interpretation of a…

  6. Research design: qualitative, quantitative and mixed methods approaches Research design: qualitative, quantitative and mixed methods approaches Creswell John W Sage 320 £29 0761924426 0761924426 [Formula: see text].

    Science.gov (United States)

    2004-09-01

    The second edition of Creswell's book has been significantly revised and updated. The author clearly sets out three approaches to research: quantitative, qualitative and mixed methods. As someone who has used mixed methods in my research, it is refreshing to read a textbook that addresses this. The differences between the approaches are clearly identified and a rationale for using each methodological stance provided.

  7. Texting for Health: An Evaluation of a Population Approach to Type 2 Diabetes Risk Reduction With a Personalized Message.

    Science.gov (United States)

    Khurshid, Anjum; Brown, Lisanne; Mukherjee, Snigdha; Abebe, Nebeyou; Kulick, David

    2015-11-01

    txt4health is an innovative, 14-week, interactive, population-based mobile health program for individuals at risk of type 2 diabetes, developed under the Beacon Community Program in the Greater New Orleans, La., area. A comprehensive social marketing campaign sought to enroll hard-to-reach, at-risk populations using a combination of mass media and face-to-face engagement in faith-based and retail environments. Little is known about the effectiveness of social marketing for mobile technology application in the general population. A systematic evaluation of the campaign identified successes and barriers to implementing a population-based mobile health program. Face-to-face engagement helped increase program enrollment after the initial launch; otherwise, enrollment leveled off over time. Results show positive trends in reaching target populations and in the use of mobile phones to record personal health information and set goals for reducing the risk of type 2 diabetes. The lessons from the txt4health campaign can help inform the development and programmatic strategies to provide a person-level intervention using a population-level approach for individuals at risk for diabetes as well as aid in chronic disease management.

  8. A abordagem do texto cristão em Erich Auerbach (Erich Auerbach’s approach to Christian texts

    Directory of Open Access Journals (Sweden)

    Victor de Oliveira Pinto Coelho

    2012-07-01

    Full Text Available Este trabalho é um breve estudo sobre a análise literária de textos cristãos elaborada por Erich Auerbach. O objetivo é destacar como, a partir do sermão 256 de Santo Agostinho e da Bíblia, Auerbach ilumina a articulação do sublime cristão com o sermo humilis, ou seja, incorpora a linguagem ordinária e temas prosaicos cotidianos para transmitir a mensagem religiosa. Do ponto de vista teórico-conceitual, faremos uma breve exposição sobre teoria da literatura, mais especificamente, sobre mimesis e literatura, como forma de pensar a abordagem de textos cristãos. Segundo Auerbach, o texto cristão, num mundo bastante conturbado, incorporou a vida e a linguagem simples das pessoas para, então, configurar uma nova forma do sublime. Concluímos que o texto cristão, visando a uma formulação religiosa (normativa, para isso trouxe para seu interior aquilo que definimos como dinâmica histórica, para trabalhá-la numa proposta de sentido. Palavras-chave: Erich Auerbach. Sermo humilis. Sublime. Literatura. Mímesis.

  9. Examining Thematic Similarity, Difference, and Membership in Three Online Mental Health Communities from Reddit: A Text Mining and Visualization Approach.

    Science.gov (United States)

    Park, Albert; Conway, Mike; Chen, Annie T

    2018-01-01

    Social media, including online health communities, have become popular platforms for individuals to discuss health challenges and exchange social support with others. These platforms can provide support for individuals who are concerned about social stigma and discrimination associated with their illness. Although mental health conditions can share similar symptoms and even co-occur, the extent to which discussion topics in online mental health communities are similar, different, or overlapping is unknown. Discovering the topical similarities and differences could potentially inform the design of related mental health communities and patient education programs. This study employs text mining, qualitative analysis, and visualization techniques to compare discussion topics in publicly accessible online mental health communities for three conditions: Anxiety, Depression and Post-Traumatic Stress Disorder. First, online discussion content for the three conditions was collected from three Reddit communities (r/Anxiety, r/Depression, and r/PTSD). Second, content was pre-processed, and then clustered using the k -means algorithm to identify themes that were commonly discussed by members. Third, we qualitatively examined the common themes to better understand them, as well as their similarities and differences. Fourth, we employed multiple visualization techniques to form a deeper understanding of the relationships among the identified themes for the three mental health conditions. The three mental health communities shared four themes: sharing of positive emotion, gratitude for receiving emotional support, and sleep- and work-related issues. Depression clusters tended to focus on self-expressed contextual aspects of depression, whereas the Anxiety Disorders and Post-Traumatic Stress Disorder clusters addressed more treatment- and medication-related issues. Visualizations showed that discussion topics from the Anxiety Disorders and Post-Traumatic Stress Disorder subreddits

  10. Reorganized text.

    Science.gov (United States)

    2015-05-01

    Reorganized Text: In the Original Investigation titled “Patterns of Hospital Utilization for Head and Neck Cancer Care: Changing Demographics” posted online in the January 29, 2015, issue of JAMA Otolaryngology–Head & Neck Surgery (doi:10.1001 /jamaoto.2014.3603), information was copied within sections and text rearranged to accommodate Continuing Medical Education quiz formatting. The information from the topic statements of each paragraph in the Hypothesis Testing subsection of the Methods section was collected in a new first paragraph for that subsection, which reads as follows: “Several hypotheses regarding the causes of regionalization of HNCA care were tested using the NIS data: (1) increasing patient comorbidities over time, causing a shift in care to teaching institutions that would theoretically be better equipped to handle such increased comorbidities; (2) shifting of payer status; (3) increased proportion of prior radiation therapy; and (4) a higher fraction of more complex procedures being referred and performed at teaching institutions.” In addition, the phrase "As summarized in Table3," was added to the beginning of paragraph 6 of the Discussion section, and the call-out to Table 3 in the middle of that paragraph was deleted. Finally, paragraphs 6 and 7 of the Discussion section were combined.

  11. Improving Students� Ability in Writing Hortatory Exposition Texts by Using Process-Genre Based Approach with YouTube Videos as the Media

    Directory of Open Access Journals (Sweden)

    fifin naili rizkiyah

    2017-06-01

    Full Text Available Abstract: This research is aimed at finding out how Process-Genre Based Approach strategy with YouTube Videos as the media are employed to improve the students� ability in writing hortatory exposition texts. This study uses collaborative classroom action research design following the procedures namely planning, implementing, observing, and reflecting. The procedures of carrying out the strategy are: (1 relating several issues/ cases to the students� background knowledge and introducing the generic structures and linguistic features of hortatory exposition text as the BKoF stage, (2 analyzing the generic structure and the language features used in the text and getting model on how to write a hortatory exposition text by using the YouTube Video as the MoT stage, (3 writing a hortatory exposition text collaboratively in a small group and in pairs through process writing as the JCoT stage, and (4 writing a hortatory exposition text individually as the ICoT stage. The result shows that the use of Process-Genre Based Approach and YouTube Videos can improve the students� ability in writing hortatory exposition texts. The percentage of the students achieving the score above the minimum passing grade (70 had improved from only 15.8% (3 out of 19 students in the preliminary study to 100% (22 students in the Cycle 1. Besides, the score of each aspect; content, organization, vocabulary, grammar, and mechanics also improved. � Key Words: writing ability, hortatory exposition text, process-genre based approach, youtube video

  12. Text Maps: Helping Students Navigate Informational Texts.

    Science.gov (United States)

    Spencer, Brenda H.

    2003-01-01

    Notes that a text map is an instructional approach designed to help students gain fluency in reading content area materials. Discusses how the goal is to teach students about the important features of the material and how the maps can be used to build new understandings. Presents the procedures for preparing and using a text map. (SG)

  13. Towards an Ethical Approach to Perspective-Taking and the Teaching of Multicultural Texts: Getting beyond Persuasion, Politeness and Political Correctness

    Science.gov (United States)

    Thein, Amanda Haertling; Sloan, DeAnn Long

    2012-01-01

    This paper aims to problematize perspective-taking -- an instructional practice widely thought to be useful in helping students develop the ability to better understand their own worlds and the worlds of others in multicultural texts. We provide examples that illustrate difficulties discovered in implementing a perspective-taking approach to…

  14. The Effects of Using Multimodal Approaches in Meaning-Making of 21st Century Literacy Texts Among ESL Students in a Private School in Malaysia

    Directory of Open Access Journals (Sweden)

    Malini Ganapathy

    2016-04-01

    Full Text Available In today’s globalised digital era, students are inevitably engaged in various multimodal texts due to their active participation in social media and frequent usage of mobile devices on a daily basis. Such daily activities advocate the need for a transformation in the teaching and learning of ESL lessons in order to promote students’ capabilities in making meaning of different literacy texts which students come across in their ESL learning activities. This paper puts forth the framework of Multimodality in the restructuring of the teaching and learning of ESL with the aim of investigating its effects and students perspectives on the use of multimodal approaches underlying the Multiliteracies theory. Using focus group interviews, this qualitative case study examines the effectiveness of ESL teaching and learning using the Multimodal approaches on literacy in meaning-making among 15 students in a private school in Penang, Malaysia. The results confirm the need to reorientate the teaching and learning of ESL with the focus on multimodal pedagogical practices as it promotes positive learning outcomes among students. The implications of this study suggest that the multimodal approaches integrated in the teaching and learning of ESL have the capacity to promote students’ autonomy in learning, improve motivation to learn and facilitate various learning styles. Keywords: Multimodal Approaches; Multiliteracies; Monomodal; Flipped Classroom; Literacy; Multimodal texts; Ipad

  15. Improve discrimination power of serum markers for diagnosis of cholangiocarcinoma using data mining-based approach.

    Science.gov (United States)

    Pattanapairoj, Sirorat; Silsirivanit, Atit; Muisuk, Kanha; Seubwai, Wunchana; Cha'on, Ubon; Vaeteewoottacharn, Kulthida; Sawanyawisuth, Kanlayanee; Chetchotsak, Danaipong; Wongkham, Sopit

    2015-07-01

    Cholangiocarcinoma (CCA) is usually fatal because of the absence of tests for early detection and lack of effective therapy. Tumor markers with adequate diagnostic values are of clinical significance. This study is aimed to improve the diagnostic power of serum markers using the computational data mining technique to develop a combined diagnostic model that yielded the best diagnostic values for CCA. Eight CCA-associated markers-carcinoembryonic antigen, carbohydrate antigen 19-9, alkaline phosphatase (ALP), and gamma glutamyl transferase, biliary-ALP, mucin5AC, CCA-associated carbohydrate antigen (CCA-CA) and CA-S27-were used as the inputs for the C4.5 decision tree classification model and the selected model was confirmed by ANN analyses. Eight serum markers for CCA were determined in the training set of 85 histologically proven-CCA patients and 82 control subjects. The chosen set of combined markers that gave the best diagnostic values for CCA was then validated in the testing set of 22 CCA patients and 60 controls. A decision tree diagram built by the C4.5 algorithm suggested the serial analysis of CCA-CA and ALP for distinguishing CCA patients from non-CCA subjects with all diagnostic parameters ≥95%. The combined tests showed a precise diagnosis in the testing set. The C4.5 model indicates the combined markers of CCA-CA and ALP that produced the more precise diagnosis for CCA. Copyright © 2015 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.

  16. Land Ecological Security Evaluation of Underground Iron Mine Based on PSR Model

    Science.gov (United States)

    Xiao, Xiao; Chen, Yong; Ruan, Jinghua; Hong, Qiang; Gan, Yong

    2018-01-01

    Iron ore mine provides an important strategic resource to the national economy while it also causes many serious ecological problems to the environment. The study summed up the characteristics of ecological environment problems of underground iron mine. Considering the mining process of underground iron mine, we analysis connections between mining production, resource, environment and economical background. The paper proposed a land ecological security evaluation system and method of underground iron mine based on Pressure-State-Response model. Our application in Chengchao iron mine proves its efficiency and promising guide on land ecological security evaluation.

  17. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  18. A public theological approach to the (impossibility of forgiveness in Matthew 18:15-35: Reading the text through the lens of integral theory

    Directory of Open Access Journals (Sweden)

    Dion A. Forster

    2017-01-01

    Full Text Available Some 20 years after the dawn of participative democracy, there is little noticeable or substantial change in the living conditions of the average South African. The country remains divided by race, class and economics. Poverty, inequality and racial enmity remain looming challenges to human flourishing and social transformation. Some have begun to ask whether forgiveness for the sins of colonialism and apartheid are possible. This article engages with the (impossibilityof forgiveness as it is presented in Matthew 18:15-35. In particular, it does so from the bilingual perspective of a public theological engagement with the text and its contemporary readers in South Africa. By reading the text from an integral All Quadrants All Levels (AQAL approach this article extrapolates a textured understanding of forgiveness that ‘possibilises’ the (impossiblity of forgiveness between racially and socially divided groups of readers.

  19. Suggestions toward some discourse-analytic approaches to text difficulty: with special reference to ‘T-unit configuration’ in the textual unfolding

    Directory of Open Access Journals (Sweden)

    Kazem Lotfipour-Saedi

    2015-01-01

    Full Text Available This paper represents some suggestions towards discourse-analytic approaches for ESL/EFL education, with the focus on identifying the textual forms which can contribute to the textual difficulty. Textual difficulty / comprehensibility, rather than being purely text-based or reader-dependent, is certainly a matter of interaction between text and reader. The paper will look at some of the textual factors which can be argued to make a text more or less readable for the same reader. The main focus here will be on academic texts. The high cognitive load and low readability of the expository texts in various academic disciplines will be argued to belong to certain textual strategies as well as variations in the configurations of the T-units as the prime scaffolding for the textualization process. Different categories of these variations to be discussed here will be exemplified from a few academic and expository registers. More extensive textual analyses will, of course, be necessary in order to be able to make evidential suggestions for possible correlations between certain types and clusters of T-unit configurations on the one hand, and cognitive load and readability indices on the other, across various academic registers, genres and disciplines.

  20. Bilingual approach to online cancer genetics education for Deaf American Sign Language users produces greater knowledge and confidence than English text only: A randomized study.

    Science.gov (United States)

    Palmer, Christina G S; Boudreault, Patrick; Berman, Barbara A; Wolfson, Alicia; Duarte, Lionel; Venne, Vickie L; Sinsheimer, Janet S

    2017-01-01

    Deaf American Sign Language-users (ASL) have limited access to cancer genetics information they can readily understand, increasing risk for health disparities. We compared effectiveness of online cancer genetics information presented using a bilingual approach (ASL with English closed captioning) and a monolingual approach (English text). Bilingual modality would increase cancer genetics knowledge and confidence to create a family tree; education would interact with modality. We used a parallel 2:1 randomized pre-post study design stratified on education. 150 Deaf ASL-users ≥18 years old with computer and internet access participated online; 100 (70 high, 30 low education) and 50 (35 high, 15 low education) were randomized to the bilingual and monolingual modalities. Modalities provide virtually identical content on creating a family tree, using the family tree to identify inherited cancer risk factors, understanding how cancer predisposition can be inherited, and the role of genetic counseling and testing for prevention or treatment. 25 true/false items assessed knowledge; a Likert scale item assessed confidence. Data were collected within 2 weeks before and after viewing the information. Significant interaction of language modality, education, and change in knowledge scores was observed (p = .01). High education group increased knowledge regardless of modality (Bilingual: p education group increased knowledge with bilingual (p Bilingual modality yielded greater confidence creating a family tree (p = .03). Bilingual approach provides a better opportunity for lower educated Deaf ASL-users to access cancer genetics information than a monolingual approach. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  1. Time delay and profit accumulation effect on a mine-based uranium market clearing model

    International Nuclear Information System (INIS)

    Auzans, Aris; Teder, Allan; Tkaczyk, Alan H.

    2016-01-01

    Highlights: • Improved version of a mine-based uranium market clearing model for the front-end uranium market and enrichment industries is proposed. • A profit accumulation algorithm and time delay function provides more realistic uranium mine decision making process. • Operational decision delay increased uranium market price volatility. - Abstract: The mining industry faces a number of challenges such as market volatility, investment safety, issues surrounding employment and productivity. Therefore, computer simulations are highly relevant in order to reduce financial risks associated with these challenges. In the mining industry, each firm must compete with other mines and the basic target is profit maximization. The aim of this paper is to evaluate the world uranium (U) supply by simulating financial management challenges faced by an individual U mine that are caused by a variety of regulation issues. In this paper front-end nuclear fuel cycle tool is used to simulate market conditions and the effects they have on the stability of U supply. An individual U mine’s exit or entry in the market might cause changes in the U supply side which can increase or decrease the market price. In this paper we offer a more advanced version of a mine-based U market clearing model. The existing U market model incorporates the market of primary U from uranium mines with secondary uranium (depleted uranium DU), enriched uranium (HEU) and enrichment services. In the model each uranium mine acts as an independent agent that is able to make operational decisions based on the market price. This paper introduces a more realistic decision making algorithm of individual U mine that adds constraints to production decisions. The authors added an accumulated profit model, which allows for the profits accumulated to cover any possible future economic losses and the time-delay algorithm to simulate delayed process of reopening a U mine. The U market simulation covers time period 2010

  2. Time delay and profit accumulation effect on a mine-based uranium market clearing model

    Energy Technology Data Exchange (ETDEWEB)

    Auzans, Aris [Institute of Physics, University of Tartu, Ostwaldi 1, EE-50411 Tartu (Estonia); Teder, Allan [School of Economics and Business Administration, University of Tartu, Narva mnt 4, EE-51009 Tartu (Estonia); Tkaczyk, Alan H., E-mail: alan@ut.ee [Institute of Physics, University of Tartu, Ostwaldi 1, EE-50411 Tartu (Estonia)

    2016-12-15

    Highlights: • Improved version of a mine-based uranium market clearing model for the front-end uranium market and enrichment industries is proposed. • A profit accumulation algorithm and time delay function provides more realistic uranium mine decision making process. • Operational decision delay increased uranium market price volatility. - Abstract: The mining industry faces a number of challenges such as market volatility, investment safety, issues surrounding employment and productivity. Therefore, computer simulations are highly relevant in order to reduce financial risks associated with these challenges. In the mining industry, each firm must compete with other mines and the basic target is profit maximization. The aim of this paper is to evaluate the world uranium (U) supply by simulating financial management challenges faced by an individual U mine that are caused by a variety of regulation issues. In this paper front-end nuclear fuel cycle tool is used to simulate market conditions and the effects they have on the stability of U supply. An individual U mine’s exit or entry in the market might cause changes in the U supply side which can increase or decrease the market price. In this paper we offer a more advanced version of a mine-based U market clearing model. The existing U market model incorporates the market of primary U from uranium mines with secondary uranium (depleted uranium DU), enriched uranium (HEU) and enrichment services. In the model each uranium mine acts as an independent agent that is able to make operational decisions based on the market price. This paper introduces a more realistic decision making algorithm of individual U mine that adds constraints to production decisions. The authors added an accumulated profit model, which allows for the profits accumulated to cover any possible future economic losses and the time-delay algorithm to simulate delayed process of reopening a U mine. The U market simulation covers time period 2010

  3. Text Mining of the Electronic Health Record: An Information Extraction Approach for Automated Identification and Subphenotyping of HFpEF Patients for Clinical Trials.

    Science.gov (United States)

    Jonnalagadda, Siddhartha R; Adupa, Abhishek K; Garg, Ravi P; Corona-Cox, Jessica; Shah, Sanjiv J

    2017-06-01

    Precision medicine requires clinical trials that are able to efficiently enroll subtypes of patients in whom targeted therapies can be tested. To reduce the large amount of time spent screening, identifying, and recruiting patients with specific subtypes of heterogeneous clinical syndromes (such as heart failure with preserved ejection fraction [HFpEF]), we need prescreening systems that are able to automate data extraction and decision-making tasks. However, a major obstacle is the vast amount of unstructured free-form text in medical records. Here we describe an information extraction-based approach that automatically converts unstructured text into structured data, which is cross-referenced against eligibility criteria using a rule-based system to determine which patients qualify for a major HFpEF clinical trial (PARAGON). We show that we can achieve a sensitivity and positive predictive value of 0.95 and 0.86, respectively. Our open-source algorithm could be used to efficiently identify and subphenotype patients with HFpEF and other disorders.

  4. Data-Mining-Based Intelligent Differential Relaying for Transmission Lines Including UPFC and Wind Farms.

    Science.gov (United States)

    Jena, Manas Kumar; Samantaray, Subhransu Ranjan

    2016-01-01

    This paper presents a data-mining-based intelligent differential relaying scheme for transmission lines, including flexible ac transmission system device, such as unified power flow controller (UPFC) and wind farms. Initially, the current and voltage signals are processed through extended Kalman filter phasor measurement unit for phasor estimation, and 21 potential features are computed at both ends of the line. Once the features are extracted at both ends, the corresponding differential features are derived. These differential features are fed to a data-mining model known as decision tree (DT) to provide the final relaying decision. The proposed technique has been extensively tested for single-circuit transmission line, including UPFC and wind farms with in-feed, double-circuit line with UPFC on one line and wind farm as one of the substations with wide variations in operating parameters. The test results obtained from simulation as well as in real-time digital simulator testing indicate that the DT-based intelligent differential relaying scheme is highly reliable and accurate with a response time of 2.25 cycles from the fault inception.

  5. Semantic Mining based on graph theory and ontologies. Case Study: Cell Signaling Pathways

    Directory of Open Access Journals (Sweden)

    Carlos R. Rangel

    2016-08-01

    Full Text Available In this paper we use concepts from graph theory and cellular biology represented as ontologies, to carry out semantic mining tasks on signaling pathway networks. Specifically, the paper describes the semantic enrichment of signaling pathway networks. A cell signaling network describes the basic cellular activities and their interactions. The main contribution of this paper is in the signaling pathway research area, it proposes a new technique to analyze and understand how changes in these networks may affect the transmission and flow of information, which produce diseases such as cancer and diabetes. Our approach is based on three concepts from graph theory (modularity, clustering and centrality frequently used on social networks analysis. Our approach consists into two phases: the first uses the graph theory concepts to determine the cellular groups in the network, which we will call them communities; the second uses ontologies for the semantic enrichment of the cellular communities. The measures used from the graph theory allow us to determine the set of cells that are close (for example, in a disease, and the main cells in each community. We analyze our approach in two cases: TGF-ß and the Alzheimer Disease.

  6. Zum Bildungspotenzial biblischer Texte

    Directory of Open Access Journals (Sweden)

    Theis, Joachim

    2017-11-01

    Full Text Available Biblical education as a holistic process goes far beyond biblical learning. It must be understood as a lifelong process, in which both biblical texts and their understanders operate appropriating their counterpart in a dialogical way. – Neither does the recipient’s horizon of understanding appear as an empty room, which had to be filled with the text only, nor is the latter a dead material one could only examine cognitively. The recipient discovers the meaning of the biblical text recomposing it by existential appropriation. So the text is brought to live in each individual reality. Both scientific insights and subjective structures as well as the understanders’ community must be included to avoid potential one-sidednesses. Unfortunately, a special negative association obscures the approach of the bible very often: Still biblical work as part of religious education appears in a cognitively oriented habit, which is neither regarding the vitality and sovereignty of the biblical texts nor the students’ desire for meaning. Moreover, the bible is getting misused for teaching moral terms or pontifications. Such downfalls can be disrupted by biblical didactics which are empowerment didactics. Regarding the sovereignty of biblical texts, these didactics assist the understander with his/her individuation by opening the texts with focus on the understander’s otherness. Thus each the text and the recipient become subjects in a dialogue. The approach of the Biblical-Enabling-Didactics leads the Bible to become always new a book of life. Understanding them from within their hermeneutics, empowerment didactics could be raised to the principle of biblical didactics in general and grow into an essential element of holistic education.

  7. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  8. The Effects of Using Multimodal Approaches in Meaning-Making of 21st Century Literacy Texts among ESL Students in a Private School in Malaysia

    Science.gov (United States)

    Ganapathy, Malina; Seetharam, Saundravalli A/P

    2016-01-01

    In today's globalised digital era, students are inevitably engaged in various multimodal texts due to their active participation in social media and frequent usage of mobile devices on a daily basis. Such daily activities advocate the need for a transformation in the teaching and learning of ESL lessons in order to promote students' capabilities…

  9. Knowledge Mining Based on Environmental Simulation Applied to Wind Farm Power Forecasting

    Directory of Open Access Journals (Sweden)

    Dongxiao Niu

    2013-01-01

    Full Text Available Considering the inherent variability and uncertainty of wind power generation, in this study, a self-organizing map (SOM combined with rough set theory clustering technique (RST is proposed to extract the relative knowledge and to choose the most similar history situation and efficient data for wind power forecasting with numerical weather prediction (NWP. Through integrating the SOM and RST methods to cluster the historical data into several classes, the approach could find the similar days and excavate the hidden rules. According to the data reprocessing, the selected samples will improve the forecast accuracy echo state network (ESN trained by the class of the forecasting day that is adopted to forecast the wind power output accordingly. The developed methods are applied to a case of power forecasting in a wind farm located in northwest of China with wind power data from April 1, 2008, to May 6, 2009. In order to verify its effectiveness, the performance of the proposed method is compared with the traditional backpropagation neural network (BP. The results demonstrated that knowledge mining led to a promising improvement in the performance for wind farm power forecasting.

  10. The Role of Stylistic Approach in Teaching Contemporary Adabi TextsThe Case of Elaik ya Valadi of Saad-e-Sabbah

    Directory of Open Access Journals (Sweden)

    Nouroddin Parvin

    2014-01-01

    In this research, we will analyze the "eleik Ya Valadi" elegy of Saad e Sabbah, one of the pioneers of contemporary Arabic Rassa poetry at the morphological and syntactic levels with discretional-analytical methods and in stylistic context to analyze the role of stylistics in teaching Arabic texts. One of the most important results of this research is that it is one of the best methods in education of literacy, because of the fact that it considers the consistency and coordination between method of teaching and literacy, and also increases the students’ motivation for understanding and communication. Keywords: Contemporary literacy, teaching literacy, stylistics, Saad e sabbah

  11. Compreensão textual em alunos de segunda e terceira séries: uma abordagem cognitiva Text comprehension in second and third graders: a cognitive approach

    Directory of Open Access Journals (Sweden)

    Jerusa Fumagalli de Salles

    2004-04-01

    Full Text Available Este estudo teve como objetivo analisar a compreensão de leitura textual de alunos de 2ª e 3ª séries. Participaram 76 crianças, com média de idade de 8,1 anos. Cada criança lia a história, recontava-a e, posteriormente, respondia a questões. Os recontos foram analisados segundo o Modelo de Compreensão Textual de Kintsch & van Dijk (1978 e Kintsch (1988, 1998. A amostra relatou, em média, 21,07% da estrutura proposicional da história, sendo mais freqüente o relato de macroproposições. Alunos da terceira série foram superiores aos da segunda série no relato de microproposições menos relevantes do texto e em responder a questões pontuais sobre a história. Foi encontrada uma correlação significativa entre idade e o reconto da macroestrutura textual. Os resultados sugerem que durante os primeiros anos de escolarização ocorreu uma melhora da memorização de detalhes, enquanto que a retenção das idéias essenciais foi influenciada pelas variações de idade das crianças.This study aimed to analyze text comprehension of students of the 2nd and 3rd grades. The sample was constituted by 76 children, at an average of 8.1 years old. Each child read the story, retold it and, afterwards, answered questions about it. The retellings were analyzed according to the model of Text Comprehension of Kintsch and van Dijk (1978 and Kintsch (1988, 1998. The sample recalled a mean of 21.07% of the proposition structure of the story, being the report of macropropositions more frequent. Students of the third grade told larger percentage of irrelevant micropropositions of the text and they were superior in answering to specific questions than students of the second grade. A significant correlation was found between age and macroproposition's retelling. The results suggest that during the first years of schooling there is an improvement of the detail-remembering, whereas the retention of the essential ideas is influenced by age differences.

  12. How did Popular Science Become a Legend? On the linguistic communication of “Science Culture” book series in 1990s Taiwan from the approach of text analysis

    Directory of Open Access Journals (Sweden)

    Ruey-Lin Chen

    2018-01-01

    Full Text Available Commonwealth Publishing Co. in Taiwan has published a series of popular science books, named Science Culture, since 1991. This series has achieved great success in publication and in marketing and up to the present has published over 164 volumes and sold out a great number of hard copies. It is well regarded as a publication legend. How did it succeed? What strategies has it adopted to become such a legend? This paper shows that the series’ success depends on two strategies: exciting subjects and strengthening the first impression. This research applies three related tactics or techniques publishing scientific biographies, literary rhetoric, and using romanticizing titles to realize two strategies of the series. This paper reveals these strategies and techniques by investigating the writing style of books in the series, comparing the titles of the series with other titles of popular science books before 1990, and conducting interviews with the editors of that series.

  13. Effect of a Connective Tissue Graft in Combination With a Single Flap Approach in the Regenerative Treatment of Intraosseous Defects [Formula: see text].

    Science.gov (United States)

    Trombelli, Leonardo; Simonelli, Anna; Minenna, Luigi; Rasperini, Giulio; Farina, Roberto

    2017-04-01

    In the attempt to limit the post-surgery increase in buccal gingival recession (bREC), effect of a connective tissue graft (CTG) when combined with a buccal single flap approach (SFA) in the regenerative treatment of intraosseous defects is evaluated. Data related to 30 patients with an intraosseous defect treated with a buccal SFA with (SFA+CTG group; n = 15) or without (SFA group; n = 15) placement of a CTG and regenerative treatment were retrospectively derived at three clinical centers. bREC and probing parameters were assessed at presurgery and 6 months post-surgery. In addition to a significant attachment gain and probing depth reduction, adjunctive use of a CTG to a buccal SFA in the regenerative treatment of periodontal intraosseous defects associated with a buccal bone dehiscence resulted in a limited post-surgery bREC, a lower prevalence of defects with a clinically detectable apical displacement of the gingival margin, and an increase in gingival width and thickness. Adjunctive use of a CTG in the regenerative treatment of intraosseous defects associated with buccal bone dehiscence accessed by buccal SFA may support the stability of the gingival profile.

  14. Unpacking the Black Box: A Formative Research Approach to the Development of Theory-Driven, Evidence-Based, and Culturally Safe Text Messages in Mobile Health Interventions.

    Science.gov (United States)

    Maar, Marion A; Yeates, Karen; Toth, Zsolt; Barron, Marcia; Boesch, Lisa; Hua-Stewart, Diane; Liu, Peter; Perkins, Nancy; Sleeth, Jessica; Wabano, Mary Jo; Williamson, Pamela; Tobe, Sheldon W

    2016-01-22

    evidence-based text message created by researchers and the message received by the recipient in mobile health interventions. These discrepancies were primarily generated by six mediators of meaning in SMS messages: (1) negative or non-affirming framing of advocacies, (2) fear- or stress-inducing content, (3) oppressive or authoritarian content, (4) incongruity with cultural and traditional practices, (5) disconnect with the reality of the social determinants of health and the diversity of cultures within a population, and (6) lack of clarity and/or practicality of content. These 6 mediators of meaning provide the basis for sound strategies for message development because they impact directly on the target populations' capability, opportunity, and motivation for behavior change. The quality of text messages impacts significantly on the effectiveness of a mobile health intervention. Our research underscores the urgent need for interventions to incorporate and evaluate the quality of SMS messages and to examine the mediators of meaning within each targeted cultural and demographic group. Reporting on this aspect of mobile health intervention research will allow researchers to move away from the current black box of SMS text message development, thus improving the transparency of the process as well as the quality of the outcomes.

  15. O texto bíblico e a igreja católica romana: aproximações pastorais = Bible text and Roman Catholic Church: approaches pastoral

    Directory of Open Access Journals (Sweden)

    Junqueira, Sérgio Rogério Azevedo

    2013-01-01

    Full Text Available O texto é parte de uma pesquisa qualitativa histórica sobre o uso do texto bíblico na pastoral. Articulado a partir do início da era cristã, perpassando pelo período medieval, renascimento, moderno e contemporâneo, este breve estudo histórico será pressuposto para outras etapas da pesquisa do uso pastoral da Bíblia. Significativamente pelo fato de que, ao longo dos séculos, o uso do texto bíblico vinha acompanhado de várias questões acerca de quem e como interpretá-lo, considerando a tradição e o magistério, de forma que houve uma restrição ao texto para a maioria dos cristãos. Trata-se de uma longa história da qual se pretende apresentar alguns acenos para levantar novas questões, sobretudo, quanto ao lugar da Escritura na pastoral da Igreja hoje, especialmente na pastoral escolar

  16. Avoid violence, rioting, and outrage; approach celebration, delight, and strength: Using large text corpora to compute valence, arousal, and the basic emotions.

    Science.gov (United States)

    Westbury, Chris; Keith, Jeff; Briesemeister, Benny B; Hofmann, Markus J; Jacobs, Arthur M

    2015-01-01

    Ever since Aristotle discussed the issue in Book II of his Rhetoric, humans have attempted to identify a set of "basic emotion labels". In this paper we propose an algorithmic method for evaluating sets of basic emotion labels that relies upon computed co-occurrence distances between words in a 12.7-billion-word corpus of unselected text from USENET discussion groups. Our method uses the relationship between human arousal and valence ratings collected for a large list of words, and the co-occurrence similarity between each word and emotion labels. We assess how well the words in each of 12 emotion label sets-proposed by various researchers over the past 118 years-predict the arousal and valence ratings on a test and validation dataset, each consisting of over 5970 items. We also assess how well these emotion labels predict lexical decision residuals (LDRTs), after co-varying out the effects attributable to basic lexical predictors. We then demonstrate a generalization of our method to determine the most predictive "basic" emotion labels from among all of the putative models of basic emotion that we considered. As well as contributing empirical data towards the development of a more rigorous definition of basic emotions, our method makes it possible to derive principled computational estimates of emotionality-specifically, of arousal and valence-for all words in the language.

  17. Interactive text visualization with Text Variation Explorer

    OpenAIRE

    Siirtola, Harri; Isokoski, Poika; Säily, Tanja; Nevalainen, Terttu

    2016-01-01

    Digitalization is changing how research is carried out in all areas of science. Humanities is no exception - materials that used to be hand-written or printed on paper are increasingly available in digital form. This development is changing how scholars are interacting with their material. We are addressing the problem of interactive text visualization in the context of sociolinguistic language study. When a scholar is reading and analyzing text from a computer screen instead of a paper, we c...

  18. Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

    Directory of Open Access Journals (Sweden)

    Ayush Singhal

    2016-11-01

    Full Text Available The practice of precision medicine will ultimately require databases of genes and mutations for healthcare providers to reference in order to understand the clinical implications of each patient's genetic makeup. Although the highest quality databases require manual curation, text mining tools can facilitate the curation process, increasing accuracy, coverage, and productivity. However, to date there are no available text mining tools that offer high-accuracy performance for extracting such triplets from biomedical literature. In this paper we propose a high-performance machine learning approach to automate the extraction of disease-gene-variant triplets from biomedical literature. Our approach is unique because we identify the genes and protein products associated with each mutation from not just the local text content, but from a global context as well (from the Internet and from all literature in PubMed. Our approach also incorporates protein sequence validation and disease association using a novel text-mining-based machine learning approach. We extract disease-gene-variant triplets from all abstracts in PubMed related to a set of ten important diseases (breast cancer, prostate cancer, pancreatic cancer, lung cancer, acute myeloid leukemia, Alzheimer's disease, hemochromatosis, age-related macular degeneration (AMD, diabetes mellitus, and cystic fibrosis. We then evaluate our approach in two ways: (1 a direct comparison with the state of the art using benchmark datasets; (2 a validation study comparing the results of our approach with entries in a popular human-curated database (UniProt for each of the previously mentioned diseases. In the benchmark comparison, our full approach achieves a 28% improvement in F1-measure (from 0.62 to 0.79 over the state-of-the-art results. For the validation study with UniProt Knowledgebase (KB, we present a thorough analysis of the results and errors. Across all diseases, our approach returned 272 triplets

  19. Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

    Science.gov (United States)

    Singhal, Ayush; Simmons, Michael; Lu, Zhiyong

    2016-11-01

    The practice of precision medicine will ultimately require databases of genes and mutations for healthcare providers to reference in order to understand the clinical implications of each patient's genetic makeup. Although the highest quality databases require manual curation, text mining tools can facilitate the curation process, increasing accuracy, coverage, and productivity. However, to date there are no available text mining tools that offer high-accuracy performance for extracting such triplets from biomedical literature. In this paper we propose a high-performance machine learning approach to automate the extraction of disease-gene-variant triplets from biomedical literature. Our approach is unique because we identify the genes and protein products associated with each mutation from not just the local text content, but from a global context as well (from the Internet and from all literature in PubMed). Our approach also incorporates protein sequence validation and disease association using a novel text-mining-based machine learning approach. We extract disease-gene-variant triplets from all abstracts in PubMed related to a set of ten important diseases (breast cancer, prostate cancer, pancreatic cancer, lung cancer, acute myeloid leukemia, Alzheimer's disease, hemochromatosis, age-related macular degeneration (AMD), diabetes mellitus, and cystic fibrosis). We then evaluate our approach in two ways: (1) a direct comparison with the state of the art using benchmark datasets; (2) a validation study comparing the results of our approach with entries in a popular human-curated database (UniProt) for each of the previously mentioned diseases. In the benchmark comparison, our full approach achieves a 28% improvement in F1-measure (from 0.62 to 0.79) over the state-of-the-art results. For the validation study with UniProt Knowledgebase (KB), we present a thorough analysis of the results and errors. Across all diseases, our approach returned 272 triplets (disease

  20. XML and Free Text.

    Science.gov (United States)

    Riggs, Ken Roger

    2002-01-01

    Discusses problems with marking free text, text that is either natural language or semigrammatical but unstructured, that prevent well-formed XML from marking text for readily available meaning. Proposes a solution to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML. (Author/LRW)

  1. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  2. A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text

    Science.gov (United States)

    Miwa, Makoto; Ohta, Tomoko; Rak, Rafal; Rowley, Andrew; Kell, Douglas B.; Pyysalo, Sampo; Ananiadou, Sophia

    2013-01-01

    Motivation: To create, verify and maintain pathway models, curators must discover and assess knowledge distributed over the vast body of biological literature. Methods supporting these tasks must understand both the pathway model representations and the natural language in the literature. These methods should identify and order documents by relevance to any given pathway reaction. No existing system has addressed all aspects of this challenge. Method: We present novel methods for associating pathway model reactions with relevant publications. Our approach extracts the reactions directly from the models and then turns them into queries for three text mining-based MEDLINE literature search systems. These queries are executed, and the resulting documents are combined and ranked according to their relevance to the reactions of interest. We manually annotate document-reaction pairs with the relevance of the document to the reaction and use this annotation to study several ranking methods, using various heuristic and machine-learning approaches. Results: Our evaluation shows that the annotated document-reaction pairs can be used to create a rule-based document ranking system, and that machine learning can be used to rank documents by their relevance to pathway reactions. We find that a Support Vector Machine-based system outperforms several baselines and matches the performance of the rule-based system. The success of the query extraction and ranking methods are used to update our existing pathway search system, PathText. Availability: An online demonstration of PathText 2 and the annotated corpus are available for research purposes at http://www.nactem.ac.uk/pathtext2/. Contact: makoto.miwa@manchester.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23813008

  3. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2018-01-01

    text can be defined by taking as point of departure the digital format in which everything is represented in the binary alphabet. While the notion of text, in most cases, lends itself to be independent of medium and embodiment, it is also often tacitly assumed that it is, in fact, modeled around...... the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities...

  4. Searching for text documents

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Blanken, Henk; de Vries, A.P.; Blok, H.E.; Feng, L.

    2007-01-01

    Many documents contain, besides text, also images, tables, and so on. This chapter concentrates on the text part only. Traditionally, systems handling text documents are called information storage and retrieval systems. Before the World-Wide Web emerged, such systems were almost exclusively used by

  5. Air Pollution Monitoring and Mining Based on Sensor Grid in London

    Directory of Open Access Journals (Sweden)

    John Hassard

    2008-06-01

    Full Text Available In this paper, we present a distributed infrastructure based on wireless sensors network and Grid computing technology for air pollution monitoring and mining, which aims to develop low-cost and ubiquitous sensor networks to collect real-time, large scale and comprehensive environmental data from road traffic emissions for air pollution monitoring in urban environment. The main informatics challenges in respect to constructing the high-throughput sensor Grid are discussed in this paper. We present a twolayer network framework, a P2P e-Science Grid architecture, and the distributed data mining algorithm as the solutions to address the challenges. We simulated the system in TinyOS to examine the operation of each sensor as well as the networking performance. We also present the distributed data mining result to examine the effectiveness of the algorithm.

  6. A DATA-MINING BASED METHOD FOR THE GAIT PATTERN ANALYSIS

    Directory of Open Access Journals (Sweden)

    Marcelo Rudek

    2015-12-01

    Full Text Available The paper presents a method developed for the gait classification based on the analysis of the trajectory of the pressure centres (CoP extracted from the contact points of the feet with the ground during walking. The data acquirement is performed ba means of a walkway with embedded tactile sensors. The proposed method includes capturing procedures, standardization of data, creation of an organized repository (data warehouse, and development of a process mining. A graphical analysis is applied to looking at the footprint signature patterns. The aim is to obtain a visual interpretation of the grouping by situating it into the normal walking patterns or deviations associated with an individual way of walking. The method consists of data classification automation which divides them into healthy and non-healthy subjects in order to assist in rehabilitation treatments for the people with related mobility problems.

  7. A Novel Method of Interestingness Measures for Association Rules Mining Based on Profit

    Directory of Open Access Journals (Sweden)

    Chunhua Ju

    2015-01-01

    Full Text Available Association rules mining is an important topic in the domain of data mining and knowledge discovering. Some papers have presented several interestingness measure methods; the most typical are Support, Confidence, Lift, Improve, and so forth. But their limitations are obvious, like no objective criterion, lack of statistical base, disability of defining negative relationship, and so forth. This paper proposes three new methods, Bi-lift, Bi-improve, and Bi-confidence, for Lift, Improve, and Confidence, respectively. Then, on the basis of utility function and the executing cost of rules, we propose interestingness function based on profit (IFBP considering subjective preferences and characteristics of specific application object. Finally, a novel measure framework is proposed to improve the traditional one through experimental analysis. In conclusion, the new methods and measure framework are prior to the traditional ones in the aspects of objective criterion, comprehensive definition, and practical application.

  8. Gas Emission Prediction Model of Coal Mine Based on CSBP Algorithm

    Directory of Open Access Journals (Sweden)

    Xiong Yan

    2016-01-01

    Full Text Available In view of the nonlinear characteristics of gas emission in a coal working face, a prediction method is proposed based on cuckoo search algorithm optimized BP neural network (CSBP. In the CSBP algorithm, the cuckoo search is adopted to optimize weight and threshold parameters of BP network, and obtains the global optimal solutions. Furthermore, the twelve main affecting factors of the gas emission in the coal working face are taken as input vectors of CSBP algorithm, the gas emission is acted as output vector, and then the prediction model of BP neural network with optimal parameters is established. The results show that the CSBP algorithm has batter generalization ability and higher prediction accuracy, and can be utilized effectively in the prediction of coal mine gas emission.

  9. Vocabulary Constraint on Texts

    Directory of Open Access Journals (Sweden)

    C. Sutarsyah

    2008-01-01

    Full Text Available This case study was carried out in the English Education Department of State University of Malang. The aim of the study was to identify and describe the vocabulary in the reading text and to seek if the text is useful for reading skill development. A descriptive qualitative design was applied to obtain the data. For this purpose, some available computer programs were used to find the description of vocabulary in the texts. It was found that the 20 texts containing 7,945 words are dominated by low frequency words which account for 16.97% of the words in the texts. The high frequency words occurring in the texts were dominated by function words. In the case of word levels, it was found that the texts have very limited number of words from GSL (General Service List of English Words (West, 1953. The proportion of the first 1,000 words of GSL only accounts for 44.6%. The data also show that the texts contain too large proportion of words which are not in the three levels (the first 2,000 and UWL. These words account for 26.44% of the running words in the texts.  It is believed that the constraints are due to the selection of the texts which are made of a series of short-unrelated texts. This kind of text is subject to the accumulation of low frequency words especially those of content words and limited of words from GSL. It could also defeat the development of students' reading skills and vocabulary enrichment.

  10. An association rule mining-based framework for understanding lifestyle risk behaviors.

    Directory of Open Access Journals (Sweden)

    So Hyun Park

    Full Text Available OBJECTIVES: This study investigated the prevalence and patterns of lifestyle risk behaviors in Korean adults. METHODS: We utilized data from the Fourth Korea National Health and Nutrition Examination Survey for 14,833 adults (>20 years of age. We used association rule mining to analyze patterns of lifestyle risk behaviors by characterizing non-adherence to public health recommendations related to the Alameda 7 health behaviors. The study variables were current smoking, heavy drinking, physical inactivity, obesity, inadequate sleep, breakfast skipping, and frequent snacking. RESULTS: Approximately 72% of Korean adults exhibited two or more lifestyle risk behaviors. Among women, current smoking, obesity, and breakfast skipping were associated with inadequate sleep. Among men, breakfast skipping with additional risk behaviors such as physical inactivity, obesity, and inadequate sleep was associated with current smoking. Current smoking with additional risk behaviors such as inadequate sleep or breakfast skipping was associated with physical inactivity. CONCLUSION: Lifestyle risk behaviors are intercorrelated in Korea. Information on patterns of lifestyle risk behaviors could assist in planning interventions targeted at multiple behaviors simultaneously.

  11. Topography and Data Mining Based Methods for Improving Satellite Precipitation in Mountainous Areas of China

    Directory of Open Access Journals (Sweden)

    Ting Xia

    2015-07-01

    Full Text Available Topography is a significant factor influencing the spatial distribution of precipitation. This study developed a new methodology to evaluate and calibrate the Tropical Rainfall Measuring Mission Multi-satellite Precipitation Analysis (TMPA products by merging geographic and topographic information. In the proposed method, firstly, the consistency rule was introduced to evaluate the fitness of satellite rainfall with measurements on the grids with and without ground gauges. Secondly, in order to improve the consistency rate of satellite rainfall, genetic programming was introduced to mine the relationship between the gauge rainfall and location, elevation and TMPA rainfall. The proof experiment and analysis for the mean annual satellite precipitation from 2001–2012, 3B43 (V7 of TMPA rainfall product, was carried out in eight mountainous areas of China. The result shows that the proposed method is significant and efficient both for the assessment and improvement of satellite precipitation. It is found that the satellite rainfall consistency rates in the gauged and ungauged grids are different in the study area. In addition, the mined correlation of location-elevation-TMPA rainfall can noticeably improve the satellite precipitation, both in the context of the new criterion of the consistency rate and the existing criteria such as Bias and RMSD. The proposed method is also efficient for correcting the monthly and mean monthly rainfall of 3B43 and 3B42RT.

  12. Groundwater Mixing Process Identification in Deep Mines Based on Hydrogeochemical Property Analysis

    Directory of Open Access Journals (Sweden)

    Bo Liu

    2016-12-01

    Full Text Available Karst collapse columns, as a potential water passageway for mine water inrush, are always considered a critical problem for the development of deep mining techniques. This study aims to identify the mixing process of groundwater deriving two different limestone karst-fissure aquifer systems. Based on analysis of mining groundwater hydrogeochemical properties, hydraulic connection between the karst-fissure objective aquifer systems was revealed. In this paper, piper diagram was used to calculate the mixing ratios at different sampling points in the aquifer systems, and PHREEQC Interactive model (Version 2.5, USGS, Reston, VA, USA, 2001 was applied to modify the mixing ratios and model the water–rock interactions during the mixing processes. The analysis results show that the highest mixing ratio is 0.905 in the C12 borehole that is located nearest to the #2 karst collapse column, and the mixing ratio decreases with the increase of the distance from the #2 karst collapse column. It demonstrated that groundwater of the two aquifers mixed through the passage of #2 karst collapse column. As a result, the proposed Piper-PHREEQC based method can provide accurate identification of karst collapse columns’ water conductivity, and can be applied to practical applications.

  13. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  14. The Vicissitudes of Text

    Directory of Open Access Journals (Sweden)

    Jonathan CULLER

    2003-06-01

    Full Text Available The concept of text, which has been central to literary studies, has undergone many mutations, as it has traveled from the work of classical philologists, for whom it was and is the object of a powerful disciplinary formation, to postmodern theorists of the text, for whom, the concept might be summed up by the title of a fine book by John Mowatt: Text: the Genealogy of an Antidisciplinary Object. Of course, the interesting thing about a travelling concept is not that it travels — travelers, t...

  15. Data-mining Based Detection of Glaciers: Quantifying the Extent of Alpine Valley Glaciation

    Directory of Open Access Journals (Sweden)

    Wei Luo

    2015-07-01

    Full Text Available The extent of glaciation in alpine valleys often gives clues to past climates, plate movement, mountain landforms, bedrock geology and more. However, without field investigation, the degree to which a valley was affected by a glacier has been difficult to assess. We developed a model that uses quantitative parameters derived from digital elevations model (DEM data to predict whether a glacier was likely present in an alpine valley. The model's inputs are mainly derived from the basin hypsometry, and a new parameter termed the Hypothetical Basin Equilibrium Elevation (HBEE, which is based on the equilibrium elevation altitude (ELA of a glacier. We used data mining techniques that comb through large data sets to find patterns for classification and prediction as the basis for the model. Four classifiers were utilized, and each was tested with two different training set/test data ratios of nearly 150 basins that were previously delineated as fully- or non-glaciated. The classifiers had a predictive accuracy of up to 90% with none falling below 72%. Two of the classifiers, classification tree and naïve-Bayes, have graphical outputs that visually describe the classification process, predictive results, and in the naïve-Bayes case, the relative effectiveness towards the model of each attribute. In all scenarios, the HBEE was found to be an accurate predictor for the model. The model can be applied to any area where glaciation may have occurred, but is particularly useful in areas where the valley is inaccessible for detailed field investigation.

  16. Final state interactions in [Formula: see text] decays: [Formula: see text] rule vs. [Formula: see text].

    Science.gov (United States)

    Buras, Andrzej J; Gérard, Jean-Marc

    2017-01-01

    Dispersive effects from strong [Formula: see text] rescattering in the final state interaction (FSI) of weak [Formula: see text] decays are revisited with the goal to have a global view on their relative importance for the [Formula: see text] rule and the ratio [Formula: see text] in the standard model (SM). We point out that this goal cannot be reached within a pure effective (meson) field approach like chiral perturbation theory in which the dominant current-current operators governing the [Formula: see text] rule and the dominant density-density (four-quark) operators governing [Formula: see text] cannot be disentangled from each other. But in the context of a dual QCD approach, which includes both long-distance dynamics and the UV completion, that is, QCD at short-distance scales, such a distinction is possible. We find then that beyond the strict large N limit, N being the number of colours, FSIs are likely to be important for the [Formula: see text] rule but much less relevant for [Formula: see text]. The latter finding diminishes significantly hopes that improved calculations of [Formula: see text] would bring its SM prediction to agree with the experimental data, opening thereby an arena for important new physics contributions to this ratio.

  17. A data mining based model for selecting type of treatment for kidney stone patients

    Directory of Open Access Journals (Sweden)

    Sepehri MM

    2009-09-01

    Full Text Available "n Normal 0 false false false EN-US X-NONE AR-SA MicrosoftInternetExplorer4 /* Style Definitions */ table.MsoNormalTable {mso-style-name:"Table Normal"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-qformat:yes; mso-style-parent:""; mso-padding-alt:0in 5.4pt 0in 5.4pt; mso-para-margin:0in; mso-para-margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:11.0pt; font-family:"Calibri","sans-serif"; mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin; mso-fareast-font-family:"Times New Roman"; mso-fareast-theme-font:minor-fareast; mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin; mso-bidi-font-family:Arial; mso-bidi-theme-font:minor-bidi;} Background: Data mining as a multidisciplinary field is rooted in the fields such as statistics, mathematics, computer science and artificial intelligence and has been gaining momentum in scientific, managerial, and executive applications in health care. Data mining can be defined as the automated extraction of valuable, practical and hidden knowledge and information from large data. Applying data mining in medical records and data is of utmost importance for health care givers and providers and brings vital and valuable outcomes. Data mining can help doctors come up with better recommendations and plans for treatment which actually in many respects have significant impact on patients' life and satisfaction In this paper we have proposed and utilized data mining methods to extract hidden information in medical records of pelvis stone patients with ureteral stone. We have tried to design a decision support system model to be applicable for selecting type of treatment for these groups of patients."n"nMethods: We gathered needed information from Shahid Hashemi Nejad hospital. In this research we have used decision tree as a data mining tool, for selecting suitable treatment for patients with ureteral stone. This

  18. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  19. Composing Texts, Composing Lives.

    Science.gov (United States)

    Perl, Sondra

    1994-01-01

    Using composition, reader response, critical, and feminist theories, a teacher demonstrates how adult students respond critically to literary texts and how teachers must critically analyze the texts of their teaching practice. Both students and teachers can use writing to bring their experiences to interpretation. (SK)

  20. Making Sense of Texts

    Science.gov (United States)

    Harper, Rebecca G.

    2014-01-01

    This article addresses the triadic nature regarding meaning construction of texts. Grounded in Rosenblatt's (1995; 1998; 2004) Transactional Theory, research conducted in an undergraduate Language Arts curriculum course revealed that when presented with unfamiliar texts, students used prior experiences, social interactions, and literary strategies…

  1. Text File Comparator

    Science.gov (United States)

    Kotler, R. S.

    1983-01-01

    File Comparator program IFCOMP, is text file comparator for IBM OS/VScompatable systems. IFCOMP accepts as input two text files and produces listing of differences in pseudo-update form. IFCOMP is very useful in monitoring changes made to software at the source code level.

  2. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    and free online dictionaries. The Diccionario español para la producción de textos is an example of a general text production dictionary that makes use of internet technologies, is based on a lexicographic theory, contains all the lexicographic data that users need in a production situation, and aims......Dictionaries for Text Production are information tools that are designed and constructed for helping users to produce (i.e. encode) texts, both oral and written texts. These can be broadly divided into two groups: (a) specialized text production dictionaries, i.e., dictionaries that only offer...... a small amount of lexicographic data, most or all of which are typically used in a production situation, e.g. synonym dictionaries, grammar and spelling dictionaries, collocation dictionaries, concept dictionaries such as the Longman Language Activator, which is advertised as the World’s First Production...

  3. The Vicissitudes of Text

    OpenAIRE

    Jonathan CULLER; Jonathan CULLER

    2003-01-01

    The concept of text, which has been central to literary studies, has undergone many mutations, as it has traveled from the work of classical philologists, for whom it was and is the object of a powerful disciplinary formation, to postmodern theorists of the text, for whom, the concept might be summed up by the title of a fine book by John Mowatt: Text: the Genealogy of an Antidisciplinary Object. Of course, the interesting thing about a travelling concept is not that it travels — travelers, t...

  4. Text mining for systems biology.

    Science.gov (United States)

    Fluck, Juliane; Hofmann-Apitius, Martin

    2014-02-01

    Scientific communication in biomedicine is, by and large, still text based. Text mining technologies for the automated extraction of useful biomedical information from unstructured text that can be directly used for systems biology modelling have been substantially improved over the past few years. In this review, we underline the importance of named entity recognition and relationship extraction as fundamental approaches that are relevant to systems biology. Furthermore, we emphasize the role of publicly organized scientific benchmarking challenges that reflect the current status of text-mining technology and are important in moving the entire field forward. Given further interdisciplinary development of systems biology-orientated ontologies and training corpora, we expect a steadily increasing impact of text-mining technology on systems biology in the future. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. Journalistic Text Production

    DEFF Research Database (Denmark)

    Haugaard, Rikke Hartmann

    JOURNALISTIC TEXT PRODUCTION: A CASE STUDY ON REVISIONS OF CONTENT AND FORM ABSTRACT • This paper provides insights into journalists’ revisions of content and linguistic form when producing newspaper articles. • By use of keystroke logging, participant observation and retrospective interviews...... different overall purposes: one in which the journalists produce (more) text for the newspaper article, and one in which they evaluate, and especially, reduce the length of the article. EXTENDED SUMMARY News products provide much of the foundation for what we know about the world we live in. However, we...... journalists make as regards content and linguistic form when composing news articles (NN 2016). More specifically, the study investigated revisions that were carried out by journalists during text production as these revisions yield insights into the progression of the text and thus contribute to our...

  6. Plagiarism in Academic Texts

    Directory of Open Access Journals (Sweden)

    Marta Eugenia Rojas-Porras

    2012-08-01

    Full Text Available The ethical and social responsibility of citing the sources in a scientific or artistic work is undeniable. This paper explores, in a preliminary way, academic plagiarism in its various forms. It includes findings based on a forensic analysis. The purpose of this paper is to raise awareness on the importance of considering these details when writing and publishing a text. Hopefully, this analysis may put the issue under discussion.

  7. Hermeneutic reading of classic texts.

    Science.gov (United States)

    Koskinen, Camilla A-L; Lindström, Unni Å

    2013-09-01

    The purpose of this article is to broaden the understandinfg of the hermeneutic reading of classic texts. The aim is to show how the choice of a specific scientific tradition in conjunction with a methodological approach creates the foundation that clarifies the actual realization of the reading. This hermeneutic reading of classic texts is inspired by Gadamer's notion that it is the researcher's own research tradition and a clearly formulated theoretical fundamental order that shape the researcher's attitude towards texts and create the starting point that guides all reading, uncovering and interpretation. The researcher's ethical position originates in a will to openness towards what is different in the text and which constantly sets the researcher's preunderstanding and research tradition in movement. It is the researcher's attitude towards the text that allows the text to address, touch and arouse wonder. Through a flexible, lingering and repeated reading of classic texts, what is different emerges with a timeless value. The reading of classic texts is an act that may rediscover and create understanding for essential dimensions and of human beings' reality on a deeper level. The hermeneutic reading of classic texts thus brings to light constantly new possibilities of uncovering for a new envisioning and interpretation for a new understanding of the essential concepts and phenomena within caring science. © 2012 The Authors Scandinavian Journal of Caring Sciences © 2012 Nordic College of Caring Science.

  8. UNPUBLISHED TEXTS / INEDITI

    African Journals Online (AJOL)

    (Pretoria). Her research interests concentrate – within the broader area of. Italian literature – on the re-interpretation and re-invention of myth by modern and contemporary writers from a Jungian archetypal perspective which privileges an interdisciplinary approach involving literature, anthropology and depth psychology.

  9. TEXT Energy Storage System

    International Nuclear Information System (INIS)

    Weldon, W.F.; Rylander, H.G.; Woodson, H.H.

    1977-01-01

    The Texas Experimental Tokamak (TEXT) Enery Storage System, designed by the Center for Electromechanics (CEM), consists of four 50 MJ, 125 V homopolar generators and their auxiliaries and is designed to power the toroidal and poloidal field coils of TEXT on a two-minute duty cycle. The four 50 MJ generators connected in series were chosen because they represent the minimum cost configuration and also represent a minimal scale up from the successful 5.0 MJ homopolar generator designed, built, and operated by the CEM

  10. Strategy as Texts

    DEFF Research Database (Denmark)

    Obed Madsen, Søren

    of the strategy into four categories. Second, the managers produce new texts based on the original strategy document by using four different ways of translation models. The study’s findings contribute to three areas. Firstly, it shows that translation is more than a sociological process. It is also...... a craftsmanship that requires knowledge and skills, which unfortunately seems to be overlooked in both the literature and in practice. Secondly, it shows that even though a strategy text is in singular, the translation makes strategy plural. Thirdly, the article proposes a way to open up the black box of what...

  11. Text analysis in R

    NARCIS (Netherlands)

    Welbers, K.; van Atteveldt, W.H.; Benoit, K.

    2017-01-01

    Computational text analysis has become an exciting research field with many applications in communication research. It can be a difficult method to apply, however, because it requires knowledge of various techniques, and the software required to perform most of these techniques is not readily

  12. Documents and legal texts

    International Nuclear Information System (INIS)

    2017-01-01

    This section treats of the following documents and legal texts: 1 - Belgium 29 June 2014 - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy; 2 - Belgium, 7 December 2016. - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy

  13. Texts in the landscape

    Directory of Open Access Journals (Sweden)

    James Graham-Campbell

    1998-11-01

    Full Text Available The Institute's members of UCL's "Celtic Inscribed Stones" project describe, in collaboration with Wendy Davies, Mark Handley and Paul Kershaw (Department of History, a major interdisciplinary study of inscriptions of the early middle ages from the Celtic areas of northwest Europe.

  14. Summarizing Expository Texts

    Science.gov (United States)

    Westby, Carol; Culatta, Barbara; Lawrence, Barbara; Hall-Kenyon, Kendra

    2010-01-01

    Purpose: This article reviews the literature on students' developing skills in summarizing expository texts and describes strategies for evaluating students' expository summaries. Evaluation outcomes are presented for a professional development project aimed at helping teachers develop new techniques for teaching summarization. Methods: Strategies…

  15. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  16. Text Induced Spelling Correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from a very large corpus of raw text, without supervision, and contains word

  17. Aphasia and text writing.

    Science.gov (United States)

    Behrns, Ingrid; Ahlsén, Elisabeth; Wengelin, Asa

    2010-01-01

    Good writing skills are needed in almost every aspect of life today, and there is a growing interest in research into acquired writing difficulties. Most of the findings reported so far, however, are based on words produced in isolation. The present study deals with the production of entire texts. The aim was to characterize written narratives produced by a group of participants with aphasia. Eight persons aged 28-63 years with aphasia took part in the study. They were compared with a reference group consisting of ten participants aged 21-30 years. All participants were asked to write a personal narrative titled 'I have never been so afraid' and to perform a picture-based story-generation task called the 'Frog Story'. The texts were written on a computer. The group could be divided into participants with low, moderate, and high general performance, respectively. The texts written by the participants in the group with moderate and high writing performance had comparatively good narrative structure despite indications of difficulties on other linguistic levels. Aphasia appeared to influence text writing on different linguistic levels. The impact on overall structure and coherence was in line with earlier findings from the analysis of spoken and written discourse and the implication of this is that the written modality should also be included in language rehabilitation. 2010 Royal College of Speech & Language Therapists.

  18. Texts and Readers.

    Science.gov (United States)

    Iser, Wolfgang

    1980-01-01

    Notes that, since fictional discourse need not reflect prevailing systems of meaning and norms or values, readers gain detachment from their own presuppositions; by constituting and formulating text-sense, readers are constituting and formulating their own cognition and becoming aware of the operations for doing so. (FL)

  19. The Emar Lexical Texts

    NARCIS (Netherlands)

    Gantzert, Merijn

    2011-01-01

    This four-part work provides a philological analysis and a theoretical interpretation of the cuneiform lexical texts found in the Late Bronze Age city of Emar, in present-day Syria. These word and sign lists, commonly dated to around 1100 BC, were almost all found in the archive of a single school.

  20. Texts On-Line.

    Science.gov (United States)

    Thomas, Jean-Jacques

    1993-01-01

    Maintains that the study of signs is divided between those scholars who use the Saussurian binary sign (semiology) and those who prefer the Peirce tripartite sign (semiotics). Concludes that neither the Saussurian nor Peircian analysis methods can produce a semiotic interpretation based on a hierarchy of the text's various components. (CFR)

  1. download full text

    African Journals Online (AJOL)

    In a literary text, the stylistician is primarily concerned with the features that are stylistically significant in creating meanings. This study, which focuses mainly on .... Others sell such items to make money, thereby sabotaging the efforts of the government to meet the health needs of the people. For instance, Census, a theatre ...

  2. Automatic text summarization

    CERN Document Server

    Torres Moreno, Juan Manuel

    2014-01-01

    This new textbook examines the motivations and the different algorithms for automatic document summarization (ADS). We performed a recent state of the art. The book shows the main problems of ADS, difficulties and the solutions provided by the community. It presents recent advances in ADS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several exemples are included in order to clarify the theoretical concepts.  The books currently available in the area of Automatic Document Summarization are not recent. Powerful algorithms have been develop

  3. Weaving with text

    DEFF Research Database (Denmark)

    Hagedorn-Rasmussen, Peter

    This paper explores how a school principal by means of practical authorship creates reservoirs of language that provide a possible context for collective sensemaking. The paper draws upon a field study in which a school principal, and his managerial team, was shadowed in a period of intensive cha...... changes. The paper explores how the manager weaves with text, extracted from stakeholders, administration, politicians, employees, public discourse etc., as a means of creating a new fabric, a texture, of diverse perspectives that aims for collective sensemaking....

  4. Note on the Text

    OpenAIRE

    2013-01-01

    Since there is no complete modern edition of Shelley’s drama, I have used a variety of texts. For Prometheus Unbound, Tasso and The Cenci I have used The Poems of Shelley edited by Kelvin Everest and Geoffrey Matthews, but I have also noted the stage directions in BSMIX which comprises the intermediate fair copy of Prometheus Unbound which Shelley transcribed into three notebooks for safe-keeping. For Hellas I have used Shelley’s Poetry and Prose edited by Donald H. Reiman and Neil Fraistat (...

  5. Documents and legal texts

    International Nuclear Information System (INIS)

    2016-01-01

    This section treats of the following documents and legal texts: 1 - Brazil: Law No. 13,260 of 16 March 2016 (To regulate the provisions of item XLIII of Article 5 of the Federal Constitution on terrorism, dealing with investigative and procedural provisions and redefining the concept of a terrorist organisation; and amends Laws No. 7,960 of 21 December 1989 and No. 12,850 of 2 August 2013); 2 - India: The Atomic Energy (Amendment) Act, 2015; Department Of Atomic Energy Notification (Civil Liability for Nuclear Damage); 3 - Japan: Act on Subsidisation, etc. for Nuclear Damage Compensation Funds following the implementation of the Convention on Supplementary Compensation for Nuclear Damage

  6. Weitere Texte physiognomischen Inhalts

    Directory of Open Access Journals (Sweden)

    Böck, Barbara

    2004-12-01

    Full Text Available The present article offers the edition of three cuneiform texts belonging to the Akkadian handbook of omens drawn from the physical appearance as well as the morals and behaviour of man. The book comprising up to 27 chapters with more than 100 omens each was entitled in antiquity Alamdimmû. The edition of the three cuneiform tablets completes, thus, the author's monographic study on the ancient Mesopotamian divinatory discipline of physiognomy (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

    En este artículo se presenta la editio princeps de tres textos cuneiformes conservados en el British Museum (Londres y el Vorderasiatisches Museum (Berlín, que pertenecen al libro asirio-babilonio de presagios fisiognómicos. Este libro, titulado originalmente Alamdimmû ('forma, figura', consta de 27 capítulos, cada uno con más de cien presagios escritos en lengua acadia. Los tres textos completan así el estudio monográfico de la autora sobre la disciplina adivinatoria de la fisiognomía en el antiguo Oriente (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

  7. Reading Authentic Texts

    DEFF Research Database (Denmark)

    Balling, Laura Winther

    2013-01-01

    Most research on cognates has focused on words presented in isolation that are easily defined as cognate between L1 and L2. In contrast, this study investigates what counts as cognate in authentic texts and how such cognates are read. Participants with L1 Danish read news articles in their highly...... proficient L2, English, while their eye-movements were monitored. The experiment shows a cognate advantage for morphologically simple words, but only when cognateness is defined relative to translation equivalents that are appropriate in the context. For morphologically complex words, a cognate disadvantage...... is observed which may be due to problems of integrating cognate with non-cognate morphemes. The results show that fast non-selective access to the bilingual lexicon is conditioned by the communicative context. Importantly, a range of variables are statistically controlled in the regression analyses, including...

  8. Documents and legal texts

    International Nuclear Information System (INIS)

    2013-01-01

    This section reprints a selection of recently published legislative texts and documents: - Russian Federation: Federal Law No.170 of 21 November 1995 on the use of atomic energy, Adopted by the State Duma on 20 October 1995; - Uruguay: Law No.19.056 On the Radiological Protection and Safety of Persons, Property and the Environment (4 January 2013); - Japan: Third Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (concerning Damages related to Rumour-Related Damage in the Agriculture, Forestry, Fishery and Food Industries), 30 January 2013; - France and the United States: Joint Statement on Liability for Nuclear Damage (Aug 2013); - Franco-Russian Nuclear Power Declaration (1 November 2013)

  9. Systematic characterizations of text similarity in full text biomedical publications.

    Directory of Open Access Journals (Sweden)

    Zhaohui Sun

    2010-09-01

    Full Text Available Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text articles are becoming increasingly available, yet the similarities among them have not been systematically studied. Here, we quantitatively investigated the full text similarity of biomedical publications in PubMed Central.72,011 full text articles from PubMed Central (PMC were parsed to generate three different datasets: full texts, sections, and paragraphs. Text similarity comparisons were performed on these datasets using the text similarity algorithm eTBLAST. We measured the frequency of similar text pairs and compared it among different datasets. We found that high abstract similarity can be used to predict high full text similarity with a specificity of 20.1% (95% CI [17.3%, 23.1%] and sensitivity of 99.999%. Abstract similarity and full text similarity have a moderate correlation (Pearson correlation coefficient: -0.423 when the similarity ratio is above 0.4. Among pairs of articles in PMC, method sections are found to be the most repetitive (frequency of similar pairs, methods: 0.029, introduction: 0.0076, results: 0.0043. In contrast, among a set of manually verified duplicate articles, results are the most repetitive sections (frequency of similar pairs, results: 0.94, methods: 0.89, introduction: 0.82. Repetition of introduction and methods sections is more likely to be committed by the same authors (odds of a highly similar pair having at least one shared author, introduction: 2.31, methods: 1.83, results: 1.03. There is also significantly more similarity in pairs of review articles than in pairs containing one review and one nonreview paper (frequency of similar pairs: 0.0167 and 0.0023, respectively.While quantifying abstract similarity is an effective approach for finding duplicate citations, a comprehensive full text analysis is necessary to uncover all potential duplicate citations in the scientific literature and is helpful when

  10. Text Analysis: Critical Component of Planning for Text-Based Discussion Focused on Comprehension of Informational Texts

    Science.gov (United States)

    Kucan, Linda; Palincsar, Annemarie Sullivan

    2018-01-01

    This investigation focuses on a tool used in a reading methods course to introduce reading specialist candidates to text analysis as a critical component of planning for text-based discussions. Unlike planning that focuses mainly on important text content or information, a text analysis approach focuses both on content and how that content is…

  11. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  12. Documents and legal texts

    International Nuclear Information System (INIS)

    2015-01-01

    This section treats of the following Documents and legal texts: 1 - Canada: Nuclear Liability and Compensation Act (An Act respecting civil liability and compensation for damage in case of a nuclear incident, repealing the Nuclear Liability Act and making consequential amendments to other acts); 2 - Japan: Act on Compensation for Nuclear Damage (The purpose of this act is to protect persons suffering from nuclear damage and to contribute to the sound development of the nuclear industry by establishing a basic system regarding compensation in case of nuclear damage caused by reactor operation etc.); Act on Indemnity Agreements for Compensation of Nuclear Damage; 3 - Slovak Republic: Act on Civil Liability for Nuclear Damage and on its Financial Coverage and on Changes and Amendments to Certain Laws (This Act regulates: a) The civil liability for nuclear damage incurred in the causation of a nuclear incident, b) The scope of powers of the Nuclear Regulatory Authority (hereinafter only as the 'Authority') in relation to the application of this Act, c) The competence of the National Bank of Slovakia in relation to the supervised financial market entities in the financial coverage of liability for nuclear damage; and d) The penalties for violation of this Act)

  13. Documents and legal texts

    International Nuclear Information System (INIS)

    2014-01-01

    This section of the Bulletin presents the recently published documents and legal texts sorted by country: - Brazil: Resolution No. 169 of 30 April 2014. - Japan: Act Concerning Exceptions to Interruption of Prescription Pertaining to Use of Settlement Mediation Procedures by the Dispute Reconciliation Committee for Nuclear Damage Compensation in relation to Nuclear Damage Compensation Disputes Pertaining to the Great East Japan Earthquake (Act No. 32 of 5 June 2013); Act Concerning Measures to Achieve Prompt and Assured Compensation for Nuclear Damage Arising from the Nuclear Plant Accident following the Great East Japan Earthquake and Exceptions to the Extinctive Prescription, etc. of the Right to Claim Compensation for Nuclear Damage (Act No. 97 of 11 December 2013); Fourth Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage Resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.); Outline of 'Fourth Supplement to Interim Guidelines (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.)'. - OECD Nuclear Energy Agency: Decision and Recommendation of the Steering Committee Concerning the Application of the Paris Convention to Nuclear Installations in the Process of Being Decommissioned; Joint Declaration on the Security of Supply of Medical Radioisotopes. - United Arab Emirates: Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage; Ratification of the Federal Supreme Council of Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage

  14. Learning with Text in the Primary Grades.

    Science.gov (United States)

    Guillaume, Andrea M.

    1998-01-01

    Provides a rationale for learning-with-text experiences for primary-grade children; lists 10 general approaches to foster primary-grade content area reading; and offers a sample lesson incorporating these approaches that promotes comprehension of text and content matter. Suggests that trade books, textbooks, realistic fiction, and other print…

  15. Text

    International Nuclear Information System (INIS)

    Anon.

    2009-01-01

    The purpose of this act is to safeguard against the dangers and harmful effects of radioactive waste and to contribute to public safety and environmental protection by laying down requirements for the safe and efficient management of radioactive waste. We will find definitions, interrelation with other legislation, responsibilities of the state and local governments, responsibilities of radioactive waste management companies and generators, formulation of the basic plan for the control of radioactive waste, radioactive waste management ( with public information, financing and part of spent fuel management), Korea radioactive waste management corporation ( business activities, budget), establishment of a radioactive waste fund in order to secure the financial resources required for radioactive waste management, and penalties in case of improper operation of radioactive waste management. (N.C.)

  16. A quick survey of text categorization algorithms

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper contains an overview of basic formulations and approaches to text classification. This paper surveys the algorithms used in text categorization: handcrafted rules, decision trees, decision rules, on-line learning, linear classifier, Rocchio’s algorithm, k Nearest Neighbor (kNN, Support Vector Machines (SVM.

  17. Condensing biomedical journal texts through paragraph ranking.

    Science.gov (United States)

    Chiang, Jung-Hsien; Liu, Heng-Hui; Huang, Yi-Ting

    2011-04-15

    The growing availability of full-text scientific articles raises the important issue of how to most efficiently digest full-text content. Although article titles and abstracts provide accurate and concise information on an article's contents, their brevity inevitably entails the loss of detail. Full-text articles provide those details, but require more time to read. The primary goal of this study is to combine the advantages of concise abstracts and detail-rich full-texts to ease the burden of reading. We retrieved abstract-related paragraphs from full-text articles through shared keywords between the abstract and paragraphs from the main text. Significant paragraphs were then recommended by applying a proposed paragraph ranking approach. Finally, the user was provided with a condensed text consisting of these significant paragraphs, allowing the user to save time from perusing the whole article. We compared the performance of the proposed approach with a keyword counting approach and a PageRank-like approach. Evaluation was conducted in two aspects: the importance of each retrieved paragraph and the information coverage of a set of retrieved paragraphs. In both evaluations, the proposed approach outperformed the other approaches. jchiang@mail.ncku.edu.tw.

  18. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  19. Analyse de textes ecrits et apprentissage "grammatical" (Analysis of Written Texts and Learning of Grammar)

    Science.gov (United States)

    Moirand, Sophie

    1977-01-01

    Presents a method for teaching grammar through an understanding of the content of a written text, using a three-fold approach: sociolinguistic, linguistic, and logico-syntactical. (Text is in French.) (AM)

  20. Automatic Text Summarization for Indonesian Language Using TextTeaser

    Science.gov (United States)

    Gunawan, D.; Pasaribu, A.; Rahmat, R. F.; Budiarto, R.

    2017-04-01

    Text summarization is one of the solution for information overload. Reducing text without losing the meaning not only can save time to read, but also maintain the reader’s understanding. One of many algorithms to summarize text is TextTeaser. Originally, this algorithm is intended to be used for text in English. However, due to TextTeaser algorithm does not consider the meaning of the text, we implement this algorithm for text in Indonesian language. This algorithm calculates four elements, such as title feature, sentence length, sentence position and keyword frequency. We utilize TextRank, an unsupervised and language independent text summarization algorithm, to evaluate the summarized text yielded by TextTeaser. The result shows that the TextTeaser algorithm needs more improvement to obtain better accuracy.

  1. Ten Guidelines for Translating Legal Texts

    Directory of Open Access Journals (Sweden)

    Alenka Kocbek

    2017-12-01

    Full Text Available The paper proposes a targeted model for translating legal texts, developed by the author by combining translation science (i.e. functionalist approaches with the findings of comparative law and legal linguistics. It consists of ten guidelines directing the translator from defining the intended function of the target text and selecting the corresponding translation type, through comparing the legal systems involved in the translation and analysing the memetic structure of the source text and parallel texts in the target culture to designing the target text as a cultureme and ensuring its legal security.

  2. Medical Text Classification Using Convolutional Neural Networks.

    Science.gov (United States)

    Hughes, Mark; Li, Irene; Kotoulas, Spyros; Suzumura, Toyotaro

    2017-01-01

    We present an approach to automatically classify clinical text at a sentence level. We are using deep convolutional neural networks to represent complex features. We train the network on a dataset providing a broad categorization of health information. Through a detailed evaluation, we demonstrate that our method outperforms several approaches widely used in natural language processing tasks by about 15%.

  3. De l'exercice ecrit au texte litteraire (From Written Exercises to Literary Texts).

    Science.gov (United States)

    Chapotot, Franck

    1981-01-01

    Proposes a novel approach to the teaching of writing for advanced language classes. This approach leads students working in pairs through various exercises that enable them to produce a text with given style characteristics. Students' texts are then discussed in class and finally compared with suitable literary models. (MES)

  4. Texte et Co-texte. Discours – Contexte

    Directory of Open Access Journals (Sweden)

    Laura POPOVICI - ADUMITROAIE

    2007-12-01

    Full Text Available This article represents a travel inside, outside and surrounding the text, having the aim of finding the parallel configuration of a text in order to better understand the hallo effect of a text, to clearly see its shadows and to hear the echo of its forefront meaning. I will focus on the aspects regarding the co-text and context of a text, enabling to expose the text as a unit very similar to co-text and the discourse as superior unit resembling the context. All these issues will be treated in relation to the didactic communication, as a resourceful way of analyzing the dynamics of a text and the feed-back that arouses instructional and perceptive valences of communication.

  5. Unsupervised information extraction by text segmentation

    CERN Document Server

    Cortez, Eli

    2013-01-01

    A new unsupervised approach to the problem of Information Extraction by Text Segmentation (IETS) is proposed, implemented and evaluated herein. The authors' approach relies on information available on pre-existing data to learn how to associate segments in the input string with attributes of a given domain relying on a very effective set of content-based features. The effectiveness of the content-based features is also exploited to directly learn from test data structure-based features, with no previous human-driven training, a feature unique to the presented approach. Based on the approach, a

  6. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  7. L'ordre du texte (The Order of the Text)

    Science.gov (United States)

    Slakta, Denis

    1975-01-01

    This article outlines a model of the two basic components of a text, namely, the system of formal linguistic rules, and the realization of these rules into concrete discourse, by means of particular transformations. (Text is in French.) (CLK)

  8. Metamorphoses d'un texte (Metamorphoses of a Text).

    Science.gov (United States)

    Meitinger, Guy Roger

    1993-01-01

    A variety of exercises based on manipulation of a single text are described. The activities involve replacing words or phrases in the text with synonyms or opposites, transposing gender, changing tenses, filling in blanks, and answering multiple-choice questions about linguistic forms. Three brief sample texts are offered. (MSE)

  9. Litterature: Retour au texte (Literature: Return to the Text).

    Science.gov (United States)

    Noe, Alfred

    1993-01-01

    Choice of texts for use in French language instruction is discussed. It is argued that the text's format (e.g., advertising, figurative poetry, journal article, play, prose, etc.) is instrumental in bringing attention to the language in it, and this has implications for the best uses of different text types. (MSE)

  10. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  11. TEXT TYPES AND THE PROBLEM OF TRANSLATABILITY

    Directory of Open Access Journals (Sweden)

    Kharitonova, E.V.

    2017-09-01

    Full Text Available The article aims at revealing the possibilities of a textual approach to the process and result of translation activity from a new perspective and stating the inviolability of the text as the main category of Translation Studies. The results of the conducted research show that the complex nature of translation requires con-sideration of a wide variety of factors, but the final set of parameters relevant to the translation process depends on the text, since it is the text that determines the primary and secondary communicative situations.

  12. Reading of Foreign Language Technical Texts

    Directory of Open Access Journals (Sweden)

    Metka Brkan

    1997-01-01

    Full Text Available An efficient foreign language reader is one who has approached the reading flexibility of a native speaker as he reads different texts presented in his environment: newspaper articles, magazins, personal letters, business correspondence, official documents, academic textbooks and scientific and technical texts. Flexibility in reading means increased speed as well as enhanced comprehension: an efficient re­ ader should read fast with needed comprehension. A poor reader is one who reacts everything slowly without getting much meaning from reading. The article focuses on techniques for developing foreign language reading skills of university students to cape with the reading of English technical texts.

  13. The Only Safe SMS Texting Is No SMS Texting.

    Science.gov (United States)

    Toth, Cheryl; Sacopulos, Michael J

    2015-01-01

    Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.

  14. Text Signals Influence Team Artifacts

    Science.gov (United States)

    Clariana, Roy B.; Rysavy, Monica D.; Taricani, Ellen

    2015-01-01

    This exploratory quasi-experimental investigation describes the influence of text signals on team visual map artifacts. In two course sections, four-member teams were given one of two print-based text passage versions on the course-related topic "Social influence in groups" downloaded from Wikipedia; this text had two paragraphs, each…

  15. Complex dynamics of text analysis

    Science.gov (United States)

    Ke, Xiaohua; Zeng, Yongqiang; Ma, Qinghua; Zhu, Lin

    2014-12-01

    This paper presents a novel method for the analysis of nonlinear text quality in Chinese language. Texts produced by university students in China were represented as scale-free networks (word adjacency model), from which typical network features such as the in/outdegree, clustering coefficient and network dynamics were obtained. The method integrates the classical concepts of network feature representation and text quality series variation. The analytical and numerical scheme leads to a parameter space representation that constitutes a valid alternative to represent the network features. The results reveal that complex network features of different text qualities can be clearly revealed and applied to potential applications in other instances of text analysis.

  16. TEXT DEIXIS IN NARRATIVE SEQUENCES

    Directory of Open Access Journals (Sweden)

    Josep Rivera

    2007-06-01

    Full Text Available This study looks at demonstrative descriptions, regarding them as text-deictic procedures which contribute to weave discourse reference. Text deixis is thought of as a metaphorical referential device which maps the ground of utterance onto the text itself. Demonstrative expressions with textual antecedent-triggers, considered as the most important text-deictic units, are identified in a narrative corpus consisting of J. M. Barrie’s Peter Pan and its translation into Catalan. Some linguistic and discourse variables related to DemNPs are analysed to characterise adequately text deixis. It is shown that this referential device is usually combined with abstract nouns, thus categorising and encapsulating (non-nominal complex discourse entities as nouns, while performing a referential cohesive function by means of the text deixis + general noun type of lexical cohesion.

  17. SparkText: Biomedical Text Mining on Big Data Framework.

    Directory of Open Access Journals (Sweden)

    Zhan Ye

    Full Text Available Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment.In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM, and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes.This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  18. Les Textes aussi sont des images (Texts Are Also Pictures)

    Science.gov (United States)

    Moirand, Sophie

    1978-01-01

    A printed text can be considered a picture on which readers project their own image in order to understand its original meaning. This process is explained via several kinds of original documents. Implications for instruction in reading a foreign language are discussed and several examples are given. (Text is in French.) (AMH)

  19. Financial Statement Fraud Detection using Text Mining

    OpenAIRE

    Rajan Gupta; Nasib Singh Gill

    2013-01-01

    Data mining techniques have been used enormously by the researchers’ community in detecting financial statement fraud. Most of the research in this direction has used the numbers (quantitative information) i.e. financial ratios present in the financial statements for detecting fraud. There is very little or no research on the analysis of text such as auditor’s comments or notes present in published reports. In this study we propose a text mining approach for detecting financial statement frau...

  20. SparkText: Biomedical Text Mining on Big Data Framework.

    Science.gov (United States)

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  1. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    Today, information travels fast. Texts travel, too. In a corporate context, the question is how to manage which knowledge elements should travel to a new language area or market and in which form? The decision to let knowledge elements travel or not travel highly depends on the limitation...... and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  2. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  3. Texte et contre-texte en situation de diglossie

    OpenAIRE

    Carpanin Marimoutou, Jean-Claude

    2015-01-01

    Le texte en situation diglossique s'inscrit dans une relation dialogique conflictuelle indépassée qui produit le contre-texte et que le contre-texte reproduit en retour, déplaçant non pas le conflit, mais les pôles du conflit. Une vue d'ensemble de la littérature réunionnaise suffit à mettre en évidence ce jeu de miroir. Une étude des préfaces montre la conscience des producteurs de ce que le combat des textes cache d'enjeux et comment celui qui est posé comme Autre semble ne produire qu'une ...

  4. Zum Uebersetzen fachlicher Texte (On the Translation of Technical Texts)

    Science.gov (United States)

    Friederich, Wolf

    1975-01-01

    Reviews a 1974 East German publication on translation of scientific literature from Russian to German. Considers terminology, different standard levels of translation in East Germany, and other matters related to translation. (Text is in German.) (DH)

  5. Approach

    Directory of Open Access Journals (Sweden)

    Guido Pinto Aguirre

    2008-01-01

    Full Text Available El propósito de este documento es investigar y re-estimar los efectos de los patrones de lactancia, salud y estado nutricional de la mujer y consumo de energía sobre la duración del retorno de la fertilidad de postparto (es decir, retorno de la menstruación de postparto utilizando toda la información relevante en el estudio longitudinal del Instituto de Nutrición de Centroamérica y Panamá y un procedimiento de estimación más adecuado (modelos de riesgo. Los datos utilizados provienen del Estudio Longitudinal llevado a cabo en Guatemala entre 1967 y 1979. En este artículo se utiliza un modelo de riesgo con varios estados que reconoce diferentes caminos y estados en el proceso del retorno de la fertilidad de postparto. El modelo descansa en la existencia de cinco estados (lactancia total, lactancia parcial, destete, mortalidad infantil y menstruación. También incluye de manera explícita nutrición maternal y consumo de energía de la mujer como elementos estratégicos del modelo. El estudio encontró que los efectos de los patrones de lactancia, nutrición de la madre y patrones de trabajo de la mujer (consumo de energía sobre la fertilidad en las áreas rurales de Guatemala son fuertes y significativos. La contribución de este artículo es mostrar que la aplicación de los modelos de riesgo con múltiples estados proporciona estimados que son consistentes con hipótesis que relacionan patrones de lactancia, estado nutricional maternal y estresores maternales externos a procesos que aceleran (desaceleran el retorno de ciclos menstruales normales.

  6. English Metafunction Analysis in Chemistry Text: Characterization of Scientific Text

    Directory of Open Access Journals (Sweden)

    Ahmad Amin Dalimunte, M.Hum

    2013-09-01

    Full Text Available The objectives of this research are to identify what Metafunctions are applied in chemistry text and how they characterize a scientific text. It was conducted by applying content analysis. The data for this research was a twelve-paragraph chemistry text. The data were collected by applying a documentary technique. The document was read and analyzed to find out the Metafunction. The data were analyzed by some procedures: identifying the types of process, counting up the number of the processes, categorizing and counting up the cohesion devices, classifying the types of modulation and determining modality value, finally counting up the number of sentences and clauses, then scoring the grammatical intricacy index. The findings of the research show that Material process (71of 100 is mostly used, circumstance of spatial location (26 of 56 is more dominant than the others. Modality (5 is less used in order to avoid from subjectivity. Impersonality is implied through less use of reference either pronouns (7 or demonstrative (7, conjunctions (60 are applied to develop ideas, and the total number of the clauses are found much more dominant (109 than the total number of the sentences (40 which results high grammatical intricacy index. The Metafunction found indicate that the chemistry text has fulfilled the characteristics of scientific or academic text which truly reflects it as a natural science.

  7. The Case for Multiple Texts

    Science.gov (United States)

    Cummins, Sunday

    2017-01-01

    Reading just one text on any topic, Cummins argues, isn't enough if we expect students to learn at deep levels about the topic, synthesize various sources of information, and gain the knowledge they need to write and speak seriously about the topic. Reading a second or third text expands a reader's knowledge on any topic or story--and the why…

  8. Text Genres in Information Organization

    Science.gov (United States)

    Nahotko, Marek

    2016-01-01

    Introduction: Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method: The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information…

  9. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  10. Text-Filled Stacked Area Graphs

    DEFF Research Database (Denmark)

    Kraus, Martin

    2011-01-01

    to consider a visualization a detailed enrichment of their personal experience instead of an abstract representation of anonymous numbers. However, the integration of textual detail into a visualization is often very challenging. This work discusses one particular approach to this problem, namely text...

  11. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed the chronol......For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  12. Outer Texts in Bilingual Dictionaries

    Directory of Open Access Journals (Sweden)

    Rufus H. Gouws

    2011-10-01

    Full Text Available

    Abstract: Dictionaries often display a central list bias with little or no attention to the use ofouter texts. This article focuses on dictionaries as text compounds and carriers of different texttypes. Utilising either a partial or a complete frame structure, a variety of outer text types can beused to enhance the data distribution structure of a dictionary and to ensure a better informationretrieval by the intended target user. A distinction is made between primary frame structures andsecondary frame structures and attention is drawn to the use of complex outer texts and the need ofan extended complex outer text with its own table of contents to guide the user to the relevant textsin the complex outer text. It is emphasised that outer texts need to be planned in a meticulous wayand that they should participate in the lexicographic functions of the specific dictionary, bothknowledge-orientated and communication-orientated functions, to ensure a transtextual functionalapproach.

    Keywords: BACK MATTER, CENTRAL LIST, COMMUNICATION-ORIENTATED FUNCTIONS,COMPLEX TEXT, CULTURAL DATA, EXTENDED COMPLEX TEXT, EXTENDED TEXTS,FRONT MATTER, FRAME STRUCTURE, KNOWLEDGE-ORIENTATED FUNCTIONS, LEXICOGRAPHICFUNCTIONS, OUTER TEXTS, PRIMARY FRAME, SECONDARY FRAME

    Opsomming: Buitetekste in tweetalige woordeboeke. Woordeboeke vertoondikwels 'n partydigheid ten gunste van die sentrale lys met min of geen aandag aan die buitetekstenie. Hierdie artikel fokus op woordeboeke as tekssamestellings en draers van verskillende tekssoorte.Met die benutting van óf 'n gedeeltelike óf 'n volledige raamstruktuur kan 'n verskeidenheidbuitetekste aangewend word om die dataverspreidingstruktuur van 'n woordeboek te verbeteren om 'n beter herwinning van inligting deur die teikengebruiker te verseker. 'n Onderskeidword gemaak tussen primêre en sekondêre raamstrukture en die aandag word gevestig op kompleksebuitetekste en die behoefte aan 'n uitgebreide komplekse

  13. Stemming Malay Text and Its Application in Automatic Text Categorization

    Science.gov (United States)

    Yasukawa, Michiko; Lim, Hui Tian; Yokoo, Hidetoshi

    In Malay language, there are no conjugations and declensions and affixes have important grammatical functions. In Malay, the same word may function as a noun, an adjective, an adverb, or, a verb, depending on its position in the sentence. Although extensively simple root words are used in informal conversations, it is essential to use the precise words in formal speech or written texts. In Malay, to make sentences clear, derivative words are used. Derivation is achieved mainly by the use of affixes. There are approximately a hundred possible derivative forms of a root word in written language of the educated Malay. Therefore, the composition of Malay words may be complicated. Although there are several types of stemming algorithms available for text processing in English and some other languages, they cannot be used to overcome the difficulties in Malay word stemming. Stemming is the process of reducing various words to their root forms in order to improve the effectiveness of text processing in information systems. It is essential to avoid both over-stemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes. Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary. The use of set of rules is aimed at reducing the occurrence of under-stemming errors, while that of the dictionaries is believed to reduce the occurrence of over-stemming errors. We performed an experiment to evaluate the application of our stemmer in text mining software. For the experiment, text data used were actual web pages collected from the World Wide Web to demonstrate the effectiveness of our Malay stemming algorithm. The experimental results showed that our stemmer can effectively increase the precision of the extracted Boolean expressions for text categorization.

  14. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  15. An Experimental Text-Commentary

    Science.gov (United States)

    O'Brien, Joan

    1976-01-01

    An experimental text-commentary of selected passages from Sophocles'"Antigone" is described. The commentary is intended for students seeking more than a conventional translation who do not know enough Greek to use a standard commentary. (RM)

  16. Text segmentation with character-level text embeddings

    NARCIS (Netherlands)

    Chrupała, Grzegorz

    2013-01-01

    Learning word representations has recently seen much success in computational linguistics. However, assuming sequences of word tokens as input to linguistic analysis is often unjustified. For many languages word segmentation is a non-trivial task and naturally occurring text is sometimes a mixture

  17. The loyalty of the literary reviser: Author, source text, target text or ...

    African Journals Online (AJOL)

    The processes of revision and translation, according to Mossop (2010:112-113), can address the problem of conflicting interests, goals and needs by taking different approaches. Translation, he suggests, should seek to achieve a balance between loyalty to the source text author and to the target text readers, whereas ...

  18. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  19. A Guide Text or Many Texts? "That is the Question”

    Directory of Open Access Journals (Sweden)

    Delgado de Valencia Sonia

    2001-08-01

    Full Text Available The use of supplementary materials in the classroom has always been an essential part of the teaching and learning process. To restrict our teaching to the scope of one single textbook means to stand behind the advances of knowledge, in any area and context. Young learners appreciate any new and varied support that expands their knowledge of the world: diaries, letters, panels, free texts, magazines, short stories, poems or literary excerpts, and articles taken from Internet are materials that will allow learnersto share more and work more collaboratively. In this article we are going to deal with some of these materials, with the criteria to select, adapt, and create them that may be of interest to the learner and that may promote reading and writing processes. Since no text can entirely satisfy the needs of students and teachers, the creativity of both parties will be necessary to improve the quality of teaching through the adequate use and adaptation of supplementary materials.

  20. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases......, the classifier is trained on each cluster having reduced dimensionality and less number of examples. The experimental results show that the proposed model outperforms the existing classification models for the task of suspicious email detection and topic categorization on the Reuters-21578 and 20 Newsgroups...... datasets. Our model also outperforms A Decision Cluster Classification (ADCC) and the Decision Cluster Forest Classification (DCFC) models on the Reuters-21578 dataset....

  1. Identifying issue frames in text.

    Directory of Open Access Journals (Sweden)

    Eyal Sagi

    Full Text Available Framing, the effect of context on cognitive processes, is a prominent topic of research in psychology and public opinion research. Research on framing has traditionally relied on controlled experiments and manually annotated document collections. In this paper we present a method that allows for quantifying the relative strengths of competing linguistic frames based on corpus analysis. This method requires little human intervention and can therefore be efficiently applied to large bodies of text. We demonstrate its effectiveness by tracking changes in the framing of terror over time and comparing the framing of abortion by Democrats and Republicans in the U.S.

  2. Text linguistics: memory and representation

    Directory of Open Access Journals (Sweden)

    Leonor Lopes Fávero

    2012-12-01

    Full Text Available Text Linguistics originates in Brazil in the 80s of the twentieth century. The first work that we know of is from 1981, authored by Prof. Ignacio Antonio Neiss, entitled Por uma gramática textua, which was followed by two other in 1983: Linguística textual: o que é e como se faz, by Prof. Luiz Antônio Marcuschi and Linguística textual: introdução by Leonor Lopes Favero and Ingedore Villaça Koch. Professor Neiss shows how initial attempts to textual linguistics, were generally related to structural and generative grammars. The work of Prof. Marcuschi focuses on the analysis of some text definitions and on the study of theoretical aspects in relation to their applicability. Leonor Lopes Favero and Ingedore V. Koch aim to provide the Brazilian reader with an overview of text linguistics in Europe, a recent branch of language science then. This work is part of the History of Linguistic Ideas, part of the Cultural History, which seeks to identify how at different times , a social reality is constructed, designed, and enlightened (Chartier, 1990.

  3. Multilingual text induced spelling correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams

  4. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level...

  5. Answering Questions from Oceanography Texts: Learner, Task and Text Characteristics.

    Science.gov (United States)

    1987-09-15

    12IMaERIN KESTISUS FMS OCEUWUNRPIW TOMTS: LEME / TW M TEXT aWAC.. (U) CLIFSWII IWly bdwTNUU S Rt ULIU ET L 15 SEP 8? CMSITIVE SCIENCI -Th471,8 wL07...indicate that the learner ought to read on for the complete answer: "The first item (time) could be read from your seismograph, but there is no direct ...terribly "-" direct but instead of resuming a search process, she retreated back to the same answer she had originally retrieved from memory and the

  6. AHP 45: REVIEW: TIBETAN LITERARY GENRES, TEXTS, AND TEXT TYPES

    Directory of Open Access Journals (Sweden)

    Zoe Tribur

    2017-03-01

    Full Text Available Following the quantitative tradition of sociolinguistic research pioneered by such scholars as William Labov, Walt Wolfram, and Penelope Eckert, Reynolds presents a detailed, coherent analysis of the social parameters behind a specific on-going sound change, the merger of syllable final bilabial nasal (m with aveolar coronal nasal (n, in one small farming community in Qinghai Province. His is certainly not the first such study on Tibetan sound change. It is also not the first study to investigate the merger of (m into (n, which is a prominent feature of so-called "farmer" dialects of Amdo Tibetan (Hua 2005. ...

  7. CCM: A Text Classification Method by Clustering

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    In this paper, a new Cluster based Classification Model (CCM) for suspicious email detection and other text classification tasks, is presented. Comparative experiments of the proposed model against traditional classification models and the boosting algorithm are also discussed. Experimental results...... approach to text classification tasks simplifies the model and at the same time increases the accuracy....... show that the CCM outperforms traditional classification models as well as the boosting algorithm for the task of suspicious email detection on terrorism domain email dataset and topic categorization on the Reuters-21578 and 20 Newsgroups datasets. The overall finding is that applying a cluster based...

  8. Extracting Conceptual Feature Structures from Text

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik; Lassen, Tine

    2011-01-01

    This paper describes an approach to indexing texts by their conceptual content using ontologies along with lexico-syntactic information and semantic role assignment provided by lexical resources. The conceptual content of meaningful chunks of text is transformed into conceptual feature structures...... and mapped into concepts in a generative ontology. Synonymous but linguistically quite distinct expressions are mapped to the same concept in the ontology. This allows us to perform a content-based search which will retrieve relevant documents independently of the linguistic form of the query as well...

  9. Locative inferences in medical texts.

    Science.gov (United States)

    Mayer, P S; Bailey, G H; Mayer, R J; Hillis, A; Dvoracek, J E

    1987-06-01

    Medical research relies on epidemiological studies conducted on a large set of clinical records that have been collected from physicians recording individual patient observations. These clinical records are recorded for the purpose of individual care of the patient with little consideration for their use by a biostatistician interested in studying a disease over a large population. Natural language processing of clinical records for epidemiological studies must deal with temporal, locative, and conceptual issues. This makes text understanding and data extraction of clinical records an excellent area for applied research. While much has been done in making temporal or conceptual inferences in medical texts, parallel work in locative inferences has not been done. This paper examines the locative inferences as well as the integration of temporal, locative, and conceptual issues in the clinical record understanding domain by presenting an application that utilizes two key concepts in its parsing strategy--a knowledge-based parsing strategy and a minimal lexicon.

  10. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    Inspecting the quality of printed texts has its own importance in many industrial applications. To do so, this paper proposes a grading system which evaluates the performance of the printing task using some quality measures for each character and symbols. The purpose of these grading system is two......-folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...... has been tested on images from a real industrial environment and the obtained results are promising....

  11. The TEXT upgrade vertical interferometer

    International Nuclear Information System (INIS)

    Hallock, G.A.; Gartman, M.L.; Li, W.; Chiang, K.; Shin, S.; Castles, R.L.; Chatterjee, R.; Rahman, A.S.

    1992-01-01

    A far-infrared interferometer has been installed on TEXT upgrade to obtain electron density profiles. The primary system views the plasma vertically through a set of large (60-cm radialx7.62-cm toroidal) diagnostic ports. A 1-cm channel spacing (59 channels total) and fast electronic time response is used, to provide high resolution for radial profiles and perturbation experiments. Initial operation of the vertical system was obtained late in 1991, with six operating channels

  12. An Elementary Approach to Thinking under Uncertainty: A Prototype Text

    Science.gov (United States)

    1982-10-01

    the writer rich materialistically . This is advertising language: "Things go better with Coke," "Marlboro Country," or "Reach out and touch someone...million un- employed men. An estimator more knowledgeable about such demographics would make an even more detailed estimate that includes teenagers with...percentage may not seem high if we recall how common it is for teenagers to go through a transient rebellious period. Most of them do not become delinquents

  13. Factor Analytic Approach to Transitive Text Mining using Medline Descriptors

    Science.gov (United States)

    Stegmann, J.; Grohmann, G.

    Matrix decomposition methods were applied to examples of noninteractive literature sets sharing implicit relations. Document-by-term matrices were created from downloaded PubMed literature sets, the terms being the Medical Subject Headings (MeSH descriptors) assigned to the documents. The loadings of the factors derived from singular value or eigenvalue matrix decomposition were sorted according to absolute values and subsequently inspected for positions of terms relevant to the discovery of hidden connections. It was found that only a small number of factors had to be screened to find key terms in close neighbourhood, being separated by a small number of terms only.

  14. Clustering Analysis within Text Classification Techniques

    Directory of Open Access Journals (Sweden)

    Madalina ZURINI

    2011-01-01

    Full Text Available The paper represents a personal approach upon the main applications of classification which are presented in the area of knowledge based society by means of methods and techniques widely spread in the literature. Text classification is underlined in chapter two where the main techniques used are described, along with an integrated taxonomy. The transition is made through the concept of spatial representation. Having the elementary elements of geometry and the artificial intelligence analysis, spatial representation models are presented. Using a parallel approach, spatial dimension is introduced in the process of classification. The main clustering methods are described in an aggregated taxonomy. For an example, spam and ham words are clustered and spatial represented, when the concepts of spam, ham and common and linkage word are presented and explained in the xOy space representation.

  15. [On two antique medical texts].

    Science.gov (United States)

    Rosa, Maria Carlota

    2005-01-01

    The two texts presented here--Regimento proueytoso contra ha pestenença [literally, "useful regime against pestilence"] and Modus curandi cum balsamo ["curing method using balm"]--represent the extent of Portugal's known medical library until circa 1530, produced in gothic letters by foreign printers: Germany's Valentim Fernandes, perhaps the era's most important printer, who worked in Lisbon between 1495 and 1518, and Germdo Galharde, a Frenchman who practiced his trade in Lisbon and Coimbra between 1519 and 1560. Modus curandi, which came to light in 1974 thanks to bibliophile José de Pina Martins, is anonymous. Johannes Jacobi is believed to be the author of Regimento proueytoso, which was translated into Latin (Regimen contra pestilentiam), French, and English. Both texts are presented here in facsimile and in modern Portuguese, while the first has also been reproduced in archaic Portuguese using modern typographical characters. This philological venture into sixteenth-century medicine is supplemented by a scholarly glossary which serves as a valuable tool in interpreting not only Regimento proueytoso but also other texts from the era. Two articles place these documents in historical perspective.

  16. Discursive paradigm of research of a literary text

    OpenAIRE

    Кондратенко, Н. В.

    2015-01-01

    The article defines discursive status of a literary text and outlines the role of the text in the act of literary communication. It also shows the dynamics study of a literary text in linguistics in terms of different methodological approaches and presents communicatively oriented approach to the analysis of literary communication.

  17. [Symbol: see text]2 Optimized predictive image coding with [Symbol: see text]∞ bound.

    Science.gov (United States)

    Chuah, Sceuchin; Dumitrescu, Sorina; Wu, Xiaolin

    2013-12-01

    In many scientific, medical, and defense applications of image/video compression, an [Symbol: see text]∞ error bound is required. However, pure[Symbol: see text]∞-optimized image coding, colloquially known as near-lossless image coding, is prone to structured errors such as contours and speckles if the bit rate is not sufficiently high; moreover, most of the previous [Symbol: see text]∞-based image coding methods suffer from poor rate control. In contrast, the [Symbol: see text]2 error metric aims for average fidelity and hence preserves the subtlety of smooth waveforms better than the ∞ error metric and it offers fine granularity in rate control, but pure [Symbol: see text]2-based image coding methods (e.g., JPEG 2000) cannot bound individual errors as the [Symbol: see text]∞-based methods can. This paper presents a new compression approach to retain the benefits and circumvent the pitfalls of the two error metrics. A common approach of near-lossless image coding is to embed into a DPCM prediction loop a uniform scalar quantizer of residual errors. The said uniform scalar quantizer is replaced, in the proposed new approach, by a set of context-based [Symbol: see text]2-optimized quantizers. The optimization criterion is to minimize a weighted sum of the [Symbol: see text]2 distortion and the entropy while maintaining a strict [Symbol: see text]∞ error bound. The resulting method obtains good rate-distortion performance in both [Symbol: see text]2 and [Symbol: see text]∞ metrics and also increases the rate granularity. Compared with JPEG 2000, the new method not only guarantees lower [Symbol: see text]∞ error for all bit rates, but also it achieves higher PSNR for relatively high bit rates.

  18. The Balinese Unicode Text Processing

    Directory of Open Access Journals (Sweden)

    Imam Habibi

    2009-06-01

    Full Text Available In principal, the computer only recognizes numbers as the representation of a character. Therefore, there are many encoding systems to allocate these numbers although not all characters are covered. In Europe, every single language even needs more than one encoding system. Hence, a new encoding system known as Unicode has been established to overcome this problem. Unicode provides unique id for each different characters which does not depend on platform, program, and language. Unicode standard has been applied in a number of industries, such as Apple, HP, IBM, JustSystem, Microsoft, Oracle, SAP, Sun, Sybase, and Unisys. In addition, language standards and modern information exchanges such as XML, Java, ECMA Script (JavaScript, LDAP, CORBA 3.0, and WML make use of Unicode as an official tool for implementing ISO/IEC 10646. There are four things to do according to Balinese script: the algorithm of transliteration, searching, sorting, and word boundary analysis (spell checking. To verify the truth of algorithm, some applications are made. These applications can run on Linux/Windows OS platform using J2SDK 1.5 and J2ME WTK2 library. The input and output of the algorithm/application are character sequence that is obtained from keyboard punch and external file. This research produces a module or a library which is able to process the Balinese text based on Unicode standard. The output of this research is the ability, skill, and mastering of 1. Unicode standard (21-bit as a substitution to ASCII (7-bit and ISO8859-1 (8-bit as the former default character set in many applications. 2. The Balinese Unicode text processing algorithm. 3. An experience of working with and learning from an international team that consists of the foremost experts in the area: Michael Everson (Ireland, Peter Constable (Microsoft US, I Made Suatjana, and Ida Bagus Adi Sudewa.

  19. Biased limiter experiments on text

    International Nuclear Information System (INIS)

    Phillips, P.E.; Wootton, A.J.; Rowan, W.L.; Ritz, C.P.; Rhodes, T.L.; Bengtson, R.D.; Hodge, W.L.; Durst, R.D.; McCool, S.C.; Richards, B.; Gentle, K.W.; Schoch, P.; Forster, J.C.; Hickok, R.L.; Evans, T.E.

    1987-01-01

    Experiments using an electrically biased limiter have been performed on the Texas Experimental Tokamak (TEXT). A small movable limiter is inserted past the main poloidal ring limiter (which is electrically connected to the vacuum vessel) and biased at V Lim with respect to it. The floating potential, plasma potential and shear layer position can be controlled. With vertical strokeV Lim vertical stroke ≥ 50 V the plasma density increases. For V Lim Lim > 0 the results obtained are inconclusive. Variation of V Lim changes the electrostatic turbulence which may explain the observed total flux changes. (orig.)

  20. Text mining by Tsallis entropy

    Science.gov (United States)

    Jamaati, Maryam; Mehri, Ali

    2018-01-01

    Long-range correlations between the elements of natural languages enable them to convey very complex information. Complex structure of human language, as a manifestation of natural languages, motivates us to apply nonextensive statistical mechanics in text mining. Tsallis entropy appropriately ranks the terms' relevance to document subject, taking advantage of their spatial correlation length. We apply this statistical concept as a new powerful word ranking metric in order to extract keywords of a single document. We carry out an experimental evaluation, which shows capability of the presented method in keyword extraction. We find that, Tsallis entropy has reliable word ranking performance, at the same level of the best previous ranking methods.

  1. Automatic segmentation of clinical texts.

    Science.gov (United States)

    Apostolova, Emilia; Channin, David S; Demner-Fushman, Dina; Furst, Jacob; Lytinen, Steven; Raicu, Daniela

    2009-01-01

    Clinical narratives, such as radiology and pathology reports, are commonly available in electronic form. However, they are also commonly entered and stored as free text. Knowledge of the structure of clinical narratives is necessary for enhancing the productivity of healthcare departments and facilitating research. This study attempts to automatically segment medical reports into semantic sections. Our goal is to develop a robust and scalable medical report segmentation system requiring minimum user input for efficient retrieval and extraction of information from free-text clinical narratives. Hand-crafted rules were used to automatically identify a high-confidence training set. This automatically created training dataset was later used to develop metrics and an algorithm that determines the semantic structure of the medical reports. A word-vector cosine similarity metric combined with several heuristics was used to classify each report sentence into one of several pre-defined semantic sections. This baseline algorithm achieved 79% accuracy. A Support Vector Machine (SVM) classifier trained on additional formatting and contextual features was able to achieve 90% accuracy. Plans for future work include developing a configurable system that could accommodate various medical report formatting and content standards.

  2. Transfer Learning beyond Text Classification

    Science.gov (United States)

    Yang, Qiang

    Transfer learning is a new machine learning and data mining framework that allows the training and test data to come from different distributions or feature spaces. We can find many novel applications of machine learning and data mining where transfer learning is necessary. While much has been done in transfer learning in text classification and reinforcement learning, there has been a lack of documented success stories of novel applications of transfer learning in other areas. In this invited article, I will argue that transfer learning is in fact quite ubiquitous in many real world applications. In this article, I will illustrate this point through an overview of a broad spectrum of applications of transfer learning that range from collaborative filtering to sensor based location estimation and logical action model learning for AI planning. I will also discuss some potential future directions of transfer learning.

  3. Dynamic Chemical Model for $\\text {H} _2 $/$\\text {O} _2 $ Combustion Developed Through a Community Workflow

    KAUST Repository

    Oreluk, James

    2018-01-30

    Elementary-reaction models for $\\\\text{H}_2$/$\\\\text{O}_2$ combustion were evaluated and optimized through a collaborative workflow, establishing accuracy and characterizing uncertainties. Quantitative findings were the optimized model, the importance of $\\\\text{H}_2 + \\\\text{O}_2(1\\\\Delta) = \\\\text{H} + \\\\text{HO}_2$ in high-pressure flames, and the inconsistency of certain low-temperature shock-tube data. The workflow described here is proposed to be even more important because the approach and publicly available cyberinfrastructure allows future community development of evolving improvements. The workflow steps applied here were to develop an initial reaction set using Burke et al. [2012], Burke et al. [2013], Sellevag et al. [2009], and Konnov [2015]; test it for thermodynamic and kinetics consistency and plausibility against other sets in the literature; assign estimated uncertainties where not stated in the sources; select key data targets (

  4. Computer-Aided Generation of Result Text for Clinical Laboratory Texts

    OpenAIRE

    Kuzmak, Peter M.; Miller, R. E.

    1983-01-01

    Efficient processing of non-numeric textual data is a frequent requirement with medical computer applications such as clinical laboratory result reporting. In such instances, it is often desirable that the computer control the generation of the text to ensure that the intended meaning is conveyed. This paper describes a technique for interactively selecting predefined text segments to form complex textual reports for laboratory tests. The approach, which uses algorithms based on augmented tra...

  5. Audio Steganography with Embedded Text

    Science.gov (United States)

    Teck Jian, Chua; Chai Wen, Chuah; Rahman, Nurul Hidayah Binti Ab.; Hamid, Isredza Rahmi Binti A.

    2017-08-01

    Audio steganography is about hiding the secret message into the audio. It is a technique uses to secure the transmission of secret information or hide their existence. It also may provide confidentiality to secret message if the message is encrypted. To date most of the steganography software such as Mp3Stego and DeepSound use block cipher such as Advanced Encryption Standard or Data Encryption Standard to encrypt the secret message. It is a good practice for security. However, the encrypted message may become too long to embed in audio and cause distortion of cover audio if the secret message is too long. Hence, there is a need to encrypt the message with stream cipher before embedding the message into the audio. This is because stream cipher provides bit by bit encryption meanwhile block cipher provide a fixed length of bits encryption which result a longer output compare to stream cipher. Hence, an audio steganography with embedding text with Rivest Cipher 4 encryption cipher is design, develop and test in this project.

  6. A programmed text in statistics

    CERN Document Server

    Hine, J

    1975-01-01

    Exercises for Section 2 42 Physical sciences and engineering 42 43 Biological sciences 45 Social sciences Solutions to Exercises, Section 1 47 Physical sciences and engineering 47 49 Biological sciences 49 Social sciences Solutions to Exercises, Section 2 51 51 PhYSical sciences and engineering 55 Biological sciences 58 Social sciences 62 Tables 2 62 x - tests involving variances 2 63,64 x - one tailed tests 2 65 x - two tailed tests F-distribution 66-69 Preface This project started some years ago when the Nuffield Foundation kindly gave a grant for writing a pro­ grammed text to use with service courses in statistics. The work carried out by Mrs. Joan Hine and Professor G. B. Wetherill at Bath University, together with some other help from time to time by colleagues at Bath University and elsewhere. Testing was done at various colleges and universities, and some helpful comments were received, but we particularly mention King Edwards School, Bath, who provided some sixth formers as 'guinea pigs' for the fir...

  7. ERRORS AND DIFFICULTIES IN TRANSLATING LEGAL TEXTS

    Directory of Open Access Journals (Sweden)

    Camelia, CHIRILA

    2014-11-01

    Full Text Available Nowadays the accurate translation of legal texts has become highly important as the mistranslation of a passage in a contract, for example, could lead to lawsuits and loss of money. Consequently, the translation of legal texts to other languages faces many difficulties and only professional translators specialised in legal translation should deal with the translation of legal documents and scholarly writings. The purpose of this paper is to analyze translation from three perspectives: translation quality, errors and difficulties encountered in translating legal texts and consequences of such errors in professional translation. First of all, the paper points out the importance of performing a good and correct translation, which is one of the most important elements to be considered when discussing translation. Furthermore, the paper presents an overview of the errors and difficulties in translating texts and of the consequences of errors in professional translation, with applications to the field of law. The paper is also an approach to the differences between languages (English and Romanian that can hinder comprehension for those who have embarked upon the difficult task of translation. The research method that I have used to achieve the objectives of the paper was the content analysis of various Romanian and foreign authors' works.

  8. NEWS TEXT GENRE OF THE BALI TIMES

    Directory of Open Access Journals (Sweden)

    Johanes Sutomo

    2015-01-01

    Full Text Available This study aims at portraying some aspects of the contextual configuration of Bali Times’s news texts. The study focuses on the communicative purposes, linguistic features and schematic structures of the news text genre. The method used is descriptive, and the approaches are quantitative and qualitative. The results revealed that the news texts could be categorized as the genre of news service. The Generic Structure Potential (GSP deriving from the selected news texts may accomplish the service. The GSP is a condensed statement and a powerful device that generates a large number of possible actual structures.  Penelitian ini dilakukan untuk memotret beberapa aspek dari konfigurasi kontekstual dari teks berita Bali Times. Penelitian ini difokuskan pada tujuan komunikatif, fitur linguistik dan struktur skematik dari genre teks berita. Metode yang digunakan deskriptif, dengan pendekatan kuantitatif dan kualitatif. Hasil penelitian menunjukkan bahwa teks berita dapat dikategorikan sebagai genre layanan berita. Struktur Generik Potensial yang merupakan derivasi dari teks berita terpilih dimungkinkan mampu memenuhi layanan ini. Struktur ini merupakan pernyataan ringkas dan sebagai perangkat yang hebat yang mampu menggeneralisasi struktur aktual yang memungkinkan dalam jumlah yang besar

  9. Difficulties in translation of socio-political texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2013-12-01

    Full Text Available Belonging of Russian socio-political texts to publicistic style assumes being guided by functional approach in order to find most adequate linguistic means by transfer of pragmatic meaning of the source text. Intralinguistic meaning can slightly remain by the interpretation of German texts. Lexical and grammatical transformations help preserving semantic-syntactic structure of the target text which means achievement of the same communicative effect by the translate which is being achieved by the source text.

  10. Comprehension de Texte et Actes de Parole (Understanding a Text and Speech Acts).

    Science.gov (United States)

    Menting, Jan P.

    Oral production (the speech act) and the comprehension of written and oral texts have long been treated as separate entities when they could be used in an integrated approach to teach a broader range of language skills. The receptive aspects of the speech act should be brought into language instruction and used to select materials for teaching…

  11. Usages possible d'un texte litteraire (Possible Uses of a Literary Text).

    Science.gov (United States)

    Kirpalani, Marie-Claudette

    1984-01-01

    A French short story is used to illustrate several possible levels of study for a single literary text: linguistic analysis, textual analysis, and analysis of uses of the romantic form. The approach suggested integrates these levels so that each is enriched by the perspectives offered by the others. (MSE)

  12. Didactique du texte litteraire: Un parcours a etapes (Teaching Literary Text: A Journey in Stages).

    Science.gov (United States)

    Gruca, Isabelle

    1996-01-01

    Outlines a three-stage strategy for teaching appreciation of French literature. The first stage is a global approach to the work, giving an overview of its structure and presentation. The second stage looks at the text type and generic characteristics, and the third focuses on the specific treatment given these characteristics and the details of…

  13. Extraction of information from unstructured text

    Energy Technology Data Exchange (ETDEWEB)

    Irwin, N.H.; DeLand, S.M.; Crowder, S.V.

    1995-11-01

    Extracting information from unstructured text has become an emphasis in recent years due to the large amount of text now electronically available. This status report describes the findings and work done by the end of the first year of a two-year LDRD. Requirements of the approach included that it model the information in a domain independent way. This means that it would differ from current systems by not relying on previously built domain knowledge and that it would do more than keyword identification. Three areas that are discussed and expected to contribute to a solution include (1) identifying key entities through document level profiling and preprocessing, (2) identifying relationships between entities through sentence level syntax, and (3) combining the first two with semantic knowledge about the terms.

  14. INNER DIALOGICITY OF MEDICAL SCIENTIFIC TEXTS

    Directory of Open Access Journals (Sweden)

    Efremova Nataliya Vladimirovna

    2015-06-01

    Full Text Available The author studies inner dialogicity as an integral property of a scientist's thinking activity, a way of a scientific idea development, one of the cognitive and discursive mechanisms of new knowledge formation, its crystallization and dementalisation in a text, as a way of search for truth. Such approach to dialogicity in the study of a scientific text makes it possible to analyze the cogitative processes proceeding in human consciousness and cognitive activity, allows to fully understand the stated scientific concept, to define pragmatic strategies of the author, to plunge into his reflexive world. On the material of medical scientific texts of N.M. Amosov and F. G. Uglov, famous scientists in the field of cardio surgery, it is established that traces of internal dialogicity manifestation in the textual space of scientists actualize the origin of new knowledge, the change of author's semantic positions, his ability to reflect, compare, analyze his own thoughts and actions, to estimate oneself and the features of thinking process which are realized in logic of a statement of the scientific concept, an explanation of concepts, terms at judgment of the points of view of contemporaries and predecessors, adherents and scientist's opponents, and also orientation to the addressee's presupposition, activization of his cogitative activity. Linguistic, discursive, verbal analysis singles out the impact on the addressee, his mental activity.

  15. TAG LINES EXTRACTION AND RANKING IN TEXT ANNOTATION

    Directory of Open Access Journals (Sweden)

    S. V. Popova

    2013-01-01

    Full Text Available The article deals with comparative analysis of two approaches for tag lines ranking in text annotation task. The first approach is based on a weight evaluation of extracted phrases using TextRank algorithm, the second one is based on tf-idf estimation. Experiments were carried out on the base of INSPEC dataset collection. Experiment descriptions and comparative results are given. It was shown experimentally that tf-idf approach gives better results than the one based on TextRank.

  16. Analysis of the equivalence relationship between [Formula: see text]-minimization and [Formula: see text]-minimization.

    Science.gov (United States)

    Wang, Changlong; Peng, Jigen

    2017-01-01

    In signal processing theory, [Formula: see text]-minimization is an important mathematical model. Unfortunately, [Formula: see text]-minimization is actually NP-hard. The most widely studied approach to this NP-hard problem is based on solving [Formula: see text]-minimization ([Formula: see text]). In this paper, we present an analytic expression of [Formula: see text], which is formulated by the dimension of the matrix [Formula: see text], the eigenvalue of the matrix [Formula: see text], and the vector [Formula: see text], such that every k -sparse vector [Formula: see text] can be exactly recovered via [Formula: see text]-minimization whenever [Formula: see text], that is, [Formula: see text]-minimization is equivalent to [Formula: see text]-minimization whenever [Formula: see text]. The superiority of our results is that the analytic expression and each its part can be easily calculated. Finally, we give two examples to confirm the validity of our conclusions.

  17. Making School Development Credible. Text, Context, Irony

    Directory of Open Access Journals (Sweden)

    Mats Börjesson

    2012-01-01

    Full Text Available

    The article argues for the importance of an open, reflexive-methodological approach when switching between studying text, context and researcher activity. Close linguistic analysis can benefit from being linked with the researcher’s contextualisation of his empirical material as well as with more distanced readings. The more specific starting point for this article is that school development, like other similar terms such as school improvement and the like, makes use of linguistic building blocks with which whole narratives about today’s and tomorrow’s schools can be constructed. The subject of the study is a short text issued by the Swedish Schools Inspectorate (Skolinspektionen. Government language changes according to the authorities’ role in society and their own definitions of their functions, and an important aspect here is the legitimacy of the authorities’ texts. By means of various kinds of close linguistic analysis, the above-mentioned text is studied with regard to choice of categories, hierarchies of modalisation and the rhetorical effects of different types of formulations in a broader political-social landscape. The article concludes with a reflective discussion on the relationship between government language and irony as a stylistic device – a device that is based on the results of the close empirical analysis.[i]



    [i] The article is part of the project ”School  Development as Narrative”, funded by the Swedish Research Council. The author would like to thank the two reviewers for very valuable comments.

  18. Empirical Studies On Machine Learning Based Text Classification Algorithms

    OpenAIRE

    Shweta C. Dharmadhikari; Maya Ingle; Parag Kulkarni

    2011-01-01

    Automatic classification of text documents has become an important research issue now days. Properclassification of text documents requires information retrieval, machine learning and Natural languageprocessing (NLP) techniques. Our aim is to focus on important approaches to automatic textclassification based on machine learning techniques viz. supervised, unsupervised and semi supervised.In this paper we present a review of various text classification approaches under machine learningparadig...

  19. Helios: Understanding Solar Evolution Through Text Analytics

    Energy Technology Data Exchange (ETDEWEB)

    Randazzese, Lucien [SRI International, Menlo Park, CA (United States)

    2016-12-02

    This proof-of-concept project focused on developing, testing, and validating a range of bibliometric, text analytic, and machine-learning based methods to explore the evolution of three photovoltaic (PV) technologies: Cadmium Telluride (CdTe), Dye-Sensitized solar cells (DSSC), and Multi-junction solar cells. The analytical approach to the work was inspired by previous work by the same team to measure and predict the scientific prominence of terms and entities within specific research domains. The goal was to create tools that could assist domain-knowledgeable analysts in investigating the history and path of technological developments in general, with a focus on analyzing step-function changes in performance, or “breakthroughs,” in particular. The text-analytics platform developed during this project was dubbed Helios. The project relied on computational methods for analyzing large corpora of technical documents. For this project we ingested technical documents from the following sources into Helios: Thomson Scientific Web of Science (papers), the U.S. Patent & Trademark Office (patents), the U.S. Department of Energy (technical documents), the U.S. National Science Foundation (project funding summaries), and a hand curated set of full-text documents from Thomson Scientific and other sources.

  20. Named entity recognition in Slovene text

    Directory of Open Access Journals (Sweden)

    Tadej Štajner

    2013-12-01

    Full Text Available This paper presents an approach and an implementation of a named entity extractor for Slovene language, based on a machine learning approach. It is designed as a supervised algorithm based on Conditional Random Fields and is trained on the ssj500k annotated corpus of Slovene. The corpus, which is available under a Creative Commons CC-BY-NC-SA licence, is annotated with morphosyntactic tags, as well as named entities for people, locations, organisations, and miscellaneous names. The paper discusses the influence of morphosyntactic tags, lexicons and conjunctions of features of neighbouring words. An important contribution of this investigation is that morphosyntactic tags benefit named entity extraction. Using all the best-performing features the recognizer reaches a precision of 74% and a recall of 72%, having stronger performance on personal and geographical named entities, followed by organizations, but performs poorly on the miscellaneous entities, since this class is very diverse and consequently difficult to predict. A major contribution of the paper is also showing the benefits of splitting the class of miscellaneous entities into organizations and other entities, which in turn improves performance even on personal and organizational names. The software, developed in this research is freely available under the Apache 2.0 licence at http://ailab.ijs.si/~tadej/slner.zip, while development versions are available at https://github.com/tadejs/slner.

  1. Measurement of prompt and nonprompt [Formula: see text] production in [Formula: see text] and [Formula: see text] collisions at [Formula: see text].

    Science.gov (United States)

    Sirunyan, A M; Tumasyan, A; Adam, W; Asilar, E; Bergauer, T; Brandstetter, J; Brondolin, E; Dragicevic, M; Erö, J; Flechl, M; Friedl, M; Frühwirth, R; Ghete, V M; Hartl, C; Hörmann, N; Hrubec, J; Jeitler, M; König, A; Krätschmer, I; Liko, D; Matsushita, T; Mikulec, I; Rabady, D; Rad, N; Rahbaran, B; Rohringer, H; Schieck, J; Strauss, J; Waltenberger, W; Wulz, C-E; Dvornikov, O; Makarenko, V; Mossolov, V; Suarez Gonzalez, J; Zykunov, V; Shumeiko, N; Alderweireldt, S; De Wolf, E A; Janssen, X; Lauwers, J; Van De Klundert, M; Van Haevermaet, H; Van Mechelen, P; Van Remortel, N; Van Spilbeeck, A; Abu Zeid, S; Blekman, F; D'Hondt, J; Daci, N; De Bruyn, I; Deroover, K; Lowette, S; Moortgat, S; Moreels, L; Olbrechts, A; Python, Q; Skovpen, K; Tavernier, S; Van Doninck, W; Van Mulders, P; Van Parijs, I; Brun, H; Clerbaux, B; De Lentdecker, G; Delannoy, H; Fasanella, G; Favart, L; Goldouzian, R; Grebenyuk, A; Karapostoli, G; Lenzi, T; Léonard, A; Luetic, J; Maerschalk, T; Marinov, A; Randle-Conde, A; Seva, T; Vander Velde, C; Vanlaer, P; Vannerom, D; Yonamine, R; Zenoni, F; Zhang, F; Cimmino, A; Cornelis, T; Dobur, D; Fagot, A; Gul, M; Khvastunov, I; Poyraz, D; Salva, S; Schöfbeck, R; Tytgat, M; Van Driessche, W; Yazgan, E; Zaganidis, N; Bakhshiansohi, H; Beluffi, C; Bondu, O; Brochet, S; Bruno, G; Caudron, A; De Visscher, S; Delaere, C; Delcourt, M; Francois, B; Giammanco, A; Jafari, A; Komm, M; Krintiras, G; Lemaitre, V; Magitteri, A; Mertens, A; Musich, M; Piotrzkowski, K; Quertenmont, L; Selvaggi, M; Vidal Marono, M; Wertz, S; Beliy, N; Aldá Júnior, W L; Alves, F L; Alves, G A; Brito, L; Hensel, C; Moraes, A; Pol, M E; Rebello Teles, P; Belchior Batista Das Chagas, E; Carvalho, W; Chinellato, J; Custódio, A; Da Costa, E M; Da Silveira, G G; De Jesus Damiao, D; De Oliveira Martins, C; Fonseca De Souza, S; Huertas Guativa, L M; Malbouisson, H; Matos Figueiredo, D; Mora Herrera, C; Mundim, L; Nogima, H; Prado Da Silva, W L; Santoro, A; Sznajder, A; Tonelli Manganote, E J; Torres Da Silva De Araujo, F; Vilela Pereira, A; Ahuja, S; Bernardes, C A; Dogra, S; Fernandez Perez Tomei, T R; Gregores, E M; Mercadante, P G; Moon, C S; Novaes, S F; Padula, Sandra S; Romero Abad, D; Ruiz Vargas, J C; Aleksandrov, A; Hadjiiska, R; Iaydjiev, P; Rodozov, M; Stoykova, S; Sultanov, G; Vutova, M; Dimitrov, A; Glushkov, I; Litov, L; Pavlov, B; Petkov, P; Fang, W; Ahmad, M; Bian, J G; Chen, G M; Chen, H S; Chen, M; Chen, Y; Cheng, T; Jiang, C H; Leggat, D; Liu, Z; Romeo, F; Ruan, M; Shaheen, S M; Spiezia, A; Tao, J; Wang, C; Wang, Z; Zhang, H; Zhao, J; Ban, Y; Chen, G; Li, Q; Liu, S; Mao, Y; Qian, S J; Wang, D; Xu, Z; Avila, C; Cabrera, A; Chaparro Sierra, L F; Florez, C; Gomez, J P; González Hernández, C F; Ruiz Alvarez, J D; Sanabria, J C; Godinovic, N; Lelas, D; Puljak, I; Ribeiro Cipriano, P M; Sculac, T; Antunovic, Z; Kovac, M; Brigljevic, V; Ferencek, D; Kadija, K; Mesic, B; Susa, T; Attikis, A; Mavromanolakis, G; Mousa, J; Nicolaou, C; Ptochos, F; Razis, P A; Rykaczewski, H; Tsiakkouri, D; Finger, M; Finger, M; Carrera Jarrin, E; Assran, Y; Elkafrawy, T; Mahrous, A; Kadastik, M; Perrini, L; Raidal, M; Tiko, A; Veelken, C; Eerola, P; Pekkanen, J; Voutilainen, M; Härkönen, J; Järvinen, T; Karimäki, V; Kinnunen, R; Lampén, T; Lassila-Perini, K; Lehti, S; Lindén, T; Luukka, P; Tuominiemi, J; Tuovinen, E; Wendland, L; Talvitie, J; Tuuva, T; Besancon, M; Couderc, F; Dejardin, M; Denegri, D; Fabbro, B; Faure, J L; Favaro, C; Ferri, F; Ganjour, S; Ghosh, S; Givernaud, A; Gras, P; Hamel de Monchenault, G; Jarry, P; Kucher, I; Locci, E; Machet, M; Malcles, J; Rander, J; Rosowsky, A; Titov, M; Abdulsalam, A; Antropov, I; Arleo, F; Baffioni, S; Beaudette, F; Busson, P; Cadamuro, L; Chapon, E; Charlot, C; Davignon, O; Granier de Cassagnac, R; Jo, M; Lisniak, S; Miné, P; Nguyen, M; Ochando, C; Ortona, G; Paganini, P; Pigard, P; Regnard, S; Salerno, R; Sirois, Y; Strebler, T; Yilmaz, Y; Zabi, A; Zghiche, A; Agram, J-L; Andrea, J; Aubin, A; Bloch, D; Brom, J-M; Buttignol, M; Chabert, E C; Chanon, N; Collard, C; Conte, E; Coubez, X; Fontaine, J-C; Gelé, D; Goerlach, U; Le Bihan, A-C; Van Hove, P; Gadrat, S; Beauceron, S; Bernet, C; Boudoul, G; Carrillo Montoya, C A; Chierici, R; Contardo, D; Courbon, B; Depasse, P; El Mamouni, H; Fay, J; Gascon, S; Gouzevitch, M; Grenier, G; Ille, B; Lagarde, F; Laktineh, I B; Lethuillier, M; Mirabito, L; Pequegnot, A L; Perries, S; Popov, A; Sabes, D; Sordini, V; Vander Donckt, M; Verdier, P; Viret, S; Khvedelidze, A; Tsamalaidze, Z; Autermann, C; Beranek, S; Feld, L; Kiesel, M K; Klein, K; Lipinski, M; Preuten, M; Schomakers, C; Schulz, J; Verlage, T; Albert, A; Brodski, M; Dietz-Laursonn, E; Duchardt, D; Endres, M; Erdmann, M; Erdweg, S; Esch, T; Fischer, R; Güth, A; Hamer, M; Hebbeker, T; Heidemann, C; Hoepfner, K; Knutzen, S; Merschmeyer, M; Meyer, A; Millet, P; Mukherjee, S; Olschewski, M; Padeken, K; Pook, T; Radziej, M; Reithler, H; Rieger, M; Scheuch, F; Sonnenschein, L; Teyssier, D; Thüer, S; Cherepanov, V; Flügge, G; Kargoll, B; Kress, T; Künsken, A; Lingemann, J; Müller, T; Nehrkorn, A; Nowack, A; Pistone, C; Pooth, O; Stahl, A; Aldaya Martin, M; Arndt, T; Asawatangtrakuldee, C; Beernaert, K; Behnke, O; Behrens, U; Bin Anuar, A A; Borras, K; Campbell, A; Connor, P; Contreras-Campana, C; Costanza, F; Diez Pardos, C; Dolinska, G; Eckerlin, G; Eckstein, D; Eichhorn, T; Eren, E; Gallo, E; Garay Garcia, J; Geiser, A; Gizhko, A; Grados Luyando, J M; Grohsjean, A; Gunnellini, P; Harb, A; Hauk, J; Hempel, M; Jung, H; Kalogeropoulos, A; Karacheban, O; Kasemann, M; Keaveney, J; Kleinwort, C; Korol, I; Krücker, D; Lange, W; Lelek, A; Lenz, T; Leonard, J; Lipka, K; Lobanov, A; Lohmann, W; Mankel, R; Melzer-Pellmann, I-A; Meyer, A B; Mittag, G; Mnich, J; Mussgiller, A; Pitzl, D; Placakyte, R; Raspereza, A; Roland, B; Sahin, M Ö; Saxena, P; Schoerner-Sadenius, T; Spannagel, S; Stefaniuk, N; Van Onsem, G P; Walsh, R; Wissing, C; Blobel, V; Centis Vignali, M; Draeger, A R; Dreyer, T; Garutti, E; Gonzalez, D; Haller, J; Hoffmann, M; Junkes, A; Klanner, R; Kogler, R; Kovalchuk, N; Lapsien, T; Marchesini, I; Marconi, D; Meyer, M; Niedziela, M; Nowatschin, D; Pantaleo, F; Peiffer, T; Perieanu, A; Poehlsen, J; Scharf, C; Schleper, P; Schmidt, A; Schumann, S; Schwandt, J; Stadie, H; Steinbrück, G; Stober, F M; Stöver, M; Tholen, H; Troendle, D; Usai, E; Vanelderen, L; Vanhoefer, A; Vormwald, B; Akbiyik, M; Barth, C; Baur, S; Baus, C; Berger, J; Butz, E; Caspart, R; Chwalek, T; Colombo, F; De Boer, W; Dierlamm, A; Fink, S; Freund, B; Friese, R; Giffels, M; Gilbert, A; Goldenzweig, P; Haitz, D; Hartmann, F; Heindl, S M; Husemann, U; Katkov, I; Kudella, S; Mildner, H; Mozer, M U; Müller, Th; Plagge, M; Quast, G; Rabbertz, K; Röcker, S; Roscher, F; Schröder, M; Shvetsov, I; Sieber, G; Simonis, H J; Ulrich, R; Wayand, S; Weber, M; Weiler, T; Williamson, S; Wöhrmann, C; Wolf, R; Anagnostou, G; Daskalakis, G; Geralis, T; Giakoumopoulou, V A; Kyriakis, A; Loukas, D; Topsis-Giotis, I; Kesisoglou, S; Panagiotou, A; Saoulidou, N; Tziaferi, E; Evangelou, I; Flouris, G; Foudas, C; Kokkas, P; Loukas, N; Manthos, N; Papadopoulos, I; Paradas, E; Filipovic, N; Pasztor, G; Bencze, G; Hajdu, C; Horvath, D; Sikler, F; Veszpremi, V; Vesztergombi, G; Zsigmond, A J; Beni, N; Czellar, S; Karancsi, J; Makovec, A; Molnar, J; Szillasi, Z; Bartók, M; Raics, P; Trocsanyi, Z L; Ujvari, B; Komaragiri, J R; Bahinipati, S; Bhowmik, S; Choudhury, S; Mal, P; Mandal, K; Nayak, A; Sahoo, D K; Sahoo, N; Swain, S K; Bansal, S; Beri, S B; Bhatnagar, V; Chawla, R; Bhawandeep, U; Kalsi, A K; Kaur, A; Kaur, M; Kumar, R; Kumari, P; Mehta, A; Mittal, M; Singh, J B; Walia, G; Kumar, Ashok; Bhardwaj, A; Choudhary, B C; Garg, R B; Keshri, S; Malhotra, S; Naimuddin, M; Ranjan, K; Sharma, R; Sharma, V; Bhattacharya, R; Bhattacharya, S; Chatterjee, K; Dey, S; Dutt, S; Dutta, S; Ghosh, S; Majumdar, N; Modak, A; Mondal, K; Mukhopadhyay, S; Nandan, S; Purohit, A; Roy, A; Roy, D; Roy Chowdhury, S; Sarkar, S; Sharan, M; Thakur, S; Behera, P K; Chudasama, R; Dutta, D; Jha, V; Kumar, V; Mohanty, A K; Netrakanti, P K; Pant, L M; Shukla, P; Topkar, A; Aziz, T; Dugad, S; Kole, G; Mahakud, B; Mitra, S; Mohanty, G B; Parida, B; Sur, N; Sutar, B; Banerjee, S; Dewanjee, R K; Ganguly, S; Guchait, M; Jain, Sa; Kumar, S; Maity, M; Majumder, G; Mazumdar, K; Sarkar, T; Wickramage, N; Chauhan, S; Dube, S; Hegde, V; Kapoor, A; Kothekar, K; Pandey, S; Rane, A; Sharma, S; Chenarani, S; Eskandari Tadavani, E; Etesami, S M; Khakzad, M; Mohammadi Najafabadi, M; Naseri, M; Paktinat Mehdiabadi, S; Rezaei Hosseinabadi, F; Safarzadeh, B; Zeinali, M; Felcini, M; Grunewald, M; Abbrescia, M; Calabria, C; Caputo, C; Colaleo, A; Creanza, D; Cristella, L; De Filippis, N; De Palma, M; Fiore, L; Iaselli, G; Maggi, G; Maggi, M; Miniello, G; My, S; Nuzzo, S; Pompili, A; Pugliese, G; Radogna, R; Ranieri, A; Selvaggi, G; Sharma, A; Silvestris, L; Venditti, R; Verwilligen, P; Abbiendi, G; Battilana, C; Bonacorsi, D; Braibant-Giacomelli, S; Brigliadori, L; Campanini, R; Capiluppi, P; Castro, A; Cavallo, F R; Chhibra, S S; Codispoti, G; Cuffiani, M; Dallavalle, G M; Fabbri, F; Fanfani, A; Fasanella, D; Giacomelli, P; Grandi, C; Guiducci, L; Marcellini, S; Masetti, G; Montanari, A; Navarria, F L; Perrotta, A; Rossi, A M; Rovelli, T; Siroli, G P; Tosi, N; Albergo, S; Costa, S; Di Mattia, A; Giordano, F; Potenza, R; Tricomi, A; Tuve, C; Barbagli, G; Ciulli, V; Civinini, C; D'Alessandro, R; Focardi, E; Lenzi, P; Meschini, M; Paoletti, S; Russo, L; Sguazzoni, G; Strom, D; Viliani, L; Benussi, L; Bianco, S; Fabbri, F; Piccolo, D; Primavera, F; Calvelli, V; Ferro, F; Monge, M R; Robutti, E; Tosi, S; Brianza, L; Brivio, F; Ciriolo, V; Dinardo, M E; Fiorendi, S; Gennai, S; Ghezzi, A; Govoni, P; Malberti, M; Malvezzi, S; Manzoni, R A; Menasce, D; Moroni, L; Paganoni, M; Pedrini, D; Pigazzini, S; Ragazzi, S; Tabarelli de Fatis, T; Buontempo, S; Cavallo, N; De Nardo, G; Di Guida, S; Esposito, M; Fabozzi, F; Fienga, F; Iorio, A O M; Lanza, G; Lista, L; Meola, S; Paolucci, P; Sciacca, C; Thyssen, F; Azzi, P; Bacchetta, N; Benato, L; Boletti, A; Carlin, R; Checchia, P; Dall'Osso, M; De Castro Manzano, P; Dorigo, T; Dosselli, U; Gasparini, F; Gasparini, U; Gozzelino, A; Lacaprara, S; Margoni, M; Meneguzzo, A T; Pazzini, J; Pegoraro, M; Pozzobon, N; Ronchese, P; Sgaravatto, M; Simonetto, F; Torassa, E; Ventura, S; Zanetti, M; Zotto, P; Braghieri, A; Fallavollita, F; Magnani, A; Montagna, P; Ratti, S P; Re, V; Riccardi, C; Salvini, P; Vai, I; Vitulo, P; Alunni Solestizi, L; Bilei, G M; Ciangottini, D; Fanò, L; Lariccia, P; Leonardi, R; Mantovani, G; Menichelli, M; Saha, A; Santocchia, A; Androsov, K; Azzurri, P; Bagliesi, G; Bernardini, J; Boccali, T; Castaldi, R; Ciocci, M A; Dell'Orso, R; Donato, S; Fedi, G; Giassi, A; Grippo, M T; Ligabue, F; Lomtadze, T; Martini, L; Messineo, A; Palla, F; Rizzi, A; Savoy-Navarro, A; Spagnolo, P; Tenchini, R; Tonelli, G; Venturi, A; Verdini, P G; Barone, L; Cavallari, F; Cipriani, M; Del Re, D; Diemoz, M; Gelli, S; Longo, E; Margaroli, F; Marzocchi, B; Meridiani, P; Organtini, G; Paramatti, R; Preiato, F; Rahatlou, S; Rovelli, C; Santanastasio, F; Amapane, N; Arcidiacono, R; Argiro, S; Arneodo, M; Bartosik, N; Bellan, R; Biino, C; Cartiglia, N; Cenna, F; Costa, M; Covarelli, R; Degano, A; Demaria, N; Finco, L; Kiani, B; Mariotti, C; Maselli, S; Migliore, E; Monaco, V; Monteil, E; Monteno, M; Obertino, M M; Pacher, L; Pastrone, N; Pelliccioni, M; Pinna Angioni, G L; Ravera, F; Romero, A; Ruspa, M; Sacchi, R; Shchelina, K; Sola, V; Solano, A; Staiano, A; Traczyk, P; Belforte, S; Casarsa, M; Cossutti, F; Della Ricca, G; Zanetti, A; Kim, D H; Kim, G N; Kim, M S; Lee, S; Lee, S W; Oh, Y D; Sekmen, S; Son, D C; Yang, Y C; Lee, A; Kim, H; Brochero Cifuentes, J A; Kim, T J; Cho, S; Choi, S; Go, Y; Gyun, D; Ha, S; Hong, B; Jo, Y; Kim, Y; Lee, K; Lee, K S; Lee, S; Lim, J; Park, S K; Roh, Y; Almond, J; Kim, J; Lee, H; Oh, S B; Radburn-Smith, B C; Seo, S H; Yang, U K; Yoo, H D; Yu, G B; Choi, M; Kim, H; Kim, J H; Lee, J S H; Park, I C; Ryu, G; Ryu, M S; Choi, Y; Goh, J; Hwang, C; Lee, J; Yu, I; Dudenas, V; Juodagalvis, A; Vaitkus, J; Ahmed, I; Ibrahim, Z A; Md Ali, M A B; Mohamad Idris, F; Wan Abdullah, W A T; Yusli, M N; Zolkapli, Z; Castilla-Valdez, H; De La Cruz-Burelo, E; Heredia-De La Cruz, I; Hernandez-Almada, A; Lopez-Fernandez, R; Magaña Villalba, R; Mejia Guisao, J; Sanchez-Hernandez, A; Carrillo Moreno, S; Oropeza Barrera, C; Vazquez Valencia, F; Carpinteyro, S; Pedraza, I; Salazar Ibarguen, H A; Uribe Estrada, C; Morelos Pineda, A; Krofcheck, D; Butler, P H; Ahmad, A; Ahmad, M; Hassan, Q; Hoorani, H R; Khan, W A; Saddique, A; Shah, M A; Shoaib, M; Waqas, M; Bialkowska, H; Bluj, M; Boimska, B; Frueboes, T; Górski, M; Kazana, M; Nawrocki, K; Romanowska-Rybinska, K; Szleper, M; Zalewski, P; Bunkowski, K; Byszuk, A; Doroba, K; Kalinowski, A; Konecki, M; Krolikowski, J; Misiura, M; Olszewski, M; Walczak, M; Bargassa, P; Beirão Da Cruz E Silva, C; Calpas, B; Di Francesco, A; Faccioli, P; Ferreira Parracho, P G; Gallinaro, M; Hollar, J; Leonardo, N; Lloret Iglesias, L; Nemallapudi, M V; Rodrigues Antunes, J; Seixas, J; Toldaiev, O; Vadruccio, D; Varela, J; Vischia, P; Afanasiev, S; Bunin, P; Gavrilenko, M; Golutvin, I; Gorbunov, I; Kamenev, A; Karjavin, V; Lanev, A; Malakhov, A; Matveev, V; Palichik, V; Perelygin, V; Shmatov, S; Shulha, S; Skatchkov, N; Smirnov, V; Voytishin, N; Zarubin, A; Chtchipounov, L; Golovtsov, V; Ivanov, Y; Kim, V; Kuznetsova, E; Murzin, V; Oreshkin, V; Sulimov, V; Vorobyev, A; Andreev, Yu; Dermenev, A; Gninenko, S; Golubev, N; Karneyeu, A; Kirsanov, M; Krasnikov, N; Pashenkov, A; Tlisov, D; Toropin, A; Epshteyn, V; Gavrilov, V; Lychkovskaya, N; Popov, V; Pozdnyakov, I; Safronov, G; Spiridonov, A; Toms, M; Vlasov, E; Zhokin, A; Aushev, T; Bylinkin, A; Chadeeva, M; Chistov, R; Polikarpov, S; Andreev, V; Azarkin, M; Dremin, I; Kirakosyan, M; Leonidov, A; Terkulov, A; Baskakov, A; Belyaev, A; Boos, E; Ershov, A; Gribushin, A; Kaminskiy, A; Kodolova, O; Korotkikh, V; Lokhtin, I; Miagkov, I; Obraztsov, S; Petrushanko, S; Savrin, V; Snigirev, A; Vardanyan, I; Blinov, V; Skovpen, Y; Shtol, D; Azhgirey, I; Bayshev, I; Bitioukov, S; Elumakhov, D; Kachanov, V; Kalinin, A; Konstantinov, D; Krychkine, V; Petrov, V; Ryutin, R; Sobol, A; Troshin, S; Tyurin, N; Uzunian, A; Volkov, A; Adzic, P; Cirkovic, P; Devetak, D; Dordevic, M; Milosevic, J; Rekovic, V; Alcaraz Maestre, J; Barrio Luna, M; Calvo, E; Cerrada, M; Chamizo Llatas, M; Colino, N; De La Cruz, B; Delgado Peris, A; Escalante Del Valle, A; Fernandez Bedoya, C; Fernández Ramos, J P; Flix, J; Fouz, M C; Garcia-Abia, P; Gonzalez Lopez, O; Goy Lopez, S; Hernandez, J M; Josa, M I; Navarro De Martino, E; Pérez-Calero Yzquierdo, A; Puerta Pelayo, J; Quintario Olmeda, A; Redondo, I; Romero, L; Soares, M S; de Trocóniz, J F; Missiroli, M; Moran, D; Cuevas, J; Fernandez Menendez, J; Gonzalez Caballero, I; González Fernández, J R; Palencia Cortezon, E; Sanchez Cruz, S; Suárez Andrés, I; Vizan Garcia, J M; Cabrillo, I J; Calderon, A; Curras, E; Fernandez, M; Garcia-Ferrero, J; Gomez, G; Lopez Virto, A; Marco, J; Martinez Rivero, C; Matorras, F; Piedra Gomez, J; Rodrigo, T; Ruiz-Jimeno, A; Scodellaro, L; Trevisani, N; Vila, I; Vilar Cortabitarte, R; Abbaneo, D; Auffray, E; Auzinger, G; Baillon, P; Ball, A H; Barney, D; Bloch, P; Bocci, A; Botta, C; Camporesi, T; Castello, R; Cepeda, M; Cerminara, G; Chen, Y; d'Enterria, D; Dabrowski, A; Daponte, V; David, A; De Gruttola, M; De Roeck, A; Di Marco, E; Dobson, M; Dorney, B; du Pree, T; Duggan, D; Dünser, M; Dupont, N; Elliott-Peisert, A; Everaerts, P; Fartoukh, S; Franzoni, G; Fulcher, J; Funk, W; Gigi, D; Gill, K; Girone, M; Glege, F; Gulhan, D; Gundacker, S; Guthoff, M; Harris, P; Hegeman, J; Innocente, V; Janot, P; Kieseler, J; Kirschenmann, H; Knünz, V; Kornmayer, A; Kortelainen, M J; Kousouris, K; Krammer, M; Lange, C; Lecoq, P; Lourenço, C; Lucchini, M T; Malgeri, L; Mannelli, M; Martelli, A; Meijers, F; Merlin, J A; Mersi, S; Meschi, E; Milenovic, P; Moortgat, F; Morovic, S; Mulders, M; Neugebauer, H; Orfanelli, S; Orsini, L; Pape, L; Perez, E; Peruzzi, M; Petrilli, A; Petrucciani, G; Pfeiffer, A; Pierini, M; Racz, A; Reis, T; Rolandi, G; Rovere, M; Sakulin, H; Sauvan, J B; Schäfer, C; Schwick, C; Seidel, M; Sharma, A; Silva, P; Sphicas, P; Steggemann, J; Stoye, M; Takahashi, Y; Tosi, M; Treille, D; Triossi, A; Tsirou, A; Veckalns, V; Veres, G I; Verweij, M; Wardle, N; Wöhri, H K; Zagozdzinska, A; Zeuner, W D; Bertl, W; Deiters, K; Erdmann, W; Horisberger, R; Ingram, Q; Kaestli, H C; Kotlinski, D; Langenegger, U; Rohe, T; Wiederkehr, S A; Bachmair, F; Bäni, L; Bianchini, L; Casal, B; Dissertori, G; Dittmar, M; Donegà, M; Grab, C; Heidegger, C; Hits, D; Hoss, J; Kasieczka, G; Lustermann, W; Mangano, B; Marionneau, M; Martinez Ruiz Del Arbol, P; Masciovecchio, M; Meinhard, M T; Meister, D; Micheli, F; Musella, P; Nessi-Tedaldi, F; Pandolfi, F; Pata, J; Pauss, F; Perrin, G; Perrozzi, L; Quittnat, M; Rossini, M; Schönenberger, M; Starodumov, A; Tavolaro, V R; Theofilatos, K; Wallny, R; Aarrestad, T K; Amsler, C; Caminada, L; Canelli, M F; De Cosa, A; Galloni, C; Hinzmann, A; Hreus, T; Kilminster, B; Ngadiuba, J; Pinna, D; Rauco, G; Robmann, P; Salerno, D; Seitz, C; Yang, Y; Zucchetta, A; Candelise, V; Doan, T H; Jain, Sh; Khurana, R; Konyushikhin, M; Kuo, C M; Lin, W; Pozdnyakov, A; Yu, S S; Kumar, Arun; Chang, P; Chang, Y H; Chao, Y; Chen, K F; Chen, P H; Fiori, F; Hou, W-S; Hsiung, Y; Liu, Y F; Lu, R-S; Miñano Moya, M; Paganis, E; Psallidas, A; Tsai, J F; Asavapibhop, B; Singh, G; Srimanobhas, N; Suwonjandee, N; Adiguzel, A; Cerci, S; Damarseckin, S; Demiroglu, Z S; Dozen, C; Dumanoglu, I; Girgis, S; Gokbulut, G; Guler, Y; Hos, I; Kangal, E E; Kara, O; Kayis Topaksu, A; Kiminsu, U; Oglakci, M; Onengut, G; Ozdemir, K; Sunar Cerci, D; Topakli, H; Turkcapar, S; Zorbakir, I S; Zorbilmez, C; Bilin, B; Bilmis, S; Isildak, B; Karapinar, G; Yalvac, M; Zeyrek, M; Gülmez, E; Kaya, M; Kaya, O; Yetkin, E A; Yetkin, T; Cakir, A; Cankocak, K; Sen, S; Grynyov, B; Levchuk, L; Sorokin, P; Aggleton, R; Ball, F; Beck, L; Brooke, J J; Burns, D; Clement, E; Cussans, D; Flacher, H; Goldstein, J; Grimes, M; Heath, G P; Heath, H F; Jacob, J; Kreczko, L; Lucas, C; Newbold, D M; Paramesvaran, S; Poll, A; Sakuma, T; Seif El Nasr-Storey, S; Smith, D; Smith, V J; Belyaev, A; Brew, C; Brown, R M; Calligaris, L; Cieri, D; Cockerill, D J A; Coughlan, J A; Harder, K; Harper, S; Olaiya, E; Petyt, D; Shepherd-Themistocleous, C H; Thea, A; Tomalin, I R; Williams, T; Baber, M; Bainbridge, R; Buchmuller, O; Bundock, A; Burton, D; Casasso, S; Citron, M; Colling, D; Corpe, L; Dauncey, P; Davies, G; De Wit, A; Della Negra, M; Di Maria, R; Dunne, P; Elwood, A; Futyan, D; Haddad, Y; Hall, G; Iles, G; James, T; Lane, R; Laner, C; Lucas, R; Lyons, L; Magnan, A-M; Malik, S; Mastrolorenzo, L; Nash, J; Nikitenko, A; Pela, J; Penning, B; Pesaresi, M; Raymond, D M; Richards, A; Rose, A; Scott, E; Seez, C; Summers, S; Tapper, A; Uchida, K; Vazquez Acosta, M; Virdee, T; Wright, J; Zenz, S C; Cole, J E; Hobson, P R; Khan, A; Kyberd, P; Reid, I D; Symonds, P; Teodorescu, L; Turner, M; Borzou, A; Call, K; Dittmann, J; Hatakeyama, K; Liu, H; Pastika, N; Bartek, R; Dominguez, A; Buccilli, A; Cooper, S I; Henderson, C; Rumerio, P; West, C; Arcaro, D; Avetisyan, A; Bose, T; Gastler, D; Rankin, D; Richardson, C; Rohlf, J; Sulak, L; Zou, D; Benelli, G; Cutts, D; Garabedian, A; Hakala, J; Heintz, U; Hogan, J M; Jesus, O; Kwok, K H M; Laird, E; Landsberg, G; Mao, Z; Narain, M; Piperov, S; Sagir, S; Spencer, E; Syarif, R; Breedon, R; Burns, D; Calderon De La Barca Sanchez, M; Chauhan, S; Chertok, M; Conway, J; Conway, R; Cox, P T; Erbacher, R; Flores, C; Funk, G; Gardner, M; Ko, W; Lander, R; Mclean, C; Mulhearn, M; Pellett, D; Pilot, J; Shalhout, S; Shi, M; Smith, J; Squires, M; Stolp, D; Tos, K; Tripathi, M; Bachtis, M; Bravo, C; Cousins, R; Dasgupta, A; Florent, A; Hauser, J; Ignatenko, M; Mccoll, N; Saltzberg, D; Schnaible, C; Valuev, V; Weber, M; Bouvier, E; Burt, K; Clare, R; Ellison, J; Gary, J W; Ghiasi Shirazi, S M A; Hanson, G; Heilman, J; Jandir, P; Kennedy, E; Lacroix, F; Long, O R; Olmedo Negrete, M; Paneva, M I; Shrinivas, A; Si, W; Wei, H; Wimpenny, S; Yates, B R; Branson, J G; Cerati, G B; Cittolin, S; Derdzinski, M; Gerosa, R; Holzner, A; Klein, D; Krutelyov, V; Letts, J; Macneill, I; Olivito, D; Padhi, S; Pieri, M; Sani, M; Sharma, V; Simon, S; Tadel, M; Vartak, A; Wasserbaech, S; Welke, C; Wood, J; Würthwein, F; Yagil, A; Zevi Della Porta, G; Amin, N; Bhandari, R; Bradmiller-Feld, J; Campagnari, C; Dishaw, A; Dutta, V; Franco Sevilla, M; George, C; Golf, F; Gouskos, L; Gran, J; Heller, R; Incandela, J; Mullin, S D; Ovcharova, A; Qu, H; Richman, J; Stuart, D; Suarez, I; Yoo, J; Anderson, D; Bendavid, J; Bornheim, A; Bunn, J; Duarte, J; Lawhorn, J M; Mott, A; Newman, H B; Pena, C; Spiropulu, M; Vlimant, J R; Xie, S; Zhu, R Y; Andrews, M B; Ferguson, T; Paulini, M; Russ, J; Sun, M; Vogel, H; Vorobiev, I; Weinberg, M; Cumalat, J P; Ford, W T; Jensen, F; Johnson, A; Krohn, M; Leontsinis, S; Mulholland, T; Stenson, K; Wagner, S R; Alexander, J; Chaves, J; Chu, J; Dittmer, S; Mcdermott, K; Mirman, N; Nicolas Kaufman, G; Patterson, J R; Rinkevicius, A; Ryd, A; Skinnari, L; Soffi, L; Tan, S M; Tao, Z; Thom, J; Tucker, J; Wittich, P; Zientek, M; Winn, D; Abdullin, S; Albrow, M; Apollinari, G; Apresyan, A; Banerjee, S; Bauerdick, L A T; Beretvas, A; Berryhill, J; Bhat, P C; Bolla, G; Burkett, K; Butler, J N; Cheung, H W K; Chlebana, F; Cihangir, S; Cremonesi, M; Elvira, V D; Fisk, I; Freeman, J; Gottschalk, E; Gray, L; Green, D; Grünendahl, S; Gutsche, O; Hare, D; Harris, R M; Hasegawa, S; Hirschauer, J; Hu, Z; Jayatilaka, B; Jindariani, S; Johnson, M; Joshi, U; Klima, B; Kreis, B; Lammel, S; Linacre, J; Lincoln, D; Lipton, R; Liu, M; Liu, T; Lopes De Sá, R; Lykken, J; Maeshima, K; Magini, N; Marraffino, J M; Maruyama, S; Mason, D; McBride, P; Merkel, P; Mrenna, S; Nahn, S; O'Dell, V; Pedro, K; Prokofyev, O; Rakness, G; Ristori, L; Sexton-Kennedy, E; Soha, A; Spalding, W J; Spiegel, L; Stoynev, S; Strait, J; Strobbe, N; Taylor, L; Tkaczyk, S; Tran, N V; Uplegger, L; Vaandering, E W; Vernieri, C; Verzocchi, M; Vidal, R; Wang, M; Weber, H A; Whitbeck, A; Wu, Y; Acosta, D; Avery, P; Bortignon, P; Bourilkov, D; Brinkerhoff, A; Carnes, A; Carver, M; Curry, D; Das, S; Field, R D; Furic, I K; Konigsberg, J; Korytov, A; Low, J F; Ma, P; Matchev, K; Mei, H; Mitselmakher, G; Rank, D; Shchutska, L; Sperka, D; Thomas, L; Wang, J; Wang, S; Yelton, J; Linn, S; Markowitz, P; Martinez, G; Rodriguez, J L; Ackert, A; Adams, T; Askew, A; Bein, S; Hagopian, S; Hagopian, V; Johnson, K F; Prosper, H; Santra, A; Yohay, R; Baarmand, M M; Bhopatkar, V; Colafranceschi, S; Hohlmann, M; Noonan, D; Roy, T; Yumiceva, F; Adams, M R; Apanasevich, L; Berry, D; Betts, R R; Bucinskaite, I; Cavanaugh, R; Evdokimov, O; Gauthier, L; Gerber, C E; Hofman, D J; Jung, K; Sandoval Gonzalez, I D; Varelas, N; Wang, H; Wu, Z; Zakaria, M; Zhang, J; Bilki, B; Clarida, W; Dilsiz, K; Durgut, S; Gandrajula, R P; Haytmyradov, M; Khristenko, V; Merlo, J-P; Mermerkaya, H; Mestvirishvili, A; Moeller, A; Nachtman, J; Ogul, H; Onel, Y; Ozok, F; Penzo, A; Snyder, C; Tiras, E; Wetzel, J; Yi, K; Anderson, I; Blumenfeld, B; Cocoros, A; Eminizer, N; Fehling, D; Feng, L; Gritsan, A V; Maksimovic, P; Roskes, J; Sarica, U; Swartz, M; Xiao, M; Xin, Y; You, C; Al-Bataineh, A; Baringer, P; Bean, A; Boren, S; Bowen, J; Castle, J; Forthomme, L; Kenny, R P; Khalil, S; Kropivnitskaya, A; Majumder, D; Mcbrayer, W; Murray, M; Sanders, S; Stringer, R; Tapia Takaki, J D; Wang, Q; Ivanov, A; Kaadze, K; Maravin, Y; Mohammadi, A; Saini, L K; Skhirtladze, N; Toda, S; Rebassoo, F; Wright, D; Anelli, C; Baden, A; Baron, O; Belloni, A; Calvert, B; Eno, S C; Ferraioli, C; Gomez, J A; Hadley, N J; Jabeen, S; Jeng, G Y; Kellogg, R G; Kolberg, T; Kunkle, J; Mignerey, A C; Ricci-Tam, F; Shin, Y H; Skuja, A; Tonjes, M B; Tonwar, S C; Abercrombie, D; Allen, B; Apyan, A; Azzolini, V; Barbieri, R; Baty, A; Bi, R; Bierwagen, K; Brandt, S; Busza, W; Cali, I A; D'Alfonso, M; Demiragli, Z; Di Matteo, L; Gomez Ceballos, G; Goncharov, M; Hsu, D; Iiyama, Y; Innocenti, G M; Klute, M; Kovalskyi, D; Krajczar, K; Lai, Y S; Lee, Y-J; Levin, A; Luckey, P D; Maier, B; Marini, A C; Mcginn, C; Mironov, C; Narayanan, S; Niu, X; Paus, C; Roland, C; Roland, G; Salfeld-Nebgen, J; Stephans, G S F; Tatar, K; Varma, M; Velicanu, D; Veverka, J; Wang, J; Wang, T W; Wyslouch, B; Yang, M; Benvenuti, A C; Chatterjee, R M; Evans, A; Hansen, P; Kalafut, S; Kao, S C; Kubota, Y; Lesko, Z; Mans, J; Nourbakhsh, S; Ruckstuhl, N; Rusack, R; Tambe, N; Turkewitz, J; Acosta, J G; Oliveros, S; Avdeeva, E; Bloom, K; Claes, D R; Fangmeier, C; Gonzalez Suarez, R; Kamalieddin, R; Kravchenko, I; Malta Rodrigues, A; Monroy, J; Siado, J E; Snow, G R; Stieger, B; Alyari, M; Dolen, J; Godshalk, A; Harrington, C; Iashvili, I; Kaisen, J; Nguyen, D; Parker, A; Rappoccio, S; Roozbahani, B; Alverson, G; Barberis, E; Hortiangtham, A; Massironi, A; Morse, D M; Nash, D; Orimoto, T; Teixeira De Lima, R; Trocino, D; Wang, R-J; Wood, D; Bhattacharya, S; Charaf, O; Hahn, K A; Kumar, A; Mucia, N; Odell, N; Pollack, B; Schmitt, M H; Sung, K; Trovato, M; Velasco, M; Dev, N; Hildreth, M; Hurtado Anampa, K; Jessop, C; Karmgard, D J; Kellams, N; Lannon, K; Marinelli, N; Meng, F; Mueller, C; Musienko, Y; Planer, M; Reinsvold, A; Ruchti, R; Rupprecht, N; Smith, G; Taroni, S; Wayne, M; Wolf, M; Woodard, A; Alimena, J; Antonelli, L; Bylsma, B; Durkin, L S; Flowers, S; Francis, B; Hart, A; Hill, C; Hughes, R; Ji, W; Liu, B; Luo, W; Puigh, D; Winer, B L; Wulsin, H W; Cooperstein, S; Driga, O; Elmer, P; Hardenbrook, J; Hebda, P; Lange, D; Luo, J; Marlow, D; Medvedeva, T; Mei, K; Ojalvo, I; Olsen, J; Palmer, C; Piroué, P; Stickland, D; Svyatkovskiy, A; Tully, C; Malik, S; Barker, A; Barnes, V E; Folgueras, S; Gutay, L; Jha, M K; Jones, M; Jung, A W; Khatiwada, A; Miller, D H; Neumeister, N; Schulte, J F; Shi, X; Sun, J; Wang, F; Xie, W; Parashar, N; Stupak, J; Adair, A; Akgun, B; Chen, Z; Ecklund, K M; Geurts, F J M; Guilbaud, M; Li, W; Michlin, B; Northup, M; Padley, B P; Roberts, J; Rorie, J; Tu, Z; Zabel, J; Betchart, B; Bodek, A; de Barbaro, P; Demina, R; Duh, Y T; Ferbel, T; Galanti, M; Garcia-Bellido, A; Han, J; Hindrichs, O; Khukhunaishvili, A; Lo, K H; Tan, P; Verzetti, M; Agapitos, A; Chou, J P; Gershtein, Y; Gómez Espinosa, T A; Halkiadakis, E; Heindl, M; Hughes, E; Kaplan, S; Kunnawalkam Elayavalli, R; Kyriacou, S; Lath, A; Nash, K; Osherson, M; Saka, H; Salur, S; Schnetzer, S; Sheffield, D; Somalwar, S; Stone, R; Thomas, S; Thomassen, P; Walker, M; Delannoy, A G; Foerster, M; Heideman, J; Riley, G; Rose, K; Spanier, S; Thapa, K; Bouhali, O; Celik, A; Dalchenko, M; De Mattia, M; Delgado, A; Dildick, S; Eusebi, R; Gilmore, J; Huang, T; Juska, E; Kamon, T; Mueller, R; Pakhotin, Y; Patel, R; Perloff, A; Perniè, L; Rathjens, D; Safonov, A; Tatarinov, A; Ulmer, K A; Akchurin, N; Cowden, C; Damgov, J; De Guio, F; Dragoiu, C; Dudero, P R; Faulkner, J; Gurpinar, E; Kunori, S; Lamichhane, K; Lee, S W; Libeiro, T; Peltola, T; Undleeb, S; Volobouev, I; Wang, Z; Greene, S; Gurrola, A; Janjam, R; Johns, W; Maguire, C; Melo, A; Ni, H; Sheldon, P; Tuo, S; Velkovska, J; Xu, Q; Arenton, M W; Barria, P; Cox, B; Goodell, J; Hirosky, R; Ledovskoy, A; Li, H; Neu, C; Sinthuprasith, T; Sun, X; Wang, Y; Wolfe, E; Xia, F; Clarke, C; Harr, R; Karchin, P E; Sturdy, J; Belknap, D A; Buchanan, J; Caillol, C; Dasu, S; Dodd, L; Duric, S; Gomber, B; Grothe, M; Herndon, M; Hervé, A; Klabbers, P; Lanaro, A; Levine, A; Long, K; Loveless, R; Perry, T; Pierro, G A; Polese, G; Ruggles, T; Savin, A; Smith, N; Smith, W H; Taylor, D; Woods, N

    2017-01-01

    This paper reports the measurement of [Formula: see text] meson production in proton-proton ([Formula: see text]) and proton-lead ([Formula: see text]) collisions at a center-of-mass energy per nucleon pair of [Formula: see text] by the CMS experiment at the LHC. The data samples used in the analysis correspond to integrated luminosities of 28[Formula: see text] and 35[Formula: see text] for [Formula: see text] and [Formula: see text] collisions, respectively. Prompt and nonprompt [Formula: see text] mesons, the latter produced in the decay of [Formula: see text] hadrons, are measured in their dimuon decay channels. Differential cross sections are measured in the transverse momentum range of [Formula: see text], and center-of-mass rapidity ranges of [Formula: see text] ([Formula: see text]) and [Formula: see text] ([Formula: see text]). The nuclear modification factor, [Formula: see text], is measured as a function of both [Formula: see text] and [Formula: see text]. Small modifications to the [Formula: see text] cross sections are observed in [Formula: see text] relative to [Formula: see text] collisions. The ratio of [Formula: see text] production cross sections in [Formula: see text]-going and Pb-going directions, [Formula: see text], studied as functions of [Formula: see text] and [Formula: see text], shows a significant decrease for increasing transverse energy deposited at large pseudorapidities. These results, which cover a wide kinematic range, provide new insight on the role of cold nuclear matter effects on prompt and nonprompt [Formula: see text] production.

  2. Chemical-text hybrid search engines.

    Science.gov (United States)

    Zhou, Yingyao; Zhou, Bin; Jiang, Shumei; King, Frederick J

    2010-01-01

    As the amount of chemical literature increases, it is critical that researchers be enabled to accurately locate documents related to a particular aspect of a given compound. Existing solutions, based on text and chemical search engines alone, suffer from the inclusion of "false negative" and "false positive" results, and cannot accommodate diverse repertoire of formats currently available for chemical documents. To address these concerns, we developed an approach called Entity-Canonical Keyword Indexing (ECKI), which converts a chemical entity embedded in a data source into its canonical keyword representation prior to being indexed by text search engines. We implemented ECKI using Microsoft Office SharePoint Server Search, and the resultant hybrid search engine not only supported complex mixed chemical and keyword queries but also was applied to both intranet and Internet environments. We envision that the adoption of ECKI will empower researchers to pose more complex search questions that were not readily attainable previously and to obtain answers at much improved speed and accuracy.

  3. Lexical Density Of English Reading Texts For Senior High School

    OpenAIRE

    Nesia, Bersyebah Herljimsi; Ginting, Siti Aisah

    2014-01-01

    This study deals with the lexical density especially the lexical items of English reading texts in the textbook for senior high school. The objectives of the study are to find out the lexical density especially the lexical items which formed in the reading texts of Look Ahead textbook and the type of genre which has the highest lexical density of the reading texts. This study was conducted by descriptive method with qualitative approach. The data of this research were the English reading text...

  4. Integrated Clustering and Feature Selection Scheme for Text Documents.

    OpenAIRE

    M. Thangamani; P. Thangaraj

    2010-01-01

    Problem statement: Text documents are the unstructured databases that contain raw data collection. The clustering techniques are used group up the text documents with reference to its similarity. Approach: The feature selection techniques were used to improve the efficiency and accuracy of clustering process. The feature selection was done by eliminate the redundant and irrelevant items from the text document contents. Statistical methods were used in the text clustering and feature selection...

  5. Text summarization as a decision support aid

    Directory of Open Access Journals (Sweden)

    Workman T

    2012-05-01

    Full Text Available Abstract Background PubMed data potentially can provide decision support information, but PubMed was not exclusively designed to be a point-of-care tool. Natural language processing applications that summarize PubMed citations hold promise for extracting decision support information. The objective of this study was to evaluate the efficiency of a text summarization application called Semantic MEDLINE, enhanced with a novel dynamic summarization method, in identifying decision support data. Methods We downloaded PubMed citations addressing the prevention and drug treatment of four disease topics. We then processed the citations with Semantic MEDLINE, enhanced with the dynamic summarization method. We also processed the citations with a conventional summarization method, as well as with a baseline procedure. We evaluated the results using clinician-vetted reference standards built from recommendations in a commercial decision support product, DynaMed. Results For the drug treatment data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.848 and 0.377, while conventional summarization produced 0.583 average recall and 0.712 average precision, and the baseline method yielded average recall and precision values of 0.252 and 0.277. For the prevention data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.655 and 0.329. The baseline technique resulted in recall and precision scores of 0.269 and 0.247. No conventional Semantic MEDLINE method accommodating summarization for prevention exists. Conclusion Semantic MEDLINE with dynamic summarization outperformed conventional summarization in terms of recall, and outperformed the baseline method in both recall and precision. This new approach to text summarization demonstrates potential in identifying decision support data for multiple needs.

  6. Doing Mathematics with Purpose: Mathematical Text Types

    Science.gov (United States)

    Dostal, Hannah M.; Robinson, Richard

    2018-01-01

    Mathematical literacy includes learning to read and write different types of mathematical texts as part of purposeful mathematical meaning making. Thus in this article, we describe how learning to read and write mathematical texts (proof text, algorithmic text, algebraic/symbolic text, and visual text) supports the development of students'…

  7. The Holy Text and Violence : Levinas and Fundamentalism

    NARCIS (Netherlands)

    Poorthuis, Marcel; Breitlin, Andris; Bremmers, Chris; Cools, Arthur

    2015-01-01

    Levinas'rejection of a historical ciritcal approach to sacred texts as well as his depreciation of Spinoza's view of the Bible might bring him close to fundamentalism. A thorough analysis is necessary to demonstrate essential differences. Levinas'rejection of a historical ciritcal approach to sacred

  8. Text summarization as a decision support aid.

    Science.gov (United States)

    Workman, T Elizabeth; Fiszman, Marcelo; Hurdle, John F

    2012-05-23

    PubMed data potentially can provide decision support information, but PubMed was not exclusively designed to be a point-of-care tool. Natural language processing applications that summarize PubMed citations hold promise for extracting decision support information. The objective of this study was to evaluate the efficiency of a text summarization application called Semantic MEDLINE, enhanced with a novel dynamic summarization method, in identifying decision support data. We downloaded PubMed citations addressing the prevention and drug treatment of four disease topics. We then processed the citations with Semantic MEDLINE, enhanced with the dynamic summarization method. We also processed the citations with a conventional summarization method, as well as with a baseline procedure. We evaluated the results using clinician-vetted reference standards built from recommendations in a commercial decision support product, DynaMed. For the drug treatment data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.848 and 0.377, while conventional summarization produced 0.583 average recall and 0.712 average precision, and the baseline method yielded average recall and precision values of 0.252 and 0.277. For the prevention data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.655 and 0.329. The baseline technique resulted in recall and precision scores of 0.269 and 0.247. No conventional Semantic MEDLINE method accommodating summarization for prevention exists. Semantic MEDLINE with dynamic summarization outperformed conventional summarization in terms of recall, and outperformed the baseline method in both recall and precision. This new approach to text summarization demonstrates potential in identifying decision support data for multiple needs.

  9. Conversation Analysis and Orality in Written Texts

    Directory of Open Access Journals (Sweden)

    Luiz Antônio da Silva

    2015-02-01

    Full Text Available Marcuschi (1977 points out that orality is an important topic to be developed in the classroom. Lamentably, however, it has been left aside, because teachers and those responsible for education do not consider it as an important feature to be emphasized in the mother tongue teaching. The main reason is the focus given to the language teaching in Brazilian schools: the school is supposed to teach writing, and how to write well. Despite the advances of Linguistic studies on speaking and writing; despite the contributions of Sociolinguistics and Conversation Analysis; and despite the overcoming of prejudices, especially on the strict distinction between the two modes, there is still a long way to go. Thus, it is beneficial to bring up a discussion on speaking and writing. After several years of Marcuschi´s findings (1977, textbook authors, teachers, researchers and those responsible for the Portuguese language teaching have another theoretical approach. Nonetheless, in practice, there is still a lot to be accomplished since writing continues to be the focus of the Portuguese language teaching in Brazilian schools. It seems that most of the teachers know the theory, but they experience difficulties when it comes to the practices of everyday school life. This paper aims to analyze oral marks or effects of orality in written literary texts, more precisely in dialogues produced. These analyzes will aid us in giving subsidies to a Portuguese teacher, so that he/she can work consistently and productively. To illustrate our observations, we have chosen fragments of chronicles written by Brazilian writer Luís Fernando Verissimo, published in three of his works: Comédias para se ler na escola, Sexo na cabeça e Amor Veríssimo.

  10. Changes are Afoot in Physics Introductory Texts of Today

    Science.gov (United States)

    Khoon, Koh Aik; Jalal, Azman; Daud, Abdul Razak; Abd-Shukor, Roslan; Samat, Supian; Talib, Ibrahim Abu; Othman, Mazlan; Yatim, Baharudin

    2008-01-01

    Among the many changes that have taken place in physics education in recent years is the fact that physics introductory texts have undergone some drastic changes in layout, content, approach and presentation. It is a total breath of fresh air compared with the drab physics texts of yesteryear. This paper takes a closer look on the changes that…

  11. MANAGING THE TRANSLATION OF ECONOMIC TEXTS

    Directory of Open Access Journals (Sweden)

    Pop Anamaria Mirabela

    2012-12-01

    Full Text Available Theoretically, translation may pass as science; practically, it seems closer to art. Translation is a challenging activity requiring a set of abilities and posing few difficulties that appear during the translation process. This paper investigates the extent to which sub-technical vocabulary can constitute a problem to Romanian students of economics reading in English, by looking at the translations produced as independent or pair work during English classes and analyzing the various errors which may appeared. The exigencies required by the efficient business communication have increased in the past few decades because of rising international trade, increased migration, globalization, the recognition of linguistic minorities, and the expansion of the mass media and technology. All these led us to approach the topic of translation which is actually a job that requires skills, stages of research necessary for disclosure of transfer characteristic into the target language, training, experience and a good sense of languages. The paper defines the theoretical issues and terminology: translation, types of translation, economic texts and then focuses on the presentation of the practical work carried out throughout the academic year of second year students. Considering that only 28% of the entire European population can read English, and even less people in South America and Asia can, it is obvious that an effective communication of business matters relies on an accurate understanding of terminology. Economics is a field of knowledge in accelerated scientific and technological development. As there is a permanent and ever increasing need to quickly update their knowledge, economists read and learn directly in the original language of the publication and stick to it in daily usage, including conferences, scientific events and articles written in Romanian. Besides researching properly the markets, finding distribution channels, and dealing with legal

  12. Measurement of [Formula: see text] polarisation in [Formula: see text] collisions at [Formula: see text] = 7 TeV.

    Science.gov (United States)

    Aaij, R; Adeva, B; Adinolfi, M; Affolder, A; Ajaltouni, Z; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Anderson, J; Andreassen, R; Andreotti, M; Andrews, J E; Appleby, R B; Aquines Gutierrez, O; Archilli, F; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Bachmann, S; Back, J J; Badalov, A; Balagura, V; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Batozskaya, V; Bauer, Th; Bay, A; Beddow, J; Bedeschi, F; Bediaga, I; Belogurov, S; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bettler, M-O; van Beuzekom, M; Bien, A; Bifani, S; Bird, T; Bizzeti, A; Bjørnstad, P M; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Bondar, A; Bondar, N; Bonivento, W; Borghi, S; Borgia, A; Borsato, M; Bowcock, T J V; Bowen, E; Bozzi, C; Brambach, T; van den Brand, J; Bressieux, J; Brett, D; Britsch, M; Britton, T; Brook, N H; Brown, H; Bursche, A; Busetto, G; Buytaert, J; Cadeddu, S; Calabrese, R; Callot, O; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carranza-Mejia, H; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cenci, R; Charles, M; Charpentier, Ph; Cheung, S-F; Chiapolini, N; Chrzaszcz, M; Ciba, K; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coca, C; Coco, V; Cogan, J; Cogneras, E; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombes, M; Coquereau, S; Corti, G; Corvo, M; Counts, I; Couturier, B; Cowan, G A; Craik, D C; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Dalseno, J; David, P; David, P N Y; Davis, A; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Silva, W; De Simone, P; Decamp, D; Deckenhoff, M; Del Buono, L; Déléage, N; Derkach, D; Deschamps, O; Dettori, F; Di Canto, A; Dijkstra, H; Donleavy, S; Dordei, F; Dorigo, M; Dosil Suárez, A; Dossett, D; Dovbnya, A; Dupertuis, F; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Easo, S; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; El Rifai, I; Elsasser, Ch; Esen, S; Evans, T; Falabella, A; Färber, C; Farinelli, C; Farry, S; Ferguson, D; Fernandez Albor, V; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fontana, M; Fontanelli, F; Forty, R; Francisco, O; Frank, M; Frei, C; Frosini, M; Fu, J; Furfaro, E; Gallas Torreira, A; Galli, D; Gandelman, M; Gandini, P; Gao, Y; Garofoli, J; Garra Tico, J; Garrido, L; Gaspar, C; Gauld, R; Gavardi, L; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianelle, A; Giani, S; Gibson, V; Giubega, L; Gligorov, V V; Göbel, C; Golubkov, D; Golutvin, A; Gomes, A; Gordon, H; Gotti, C; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graziani, G; Grecu, A; Greening, E; Gregson, S; Griffith, P; Grillo, L; Grünberg, O; Gui, B; Gushchin, E; Guz, Yu; Gys, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Hampson, T; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hartmann, T; He, J; Head, T; Heijne, V; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hoballah, M; Hombach, C; Hulsbergen, W; Hunt, P; Hussain, N; Hutchcroft, D; Hynds, D; Iakovenko, V; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jaton, P; Jawahery, A; Jezabek, M; Jing, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kaballo, M; Kandybei, S; Kanso, W; Karacson, M; Karbach, T M; Kelsey, M; Kenyon, I R; Ketel, T; Khanji, B; Khurewathanakul, C; Klaver, S; Kochebina, O; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Korolev, M; Kozlinskiy, A; Kravchuk, L; Kreplin, K; Kreps, M; Krocker, G; Krokovny, P; Kruse, F; Kucharczyk, M; Kudryavtsev, V; Kurek, K; Kvaratskheliya, T; La Thi, V N; Lacarrere, D; Lafferty, G; Lai, A; Lambert, D; Lambert, R W; Lanciotti, E; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Lefèvre, R; Leflat, A; Lefrançois, J; Leo, S; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Liles, M; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, G; Lohn, S; Longstaff, I; Longstaff, I; Lopes, J H; Lopez-March, N; Lowdon, P; Lu, H; Lucchesi, D; Luisier, J; Luo, H; Lupato, A; Luppi, E; Lupton, O; Machefert, F; Machikhiliyan, I V; Maciuc, F; Maev, O; Malde, S; Manca, G; Mancinelli, G; Manzali, M; Maratas, J; Marchand, J F; Marconi, U; Marino, P; Märki, R; Marks, J; Martellotti, G; Martens, A; Martín Sánchez, A; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massafferri, A; Matev, R; Mathe, Z; Matteuzzi, C; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; McSkelly, B; Meadows, B; Meier, F; Meissner, M; Merk, M; Milanes, D A; Minard, M-N; Molina Rodriguez, J; Monteil, S; Moran, D; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Mountain, R; Muheim, F; Müller, K; Muresan, R; Muster, B; Naik, P; Nakada, T; Nandakumar, R; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nicol, M; Niess, V; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; Oblakowska-Mucha, A; Obraztsov, V; Oggero, S; Ogilvy, S; Okhrimenko, O; Oldeman, R; Onderwater, G; Orlandea, M; Otalora Goicochea, J M; Owen, P; Oyanguren, A; Pal, B K; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Parkes, C; Parkinson, C J; Passaleva, G; Patel, G D; Patel, M; Patrignani, C; Pazos Alvarez, A; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perez Trigo, E; Perret, P; Perrin-Terrin, M; Pescatore, L; Pesen, E; Petridis, K; Petrolini, A; Picatoste Olloqui, E; Pietrzyk, B; Pilař, T; Pinci, D; Pistone, A; Playfer, S; Plo Casasus, M; Polci, F; Polok, G; Poluektov, A; Polycarpo, E; Popov, A; Popov, D; Popovici, B; Potterat, C; Powell, A; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Rachwal, B; Rademacker, J H; Rakotomiaramanana, B; Rama, M; Rangel, M S; Raniuk, I; Rauschmayr, N; Raven, G; Redford, S; Reichert, S; Reid, M M; Dos Reis, A C; Ricciardi, S; Richards, A; Rinnert, K; Rives Molina, V; Roa Romero, D A; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Perez, P; Roiser, S; Romanovsky, V; Romero Vidal, A; Rotondo, M; Rouvinet, J; Ruf, T; Ruffini, F; Ruiz, H; Ruiz Valls, P; Sabatino, G; Saborido Silva, J J; Sagidova, N; Sail, P; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santovetti, E; Sapunov, M; Sarti, A; Satriano, C; Satta, A; Savrie, M; Savrina, D; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmidt, B; Schneider, O; Schopper, A; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Seco, M; Semennikov, A; Senderowska, K; Sepp, I; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Shires, A; Silva Coutinho, R; Simi, G; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, N A; Smith, E; Smith, E; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Soomro, F; Souza, D; Souza De Paula, B; Spaan, B; Sparkes, A; Spinella, F; Spradlin, P; Stagni, F; Stahl, S; Steinkamp, O; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Stroili, R; Subbiah, V K; Sun, L; Sutcliffe, W; Swientek, K; Swientek, S; Syropoulos, V; Szczekowski, M; Szczypka, P; Szilard, D; Szumlak, T; T'Jampens, S; Teklishyn, M; Tellarini, G; Teodorescu, E; Teubert, F; Thomas, C; Thomas, E; van Tilburg, J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Torr, N; Tournefier, E; Tourneur, S; Tran, M T; Tresch, M; Tsaregorodtsev, A; Tsopelas, P; Tuning, N; Ubeda Garcia, M; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vagnoni, V; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vázquez Sierra, C; Vecchi, S; Velthuis, J J; Veltri, M; Veneziano, G; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Vilasis-Cardona, X; Vollhardt, A; Volyanskyy, D; Voong, D; Vorobyev, A; Vorobyev, V; Voß, C; Voss, H; de Vries, J A; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wandernoth, S; Wang, J; Ward, D R; Watson, N K; Webber, A D; Websdale, D; Whitehead, M; Wicht, J; Wiedner, D; Wiggers, L; Wilkinson, G; Williams, M P; Williams, M; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wright, S; Wu, S; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yuan, X; Yushchenko, O; Zangoli, M; Zavertyaev, M; Zhang, F; Zhang, L; Zhang, W C; Zhang, Y; Zhelezov, A; Zhokhov, A; Zhong, L; Zvyagin, A

    The polarisation of prompt [Formula: see text] mesons is measured by performing an angular analysis of [Formula: see text] decays using proton-proton collision data, corresponding to an integrated luminosity of 1.0[Formula: see text], collected by the LHCb detector at a centre-of-mass energy of 7 TeV. The polarisation is measured in bins of transverse momentum [Formula: see text] and rapidity [Formula: see text] in the kinematic region [Formula: see text] and [Formula: see text], and is compared to theoretical models. No significant polarisation is observed.

  13. SIAM 2007 Text Mining Competition dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — Subject Area: Text Mining Description: This is the dataset used for the SIAM 2007 Text Mining competition. This competition focused on developing text mining...

  14. The classical dramatic text and its value in contemporary theatre

    Directory of Open Access Journals (Sweden)

    Nina Žavbi Milojević

    2013-06-01

    Full Text Available This paper deals with the classical dramatic text and its staging in contemporary theatre. Specifically, it aims to show that classical texts can address topical issues. This is illustrated by the example of several stagings of Ivan Cankar’s Hlapci, one of the most influential dramatic texts in Slovene literature. The history of this dramatic text is presented from its first publication and reception to the different stagings in various Slovene professional theatres. The focus is on how the situation in Slovene society is reflected in each examined staging. The drama Hlapci was first staged almost one hundred years ago, when the staging followed closely the dramatic text. However, after 1980 stagings became more independent from the text and more artistic freedom was allowed. The paper will prove that classical dramatic texts are very appropriate for staging in contemporary theatre, especially with an innovative director’s approach.

  15. The text plan concept: contributions to the writing planning process

    Directory of Open Access Journals (Sweden)

    Ana Lúcia Tinoco Cabral

    2013-12-01

    Full Text Available Students - at different levels, ranging from early grades up to PhD - face problems both on comprehension and text production. This paper focuses on the text plan concept according to the DTA (Discourse Text Analysis approach, i.e., a principle of organization that allows students to put into practice the production intention as well as to arrange text information while producing; being responsible for the text compositional structure (Adam, 2008. The study analyzes the relation between text plan and the writing planning process, in which the first one provides the second with theoretical support. In order to develop such research, the study covers some issues related to the reading skill, analyzes an argumentative text as per its textual plan, and presents some reflections on the writing process, focusing on the relation between textual plan and the writing planning process.

  16. Handwriting segmentation of unconstrained Oriya text

    Indian Academy of Sciences (India)

    Segmentation of handwritten text into lines, words and characters is one of the important steps in the handwritten text recognition process. In this paper we propose a water reservoir concept-based scheme for segmentation of unconstrained Oriya handwritten text into individual characters. Here, at first, the text image is ...

  17. Theoretical simulation of CO2 capture by an \\text{A}{{\\text{l}}_{11}}\\text{Mg}_{3}^{-} cluster

    Science.gov (United States)

    Jiang, Yuanyuan; Xie, Xuefang; Hamid, Ilyar; Chen, Chu; Duan, Haiming

    2017-04-01

    In order to have an impact on carbon emissions, new stable materials for carbon capture should be able to adsorb CO2 from a mixture of other gases efficiently. Based on density functional theory calculations, we showed that the \\text{A}{{\\text{l}}11}\\text{Mg}3- cluster has an excellent capture capacity of CO2 and high CO2 selectivity under ambient conditions. \\text{A}{{\\text{l}}11}\\text{Mg}3- has an O2-resist property because this cluster is similar to \\text{Al}13- which contains 40 electrons with a larger energy gap. The \\text{A}{{\\text{l}}11}\\text{Mg}3- cluster prefers to adsorb CO2 compared with CH4, H2 and N2, and the CO2 molecule can be chemically adsorbed on the cluster by overcoming a lower barrier, which originates from the introduction of the Mg atom. When seven CO2 molecules are chemically adsorbed on the cluster, the capture capacity of CO2 can reach up to 18.99 mol kg-1 this means that the \\text{A}{{\\text{l}}11}\\text{Mg}3- cluster can be viewed as a potential candidate material for CO2 capture.

  18. Text mining in livestock animal science: introducing the potential of text mining to animal sciences.

    Science.gov (United States)

    Sahadevan, S; Hofmann-Apitius, M; Schellander, K; Tesfaye, D; Fluck, J; Friedrich, C M

    2012-10-01

    In biological research, establishing the prior art by searching and collecting information already present in the domain has equal importance as the experiments done. To obtain a complete overview about the relevant knowledge, researchers mainly rely on 2 major information sources: i) various biological databases and ii) scientific publications in the field. The major difference between the 2 information sources is that information from databases is available, typically well structured and condensed. The information content in scientific literature is vastly unstructured; that is, dispersed among the many different sections of scientific text. The traditional method of information extraction from scientific literature occurs by generating a list of relevant publications in the field of interest and manually scanning these texts for relevant information, which is very time consuming. It is more than likely that in using this "classical" approach the researcher misses some relevant information mentioned in the literature or has to go through biological databases to extract further information. Text mining and named entity recognition methods have already been used in human genomics and related fields as a solution to this problem. These methods can process and extract information from large volumes of scientific text. Text mining is defined as the automatic extraction of previously unknown and potentially useful information from text. Named entity recognition (NER) is defined as the method of identifying named entities (names of real world objects; for example, gene/protein names, drugs, enzymes) in text. In animal sciences, text mining and related methods have been briefly used in murine genomics and associated fields, leaving behind other fields of animal sciences, such as livestock genomics. The aim of this work was to develop an information retrieval platform in the livestock domain focusing on livestock publications and the recognition of relevant data from

  19. SAW Classification Algorithm for Chinese Text Classification

    OpenAIRE

    Xiaoli Guo; Huiyu Sun; Tiehua Zhou; Ling Wang; Zhaoyang Qu; Jiannan Zang

    2015-01-01

    Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word) algorithm. The algorithm uses the special space effect of Chinese text where words...

  20. The socio-demographics of texting

    DEFF Research Database (Denmark)

    Ling, Richard; Bertel, Troels Fibæk; Sundsøy, Pål

    2012-01-01

    partners for different age groups? 3) To which degree are texting relationships characterized by age and gender homophily? We find that that texting is hugely popular among teens compared to other age groups. Further, the number of persons with whom people text is quite small. About half of all text...... messages go to only five other persons. Finally, we find that there is pronounced homophily in terms of age and gender in texting relationships. These findings support previous claims that texting is an important element of teen culture and is an element in the construction of a bounded solidarity....

  1. Text analysis with R for students of literature

    CERN Document Server

    Jockers, Matthew L

    2014-01-01

    Text Analysis with R for Students of Literature is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological tool kit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that we simply cannot gather using traditional qualitative methods of close reading and human synthesis. Text Analysis with R for Students of Literature provides a practical introduction to computational text analysis using the open source programming language R. R is extremely popular throughout the sciences and because of its accessibility, R is now used increasingly in other research areas. Readers begin working with text right away and each chapter works through a new technique or process such that readers gain a broad exposure to core R procedures and a basic understanding of the possibilities of computational text analysis at both the micro and macro scale. Each c...

  2. Arabic text classification using Polynomial Networks

    Directory of Open Access Journals (Sweden)

    Mayy M. Al-Tahrawi

    2015-10-01

    Full Text Available In this paper, an Arabic statistical learning-based text classification system has been developed using Polynomial Neural Networks. Polynomial Networks have been recently applied to English text classification, but they were never used for Arabic text classification. In this research, we investigate the performance of Polynomial Networks in classifying Arabic texts. Experiments are conducted on a widely used Arabic dataset in text classification: Al-Jazeera News dataset. We chose this dataset to enable direct comparisons of the performance of Polynomial Networks classifier versus other well-known classifiers on this dataset in the literature of Arabic text classification. Results of experiments show that Polynomial Networks classifier is a competitive algorithm to the state-of-the-art ones in the field of Arabic text classification.

  3. A Proposed Arabic Handwritten Text Normalization Method

    Directory of Open Access Journals (Sweden)

    Tarik Abu-Ain

    2014-11-01

    Full Text Available Text normalization is an important technique in document image analysis and recognition. It consists of many preprocessing stages, which include slope correction, text padding, skew correction, and straight the writing line. In this side, text normalization has an important role in many procedures such as text segmentation, feature extraction and characters recognition. In the present article, a new method for text baseline detection, straightening, and slant correction for Arabic handwritten texts is proposed. The method comprises a set of sequential steps: first components segmentation is done followed by components text thinning; then, the direction features of the skeletons are extracted, and the candidate baseline regions are determined. After that, selection of the correct baseline region is done, and finally, the baselines of all components are aligned with the writing line.  The experiments are conducted on IFN/ENIT benchmark Arabic dataset. The results show that the proposed method has a promising and encouraging performance.

  4. Text comprehension dependence on reading experience

    OpenAIRE

    Tilmantaitė, Kamilė

    2016-01-01

    In bachelor thesis „Text comprehension dependence on reading experience“ – is researching, how students text comprehension is dependent on reading experience. In theoretical part discussed the reading conception and reading methods are discussed as well as the text comprehension, models and reading capacity. The practical part contains of pupils of eighth and tenth classes text comprehension test analysis, questionnaire about reading experience analysis and how they both interdependent. In th...

  5. Automatic prediction of text aesthetics and interestingness

    OpenAIRE

    Ganguly, Debasis; Leveling, Johannes; Jones, Gareth J.F.

    2014-01-01

    This paper investigates the problem of automated text aesthetics prediction. The availability of user generated content and ratings, e.g. Flickr, has induced research in aesthetics prediction for non-text domains, particularly for photographic images. This problem, however, has yet not been explored for the text domain. Due to the very subjective nature of text aesthetics, it is dicult to compile human annotated data by methods such as crowd sourcing with a fair degree of inter-annotator agre...

  6. Multimodal Diversity of Postmodernist Fiction Text

    Directory of Open Access Journals (Sweden)

    U. I. Tykha

    2016-12-01

    Full Text Available The article is devoted to the analysis of structural and functional manifestations of multimodal diversity in postmodernist fiction texts. Multimodality is defined as the coexistence of more than one semiotic mode within a certain context. Multimodal texts feature a diversity of semiotic modes in the communication and development of their narrative. Such experimental texts subvert conventional patterns by introducing various semiotic resources – verbal or non-verbal.

  7. Creating and Using Culturally Sustaining Informational Texts

    Science.gov (United States)

    Watanabe Kganetso, Lynne M.

    2017-01-01

    Current standards and assessments emphasize the importance of a variety of genres in students' literacy diets, which has placed increased attention on informational texts. Unfortunately, young students' current exposure to and experiences with informational texts are often limited by the texts' availability, quality, and relevance to children's…

  8. Does Writing Summaries Improve Memory for Text?

    Science.gov (United States)

    Spirgel, Arie S.; Delaney, Peter F.

    2016-01-01

    In five experiments, we consistently found that items included in summaries were better remembered than items omitted from summaries. We did not, however, find evidence that summary writing was better than merely restudying the text. These patterns held with shorter and longer texts, when the text was present or absent during the summary writing,…

  9. Female gender stereotype in French advertising texts

    Directory of Open Access Journals (Sweden)

    А С Борисова

    2008-06-01

    Full Text Available This article deals with the problem of female gender stereotypes in French advertising texts. On the ground of the practical analysis of advertising texts published in some modern French periodicals, we managed to expose and define general and national-cultural female gender stereotypes fixed in collective consciousness of the French.

  10. Effects of Text Messaging on Academic Performance

    OpenAIRE

    Barks Amanda; Searight H. Russell; Ratwik Susan

    2011-01-01

    University students frequently send and receive cellular phone text messages during classroominstruction. Cognitive psychology research indicates that multi-tasking is frequently associatedwith performance cost. However, university students often have considerable experience withelectronic multi-tasking and may believe that they can devote necessary attention to a classroomlecture while sending and receiving text messages. In the current study, university students whoused text messaging were ...

  11. Effects of Text Messaging on Academic Performance

    Directory of Open Access Journals (Sweden)

    Barks Amanda

    2011-12-01

    Full Text Available University students frequently send and receive cellular phone text messages during classroominstruction. Cognitive psychology research indicates that multi-tasking is frequently associatedwith performance cost. However, university students often have considerable experience withelectronic multi-tasking and may believe that they can devote necessary attention to a classroomlecture while sending and receiving text messages. In the current study, university students whoused text messaging were randomly assigned to one of two conditions: 1. a group that sent andreceived text messages during a lecture or, 2. a group that did not engage in text messagingduring the lecture. Participants who engaged in text messaging demonstrated significantlypoorer performance on a test covering lecture content compared with the group that did notsend and receive text messages. Participants exhibiting higher levels of text messaging skill hadsignificantly lower test scores than participants who were less proficient at text messaging. It ishypothesized that in terms of retention of lecture material, more frequent task shifting by thosewith greater text messaging proficiency contributed to poorer performance. Overall, the findingsdo not support the view, held by many university students, that this form of multitasking has littleeffect on the acquisition of lecture content. Results provide empirical support for teachers andprofessors who ban text messaging in the classroom.

  12. Center for Electronic Texts in the Humanities.

    Science.gov (United States)

    Gaunt, Marianne I.

    1994-01-01

    Describes the development and activities of the Center for Electronic Texts in the Humanities, established by Princeton University and Rutgers University to provide a national focus for the development, dissemination, and use of electronic texts in the humanities. Sidebars explain the Text Encoding Initiative and Standard Generalized Markup…

  13. Refutation Texts for Effective Climate Change Education

    Science.gov (United States)

    Nussbaum, E. Michael; Cordova, Jacqueline R.; Rehmat, Abeera P.

    2017-01-01

    Refutation texts, which are texts that rebut scientific misconceptions and explain the normative concept, can be effective devices for addressing misconceptions and affecting conceptual change. However, few, if any, refutation texts specifically related to climate change have been validated for effectiveness. In this project, we developed and…

  14. Rational kernels for Arabic Root Extraction and Text Classification

    Directory of Open Access Journals (Sweden)

    Attia Nehar

    2016-04-01

    Full Text Available In this paper, we address the problems of Arabic Text Classification and root extraction using transducers and rational kernels. We introduce a new root extraction approach on the basis of the use of Arabic patterns (Pattern Based Stemmer. Transducers are used to model these patterns and root extraction is done without relying on any dictionary. Using transducers for extracting roots, documents are transformed into finite state transducers. This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification. Root extraction experiments are conducted on three word collections and yield 75.6% of accuracy. Classification experiments are done on the Saudi Press Agency dataset and N-gram kernels are tested with different values of N. Accuracy and F1 report 90.79% and 62.93% respectively. These results show that our approach, when compared with other approaches, is promising specially in terms of accuracy and F1.

  15. Bilingual Text Messaging Translation: Translating Text Messages From English Into Spanish for the Text4Walking Program.

    Science.gov (United States)

    Buchholz, Susan Weber; Sandi, Giselle; Ingram, Diana; Welch, Mary Jane; Ocampo, Edith V

    2015-05-06

    Hispanic adults in the United States are at particular risk for diabetes and inadequate blood pressure control. Physical activity improves these health problems; however Hispanic adults also have a low rate of recommended aerobic physical activity. To address improving physical inactivity, one area of rapidly growing technology that can be utilized is text messaging (short message service, SMS). A physical activity research team, Text4Walking, had previously developed an initial database of motivational physical activity text messages in English that could be used for physical activity text messaging interventions. However, the team needed to translate these existing English physical activity text messages into Spanish in order to have culturally meaningful and useful text messages for those adults within the Hispanic population who would prefer to receive text messages in Spanish. The aim of this study was to translate a database of English motivational physical activity messages into Spanish and review these text messages with a group of Spanish speaking adults to inform the use of these text messages in an intervention study. The consent form and study documents, including the existing English physical activity text messages, were translated from English into Spanish, and received translation certification as well as Institutional Review Board approval. The translated text messages were placed into PowerPoint, accompanied by a set of culturally appropriate photos depicting barriers to walking, as well as walking scenarios. At the focus group, eligibility criteria for this study included being an adult between 30 to 65 years old who spoke Spanish as their primary language. After a general group introduction, participants were placed into smaller groups of two or three. Each small group was asked to review a segment of the translated text messages for accuracy and meaningfulness. After the break out, the group was brought back together to review the text messages

  16. The Instructional Text like a Textual Genre

    Directory of Open Access Journals (Sweden)

    Adiane Fogali Marinello

    2011-07-01

    Full Text Available This article analyses the instructional text as a textual genre and is part of the research called Reading and text production from the textual genre perspective, done at Universidade de Caxias do Sul, Campus Universitário da Região dos Vinhedos. Firstly, some theoretical assumptions about textual genre are presented, then, the instructional text is characterized. After that an instructional text is analyzed and, finally, some activities related to reading and writing of the mentioned genre directed to High School and University students are suggested.

  17. An Embedded Application for Degraded Text Recognition

    Directory of Open Access Journals (Sweden)

    Thillou Céline

    2005-01-01

    Full Text Available This paper describes a mobile device which tries to give the blind or visually impaired access to text information. Three key technologies are required for this system: text detection, optical character recognition, and speech synthesis. Blind users and the mobile environment imply two strong constraints. First, pictures will be taken without control on camera settings and a priori information on text (font or size and background. The second issue is to link several techniques together with an optimal compromise between computational constraints and recognition efficiency. We will present the overall description of the system from text detection to OCR error correction.

  18. Text segmentation in degraded historical document images

    Directory of Open Access Journals (Sweden)

    A.S. Kavitha

    2016-07-01

    Full Text Available Text segmentation from degraded Historical Indus script images helps Optical Character Recognizer (OCR to achieve good recognition rates for Hindus scripts; however, it is challenging due to complex background in such images. In this paper, we present a new method for segmenting text and non-text in Indus documents based on the fact that text components are less cursive compared to non-text ones. To achieve this, we propose a new combination of Sobel and Laplacian for enhancing degraded low contrast pixels. Then the proposed method generates skeletons for text components in enhanced images to reduce computational burdens, which in turn helps in studying component structures efficiently. We propose to study the cursiveness of components based on branch information to remove false text components. The proposed method introduces the nearest neighbor criterion for grouping components in the same line, which results in clusters. Furthermore, the proposed method classifies these clusters into text and non-text cluster based on characteristics of text components. We evaluate the proposed method on a large dataset containing varieties of images. The results are compared with the existing methods to show that the proposed method is effective in terms of recall and precision.

  19. Text-speak processing impairs tactile location.

    Science.gov (United States)

    Head, James; Helton, William; Russell, Paul; Neumann, Ewald

    2012-09-01

    Dual task experiments have highlighted that driving while having a conversation on a cell phone can have negative impacts on driving (Strayer & Drews, 2007). It has also been noted that this negative impact is greater when reading a text-message (Lee, 2007). Commonly used in text-messaging are shortening devices collectively known as text-speak (e.g.,Ys I wll ttyl 2nite, Yes I will talk to you later tonight). To the authors' knowledge, there has been no investigation into the potential negative impacts of reading text-speak on concurrent performance on other tasks. Forty participants read a correctly spelled story and a story presented in text-speak while concurrently monitoring for a vibration around their waist. Slower reaction times and fewer correct vibration detections occurred while reading text-speak than while reading a correctly spelled story. The results suggest that reading text-speak imposes greater cognitive load than reading correctly spelled text. These findings suggest that the negative impact of text messaging on driving may be compounded by the messages being in text-speak, instead of orthographically correct text. Copyright © 2012 Elsevier B.V. All rights reserved.

  20. The nuclear modification of charged particles in Pb-Pb at $\\sqrt{\\text{s}_\\text{NN}} = \\text{5.02}\\,\\text{TeV}$ measured with ALICE

    CERN Document Server

    Gronefeld, Julius

    2016-09-21

    The study of inclusive charged-particle production in heavy-ion collisions provides insights into the density of the medium and the energy-loss mechanisms. The observed suppression of high-$\\textit{p}_\\text{T}$ yield is generally attributed to energy loss of partons as they propagate through a deconfined state of quarks and gluons - Quark-Gluon Plasma (QGP) - predicted by QCD. Such measurements allow the characterization of the QGP by comparison with models. In these proceedings, results on high-$\\textit{p}_\\text{T}$ particle production measured by ALICE in Pb-Pb collisions at $ \\sqrt{\\text{s}_\\text{NN}}\\, = 5.02\\ \\rm{TeV}$ as well as well in pp at $\\sqrt{\\text{s}}\\,=5.02\\ \\rm{TeV}$ are presented for the first time. The nuclear modification factors ($\\text{R}_\\text{AA}$) in Pb-Pb collisions are presented and compared with model calculations.

  1. Extractive text summarization system to aid data extraction from full text in systematic review development.

    Science.gov (United States)

    Bui, Duy Duc An; Del Fiol, Guilherme; Hurdle, John F; Jonnalagadda, Siddhartha

    2016-12-01

    Extracting data from publication reports is a standard process in systematic review (SR) development. However, the data extraction process still relies too much on manual effort which is slow, costly, and subject to human error. In this study, we developed a text summarization system aimed at enhancing productivity and reducing errors in the traditional data extraction process. We developed a computer system that used machine learning and natural language processing approaches to automatically generate summaries of full-text scientific publications. The summaries at the sentence and fragment levels were evaluated in finding common clinical SR data elements such as sample size, group size, and PICO values. We compared the computer-generated summaries with human written summaries (title and abstract) in terms of the presence of necessary information for the data extraction as presented in the Cochrane review's study characteristics tables. At the sentence level, the computer-generated summaries covered more information than humans do for systematic reviews (recall 91.2% vs. 83.8%, psummarization system. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Intertextuality: On the use of the Bible in mystical texts

    Directory of Open Access Journals (Sweden)

    Kees Waaijman

    2010-11-01

    Full Text Available This article discussed the use of the Bible in mystical texts by focusing on intertextuality as a literary approach which analyses the intersection of texts. It investigated how mystical texts, as phenotexts, relate to the Bible as archetext: firstly, the intertextual relations affect the surface of the text in a mono-causal way and secondly, they govern the production of meaning reciprocally. The article also discussed forms of intersection (quotations, collage, allusions and reproduction before it analysed the three intertextual strategies producing meaning: participation, detachment and change or rearrangement. Finally, six functions and dimensions of meaning were delineated in the intertextual dynamic between the Bible and the mystical texts. In these the Bible serves as an authoritative framework for argumentation, as a guide and blueprint of the mystical way, as a vocabulary of mystical experience, as an initiation into the divine infinity, as the place of mystical transformation in love and as the articulation of transformation in glory.

  3. Considerations on a methodological framework for the analysis of texts

    Directory of Open Access Journals (Sweden)

    David Andrés Camargo Mayorga

    2017-03-01

    Full Text Available This article presents a review of relevant literature for the construction of a methodological framework for the analysis of texts in applied social sciences, such as economics, which we have supported in the main hermeneutical approaches from philosophy, linguistics and social sciences. In essence, they assume that every discourse carries meaning - be it truthful or not - and that they express complex social relations. Thus, any analysis of content happens finally to be a certain type of hermeneutics (interpretation, while trying to account for multiple phenomena immersed in the production, application, use and reproduction of knowledge within the text. When applying discourse analysis in teaching texts in economic sciences, we find traces of legalistic, political, ethnocentric tendencies, among other discourses hidden from the text. For this reason, the analysis of the internal discourse of the text allows us to delve inside the state ideology and its underlying or latent discourses.

  4. Gender Analysis On Islamic Texts: A Study On Its Accuracy

    Directory of Open Access Journals (Sweden)

    Muchammad Ichsan

    2014-06-01

    Full Text Available Gender equality movement is spreading all over the world, including in Indonesia where Muslim gender activists have made hard efforts to ensure gender fairness and equality among people. One of their efforts is emphasizing the urgency of reinterpreting Islamic texts. They insist on the reinterpretation of Islamic texts based on gender perspective and analysis due to the existence of many Islamic texts that trespass the principles of gender equality and fairness they have been fighting for. This paper aims at assuring and examining the accuracy of using gender perspective as a tool for analyzing the Islamic text. It is found that using gender perspective and analysis for reinterpreting Islamic texts is not in line with the Islamic principles and will only produce laws and points of views which deviate from Islamic teachings. To reach the goals of this study, a descriptive-analytical approach is employed.

  5. Visualization of text documents based on conceptual spaces

    OpenAIRE

    Vidmar, Kaja

    2010-01-01

    In my thesis I am presenting an approach of conceptual spaces for vizulalization of text corpora. Thesis is divided into two parts. First part is overview of methods for text corpora analysis and the second one presents some ways for result vizualization. Due to increasing number of eletronic data, we tend to automatic analisys and organisation of this data into various, pre-unknown groups. Some algorithms, that are providing us ways to do this, are presented (such as latent semant...

  6. Gene prioritization and clustering by multi-view text mining.

    Science.gov (United States)

    Yu, Shi; Tranchevent, Leon-Charles; De Moor, Bart; Moreau, Yves

    2010-01-14

    Text mining has become a useful tool for biologists trying to understand the genetics of diseases. In particular, it can help identify the most interesting candidate genes for a disease for further experimental analysis. Many text mining approaches have been introduced, but the effect of disease-gene identification varies in different text mining models. Thus, the idea of incorporating more text mining models may be beneficial to obtain more refined and accurate knowledge. However, how to effectively combine these models still remains a challenging question in machine learning. In particular, it is a non-trivial issue to guarantee that the integrated model performs better than the best individual model. We present a multi-view approach to retrieve biomedical knowledge using different controlled vocabularies. These controlled vocabularies are selected on the basis of nine well-known bio-ontologies and are applied to index the vast amounts of gene-based free-text information available in the MEDLINE repository. The text mining result specified by a vocabulary is considered as a view and the obtained multiple views are integrated by multi-source learning algorithms. We investigate the effect of integration in two fundamental computational disease gene identification tasks: gene prioritization and gene clustering. The performance of the proposed approach is systematically evaluated and compared on real benchmark data sets. In both tasks, the multi-view approach demonstrates significantly better performance than other comparing methods. In practical research, the relevance of specific vocabulary pertaining to the task is usually unknown. In such case, multi-view text mining is a superior and promising strategy for text-based disease gene identification.

  7. Adaptive Text Entry for Mobile Devices

    DEFF Research Database (Denmark)

    Proschowsky, Morten Smidt

    The reduced size of many mobile devices makes it difficult to enter text with them. The text entry methods are often slow or complicated to use. This affects the performance and user experience of all applications and services on the device. This work introduces new easy-to-use text entry methods...... for mobile devices and a framework for adaptive context-aware language models. Based on analysis of current text entry methods, the requirements to the new text entry methods are established. Transparent User guided Prediction (TUP) is a text entry method for devices with one dimensional touch input. It can...... be touch sensitive wheels, sliders or similar input devices. The interaction design of TUP is done with a combination of high level task models and low level models of human motor behaviour. Three prototypes of TUP are designed and evaluated by more than 30 users. Observations from the evaluations are used...

  8. A Query System for Texts with Macros

    Science.gov (United States)

    Kwon, Keehang; Kang, Dae-Seong; Kim, Jinsoo

    We propose a query language based on extended regular expressions. This language extends texts with text-generating macros. These macros make it possible to define languages in a compressed, elegant way. This paper also extends queries with linear implications and additive (classical) conjunctions. To be precise, it allows goals of the form D _??_ G and G1 & G2 where D is a text or a macro and G is a query. The first goal is solved by adding D to the current text and then solving G. This goal is flexible in controlling the current text dynamically. The second goal is solved by solving both G1 and G2 from the current text. This goal is particularly useful for internet search.

  9. HELPING STUDENTS UNDERSTAND THE TEXT THROUGH SCAFFOLDING

    Directory of Open Access Journals (Sweden)

    Deni Sapta Nugraha

    2015-12-01

    Full Text Available This study reported the practice of helping adult students to comprehend the texts in Indonesian Civil Aviation Institute majoring at Air traffic controller programme, Curug - Tangerang. The article demonstrated of how teacher helped them to comprehend the text during 100 minutes reading class in three meetings. It was employed as their input session to acquire context, knowledge and specific vocabulary in aviation or what so called as phraseology. Students were asked to construct some questions dealing with the text both literal and inferential comprehension suggested by Barrett (in Eanes 1997. The result showed that students attained three main bonuses; they get used to build questions that impact to their grammatical awareness, they get used to communicate orally, and they are successful to comprehend the text thoroughly by acquiring new knowledge, vocabulary as well as context. Keywords: reading comprehension, text, scaffolding

  10. Choices of texts for literary education

    DEFF Research Database (Denmark)

    Skyggebjerg, Anna Karlskov

    . The teaching of literature has a double bind. On the one hand, there is a subject (Danish) and a curriculum with a certain type of texts with cultural and even national connotations, and the limits of the choice of texts and curriculum are decided by the state. On the other hand, there are some concrete......This paper charts the general implications of the choice of texts for literature teaching in the Danish school system, especially in Grades 8 and 9. It will analyze and discuss the premises of the choice of texts, and the possibilities of a certain choice of text in a concrete classroom situation...... readers with literary interests, competences, possibilities, needs, etc. Generally speaking the criteria for the choice of texts for teaching literature in Danish schools have been dominated by considerations for the subject and Literature in itself. The predominant view of literature comes from...

  11. Arabic text preprocessing for the natural language processing applications

    International Nuclear Information System (INIS)

    Awajan, A.

    2007-01-01

    A new approach for processing vowelized and unvowelized Arabic texts in order to prepare them for Natural Language Processing (NLP) purposes is described. The developed approach is rule-based and made up of four phases: text tokenization, word light stemming, word's morphological analysis and text annotation. The first phase preprocesses the input text in order to isolate the words and represent them in a formal way. The second phase applies a light stemmer in order to extract the stem of each word by eliminating the prefixes and suffixes. The third phase is a rule-based morphological analyzer that determines the root and the morphological pattern for each extracted stem. The last phase produces an annotated text where each word is tagged with its morphological attributes. The preprocessor presented in this paper is capable of dealing with vowelized and unvowelized words, and provides the input words along with relevant linguistics information needed by different applications. It is designed to be used with different NLP applications such as machine translation text summarization, text correction, information retrieval and automatic vowelization of Arabic Text. (author)

  12. Science and Technology Text Mining Basic Concepts

    National Research Council Canada - National Science Library

    Losiewicz, Paul

    2003-01-01

    ...). It then presents some of the most widely used data and text mining techniques, including clustering and classification methods, such as nearest neighbor, relational learning models, and genetic...

  13. Using Unlabeled Data to Improve Text Classification

    National Research Council Canada - National Science Library

    Nigam, Kamal P

    2001-01-01

    .... This dissertation demonstrates that supervised learning algorithms that use a small number of labeled examples and many inexpensive unlabeled examples can create high-accuracy text classifiers...

  14. Diode and Diode Circuits, a Programmed Text.

    Science.gov (United States)

    Balabanian, Norman; Kirwin, Gerald J.

    This programed text on diode and diode circuits was developed under contract with the United States Office of Education as Number 4 in a series of materials for use in an electrical engineering sequence. It is intended as a supplement to a regular text and other instructional material. (DH)

  15. Readability Revisited? The Implications of Text Complexity

    Science.gov (United States)

    Wray, David; Janan, Dahlia

    2013-01-01

    The concept of readability has had a variable history, moving from a position where it was considered as a very important topic for those responsible for producing texts and matching those texts to the abilities and needs of learners, to its current declining visibility in the education literature. Some important work has been coming from the USA…

  16. MORPHOLOGICAL STRATEGIES IN TEXT MESSAGING AMONG ...

    African Journals Online (AJOL)

    Text messaging is the application of abridged morphological forms in order to communicate and it is one of the fastest means of communication since the emergence of the Global System for Mobile Communication (GSM) in the world. In text messaging, we apply innovative language forms with morpho-syntactic structures ...

  17. Text Fabric: What, How, and Why

    NARCIS (Netherlands)

    Erwich, C.M.; Kingham, Cody

    Text-Fabric (TF) is a promising new framework for the Eep Talstra Center for Bible and Computer corpus plus (linguistic) annotations. TF is a Python 3.x software package that provides scientific, accessible and reproducible ways of processing Biblical Hebrew text data. It also allows sharing the

  18. Tagalog for Beginners. PALI Language Texts: Philippines.

    Science.gov (United States)

    Ramos, Teresita V.; de Guzman, Videa

    This language textbook is designed for beginning students of Tagalog, the principal language spoken on the island of Luzon in the Philippines. The introduction discusses the history of Tagalog and certain features of the language. An explanation of the text is given, along with notes for the teacher. The text itself is divided into nine sections:…

  19. An Intelligent System For Arabic Text Categorization

    NARCIS (Netherlands)

    Syiam, M.M.; Tolba, Mohamed F.; Fayed, Z.T.; Abdel-Wahab, Mohamed S.; Ghoniemy, Said A.; Habib, Mena Badieh

    Text Categorization (classification) is the process of classifying documents into a predefined set of categories based on their content. In this paper, an intelligent Arabic text categorization system is presented. Machine learning algorithms are used in this system. Many algorithms for stemming and

  20. Using Digital Texts to Promote Fluent Reading

    Science.gov (United States)

    Thoermer, Andrea; Williams, Lunetta

    2012-01-01

    Fluency is a critical skill of adept readers. As listening to read alouds and performing Readers Theatre scripts are two prevalent strategies that can increase students' fluency skills, this article provides suggestions in using these strategies with digital texts through free, online resources. Digital texts can be accessed using a desktop,…

  1. A text in Romani from 1622

    DEFF Research Database (Denmark)

    Bakker, Peter

    2015-01-01

    this is a reprint of a 2012 article: A new old text in Romani: Lord's Prayer, 1622. International Journal of Romani Language and Culture 2 (2011): 193-212.......this is a reprint of a 2012 article: A new old text in Romani: Lord's Prayer, 1622. International Journal of Romani Language and Culture 2 (2011): 193-212....

  2. The Patchwork Text in Teaching Greek Tragedy.

    Science.gov (United States)

    Parker, Jan

    2003-01-01

    Describes the rewards and challenges of using the Patchwork Text to teach Greek Tragedy to Cambridge University English final-year students. The article uses close reading of the students' texts, analysis and reflection to discuss both the products and the process of Patchwork writing. (Author/AEF)

  3. The Medline/full-text research project.

    Science.gov (United States)

    McKinin, E J; Sievert, M; Johnson, E D; Mitchell, J A

    1991-05-01

    This project was designed to test the relative efficacy of index terms and full-text for the retrieval of documents in those MEDLINE journals for which full-text searching was also available. The full-text files used were MEDIS from Mead Data Central and CCML from BRS Information Technologies. One hundred clinical medical topics were searched in these two files as well as the MEDLINE file to accumulate the necessary data. It was found that full-text identified significantly more relevant articles than did the indexed file, MEDLINE. The full-text searches, however, lacked the precision of searches done in the indexed file. Most relevant items missed in the full-text files, but identified in MEDLINE, were missed because the searcher failed to account for some aspect of natural language, used a logical or positional operator that was too restrictive, or included a concept which was implied, but not expressed in the natural language. Very few of the unique relevant full-text citations would have been retrieved by title or abstract alone. Finally, as of July, 1990 the more current issue of a journal was just as likely to appear in MEDLINE as in one of the full-text files.

  4. Classifying Written Texts Through Rhythmic Features

    NARCIS (Netherlands)

    Balint, Mihaela; Dascalu, Mihai; Trausan-Matu, Stefan

    2016-01-01

    Rhythm analysis of written texts focuses on literary analysis and it mainly considers poetry. In this paper we investigate the relevance of rhythmic features for categorizing texts in prosaic form pertaining to different genres. Our contribution is threefold. First, we define a set of rhythmic

  5. Text comprehension strategy instruction with poor readers

    NARCIS (Netherlands)

    Van den Bos, K.P.; Aarnoudse, C.C.; Brand-Gruwel, S.

    1998-01-01

    The goal of this study was to investigate the effects of teaching text comprehension strategies to children with decoding and reading comprehension problems and with a poor or normal listening ability. Two experiments are reported. Four text comprehension strategies, viz., question generation,

  6. Text mining for the biocuration workflow.

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  7. NOTICING HYBRID RECASTS IN TEXT CHAT

    Directory of Open Access Journals (Sweden)

    Mark J. Oliver

    2016-12-01

    Full Text Available This study examined ten EFL learners’ noticing of the corrective nature of a form of text-based SCMC (text chat feedback that combined a recast of a grammatical error with metalinguistic information. The feedback, termed a hybrid recast, was provided by a native-speaker interlocutor during two text chat activities: a spot-the-difference and picture-ordering task. Data was collected in two ways: analysis of task-based dyadic text chat interaction in which uptake was used as an indicator of learner noticing, and a post-task questionnaire containing questions that identified evidence of learner noticing. Interaction analysis showed that learners responded to almost two thirds of the hybrid recasts with uptake. In addition, every learner provided evidence that they had correctly perceived at least some of the hybrid recasts as corrective in their post-task questionnaire responses.

  8. Using Genetic Algorithms for Texts Classification Problems

    Directory of Open Access Journals (Sweden)

    A. A. Shumeyko

    2009-01-01

    Full Text Available The avalanche quantity of the information developed by mankind has led to concept of automation of knowledge extraction – Data Mining ([1]. This direction is connected with a wide spectrum of problems - from recognition of the fuzzy set to creation of search machines. Important component of Data Mining is processing of the text information. Such problems lean on concept of classification and clustering ([2]. Classification consists in definition of an accessory of some element (text to one of in advance created classes. Clustering means splitting a set of elements (texts on clusters which quantity are defined by localization of elements of the given set in vicinities of these some natural centers of these clusters. Realization of a problem of classification initially should lean on the given postulates, basic of which – the aprioristic information on primary set of texts and a measure of affinity of elements and classes.

  9. Frontiers of biomedical text mining: current progress

    Science.gov (United States)

    Zweigenbaum, Pierre; Demner-Fushman, Dina; Yu, Hong; Cohen, Kevin B.

    2008-01-01

    It is now almost 15 years since the publication of the first paper on text mining in the genomics domain, and decades since the first paper on text mining in the medical domain. Enormous progress has been made in the areas of information retrieval, evaluation methodologies and resource construction. Some problems, such as abbreviation-handling, can essentially be considered solved problems, and others, such as identification of gene mentions in text, seem likely to be solved soon. However, a number of problems at the frontiers of biomedical text mining continue to present interesting challenges and opportunities for great improvements and interesting research. In this article we review the current state of the art in biomedical text mining or ‘BioNLP’ in general, focusing primarily on papers published within the past year. PMID:17977867

  10. Creating texts an introduction to the study of composition

    CERN Document Server

    Nash, Walter

    2014-01-01

    Creating Texts emphasises a practical approach to composition and enables students to understand what is involved in the creation of a text and to learn from the practice of other writers. Extensively rewritten and updated from Walter Nash's earlier volume, Designs in Prose, attention is paid to the general theory of composition, in both traditional and original terms, so that students are made familiar with the basic resources of composition, in grammar and in the lexicon.The essence of every chapter is the discussion of examples of text, sometimes devised by the authors

  11. Text corrections in the school context from a dialogical perspective

    Directory of Open Access Journals (Sweden)

    Carla Avena Camilotto

    2015-02-01

    Full Text Available This article discusses a teacher’s text correction approach with students enrolled in the  4th period  of a Young and Adult Education course at a municipal school located in Itapema, Santa Catarina. The “corpus” used in this article made up of transcriptions of the teacher’s speech, in which her pedagogical conceptions became quite evident, especially in regards to normativism in  language use. Theoreticalreferences include: Costa Val, Ilari, Geraldi, Elias, Gil Neto, Cavalcante, Antunes, Passarelli, amongothers. The analysis showed that the text correction practice carried out by the teacher does not help students to think about how language works.

  12. EU external relations law : text, cases and materials

    NARCIS (Netherlands)

    Van Vooren, Bart; Wessel, Ramses A.

    2014-01-01

    This major new textbook for students in European law uses a text, cases and materials approach to explore the law, politics, policy and practice of EU external relations, and navigates the complex questions at the interface of these areas. The subject is explored by explaining major constitutional

  13. A survey of text clustering techniques used for web mining

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2005-12-01

    Full Text Available This paper contains an overview of basic formulations and approaches to clustering. Then it presents two important clustering paradigms: a bottom-up agglomerative technique, which collects similar documents into larger and larger groups, and a top-down partitioning technique, which divides a corpus into topic-oriented partitions.

  14. Effectiveness of Conceptual Change Texts: A Meta Analysis

    Science.gov (United States)

    Armagan, Fulya Öner; Keskin, Melike Özer; Akin, Beril Salman

    2017-01-01

    The purpose of this study was to determine the overall effectiveness of conceptual change texts (CCTs) on academic achievement and to find out if effectiveness was related to some characteristics of the study. It followed up a Meta-analysis research approach. 42 published and unpublished studies, published between 1995 and 2010, and 42 experiment…

  15. Bona , Barometer of the Decades | Khuzwayo | Current Writing: Text ...

    African Journals Online (AJOL)

    The article examines practices of descriptive translation studies (DTS), pertaining particularly to the use of substitution and omission, in articles of a socially sensitive nature in the English and isiZulu texts of the monthly magazine Bona. The argument is that in its cautious approach to political matters Bona, paradoxically, ...

  16. Reconstructing rhetorical strategies from the text of galatians ...

    African Journals Online (AJOL)

    This paper focuses on areas of overlap between linguistic and rhetorical analyses of Paul's Letter to the Galatians. The question is raised whether and to what extent conclusions drawn from a text immanent linguistic approach, on the one hand, and those drawn from rhetorical analyses, on the other, are compatible and ...

  17. The translation of biblical texts into South African Sign Language ...

    African Journals Online (AJOL)

    SASL) are more accessible than written or printed biblical texts for deaf-born South African people who use sign language as their first language. The study made use of the functionalist approach in translation to translate six parts from the Bible into ...

  18. Extracting bimodal representations for language-based image text retrieval

    NARCIS (Netherlands)

    Westerveld, T.H.W.; Hiemstra, Djoerd; de Jong, Franciska M.G.; Correia, N.; Chambel, T.; Davenport, G.

    2000-01-01

    This paper explores two approaches to multimedia indexing that might contribute to the advancement of text-based conceptual search for pictorial information. Insights from relatively mature retrieval areas (spoken document retrieval and cross-language retrieval) are taken as a starting point for an

  19. Intertextuality and Dialogic Interaction in Students' Online Text Construction

    Science.gov (United States)

    Ronan, Briana

    2015-01-01

    This study examines the online writing practices of adolescent emergent bilinguals through the mediating lenses of dialogic interaction and intertextuality. Using a multimodal discourse analysis approach, the study traces how three students develop online academic texts through intertextual moves that traverse modal boundaries. The analysis…

  20. Text mining of web-based medical content

    CERN Document Server

    Neustein, Amy

    2014-01-01

    Text Mining of Web-Based Medical Content examines web mining for extracting useful information that can be used for treating and monitoring the healthcare of patients. This work provides methodological approaches to designing mapping tools that exploit data found in social media postings. Specific linguistic features of medical postings are analyzed vis-a-vis available data extraction tools for culling useful information.

  1. Advances in Text Mining and Visualization for Precision Medicine.

    Science.gov (United States)

    Gonzalez-Hernandez, Graciela; Sarker, Abeed; O'Connor, Karen; Greene, Casey; Liu, Hongfang

    2018-01-01

    According to the National Institutes of Health (NIH), precision medicine is "an emerging approach for disease treatment and prevention that takes into account individual variability in genes, environment, and lifestyle for each person." Although the text mining community has explored this realm for some years, the official endorsement and funding launched in 2015 with the Precision Medicine Initiative are beginning to bear fruit. This session sought to elicit participation of researchers with strong background in text mining and/or visualization who are actively collaborating with bench scientists and clinicians for the deployment of integrative approaches in precision medicine that could impact scientific discovery and advance the vision of precision medicine as a universal, accessible approach at the point of care.

  2. A Network Text Analysis of David Ayer’s Fury

    Directory of Open Access Journals (Sweden)

    Starling David Hunter

    2015-12-01

    Full Text Available Network Text Analysis (NTA involves the creation of networks of words and/or concepts from linguistic data. Its key insight is that the position of words and concepts in a text network provides vital clues to the central and underlying themes of the text as a whole. Recent research has relied on inductive approaches to identify these themes. In this study we demonstrate a deductive approach that we apply to the screenplay of the 2014 World War II-era film Fury. Specifically, we first use genre expectations theory to establish prior expectations as to the key themes associated with war films. We then empirically test whether words and concepts associated with the most influentially-positioned nodes are consistent with themes common to the war-film genre. As predicted, we find that words and concepts associated with the least constrained nodes in the text network were significantly more likely to be associated with the war, action, and biography genres and significantly less likely to be associated with the mystery, science-fiction, fantasy, and film-noir genres. Keywords: content analysis, text analysis, network text analysis, semantic network analysis, film studies, screenplay, screenwriting, war movies, World War II, tanks

  3. Application of LSP texts in translator training

    Directory of Open Access Journals (Sweden)

    Larisa Ilynska

    2017-06-01

    Full Text Available The paper presents discussion of the results of extensive empirical research into efficient methods of educating and training translators of LSP (language for special purposes texts. The methodology is based on using popular LSP texts in the respective fields as one of the main media for translator training. The aim of the paper is to investigate the efficiency of this methodology in developing thematic, linguistic and cultural competences of the students, following Bloom’s revised taxonomy and European Master in Translation Network (EMT translator training competences. The methodology has been tested on the students of a professional Master study programme called Technical Translation implemented by the Institute of Applied Linguistics, Riga Technical University, Latvia. The group of students included representatives of different nationalities, translating from English into Latvian, Russian and French. Analysis of popular LSP texts provides an opportunity to structure student background knowledge and expand it to account for linguistic innovation. Application of popular LSP texts instead of purely technical or scientific texts characterised by neutral style and rigid genre conventions provides an opportunity for student translators to develop advanced text processing and decoding skills, to develop awareness of expressive resources of the source and target languages and to develop understanding of socio-pragmatic language use.

  4. Figure-associated text summarization and evaluation.

    Directory of Open Access Journals (Sweden)

    Balaji Polepalli Ramesh

    Full Text Available Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903.

  5. LITURGICAL TEXT IN RUSSIAN LITERATURE. PROBLEM STATEMENT

    Directory of Open Access Journals (Sweden)

    Avetis Serezhaevich Seropyan

    2012-11-01

    Full Text Available The article analyses artistic expressions of liturgical language in the literary text and its interaction of the Holy Tradition. Many Russian authors knew the liturgical text well. Studying it reveals the crucial meaning of the Gospel and liturgical texts (as part of the Holy Tradition for Russian literature. Authors saw the essence of every phenomenon in the word for it, and the nature of God in His name. Some ideas and sayings of the authors and their characters find their sources in liturgical texts. The article focuses on liturgical sources of some characters' commemorations and invocations, as well as poetical topics of the symbolists, Dostoevsky's famous dictum on beauty which will save the world (The Idiot, etc. De-cyphering this liturgical code will help us learn and comprehend the hidden endless meaning of a literary text. The specific feature of Russian literature is its pursuit of the spiritual liturgical exploration of the world, an exploration when truth takes shape and thus becomes real in both literary text and history.

  6. DIFFICULTIES AND STRATEGIES IN THE PROCESS OF LEGAL TEXTS TRANSLATION

    Directory of Open Access Journals (Sweden)

    Adela-Elena, DUMITRESCU

    2014-11-01

    Full Text Available This article aims to identify the difficulties and find approaches in translating legal texts which involve a lot of different types of translation problems. The translator has the task to discover proper strategies to render the translated text comprehensible for the reader in the target language simultaneously reflecting the unique character of the legal system from the source language country. Some of the necessary strategies which the translator should take into account are: the borrowing of original terms, the naturalization of specific terms into the target language, the language calques usage, or the introduction of descriptive translation. Even if a translator tries to solve any difficulty when he translates a legal text, he must maintain the source culture characteristics and do not deprive the texts of their specific character.

  7. Du texte au texte: Ou l'on dit ce qu'il faut faire (From Text to Text: Where One Is Told What to Do).

    Science.gov (United States)

    Bertocchini, Paola; Costanzo, Edwige

    1991-01-01

    A variety of authentic materials drawn from explanatory texts or offering instructions for performing some daily or common task are presented, and classroom activities for building foreign language competence based on the materials are outlined. (MSE)

  8. Cohesion and Metaphor Aspects in Andabhuana Text

    Directory of Open Access Journals (Sweden)

    Ida Bagus Mahardika

    2015-02-01

    Full Text Available Cohesion and metaphor are the unique and interesting parts of language aspects in Andhabhuan text to research. They are quite dominant aspects in the story in developing its literature aesthetic. This research is based on the arts technical and analytical method. The result of the research on those two aspects shows that traditional aesthetic style in arts, as described in Andabhuana verses emphasize on the reference, meaning, selection and variation of words. The language parts used are aimed at bringing the text ideology to humanity perspective, especially the ?iwatattwa values as parts of Hindu teaching. Hence the cohesion and metaphor in Andabhuana text  are  semiotic description to transform to Balinese Hindus as most of them follow ?iwatattwa belief.

  9. Punctuation effects in english and esperanto texts

    Science.gov (United States)

    Ausloos, M.

    2010-07-01

    A statistical physics study of punctuation effects on sentence lengths is presented for written texts: Alice in wonderland and Through a looking glass. The translation of the first text into esperanto is also considered as a test for the role of punctuation in defining a style, and for contrasting natural and artificial, but written, languages. Several log-log plots of the sentence-length-rank relationship are presented for the major punctuation marks. Different power laws are observed with characteristic exponents. The exponent can take a value much less than unity ( ca. 0.50 or 0.30) depending on how a sentence is defined. The texts are also mapped into time series based on the word frequencies. The quantitative differences between the original and translated texts are very minutes, at the exponent level. It is argued that sentences seem to be more reliable than word distributions in discussing an author style.

  10. The Relationship between Paraphrasing and Text Analysis

    Directory of Open Access Journals (Sweden)

    María Luisa Cepeda Islas

    2013-04-01

    Full Text Available Given the importance of paraphrasing in the process of comprehension for college students, this study assessed the level of implementation of text analysis and paraphrases the response of a sample of senior students of the career psychology. We selected a group of freshmen to the Psychology course, which was asked to answer a questionnaire and carry out the summary of an empirical article. The results showed that participants have a low level of text analysis, at the same time had low levels of paraphrasing. It was seen that the predominant textual copy. They envision some possibilities for the structure of a training workshop not only paraphrasing but on the analysis of text.

  11. User's Epistle on Text Chat Tool Acquisition

    National Research Council Canada - National Science Library

    Simpson, Jr, Marvin L

    2006-01-01

    .... With a revolutionary technology like text chat, a monopoly of naysayers produce a litany of obstacles that predict inevitable failure and a monopoly of ideologues insist that only the purest implementation can succeed...

  12. Discovery of Recurring Anomalies in Text Reports

    Data.gov (United States)

    National Aeronautics and Space Administration — This paper describes the results of a significant research and development effort conducted at NASA Ames Research Center to develop new text mining algorithms to...

  13. Strategies to Increase Accuracy in Text Classification

    NARCIS (Netherlands)

    D. Blommesteijn (Dennis)

    2014-01-01

    htmlabstractText classification via supervised learning involves various steps from processing raw data, features extraction to training and validating classifiers. Within these steps implementation decisions are critical to the resulting classifier accuracy. This paper contains a report of the

  14. Figures of thought mathematics and mathematical texts

    CERN Document Server

    Reed, David

    2003-01-01

    Examines the ways in which mathematical works can be read as texts, examines their textual strategiesand demonstrates that such readings provide a rich source of philosophical debate regarding mathematics.

  15. Building Fluency through the Phrased Text Lesson

    Science.gov (United States)

    Rasinski, Timothy; Yildirim, Kasim; Nageldinger, James

    2012-01-01

    This Teaching Tip article explores the importance of phrasing while reading. It also presents an instructional intervention strategy for helping students develop greater proficiency in reading with phrases that reflect the meaning of the text.

  16. Text mining for the biocuration workflow

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A. P. C; Krallinger, Martin; Arighi, Cecilia; Cohen, K. Bretonnel; Valencia, Alfonso; Wu, Cathy H.; Chatr-Aryamontri, Andrew; Dowell, Karen G.; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G.

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on ‘Text Mining for the BioCuration Workflow’ at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community. PMID:22513129

  17. Text document classification based on mixture models

    Czech Academy of Sciences Publication Activity Database

    Novovičová, Jana; Malík, Antonín

    2004-01-01

    Roč. 40, č. 3 (2004), s. 293-304 ISSN 0023-5954 R&D Projects: GA AV ČR IAA2075302; GA ČR GA102/03/0049; GA AV ČR KSK1019101 Institutional research plan: CEZ:AV0Z1075907 Keywords : text classification * text categorization * multinomial mixture model Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.224, year: 2004

  18. Modeling text with generalizable Gaussian mixtures

    DEFF Research Database (Denmark)

    Hansen, Lars Kai; Sigurdsson, Sigurdur; Kolenda, Thomas

    2000-01-01

    We apply and discuss generalizable Gaussian mixture (GGM) models for text mining. The model automatically adapts model complexity for a given text representation. We show that the generalizability of these models depends on the dimensionality of the representation and the sample size. We discuss ...... the relation between supervised and unsupervised learning in the test data. Finally, we implement a novelty detector based on the density model....

  19. Preserved Network Metrics across Translated Texts

    Science.gov (United States)

    Cabatbat, Josephine Jill T.; Monsanto, Jica P.; Tapang, Giovanni A.

    2014-09-01

    Co-occurrence language networks based on Bible translations and the Universal Declaration of Human Rights (UDHR) translations in different languages were constructed and compared with random text networks. Among the considered network metrics, the network size, N, the normalized betweenness centrality (BC), and the average k-nearest neighbors, knn, were found to be the most preserved across translations. Moreover, similar frequency distributions of co-occurring network motifs were observed for translated texts networks.

  20. Text Entry by Gazing and Smiling

    Directory of Open Access Journals (Sweden)

    Outi Tuisku

    2013-01-01

    Full Text Available Face Interface is a wearable prototype that combines the use of voluntary gaze direction and facial activations, for pointing and selecting objects on a computer screen, respectively. The aim was to investigate the functionality of the prototype for entering text. First, three on-screen keyboard layout designs were developed and tested (n=10 to find a layout that would be more suitable for text entry with the prototype than traditional QWERTY layout. The task was to enter one word ten times with each of the layouts by pointing letters with gaze and select them by smiling. Subjective ratings showed that a layout with large keys on the edge and small keys near the center of the keyboard was rated as the most enjoyable, clearest, and most functional. Second, using this layout, the aim of the second experiment (n=12 was to compare entering text with Face Interface to entering text with mouse. The results showed that text entry rate for Face Interface was 20 characters per minute (cpm and 27 cpm for the mouse. For Face Interface, keystrokes per character (KSPC value was 1.1 and minimum string distance (MSD error rate was 0.12. These values compare especially well with other similar techniques.

  1. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  2. Basic philosophical texts in Medieval Serbia

    Directory of Open Access Journals (Sweden)

    Milosavljević Boris

    2008-01-01

    Full Text Available Medieval Serbian philosophy took shape mostly through the process of translating Byzantine texts and revising the Slavic translations. Apart from the Aristotelian terminological tradition, introduced via the translation of Damascene’s Dialectic, there also was, under the influence of the Corpus Areopagiticum and ascetic literature, notably of John Climacus’ Ladder, another strain of thought originating from Christian Platonism. Damascene’s philosophical chapters, or Dialectic, translated into medieval Serbian in the third quarter of the fourteenth century, not only shows the high standards of translation technique developed in Serbian monastic scriptoria, but testifies to a highly educated readership interested in such a complex theologico-philosophical text with its nuanced terminology. A new theological debate about the impossibility of knowing God led to Gregory Palamas’ complex text, The Exposition of the Orthodox Faith. Philosophical texts were frequently copied and much worked on in medieval Serbia, but it is difficult to infer about the actual scope of their influence on the formation and articulation of the worldview of medieval society. As a result of their demanding theoretical complexity, the study of philosophy was restricted to quite narrow monastic, court and urban circles. However, the strongest aspect of the influence of Byzantine thought on medieval society was the liturgy as the central social event of the community. It was through the liturgy that the wording of the translated texts influenced the life of medieval Serbian society.

  3. Inspiration and the Texts of the Bible

    Directory of Open Access Journals (Sweden)

    Dirk Buchner

    1997-12-01

    Full Text Available This article seeks to explore what the inspired text of the Old Testament was as it existed for the New Testament authors, particularly for the author of the book of Hebrews. A quick look at the facts makes. it clear that there was, at the time, more than one 'inspired' text, among these were the Septuagint and the Masoretic Text 'to name but two'. The latter eventually gained ascendancy which is why it forms the basis of our translated Old Testament today. Yet we have to ask: what do we make of that other text that was the inspired Bible to the early Church, especially to the writer of the book of Hebrews, who ignored the Masoretic text? This article will take a brief look at some suggestions for a doctrine of inspiration that keeps up with the facts of Scripture. Allied to this, the article is something of a bibliographical study of recent developments in textual research following the discovery of the Dead Sea scrolls.

  4. Figure-associated text summarization and evaluation.

    Science.gov (United States)

    Polepalli Ramesh, Balaji; Sethi, Ricky J; Yu, Hong

    2015-01-01

    Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).

  5. Ergonomic recommendations when texting on mobile phones.

    Science.gov (United States)

    Gustafsson, Ewa

    2012-01-01

    The aim of this report was to give ergonomic recommendations in order to prevent musculoskeletal symptoms/disorders among young people due to intensive texting on mobile phones. In a study of 56 Swedish young adults (19-25 years, 41 with musculoskeletal symptoms in neck and/or upper extremities and 15 without symptoms) registration of thumb movements with electrogoniometry, muscle activity with electromyography and observation of texting technique were conducted during texting on mobile phones. The results showed differences in physical load between the group with musculoskeletal symptoms and the group without symptoms. There were also found differences in muscle activity and kinematics between different texting techniques. These differences could not be explained by the asymptomatic group having symptoms but may be a possible contribution to their symptoms. According to these results it can be recommended to support the forearms, to use both thumbs, to avoid sitting with the head bent forward and to avoid texting with high velocity in order to prevent musculoskeletal disorders when using mobile phones for texting.

  6. Text mining resources for the life sciences.

    Science.gov (United States)

    Przybyła, Piotr; Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable-those that have the crucial ability to share information, enabling smooth integration and reusability. © The Author(s) 2016. Published by Oxford University Press.

  7. Text mining resources for the life sciences

    Science.gov (United States)

    Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable—those that have the crucial ability to share information, enabling smooth integration and reusability. PMID:27888231

  8. De l'analyse de contextes a la pedagogie des textes (From Analysis of Context to Pedagogy of Texts)

    Science.gov (United States)

    Lehmann, Denis

    1976-01-01

    A detailed description of the "Dictionnaire contextual de francais pour la geologie." This study is used as an example of such a dictionary serving as a basic tool in the study of functional French. Such dictionaries can be useful in conceptual approaches to language learning. (Text is in French.) (AMH)

  9. Learners misperceive benefits of redundant text in multimedia learning

    Directory of Open Access Journals (Sweden)

    Barbara eFenesi

    2014-07-01

    Full Text Available Research on metacognition has consistently demonstrated that learners fail to endorse instructional designs that produce benefits to memory, and often prefer designs that actually impair comprehension. Unlike previous studies in which learners were only exposed to a single multimedia design, the current study used a within–subjects approach to examine whether exposure to both redundant text and non-redundant text multimedia presentations improved learners’ metacognitive judgments about presentation styles that promote better understanding. A redundant text multimedia presentation containing narration paired with verbatim on–screen text (Redundant was contrasted with two non-redundant text multimedia presentations: (1 narration paired with images and minimal text (Complementary or (2 narration paired with minimal text (Sparse. Learners watched presentation pairs of either Redundant + Complementary, or Redundant + Sparse. Results demonstrate that Complementary and Sparse presentations produced highest overall performance on the final comprehension assessment, but the Redundant presentation produced highest perceived understanding and engagement ratings. These findings suggest that learners misperceive the benefits of redundant text, even after direct exposure to a non-redundant, effective presentation.

  10. Assessing semantic similarity of texts - Methods and algorithms

    Science.gov (United States)

    Rozeva, Anna; Zerkova, Silvia

    2017-12-01

    Assessing the semantic similarity of texts is an important part of different text-related applications like educational systems, information retrieval, text summarization, etc. This task is performed by sophisticated analysis, which implements text-mining techniques. Text mining involves several pre-processing steps, which provide for obtaining structured representative model of the documents in a corpus by means of extracting and selecting the features, characterizing their content. Generally the model is vector-based and enables further analysis with knowledge discovery approaches. Algorithms and measures are used for assessing texts at syntactical and semantic level. An important text-mining method and similarity measure is latent semantic analysis (LSA). It provides for reducing the dimensionality of the document vector space and better capturing the text semantics. The mathematical background of LSA for deriving the meaning of the words in a given text by exploring their co-occurrence is examined. The algorithm for obtaining the vector representation of words and their corresponding latent concepts in a reduced multidimensional space as well as similarity calculation are presented.

  11. Automatic Amharic text news classification: Aneural networks ...

    African Journals Online (AJOL)

    The study is on classification of Amharic news automatically using neural networks approach. Learning Vector Quantization (LVQ) algorithm is employed to classify new instance of Amharic news based on classifier developed using training dataset. Two weighting schemes, Term Frequency (TF) and Term Frequency by ...

  12. 重新發現教科「書」的歷程:從物質文化看教科書的潛在課程 The Trajectory of Rediscovering the Text-BOOK: Approaching the Hidden Curriculum of the Textbook from a Material Culturist Perspective

    Directory of Open Access Journals (Sweden)

    彭秉權 Ping-Chuan Peng

    2018-03-01

    Full Text Available 2005年筆者在北部某大學講授「潛在課程」,一份期末報告敘說了學生在成長過程中對教科書的愛恨情仇,這些大家熟悉的情感與經驗揭露了既有教科書研究方法的不足,促使筆者重新思考這本「書」的存在,不只是個傳統科技的文字載具,也是青少年日常生活裡不可或缺的物件。之後10年,筆者嘗試從物質文化的角度重新檢視這本書對學習與社會化的影響。本文以倒敘的方式先分享筆者尋找物質之理論意涵的歷程,放眼教育社會學的批判傳統,從古典理論,到繼起的文化研究、後現代、後結構,乃至近期的後人文思想,儘管處理物質的方式殊異,但皆無損其重要性。之後再引用部分理論來敘說、反芻當年的情事,完成延宕多年的回應。本文希望能為教育研究者與工作者開啟物質文化取向在教科書、潛在課程與青少年次文化,乃至學習、教育科技、政策及課程與教學等領域的應用。 The author has taught a course on hidden curriculum at a university in the northern Taiwan in 2005. A term project on students’ normal but forgotten affection to textbooks unwittingly revealed the limit of the critical approaches in textbook studies. New theories were desperately needed. The author, therefore, has begun reconsidering the existence of the “book” as more than a vehicle of words made by outdated printing technology, but also an everyday necessity for students’ social practice and learning. After years of searching, the author was convinced that material cultural studies are helpful in exploring the effect of the book for researchers and educators interested in studying textbook, hidden curriculum, and youth cultures, and issues of learning, educational technologies and policy, as well as curriculum and pedagogy. This article is a flashback. It begins with the author’s exploration of a long lost

  13. [Formula: see text] excited states within a [Formula: see text] HQSS model.

    Science.gov (United States)

    Nieves, J; Pavao, R; Tolos, L

    2018-01-01

    We have reviewed the renormalization procedure used in the unitarized coupled-channel model of Romanets et al. (Phys Rev D 85:114032, 2012), and its impact in the [Formula: see text], [Formula: see text], and [Formula: see text] sector, where five [Formula: see text] states have been recently observed by the LHCb Collaboration. The meson-baryon interactions used in the model are consistent with both chiral and heavy-quark spin symmetries, and lead to a successful description of the observed lowest-lying odd parity resonances [Formula: see text] and [Formula: see text], and [Formula: see text] and [Formula: see text] resonances. We show that some (probably at least three) of the states observed by LHCb will also have odd parity and [Formula: see text] or [Formula: see text], belonging two of them to the same [Formula: see text] HQSS multiplets as the latter charmed and beauty [Formula: see text] baryons.

  14. Letter detection in very familiar texts.

    Science.gov (United States)

    Greenberg, S N; Tai, J

    2001-12-01

    In the present study, we investigated whether patterns of letter detection for function and content words in texts are affected by the familiarity of the material being read. In Experiment 1, subjects searched for target letters in sentences that had been rehearsed prior to performing the letter detection on them as well as on unfamiliar sentences. In Experiment 2, subjects searched for target letters in highly familiar verses (e.g., nursery rhymes) and in unfamiliar sentences that were matched to the familiar verses. A disadvantage in letter detection for function as compared with content words consistently found with unfamiliar passages was reduced significantly with the familiar material in both experiments. Specifically, letter detection for content words grew worse in familiar text, but letter detection for function words showed a contrasting modest, though nonsignificant, improvement. The results are consistent with the proposition that in very familiar texts, parafoveal analysis permits the identification of generally less familiar content words. Simultaneously, the normal pattern of weighing the structure and content elements of text changes so that more fixations on function words occur than when one is reading unfamiliar texts.

  15. Benchmarking infrastructure for mutation text mining.

    Science.gov (United States)

    Klein, Artjom; Riazanov, Alexandre; Hindle, Matthew M; Baker, Christopher Jo

    2014-02-25

    Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption.

  16. La dimension diachronique des textes beckettiens

    Directory of Open Access Journals (Sweden)

    Carla Taban

    2007-07-01

    Full Text Available La présente discussion se propose de montrer que les aspects diachroniques du français et de l’anglais – entendues restrictivement comme évolutions sémantiques des lexèmes des deux idiomes et non pas comme évolutions syntaxiques ou phonétiques de ceux-ci – opèrent dans les textes de Beckett en tant que modalités po(ïétiques de différenciation de sens. Autrement dit, la manière dont les unités lexicales sont inscrites dans leurs environnements intra-textuel (d’un texte donné et intra-inter-textuel (d’une paire bilingue de textes correspondants permet, voire requiert de les actualiser simultanément avec plusieurs significations, dont certaines sont originaires ou historiques. La dimension diachronique dans les deux langues offre ainsi à Beckett un outil d’accroissement du potentiel signifiant de ses textes.

  17. Benchmarking infrastructure for mutation text mining

    Science.gov (United States)

    2014-01-01

    Background Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. Results We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. Conclusion We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption. PMID:24568600

  18. Database citation in full text biomedical articles.

    Directory of Open Access Journals (Sweden)

    Şenay Kafkas

    Full Text Available Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA, UniProt and Protein Data Bank, Europe (PDBe, we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services.

  19. Managing Legal Texts in Requirements Engineering

    Science.gov (United States)

    Otto, Paul N.; Antón, Annie I.

    Laws and regulations are playing an increasingly important role in requirements engineering and systems development. Monitoring systems for requirements and policy compliance has been recognized in the requirements engineering community as a key area for research. Similarly, legal compliance is critical in systems development, especially given that non-compliance can result in both financial and criminal penalties. Working with legal texts can be very challenging, however, because they contain numerous ambiguities, cross-references, domain-specific definitions, and acronyms, and are frequently amended via new statutes, regulations, and case law. Requirements engineers and compliance auditors must be able to identify relevant legal texts, extract requirements and other key concepts, and monitor compliance. This chapter surveys research efforts over the past 50 years in handling legal texts for systems development. This survey can aid requirements engineers and auditors to better specify, test, and monitor systems for compliance.

  20. WYLBUR reference manual. [For interactive text editing

    Energy Technology Data Exchange (ETDEWEB)

    Krupp, R.F.; Messina, P.C.; Peavler, J.M.; Schustack, S.; Starai, T.

    1977-04-01

    WYLBUR is a system for manipulating various kinds of text, such as computer programs, manuscripts, letters, forms, articles, or reports. Its on-line interactive text-editing capabilities allow the user to create, change, and correct text, and to search and display it. WYLBUR also has facilities for job submission and retrieval from remote terminals that make it possible for a user to inquire about the status of any job in the system, cancel jobs that are executing or awaiting execution, reroute output, raise job priority, or get information on the backlog of batch jobs. WYLBUR also has excellent recovery capabilities and a fast response time. This manual describes the WYLBUR version currently used at ANL. It is intended primarily as a reference manual; thus, examples of WYLBUR commands are kept to a minimum. (RWR)

  1. Review network for scene text recognition

    Science.gov (United States)

    Li, Shuohao; Han, Anqi; Chen, Xu; Yin, Xiaoqing; Zhang, Jun

    2017-09-01

    Recognizing text in images captured in the wild is a fundamental preprocessing task for many computer vision and machine learning applications and has gained significant attention in recent years. This paper proposes an end-to-end trainable deep review neural network for scene text recognition, which is a combination of feature extraction, feature reviewing, feature attention, and sequence recognition. Our model can generate the predicted text without any segmentation or grouping algorithm. Because the attention model in the feature attention stage lacks global modeling ability, a review network is applied to extract the global context of sequence data in the feature reviewing stage. We perform rigorous experiments across a number of standard benchmarks, including IIIT5K, SVT, ICDAR03, and ICDAR13 datasets. Experimental results show that our model is comparable to or outperforms state-of-the-art techniques.

  2. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...

  3. Monolingual Accounting Dictionaries for EFL Text Production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2009-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...

  4. Multilingual access to full text databases

    International Nuclear Information System (INIS)

    Fluhr, C.; Radwan, K.

    1990-05-01

    Many full text databases are available in only one language, or more, they may contain documents in different languages. Even if the user is able to understand the language of the documents in the database, it could be easier for him to express his need in his own language. For the case of databases containing documents in different languages, it is more simple to formulate the query in one language only and to retrieve documents in different languages. This paper present the developments and the first experiments of multilingual search, applied to french-english pair, for text data in nuclear field, based on the system SPIRIT. After reminding the general problems of full text databases search by queries formulated in natural language, we present the methods used to reformulate the queries and show how they can be expanded for multilingual search. The first results on data in nuclear field are presented (AFCEN norms and INIS abstracts). 4 refs

  5. Text mining patents for biomedical knowledge.

    Science.gov (United States)

    Rodriguez-Esteban, Raul; Bundschus, Markus

    2016-06-01

    Biomedical text mining of scientific knowledge bases, such as Medline, has received much attention in recent years. Given that text mining is able to automatically extract biomedical facts that revolve around entities such as genes, proteins, and drugs, from unstructured text sources, it is seen as a major enabler to foster biomedical research and drug discovery. In contrast to the biomedical literature, research into the mining of biomedical patents has not reached the same level of maturity. Here, we review existing work and highlight the associated technical challenges that emerge from automatically extracting facts from patents. We conclude by outlining potential future directions in this domain that could help drive biomedical research and drug discovery. Copyright © 2016 Elsevier Ltd. All rights reserved.

  6. The Interplay of Text, Meaning and Practice

    DEFF Research Database (Denmark)

    Kärreman, Dan; Levay, Charlotta

    2017-01-01

    Context: The study of discourses (i.e. verbal interactions or written accounts) is increasingly used in social sciences to gain insight into issues connected to discourse, such as meanings, behaviours and actions. This paper situates discourse analysis in medical education, based on a framework...... of the links between text, practices and meaning. Conclusions: Discourse analysis provides a more strongly supported argument when it is possible to defend claims on three levels: practice, using observational data; meaning, using ethnographic data, and text, using conversational and textual data....

  7. Lidový text a grafika

    OpenAIRE

    Lukš, Jiří

    2015-01-01

    The dissertation "The Folk Text and Graphic Art" studies a song as a topic for graphic and book production. Within the praktical part of the dissertation the author works up a graphic design of a original song-book, which represent his former music band's texts. He surveys the clash of today's fashionable music trends with folk traditions in his region and asks a question about the character of the contemporary folk song. The author's song-book is one of answers. On the base of this effort he...

  8. Quantum mechanics a comprehensive text for chemistry

    CERN Document Server

    Arora, Kishor

    2010-01-01

    This book contains 14 chapters. The text includes the inadequacy of classical mechanics and covers basic and fundamental concepts of quantum mechanics including concepts of transitional, vibration rotation and electronic energies, introduction to concepts of angular momenta, approximatemethods and their application concepts related to electron spin, symmetery concepts and quantum mechanics and ultimately the book features the theories of chemical bonding and use of softwares in quantum mechanics. the text of the book is presented in a lucid manner with ample examples and illustrations wherever

  9. Radioprotection and radiotherapy: new regulatory texts

    International Nuclear Information System (INIS)

    Cosset, J.M.

    1998-01-01

    This article reviews about radiation protection of the workers in the radiotherapy centers. The different texts are explained. These texts (international and european ones) have to aim to reinforce the protection of personnel working in radiotherapy services, to reduce as it is possible the determinists an stochastic effects to organs out of the irradiated volumes, to avoid severe accidents. The radiotherapists have to keep in their mind that treatments must be justified in a clear way and optimized as reasonably achievable. (N.C.)

  10. Learners misperceive the benefits of redundant text in multimedia learning.

    Science.gov (United States)

    Fenesi, Barbara; Kim, Joseph A

    2014-01-01

    Research on metacognition has consistently demonstrated that learners fail to endorse instructional designs that produce benefits to memory, and often prefer designs that actually impair comprehension. Unlike previous studies in which learners were only exposed to a single multimedia design, the current study used a within-subjects approach to examine whether exposure to both redundant text and non-redundant text multimedia presentations improved learners' metacognitive judgments about presentation styles that promote better understanding. A redundant text multimedia presentation containing narration paired with verbatim on-screen text (Redundant) was contrasted with two non-redundant text multimedia presentations: (1) narration paired with images and minimal text (Complementary) or (2) narration paired with minimal text (Sparse). Learners watched presentation pairs of either Redundant + Complementary, or Redundant + Sparse. Results demonstrate that Complementary and Sparse presentations produced highest overall performance on the final comprehension assessment, but the Redundant presentation produced highest perceived understanding and engagement ratings. These findings suggest that learners misperceive the benefits of redundant text, even after direct exposure to a non-redundant, effective presentation.

  11. Selecting Full-Text Undergraduate Periodicals Databases.

    Science.gov (United States)

    Still, Julie M.; Kassabian, Vibiana

    1999-01-01

    Examines how libraries and librarians can compare full-text general periodical indices, using ProQuest Direct, Periodical Abstracts (via Ovid), and EBSCOhost as examples. Explores breadth and depth of coverage; manipulation of results (email/download/print); ease of use (searching); and indexing quirks. (AEF)

  12. Modeling statistical properties of written text.

    Directory of Open Access Journals (Sweden)

    M Angeles Serrano

    Full Text Available Written text is one of the fundamental manifestations of human language, and the study of its universal regularities can give clues about how our brains process information and how we, as a society, organize and share it. Among these regularities, only Zipf's law has been explored in depth. Other basic properties, such as the existence of bursts of rare words in specific documents, have only been studied independently of each other and mainly by descriptive models. As a consequence, there is a lack of understanding of linguistic processes as complex emergent phenomena. Beyond Zipf's law for word frequencies, here we focus on burstiness, Heaps' law describing the sublinear growth of vocabulary size with the length of a document, and the topicality of document collections, which encode correlations within and across documents absent in random null models. We introduce and validate a generative model that explains the simultaneous emergence of all these patterns from simple rules. As a result, we find a connection between the bursty nature of rare words and the topical organization of texts and identify dynamic word ranking and memory across documents as key mechanisms explaining the non trivial organization of written text. Our research can have broad implications and practical applications in computer science, cognitive science and linguistics.

  13. Mofolo's Chaka revisited via the original text

    African Journals Online (AJOL)

    oi, oi! … you must go by the right path': Mofolo's Chaka revisited via the original text. Thomas Mofolo never defended himself against accusations that his novel Chaka distorts historical facts to express anti-Nguni sentiments under the guise of Christianity. But in a way he foreshadowed the possibility of it, by including as part ...

  14. "The Politics of Location": Text as Opposition.

    Science.gov (United States)

    Moreno, Renee

    Eduardo Galeano's "Memory of Fire: Genesis" raises a number of questions concerning the "politics of location," a term that may be defined as the intersections, tensions, and complications that people of color bring to space and what space means in terms of hierarchies and power, racial and gender stratifications. Text can also…

  15. The Impact of Texting on Comprehension

    Directory of Open Access Journals (Sweden)

    Jamal K. M. Ali

    2015-07-01

    Full Text Available This paper presents a study of the effects of texting on English language comprehension. The authors believe that English used in texting causes a lack of comprehension for English speakers, learners, and texters. Wei, Xian-hai and Jiang (2008:3 declare “In Netspeak, there are some newly-created vocabularies, which people cannot comprehend them either from their partial pronunciation or from their figures.” Crystal (2007:23 claims; “variation causes problems of comprehension and acceptability. If you speak or write differently from the way I do, we may fail to understand each other.”  In this paper, the authors conducted a questionnaire at Aligarh Muslim University to ninety respondents from five different Faculties and four different levels. To measure respondents’ comprehension of English texting, the authors gave the respondents abbreviations used by texters and asked them to write the full forms of the abbreviations. The authors found that many abbreviations were not understood, which suggested that most of the respondents did not understand and did not use these abbreviations.

  16. Handwriting segmentation of unconstrained Oriya text

    Indian Academy of Sciences (India)

    Based on vertical projection profiles and structural features of Oriya characters, text lines are segmented into words. For character segmentation, at first, the isolated and connected (touching) characters in a word are detected. Using structural, topological and water reservoir concept-based features, characters of the word ...

  17. Prayer in Qumran texts. A brief introduction

    Directory of Open Access Journals (Sweden)

    Zdzisław J. Kapera

    2011-03-01

    Full Text Available Of some three hundred literary texts found in the caves of the Judaean Desert and those close to Khirbet Qumran, 56 are various pieces of poetry and liturgy. Seven specific groups have been distinguished among them: 1. Liturgy on sunshine and sunset and on specific days; 2. Liturgy on specific ceremonies of the community; 3. Eschatological prayers; 4. Magic texts; 5. Collections of psalms (including pseudepigrapha; 6. Thanksgiving hymns; 7. Prose prayers. The issue of how the Qumranians were praying is here briefly touched upon. Then there is a description of morning and evening prayers, Sabbath prayers, specific liturgy of the annual ceremony of entering the New Covenant, the Hodayot (Thanksgiving Hymns, pseudepigraphic Psalms (like Ps 151, and the eschatological prayers. The introduction ends with a summary evaluation of the role of the texts in reconstructing the historical development of the Jewish prayer of the late Second Temple period. The need to study the relationship of the Qumran prayers with the early Christian prayers is also briefly discussed.

  18. Sleep Habits and Nighttime Texting among Adolescents

    Science.gov (United States)

    Garmy, Pernilla; Ward, Teresa M.

    2018-01-01

    The aim of this study was to examine sleep habits (i.e., bedtimes and rising times) and their association with nighttime text messaging in 15- to 17-year-old adolescents. This cross-sectional study analyzed data from a web-based survey of adolescent students attending secondary schools in southern Sweden (N = 278, 50% female). Less than 8 hr of…

  19. Full Text Journal Subscriptions: An Evolutionary Process.

    Science.gov (United States)

    Luther, Judy

    1997-01-01

    Provides an overview of companies offering Web accessible subscriptions to full text electronic versions of scientific, technical, and medical journals (Academic Press, Blackwell, EBSCO, Elsevier, Highwire Press, Information Quest, Institute of Physics, Johns Hopkins University Press, OCLC, OVID, Springer, and SWETS). Also lists guidelines for…

  20. Exploring Academic Voice in Multimodal Quantitative Texts

    Directory of Open Access Journals (Sweden)

    Robert Prince

    2014-10-01

    Full Text Available Research on students’ academic literacies practices has tended to focus on the written mode in order to understand the academic conventions necessary to access Higher Education. However, the representation of quantitative information can be a challenge to many students. Quantitative information can be represented through a range of modes (such as writing, visuals and numbers and different information graphics (such as tables, charts, graphs. This paper focuses on the semiotic aspects of graphic representation in academic work, using student and published data from the Health Science, and an information graphic from the social domain as a counterpoint to explore aspects about agency and choice in academic voice in multimodal texts. It explores voice in terms of three aspects which work across modes, namely authorial engagement, citation and modality. The work of different modes and their inter-relations in quantitative texts is established, as is the use of sources in citation. We also look at the ways in which credibility and validity are established through modality. This exploration reveals that there is a complex interplay of modes in the construction of academic voice, which are largely tacit. This has implications for the way we think about and teach writing and text-making in quantitative disciplines in Higher Education.

  1. Examining Response Confidence in Multiple Text Tasks

    Science.gov (United States)

    List, Alexandra; Alexander, Patricia A.

    2015-01-01

    Students' confidence in their responses to a multiple text-processing task and their justifications for those confidence ratings were investigated. Specifically, 215 undergraduates responded to two academic questions, differing by type (i.e., discrete and open-ended) and by domain (i.e., developmental psychology and astrophysics), using a digital…

  2. AUTHENTIC TEXTS FOR CRITICAL READING ACTIVITIES

    Directory of Open Access Journals (Sweden)

    Ila Amalia

    2016-03-01

    Full Text Available This research takes an action research aimed at promoting critical reading (“thinking” while reading skills using authentic materials among the students. This research also aims to reveal the students perception on using critical reading skills in reading activities. Nineteen English Education Department students who took Reading IV class, participated in this project. There were three cycles with three different critical reading strategies were applied. Meanwhile, the authentic materials were taken from newspaper and internet articles. The result revealed that the use of critical reading strategies along with the use of authentic materials has improved students’ critical reading skills as seen from the improvement of each cycle - the students critical reading skill was 54% (fair in the cycle 1 improved to 68% (average in cycle 2, and 82% (good in cycle 3.. In addition, based on the critical reading skill criteria, the students’ critical reading skill has improved from 40% (nearly meet to 80% (exceed. Meanwhile, from the students’ perception questionnaire, it was shown that 63% students agreed the critical reading activity using authentic text could improve critical thinking and 58% students agreed that doing critical reading activity could improve reading comprehension. The result had the implication that the use of authentic texts could improve students’ critical reading skills if it was taught by performing not lecturing them. Selectively choosing various strategies and materials can trigger students’ activeness in responding to a text, that eventually shape their critical reading skills.

  3. Task-Driven Dynamic Text Summarization

    Science.gov (United States)

    Workman, Terri Elizabeth

    2011-01-01

    The objective of this work is to examine the efficacy of natural language processing (NLP) in summarizing bibliographic text for multiple purposes. Researchers have noted the accelerating growth of bibliographic databases. Information seekers using traditional information retrieval techniques when searching large bibliographic databases are often…

  4. Acts of Reading: Teachers, Text and Childhood

    Science.gov (United States)

    Styles, Morag, Ed.; Arizpe, Evelyn, Ed.

    2009-01-01

    "Acts of Reading" is an enchanting and scholarly review of the history of reading and texts for children, from the 18th century to the digital age and beyond. They are examined through the eyes of their various audiences: the children, writers, teachers and parents, so as to explore the act of reading itself, whether oral, silent or performative,…

  5. Project Physics Text 4, Light and Electromagnetism.

    Science.gov (United States)

    Harvard Univ., Cambridge, MA. Harvard Project Physics.

    Optical and electromagnetic fundamentals are presented in this fourth unit of the Project Physics text for use by senior high students. Development of the wave theory in the first half of the 19th Century is described to deal with optical problems at the early stage. Following explanations of electric charges and forces, field concepts are…

  6. Database citation in full text biomedical articles.

    Science.gov (United States)

    Kafkas, Şenay; Kim, Jee-Hyub; McEntyre, Johanna R

    2013-01-01

    Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services.

  7. Assessing Literary Reasoning: Text and Task Complexities

    Science.gov (United States)

    Lee, Carol D.; Goldman, Susan R.

    2015-01-01

    This article addresses 3 broad challenges of assessment in reading comprehension: (a) explicitly articulating the knowledge and skills students need to recognize and be able to use in comprehending complex texts; (b) understanding how knowledge and skills progress and successively deepen and develop over repeated opportunities to engage in tasks…

  8. Computation of term dominance in text documents

    Science.gov (United States)

    Bauer, Travis L [Albuquerque, NM; Benz, Zachary O [Albuquerque, NM; Verzi, Stephen J [Albuquerque, NM

    2012-04-24

    An improved entropy-based term dominance metric useful for characterizing a corpus of text documents, and is useful for comparing the term dominance metrics of a first corpus of documents to a second corpus having a different number of documents.

  9. Fieldwork, Heritage and Engaging Landscape Texts

    Science.gov (United States)

    Mains, Susan P.

    2014-01-01

    This paper outlines and analyses efforts to critically engage with "heritage" through the development and responses to a series of undergraduate residential fieldwork trips held in the North Coast of Jamaica. The ways in which we read heritage through varied "texts"--specifically, material landscapes, guided heritage tours,…

  10. IDENTITY CLAIMS, TEXTS, ROME AND GALATIANS

    African Journals Online (AJOL)

    in which memory and texts figured prominently, situated in contexts of unequal relations of power. Through a ... Memory and identity theories attempt to explain both how and why traditions formed and changed, as well as .... makes much sense when read in the Roman imperial context. The imperial setting constituted and ...

  11. Rhetorical Structure Theory and Text Analysis

    Science.gov (United States)

    1989-11-01

    NO. NO. ACCESSION NO. 11. TITLE (include Security Clasification ) Rhetorical Structure Theory and Text Analysis (Unclassified) 12. PERSONAL AUTHOR(S...34Antithesis: A Study in Clause Combining and Discourse Structure ," in Ross Steele and Terry Threadgold (eds.), Language Topics: Essays in Honour of M

  12. CONAN : Text Mining in the Biomedical Domain

    NARCIS (Netherlands)

    Malik, R.

    2006-01-01

    This thesis is about Text Mining. Extracting important information from literature. In the last years, the number of biomedical articles and journals is growing exponentially. Scientists might not find the information they want because of the large number of publications. Therefore a system was

  13. The Challenges of Qualitatively Coding Ancient Texts

    Science.gov (United States)

    Slingerland, Edward; Chudek, Maciej

    2012-01-01

    We respond to several important and valid concerns about our study ("The Prevalence of Folk Dualism in Early China," "Cognitive Science" 35: 997-1007) by Klein and Klein, defending our interpretation of our data. We also argue that, despite the undeniable challenges involved in qualitatively coding texts from ancient cultures,…

  14. Knowledge Revision Processes in Refutation Texts

    Science.gov (United States)

    Kendeou, Panayiota; Walsh, Erinn K.; Smith, Emily R.; O'Brien, Edward J.

    2014-01-01

    In the present set of experiments, we systematically examined the processes that occur while reading texts designed to refute and explain commonsense beliefs that reside in readers' long-term memory. In Experiment 1 (n = 36), providing readers with a refutation-plus-explanation of a commonsense belief was sufficient to significantly reduce…

  15. Unpublished Texts - Quei sussurri meridiani | Maggiari | Italian ...

    African Journals Online (AJOL)

    Italian Studies in Southern Africa/Studi d'Italianistica nell'Africa Australe. Journal Home · ABOUT THIS JOURNAL · Advanced Search · Current Issue · Archives · Journal Home > Vol 9, No 2 (1996) >. Log in or Register to get access to full text downloads.

  16. The psycholinguistics of developing text construction.

    Science.gov (United States)

    Berman, Ruth A

    2008-11-01

    This paper outlines functionally motivated quantifiable criteria for characterizing different facets of discourse--global-level principles, categories of referential content, clause-linking complex syntax, local linguistic expression and overall discourse stance--in relation to the variables of development, genre and modality. Concern is with later, school-age language development, in the conviction that the long developmental route of language acquisition can profitably be examined in the context of extended discourse. Findings are reviewed from a cross-linguistic project that elicited narrative and expository texts in both speech and writing at four age groups: (9-10 years, 12-13, 16-17 and adults). Clear developmental patterns emerge from middle childhood to adulthood, with significant shifts in adolescence; global-level text organization is mastered earlier in narratives than in expository essays, but the latter promote more advanced use of local-level lexicon and syntax; and spoken texts are more spread out than their denser written counterparts in clause-linkage, referential content and lexical usage. These and other findings are discussed in terms of the growth and reorganization of knowledge about types of discourse and text-embedded language use.

  17. Mining biological networks from full-text articles.

    Science.gov (United States)

    Czarnecki, Jan; Shepherd, Adrian J

    2014-01-01

    The study of biological networks is playing an increasingly important role in the life sciences. Many different kinds of biological system can be modelled as networks; perhaps the most important examples are protein-protein interaction (PPI) networks, metabolic pathways, gene regulatory networks, and signalling networks. Although much useful information is easily accessible in publicly databases, a lot of extra relevant data lies scattered in numerous published papers. Hence there is a pressing need for automated text-mining methods capable of extracting such information from full-text articles. Here we present practical guidelines for constructing a text-mining pipeline from existing code and software components capable of extracting PPI networks from full-text articles. This approach can be adapted to tackle other types of biological network.

  18. Intertext: On Connecting Text in the Building Process

    DEFF Research Database (Denmark)

    Christensen, Lars Rune

    2015-01-01

    Actors in the building process are critically dependent on a corpus of written text that draws the distributed work tasks together. This paper introduces, on the basis of a field study, the concepts of corpus, intertext and intertextuality to the analysis of text in cooperative work practice....... This paper shows that actors in the building process create intertext (connections) between complementary texts, in a particular situation and for a particular task. This has an integrating effect on the building process. Several types of intertextuality, including the complementary type, the intratextual...... type and the mediated type, may constitute the intertext of a particular task. By employing the concepts of corpus, intertext and intertextuality with respect to the study of the building process, this paper outlines an approach to the investigation of text in cooperative work....

  19. Text Mining the History of Medicine.

    Science.gov (United States)

    Thompson, Paul; Batista-Navarro, Riza Theresa; Kontonatsios, Georgios; Carter, Jacob; Toon, Elizabeth; McNaught, John; Timmermann, Carsten; Worboys, Michael; Ananiadou, Sophia

    2016-01-01

    Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM) methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc.), synonyms/variant forms of concepts, and relationships holding between concepts (which drugs are used to treat which medical conditions, etc.). TM analysis allows search systems to incorporate functionality such as automatic suggestions of synonyms of user-entered query terms, exploration of different concepts mentioned within search results or isolation of documents in which concepts are related in specific ways. However, applying TM methods to historical text can be challenging, according to differences and evolutions in vocabulary, terminology, language structure and style, compared to more modern text. In this article, we present our efforts to overcome the various challenges faced in the semantic analysis of published historical medical text dating back to the mid 19th century. Firstly, we used evidence from diverse historical medical documents from different periods to develop new resources that provide accounts of the multiple, evolving ways in which concepts, their variants and relationships amongst them may be expressed. These resources were employed to support the development of a modular processing pipeline of TM tools for the robust detection of semantic information in historical medical documents with varying characteristics. We applied the pipeline to two large-scale medical document archives covering wide temporal ranges as the basis for the development of a publicly accessible semantically-oriented search system. The novel resources are available for research purposes, while

  20. Aesthetic Analysis of Media Texts in the Classroom at the Student Audience

    Science.gov (United States)

    Fedorov, Alexander

    2015-01-01

    Aesthetic analysis of media texts, ie the analysis of art concept of the media texts of different types and genres, is closely related to the aesthetic (artistic) theory of media (Aesthetical Approach, Media as Popular Arts Approach, Discriminatory Approach). Aesthetic theory of media literacy education has been very popular in the 1960s…

  1. Measurement of the [Formula: see text] meson lifetime using [Formula: see text] decays.

    Science.gov (United States)

    Aaij, R; Adeva, B; Adinolfi, M; Affolder, A; Ajaltouni, Z; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Cartelle, P Alvarez; Alves, A A; Amato, S; Amerio, S; Amhis, Y; Anderlini, L; Anderson, J; Andreassen, R; Andreotti, M; Andrews, J E; Appleby, R B; Gutierrez, O Aquines; Archilli, F; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Bachmann, S; Back, J J; Badalov, A; Balagura, V; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Batozskaya, V; Bauer, Th; Bay, A; Beddow, J; Bedeschi, F; Bediaga, I; Belogurov, S; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bettler, M-O; van Beuzekom, M; Bien, A; Bifani, S; Bird, T; Bizzeti, A; Bjørnstad, P M; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Bondar, A; Bondar, N; Bonivento, W; Borghi, S; Borgia, A; Borsato, M; Bowcock, T J V; Bowen, E; Bozzi, C; Brambach, T; van den Brand, J; Bressieux, J; Brett, D; Britsch, M; Britton, T; Brook, N H; Brown, H; Bursche, A; Busetto, G; Buytaert, J; Cadeddu, S; Calabrese, R; Callot, O; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carranza-Mejia, H; Carson, L; Carvalho Akiba, K; Casse, G; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cenci, R; Charles, M; Charpentier, Ph; Cheung, S-F; Chiapolini, N; Chrzaszcz, M; Ciba, K; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coca, C; Coco, V; Cogan, J; Cogneras, E; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombes, M; Coquereau, S; Corti, G; Counts, I; Couturier, B; Cowan, G A; Craik, D C; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Dalseno, J; David, P; David, P N Y; Davis, A; De Bonis, I; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Silva, W; De Simone, P; Decamp, D; Deckenhoff, M; Del Buono, L; Déléage, N; Derkach, D; Deschamps, O; Dettori, F; Di Canto, A; Dijkstra, H; Donleavy, S; Dordei, F; Dorigo, M; Dorosz, P; Dosil Suárez, A; Dossett, D; Dovbnya, A; Dupertuis, F; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Easo, S; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; El Rifai, I; Elsasser, Ch; Falabella, A; Färber, C; Farinelli, C; Farry, S; Ferguson, D; Fernandez Albor, V; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fiore, M; Fiorini, M; Fitzpatrick, C; Fontana, M; Fontanelli, F; Forty, R; Francisco, O; Frank, M; Frei, C; Frosini, M; Furfaro, E; Gallas Torreira, A; Galli, D; Gandelman, M; Gandini, P; Gao, Y; Garofoli, J; Garra Tico, J; Garrido, L; Gaspar, C; Gauld, R; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianelle, A; Gibson, V; Giubega, L; Gligorov, V V; Göbel, C; Golubkov, D; Golutvin, A; Gomes, A; Gordon, H; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graziani, G; Grecu, A; Greening, E; Gregson, S; Griffith, P; Grillo, L; Grünberg, O; Gui, B; Gushchin, E; Guz, Yu; Gys, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Hafkenscheid, T W; Haines, S C; Hall, S; Hamilton, B; Hampson, T; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hartmann, T; He, J; Head, T; Heijne, V; Hennessy, K; Henrard, P; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hoballah, M; Hombach, C; Hulsbergen, W; Hunt, P; Huse, T; Hussain, N; Hutchcroft, D; Hynds, D; Iakovenko, V; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jans, E; Jaton, P; Jawahery, A; Jing, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kaballo, M; Kandybei, S; Kanso, W; Karacson, M; Karbach, T M; Kenyon, I R; Ketel, T; Khanji, B; Khurewathanakul, C; Klaver, S; Kochebina, O; Komarov, I; Koopman, R F; Koppenburg, P; Korolev, M; Kozlinskiy, A; Kravchuk, L; Kreplin, K; Kreps, M; Krocker, G; Krokovny, P; Kruse, F; Kucharczyk, M; Kudryavtsev, V; Kurek, K; Kvaratskheliya, T; La Thi, V N; Lacarrere, D; Lafferty, G; Lai, A; Lambert, D; Lambert, R W; Lanciotti, E; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Lefèvre, R; Leflat, A; Lefrançois, J; Leo, S; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Liles, M; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, G; Lohn, S; Longstaff, I; Lopes, J H; Lopez-March, N; Lowdon, P; Lu, H; Lucchesi, D; Luisier, J; Luo, H; Luppi, E; Lupton, O; Machefert, F; Machikhiliyan, I V; Maciuc, F; Maev, O; Malde, S; Manca, G; Mancinelli, G; Manzali, M; Maratas, J; Marconi, U; Marino, P; Märki, R; Marks, J; Martellotti, G; Martens, A; Martín Sánchez, A; Martinelli, M; Martinez Santos, D; Martins Tostes, D; Massafferri, A; Matev, R; Mathe, Z; Matteuzzi, C; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; McSkelly, B; Meadows, B; Meier, F; Meissner, M; Merk, M; Milanes, D A; Minard, M-N; Molina Rodriguez, J; Monteil, S; Moran, D; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Mountain, R; Mous, I; Muheim, F; Müller, K; Muresan, R; Muryn, B; Muster, B; Naik, P; Nakada, T; Nandakumar, R; Nasteva, I; Needham, M; Neubert, S; Neufeld, N; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nicol, M; Niess, V; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; Oblakowska-Mucha, A; Obraztsov, V; Oggero, S; Ogilvy, S; Okhrimenko, O; Oldeman, R; Onderwater, G; Orlandea, M; Otalora Goicochea, J M; Owen, P; Oyanguren, A; Pal, B K; Palano, A; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Pappalardo, L; Parkes, C; Parkinson, C J; Passaleva, G; Patel, G D; Patel, M; Patrignani, C; Pavel-Nicorescu, C; Pazos Alvarez, A; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perez Trigo, E; Perret, P; Perrin-Terrin, M; Pescatore, L; Pesen, E; Pessina, G; Petridis, K; Petrolini, A; Picatoste Olloqui, E; Pietrzyk, B; Pilař, T; Pinci, D; Pistone, A; Playfer, S; Plo Casasus, M; Polci, F; Polok, G; Poluektov, A; Polycarpo, E; Popov, A; Popov, D; Popovici, B; Potterat, C; Powell, A; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Rachwal, B; Rademacker, J H; Rakotomiaramanana, B; Rama, M; Rangel, M S; Raniuk, I; Rauschmayr, N; Raven, G; Redford, S; Reichert, S; Reid, M M; Dos Reis, A C; Ricciardi, S; Richards, A; Rinnert, K; Rives Molina, V; Roa Romero, D A; Robbe, P; Roberts, D A; Rodrigues, A B; Rodrigues, E; Rodriguez Perez, P; Roiser, S; Romanovsky, V; Romero Vidal, A; Rotondo, M; Rouvinet, J; Ruf, T; Ruffini, F; Ruiz, H; Ruiz Valls, P; Sabatino, G; Saborido Silva, J J; Sagidova, N; Sail, P; Saitta, B; Salustino Guimaraes, V; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santovetti, E; Sapunov, M; Sarti, A; Satriano, C; Satta, A; Savrie, M; Savrina, D; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmidt, B; Schneider, O; Schopper, A; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Seco, M; Semennikov, A; Senderowska, K; Sepp, I; Serra, N; Serrano, J; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, O; Shevchenko, V; Shires, A; Silva Coutinho, R; Simi, G; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, N A; Smith, E; Smith, E; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Soomro, F; Souza, D; Souza De Paula, B; Spaan, B; Sparkes, A; Spinella, F; Spradlin, P; Stagni, F; Stahl, S; Steinkamp, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Stroili, R; Subbiah, V K; Sun, L; Sutcliffe, W; Swientek, S; Syropoulos, V; Szczekowski, M; Szczypka, P; Szilard, D; Szumlak, T; T'Jampens, S; Teklishyn, M; Tellarini, G; Teodorescu, E; Teubert, F; Thomas, C; Thomas, E; van Tilburg, J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Torr, N; Tournefier, E; Tourneur, S; Tran, M T; Tresch, M; Tsaregorodtsev, A; Tsopelas, P; Tuning, N; Ubeda Garcia, M; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vagnoni, V; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vázquez Sierra, C; Vecchi, S; Velthuis, J J; Veltri, M; Veneziano, G; Vesterinen, M; Viaud, B; Vieira, D; Vilasis-Cardona, X; Vollhardt, A; Volyanskyy, D; Voong, D; Vorobyev, A; Vorobyev, V; Voß, C; Voss, H; de Vries, J A; Waldi, R; Wallace, C; Wallace, R; Wandernoth, S; Wang, J; Ward, D R; Watson, N K; Webber, A D; Websdale, D; Whitehead, M; Wicht, J; Wiechczynski, J; Wiedner, D; Wiggers, L; Wilkinson, G; Williams, M P; Williams, M; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wright, S; Wu, S; Wyllie, K; Xie, Y; Xing, Z; Yang, Z; Yuan, X; Yushchenko, O; Zangoli, M; Zavertyaev, M; Zhang, F; Zhang, L; Zhang, W C; Zhang, Y; Zhelezov, A; Zhokhov, A; Zhong, L; Zvyagin, A

    The lifetime of the [Formula: see text] meson is measured using semileptonic decays having a [Formula: see text] meson and a muon in the final state. The data, corresponding to an integrated luminosity of [Formula: see text], are collected by the LHCb detector in [Formula: see text] collisions at a centre-of-mass energy of 8 TeV. The measured lifetime is [Formula: see text]where the first uncertainty is statistical and the second is systematic.

  2. Working with Data: Discovering Knowledge through Mining and Analysis; Systematic Knowledge Management and Knowledge Discovery; Text Mining; Methodological Approach in Discovering User Search Patterns through Web Log Analysis; Knowledge Discovery in Databases Using Formal Concept Analysis; Knowledge Discovery with a Little Perspective.

    Science.gov (United States)

    Qin, Jian; Jurisica, Igor; Liddy, Elizabeth D.; Jansen, Bernard J; Spink, Amanda; Priss, Uta; Norton, Melanie J.

    2000-01-01

    These six articles discuss knowledge discovery in databases (KDD). Topics include data mining; knowledge management systems; applications of knowledge discovery; text and Web mining; text mining and information retrieval; user search patterns through Web log analysis; concept analysis; data collection; and data structure inconsistency. (LRW)

  3. Morpheme matching based text tokenization for a scarce resourced language.

    Science.gov (United States)

    Rehman, Zobia; Anwar, Waqas; Bajwa, Usama Ijaz; Xuan, Wang; Chaoying, Zhou

    2013-01-01

    Text tokenization is a fundamental pre-processing step for almost all the information processing applications. This task is nontrivial for the scarce resourced languages such as Urdu, as there is inconsistent use of space between words. In this paper a morpheme matching based approach has been proposed for Urdu text tokenization, along with some other algorithms to solve the additional issues of boundary detection of compound words, affixation, reduplication, names and abbreviations. This study resulted into 97.28% precision, 93.71% recall, and 95.46% F1-measure; while tokenizing a corpus of 57000 words by using a morpheme list with 6400 entries.

  4. Text-based CAPTCHAs over the years

    Science.gov (United States)

    Chow, Y. W.; Susilo, W.

    2017-11-01

    The notion of CAPTCHAs has been around for more than two decades. Since its introduction, CAPTCHAs have now become a ubiquitous part of the Internet. Over the years, research on various aspects of CAPTCHAs has evolved and different design principles have emerged. This article discusses text-based CAPTCHAs in terms of their fundamental requirements, namely, security and usability. Practicality necessitates that humans must be able to correctly solve CAPTCHA challenges, while at the same time automated computer programs should have difficulty solving the challenges. This article also presents alternative paradigms to text-based CAPTCHA design that have been examined in previous work. With the advances in techniques to defeat CAPTCHAs, the future of auto- mated Turing tests is an open question.

  5. Aligning Greek-English parallel texts

    Science.gov (United States)

    Galiotou, Eleni; Koronakis, George; Lazari, Vassiliki

    2015-02-01

    In this paper, we discuss issues concerning the alignment of parallel texts written in languages with different alphabets based on an experiment of aligning texts from the proceedings of the European Parliament in Greek and English. First, we describe our implementation of the k-vec algorithm and its application to the bilingual corpus. Then the output of the algorithm is used as a starting point for an alignment procedure at a sentence level which also takes into account mark-ups of meta-information. The results of the implementation are compared to those of the application of the Church and Gale alignment algorithm on the Europarl corpus. The conclusions of this comparison can give useful insights as for the efficiency of alignment algorithms when applied to the particular bilingual corpus.

  6. Ordinary differential equations a graduate text

    CERN Document Server

    Bhamra, K S

    2015-01-01

    ORDINARY DIFFERENTIAL EQUATIONS: A Graduate Text presents a systematic and comprehensive introduction to ODEs for graduate and postgraduate students. The systematic organized text on differential inequalities, Gronwall's inequality, Nagumo's theorems, Osgood's criteria and applications of different equations of first order is dealt with in a greater depth. The book discusses qualitative and quantitative aspects of the Strum - Liouville problems, Green's function, integral equations, Laplace transform and is supported by a number of worked-out examples in each lesson to make the concepts clear. A lot of stress on stability theory is laid down, especially on Lyapunov and Poincare stability theory. A numerous figures in various lessons (in particular lessons dealing with stability theory) have been added to clarify the key concepts in DE theory. Nonlinear oscillation in conservative systems and Hamiltonian systems highlights basic nature of the systems considered. Perturbation techniques lesson deals in fairly d...

  7. Can An Evolutionary Process Create English Text?

    Energy Technology Data Exchange (ETDEWEB)

    Bailey, David H.

    2008-10-29

    Critics of the conventional theory of biological evolution have asserted that while natural processes might result in some limited diversity, nothing fundamentally new can arise from 'random' evolution. In response, biologists such as Richard Dawkins have demonstrated that a computer program can generate a specific short phrase via evolution-like iterations starting with random gibberish. While such demonstrations are intriguing, they are flawed in that they have a fixed, pre-specified future target, whereas in real biological evolution there is no fixed future target, but only a complicated 'fitness landscape'. In this study, a significantly more sophisticated evolutionary scheme is employed to produce text segments reminiscent of a Charles Dickens novel. The aggregate size of these segments is larger than the computer program and the input Dickens text, even when comparing compressed data (as a measure of information content).

  8. Cell Phoning and Texting While Driving

    Directory of Open Access Journals (Sweden)

    Judy Honoria Rosaire Telemaque

    2015-07-01

    Full Text Available A qualitative phenomenological study was conducted on the consequences of cell phone use while operating a vehicle. We discussed why talking and texting on cell phones are so popular through the analysis of our interviews with police officers, driving instructors, and parents of teens and young adults. The participants came from central, northeastern, northwestern, and southeastern Connecticut. All had exposure with respect to the effects of cell phone usage problem. The study reached a point of theoretical saturation or redundancy by which the analysis no longer resulted in new themes. We concluded that the discoveries revealed the necessity for education, expansion of technology, and additional driver education preparation, which may provide a path for leadership to help solve the problem.

  9. Resonant island divertor experiments on text

    International Nuclear Information System (INIS)

    deGrassie, J.S.; Evans, T.E.; Jackson, G.L.

    1988-09-01

    The first experimental tests of the resonant island divertor (RID) concept have been carried out on the Texas Experimental Tokamak (TEXT). Modular perturbation coils produce static resonant magnetic fields at the tokamak boundary. The resulting magnetic islands are used to guide heat and particle fluxes around a small scoop limiter head. An enhancement in the limiter collection efficiency over the nonisland operation, as evidenced by enhanced neutral density within the limiter head, of up to a factor of 4 is obtained. This enhancement is larger than one would expect given the measured magnitude of the cross-field particle transport in TEXT. It is proposed that electrostatic perturbations occur which enhance the ion convection rate around the islands. Preliminary experiments utilizing electron cyclotron heating (ECH) in conjunction with RID operation have also have been performed. 6 refs., 3 figs

  10. Stochastic text models for music categorization

    OpenAIRE

    Pérez Sancho, Carlos; Rizo Valero, David; Iñesta Quereda, José Manuel

    2008-01-01

    Music genre meta-data is of paramount importance for the organization of music repositories. People use genre in a natural way when entering a music store or looking into music collections. Automatic genre classification has become a popular topic in music information retrieval research. This work brings to symbolic music recognition some technologies, like the stochastic language models, already successfully applied to text categorization. In this work we model chord progressions and melodie...

  11. Text summarization as a decision support aid

    OpenAIRE

    Workman, T Elizabeth; Fiszman, Marcelo; Hurdle, John F

    2012-01-01

    Abstract Background PubMed data potentially can provide decision support information, but PubMed was not exclusively designed to be a point-of-care tool. Natural language processing applications that summarize PubMed citations hold promise for extracting decision support information. The objective of this study was to evaluate the efficiency of a text summarization application called Semantic MEDLINE, enhanced with a novel dynamic summarization method, in identifying decision support data. Me...

  12. Stemming of Slovenian library science texts

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2002-01-01

    Full Text Available The theme of the article is the preparation of a stemming algorithm for Slovenian library science texts. The procedure consisted of three phases: learning, testing and evaluation.The preparation of the optimal stemmer for Slovenian texts from the field of library science is presented, its testing and comparison with two other stemmers for the Slovenian language: the Popovič stemmer and the Generic stemmer. A corpus of 790.000 words from the field of library science was used for learning. Lists of stems, word endings and stop-words were built. In the testing phase, the component parts of the algorithm were tested on an additional corpus of 167.000 words. In the evaluation phase, a comparison of the three stemmers processing the same word corpus was made. The results of each stemmer were compared with an intellectually prepared control result of the stemming of the corpus. It consisted of groups of semantically connected words with no errors. Understemming was especially monitored – the number of stems for semantically connected words, produced by an algorithm. The results were statistically processed with the Kruskal-Wallis test. The Optimal stemmer produced the best results.It matched best with the reference results and also gave the smallest number of stems for one semantic meaning. The Popovič stemmer followed closely. The Generic stemmer proved to be the least accurate. The procedures described in the thesis can represent a platform for the development of the tools for automatic indexing and retrieval for library science texts in Slovenian language.

  13. Intertextuality in Text-based Discussions

    Directory of Open Access Journals (Sweden)

    Hamidah Mohd Ismail

    2011-01-01

    Full Text Available One  of  the  main  issues  often  discussed  among  academics  is  how  to  encourage  active participation by students during classroom discussions. This applies particularly to students at the tertiary level who are expected to possess creative and critical thinking skills. Hence, this paper reports on a study that examined how these skills were demonstrated by a group of university students  who  employed  intertextual  links  during  a  follow-up  reading  activity involving  small-group  text  discussions.  Thirty  undergraduates  who  were  in  their  fifth semester of a TESL degree programme were prescribed reading texts consisting of two chapters taken  from  a  book.  Findings  reveal  that  intertextual  links  made  during  text discussions created successfully a “collaborative environment” where beliefs and values were shared judicially among participants. Pedagogical implications for ESL classroom practice include  heightening  the  awareness  amongst  academics  and  students  of  the  role  of intertextuality in order to promote students’ use of their critical and creative thinking skills in a supportive classroom environment.

  14. Choices of texts for literary education

    DEFF Research Database (Denmark)

    Skyggebjerg, Anna Karlskov

    literature studies at universities, where criteria concerning language and form are often more valued than criteria concerning character and content. This tendency to celebrate the formal aspects and the literariness of literature is recognized in governmental documents, teaching materials...... readers with literary interests, competences, possibilities, needs, etc. Generally speaking the criteria for the choice of texts for teaching literature in Danish schools have been dominated by considerations for the subject and Literature in itself. The predominant view of literature comes from...

  15. Ontology Learning - Suggesting Associations from Text

    OpenAIRE

    Kvarv, Gøran Sveia

    2007-01-01

    In many applications, large-scale ontologies have to be constructed and maintained. A manual construction of an ontology is a time consuming and resource demanding process, often involving some domain experts. It would therefore be beneficial to support this process with tools that automates the construction of an ontology. This master thesis has examined the use of association rules for suggesting associations between words in text. In ontology learning, concepts are often extracted from d...

  16. Cohesion in Computer Text Generation: Lexical Substitution.

    Science.gov (United States)

    1983-05-01

    can contain any information desired, the rules need not be strictly syntactic, but can reflect semantic and pragmatic information as well. A subset of...its antecedent. Otherwise, unintelligible text may be generated. Investigation into anaphora resolution has been performed in the pursuit of natural...syntactic, semantic, and pragmatic acceptance. The first item in the ranked list that passes these criteria is assumed to be the antecedent for the

  17. Logistic regression a self-learning text

    CERN Document Server

    Kleinbaum, David G

    1994-01-01

    This textbook provides students and professionals in the health sciences with a presentation of the use of logistic regression in research. The text is self-contained, and designed to be used both in class or as a tool for self-study. It arises from the author's many years of experience teaching this material and the notes on which it is based have been extensively used throughout the world.

  18. HPTA: High-Performance Text Analytics

    OpenAIRE

    Vandierendonck, Hans; Murphy, Karen; Arif, Mahwish; Nikolopoulos, Dimitrios S.

    2017-01-01

    One of the main targets of data analytics is unstructured data, which primarily involves textual data. High-performance processing of textual data is non-trivial. We present the HPTA library for high-performance text analytics. The library helps programmers to map textual data to a dense numeric representation, which can be handled more efficiently. HPTA encapsulates three performance optimizations: (i) efficient memory management for textual data, (ii) parallel computation on associative dat...

  19. Reading an ESL Writer’s Text

    Directory of Open Access Journals (Sweden)

    Paul Kei Matsuda

    2011-03-01

    Full Text Available This paper focuses on reading as a central act of communication in the tutorial session. Writing center tutors without extensive experience reading writing by second language writers may have difficulty getting past the many differences in surface-level features, organization, and rhetorical moves. After exploring some of the sources of these differences in writing, the authors present strategies that writing tutors can use to work effectively with second language writers.

  20. Domain-independent information extraction in unstructured text

    Energy Technology Data Exchange (ETDEWEB)

    Irwin, N.H. [Sandia National Labs., Albuquerque, NM (United States). Software Surety Dept.

    1996-09-01

    Extracting information from unstructured text has become an important research area in recent years due to the large amount of text now electronically available. This status report describes the findings and work done during the second year of a two-year Laboratory Directed Research and Development Project. Building on the first-year`s work of identifying important entities, this report details techniques used to group words into semantic categories and to output templates containing selective document content. Using word profiles and category clustering derived during a training run, the time-consuming knowledge-building task can be avoided. Though the output still lacks in completeness when compared to systems with domain-specific knowledge bases, the results do look promising. The two approaches are compatible and could complement each other within the same system. Domain-independent approaches retain appeal as a system that adapts and learns will soon outpace a system with any amount of a priori knowledge.

  1. DEEP LEARNING MODEL FOR BILINGUAL SENTIMENT CLASSIFICATION OF SHORT TEXTS

    Directory of Open Access Journals (Sweden)

    Y. B. Abdullin

    2017-01-01

    Full Text Available Sentiment analysis of short texts such as Twitter messages and comments in news portals is challenging due to the lack of contextual information. We propose a deep neural network model that uses bilingual word embeddings to effectively solve sentiment classification problem for a given pair of languages. We apply our approach to two corpora of two different language pairs: English-Russian and Russian-Kazakh. We show how to train a classifier in one language and predict in another. Our approach achieves 73% accuracy for English and 74% accuracy for Russian. For Kazakh sentiment analysis, we propose a baseline method, that achieves 60% accuracy; and a method to learn bilingual embeddings from a large unlabeled corpus using a bilingual word pairs.

  2. Torpedo: topic periodicity discovery from text data

    Science.gov (United States)

    Wang, Jingjing; Deng, Hongbo; Han, Jiawei

    2015-05-01

    Although history may not repeat itself, many human activities are inherently periodic, recurring daily, weekly, monthly, yearly or following some other periods. Such recurring activities may not repeat the same set of keywords, but they do share similar topics. Thus it is interesting to mine topic periodicity from text data instead of just looking at the temporal behavior of a single keyword/phrase. Some previous preliminary studies in this direction prespecify a periodic temporal template for each topic. In this paper, we remove this restriction and propose a simple yet effective framework Torpedo to mine periodic/recurrent patterns from text, such as news articles, search query logs, research papers, and web blogs. We first transform text data into topic-specific time series by a time dependent topic modeling module, where each of the time series characterizes the temporal behavior of a topic. Then we use time series techniques to detect periodicity. Hence we both obtain a clear view of how topics distribute over time and enable the automatic discovery of periods that are inherent in each topic. Theoretical and experimental analyses demonstrate the advantage of Torpedo over existing work.

  3. Text Mining for Drug–Drug Interaction

    Science.gov (United States)

    Wu, Heng-Yi; Chiang, Chien-Wei; Li, Lang

    2015-01-01

    In order to understand the mechanisms of drug–drug interaction (DDI), the study of pharmacokinetics (PK), pharmacodynamics (PD), and pharmacogenetics (PG) data are significant. In recent years, drug PK parameters, drug interaction parameters, and PG data have been unevenly collected in different databases and published extensively in literature. Also the lack of an appropriate PK ontology and a well-annotated PK corpus, which provide the background knowledge and the criteria of determining DDI, respectively, lead to the difficulty of developing DDI text mining tools for PK data collection from the literature and data integration from multiple databases. To conquer the issues, we constructed a comprehensive pharmacokinetics ontology. It includes all aspects of in vitro pharmacokinetics experiments, in vivo pharmacokinetics studies, as well as drug metabolism and transportation enzymes. Using our pharmacokinetics ontology, a PK corpus was constructed to present four classes of pharmacokinetics abstracts: in vivo pharmacokinetics studies, in vivo pharmacogenetic studies, in vivo drug interaction studies, and in vitro drug interaction studies. A novel hierarchical three-level annotation scheme was proposed and implemented to tag key terms, drug interaction sentences, and drug interaction pairs. The utility of the pharmacokinetics ontology was demonstrated by annotating three pharmacokinetics studies; and the utility of the PK corpus was demonstrated by a drug interaction extraction text mining analysis. The pharmacokinetics ontology annotates both in vitro pharmacokinetics experiments and in vivo pharmacokinetics studies. The PK corpus is a highly valuable resource for the text mining of pharmacokinetics parameters and drug interactions. PMID:24788261

  4. Text mining for drug-drug interaction.

    Science.gov (United States)

    Wu, Heng-Yi; Chiang, Chien-Wei; Li, Lang

    2014-01-01

    In order to understand the mechanisms of drug-drug interaction (DDI), the study of pharmacokinetics (PK), pharmacodynamics (PD), and pharmacogenetics (PG) data are significant. In recent years, drug PK parameters, drug interaction parameters, and PG data have been unevenly collected in different databases and published extensively in literature. Also the lack of an appropriate PK ontology and a well-annotated PK corpus, which provide the background knowledge and the criteria of determining DDI, respectively, lead to the difficulty of developing DDI text mining tools for PK data collection from the literature and data integration from multiple databases.To conquer the issues, we constructed a comprehensive pharmacokinetics ontology. It includes all aspects of in vitro pharmacokinetics experiments, in vivo pharmacokinetics studies, as well as drug metabolism and transportation enzymes. Using our pharmacokinetics ontology, a PK corpus was constructed to present four classes of pharmacokinetics abstracts: in vivo pharmacokinetics studies, in vivo pharmacogenetic studies, in vivo drug interaction studies, and in vitro drug interaction studies. A novel hierarchical three-level annotation scheme was proposed and implemented to tag key terms, drug interaction sentences, and drug interaction pairs. The utility of the pharmacokinetics ontology was demonstrated by annotating three pharmacokinetics studies; and the utility of the PK corpus was demonstrated by a drug interaction extraction text mining analysis.The pharmacokinetics ontology annotates both in vitro pharmacokinetics experiments and in vivo pharmacokinetics studies. The PK corpus is a highly valuable resource for the text mining of pharmacokinetics parameters and drug interactions.

  5. Effects of music on memory for text.

    Science.gov (United States)

    Purnell-Webb, Patricia; Speelman, Craig P

    2008-06-01

    Previous research has suggested that the use of song can facilitate recall of text. This study examined the effect of repetition of a melody across verses, familiarity with the melody, rhythm, and other structural processing hypotheses to explain this phenomenon. Two experiments were conducted, each with 100 participants recruited from undergraduate Psychology programs (44 men, 156 women, M age = 28.5 yr., SD = 9.4). In Exp. 1, participants learned a four-verse ballad in one of five encoding conditions (familiar melody, unfamiliar melody, unknown rhythm, known rhythm, and spoken). Exp. 2 assessed the effect of familiarity in rhythm-only conditions and of pre-exposure with a previously unfamiliar melody. Measures taken were number of verbatim words recalled and number of lines produced with correct syllabic structure. Analysis indicated that rhythm, with or without musical accompaniment, can facilitate recall of text, suggesting that rhythm may provide a schematic frame to which text can be attached. Similarly, familiarity with the rhythm or melody facilitated recall. Findings are discussed in terms of integration and dual-processing theories.

  6. Speech Act Classification of German Advertising Texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2015-12-01

    Full Text Available This paper uses the theory of speech acts and the underlying concept of pragmalinguistics to determine the types of speech acts and their classification in the German advertising printed texts. We ascertain that the advertising of cars and accessories, household appliances and computer equipment, watches, fancy goods, food, pharmaceuticals, and financial, insurance, legal services and also airline advertising is dominated by a pragmatic principle, which is based on demonstrating information about the benefits of a product / service. This influences the frequent usage of certain speech acts. The dominant form of exposure is to inform the recipient-user about the characteristics of the advertised product. This information is fore-grounded by means of stylistic and syntactic constructions specific to the advertisement (participial constructions, appositional constructions which contribute to emphasize certain notional components within the framework of the advertising text. Stylistic and syntactic devices of reduction (parceling constructions convey the author's idea. Other means like repetitions, enumerations etc are used by the advertiser to strengthen his selling power. The advertiser focuses the attention of the consumer on the characteristics of the product seeking to convince him of the utility of the product and to influence his/ her buying behavior.

  7. Facilitating text reading in posterior cortical atrophy.

    Science.gov (United States)

    Yong, Keir X X; Rajdev, Kishan; Shakespeare, Timothy J; Leff, Alexander P; Crutch, Sebastian J

    2015-07-28

    We report (1) the quantitative investigation of text reading in posterior cortical atrophy (PCA), and (2) the effects of 2 novel software-based reading aids that result in dramatic improvements in the reading ability of patients with PCA. Reading performance, eye movements, and fixations were assessed in patients with PCA and typical Alzheimer disease and in healthy controls (experiment 1). Two reading aids (single- and double-word) were evaluated based on the notion that reducing the spatial and oculomotor demands of text reading might support reading in PCA (experiment 2). Mean reading accuracy in patients with PCA was significantly worse (57%) compared with both patients with typical Alzheimer disease (98%) and healthy controls (99%); spatial aspects of passages were the primary determinants of text reading ability in PCA. Both aids led to considerable gains in reading accuracy (PCA mean reading accuracy: single-word reading aid = 96%; individual patient improvement range: 6%-270%) and self-rated measures of reading. Data suggest a greater efficiency of fixations and eye movements under the single-word reading aid in patients with PCA. These findings demonstrate how neurologic characterization of a neurodegenerative syndrome (PCA) and detailed cognitive analysis of an important everyday skill (reading) can combine to yield aids capable of supporting important everyday functional abilities. This study provides Class III evidence that for patients with PCA, 2 software-based reading aids (single-word and double-word) improve reading accuracy. © 2015 American Academy of Neurology.

  8. PEDANT: Parallel Texts in Göteborg

    Directory of Open Access Journals (Sweden)

    Daniel Ridings

    2012-09-01

    Full Text Available

    The article presents the status of the PEDANT project with parallel corpora at the Language Bank at Göteborg University. The solutions for access to the corpus data are presented. Access is provided by way of the internet and standard applications and SGML-aware programming tools. The SGML format for encoding translation pairs is outlined together. The methods allow working with everything from plain text to texts densely encoded with linguistic information.

     

    In hierdie artikel word 'n beskrywing gegee van die stand van die PEDANT-projek met parallelle korpora by die Taalbank by die Universiteit van Göteborg. Oplossings vir die verkryging van toegang tot die korpusdata word aangedui. Toegang word verskaf deur middel van die Internet en standaardtoepassings en SGML-sensitiewe programmeringshulpmiddels. Die SGML-formaat vir die enkodering van vertaalpare word gesamentlik geskets. Hierdie metodes laat toe dat gewerk kan word met enigiets vanaf suiwer teks tot tekste wat taalkundig dig geëtiketteer is.

     

  9. DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

    Directory of Open Access Journals (Sweden)

    Xu-Cheng Yin

    Full Text Available Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. Since text is a rich source of information in figures, automatically extracting such text may assist in the task of mining figure information. A high-quality ground truth standard can greatly facilitate the development of an automated system. This article describes DeTEXT: A database for evaluating text extraction from biomedical literature figures. It is the first publicly available, human-annotated, high quality, and large-scale figure-text dataset with 288 full-text articles, 500 biomedical figures, and 9308 text regions. This article describes how figures were selected from open-access full-text biomedical articles and how annotation guidelines and annotation tools were developed. We also discuss the inter-annotator agreement and the reliability of the annotations. We summarize the statistics of the DeTEXT data and make available evaluation protocols for DeTEXT. Finally we lay out challenges we observed in the automated detection and recognition of figure text and discuss research directions in this area. DeTEXT is publicly available for downloading at http://prir.ustb.edu.cn/DeTEXT/.

  10. Text mining improves prediction of protein functional sites.

    Directory of Open Access Journals (Sweden)

    Karin M Verspoor

    Full Text Available We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites. The structure analysis was carried out using Dynamics Perturbation Analysis (DPA, which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

  11. Text Detection and Pose Estimation for a Reading Robot

    OpenAIRE

    Bulacu, Marius; Ezaki, Nobuo; Schomaker, Lambert

    2008-01-01

    One very important advantage of using CoCos for text detection is that they naturally allow the analysis to take place across scales. In this approach, scale does not represent such a problematic issue because the CoCo extraction process is scale independent. CoCos give a prompt, but rather imperfect, hold to the structures present in the image and CoCo selection

  12. Extracting BI-RADS Features from Portuguese Clinical Texts

    OpenAIRE

    Nassif, Houssam; Cunha, Filipe; Moreira, Inês C.; Cruz-Correia, Ricardo; Sousa, Eliana; Page, David; Burnside, Elizabeth; Dutra, Inês

    2012-01-01

    In this work we build the first BI-RADS parser for Portuguese free texts, modeled after existing approaches to extract BI-RADS features from English medical records. Our concept finder uses a semantic grammar based on the BIRADS lexicon and on iterative transferred expert knowledge. We compare the performance of our algorithm to manual annotation by a specialist in mammography. Our results show that our parser’s performance is comparable to the manual method.

  13. Sentiment topic mining based on comment tags

    Science.gov (United States)

    Zhang, Daohai; Liu, Xue; Li, Juan; Fan, Mingyue

    2018-03-01

    With the development of e-commerce, various comments based on tags are generated, how to extract valuable information from these comment tags has become an important content of business management decisions. This study takes HUAWEI mobile phone tags as an example using the sentiment analysis and topic LDA mining method. The first step is data preprocessing and classification of comment tag topic mining. And then make the sentiment classification for comment tags. Finally, mine the comments again and analyze the emotional theme distribution under different sentiment classification. The results show that HUAWEI mobile phone has a good user experience in terms of fluency, cost performance, appearance, etc. Meanwhile, it should pay more attention to independent research and development, product design and development. In addition, battery and speed performance should be enhanced.

  14. DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

    Science.gov (United States)

    Yin, Xu-Cheng; Yang, Chun; Pei, Wei-Yi; Man, Haixia; Zhang, Jun; Learned-Miller, Erik; Yu, Hong

    2015-01-01

    Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. Since text is a rich source of information in figures, automatically extracting such text may assist in the task of mining figure information. A high-quality ground truth standard can greatly facilitate the development of an automated system. This article describes DeTEXT: A database for evaluating text extraction from biomedical literature figures. It is the first publicly available, human-annotated, high quality, and large-scale figure-text dataset with 288 full-text articles, 500 biomedical figures, and 9308 text regions. This article describes how figures were selected from open-access full-text biomedical articles and how annotation guidelines and annotation tools were developed. We also discuss the inter-annotator agreement and the reliability of the annotations. We summarize the statistics of the DeTEXT data and make available evaluation protocols for DeTEXT. Finally we lay out challenges we observed in the automated detection and recognition of figure text and discuss research directions in this area. DeTEXT is publicly available for downloading at http://prir.ustb.edu.cn/DeTEXT/.

  15. Music as a Component of Biographic Text

    Directory of Open Access Journals (Sweden)

    Svitlana Macenka

    2014-11-01

    Full Text Available The role of music in the biography of a musician is analyzed. The focus is on the life of famous composers interpretation via their work and the role of music in structuring, aestheticization, additional semantization of biography. It is indicated that the artist stands out in the context of his time as the center of productive gravity, thus a musician’s biography demonstrates communication of the life and creative ways, distinguishing its major milestones and at the same time emphasizing the individual changes of the artist and those his innovations that defined the image of the era. It goes about the semantization of music by an author-biographer, the desire to articulate the principles of an individual artist (also by reference to the listening to musical works, comprehended as his musical text, with its help. For example, the genre of musician biography is viewed as based on the corresponding biographies of W. A. Mozart, Ludwig van Beethoven and Richard Strauss. It is noted that W. A. Mozart is considered a catalyst for musical biographical writing, for certain events of his life were such that he started to be generally recognized as the prototype of the artist. Biographers of both Mozart and Beethoven are convinced of the efficiency of the biographical method in relation to musical analysis. Richard Strauss’s musical-biographical text is also structured according to the logic of musical thinking. A tendency towards correlation between life and work inherent in the biographies of musicians is also peculiar to the novel which tends to music.

  16. Interview als Text vs. Interview als Interaktion

    Directory of Open Access Journals (Sweden)

    Arnulf Deppermann

    2013-09-01

    Full Text Available Das Interview ist nach wie vor das beliebteste sozialwissenschaftliche Verfahren des Datengewinns. Ökonomie der Erhebung, Vergleichbarkeit und die Möglichkeit, Einsicht in Praxisbereiche und historisch-biografische Dimensionen zu erhalten, die der direkten Beobachtung kaum zugänglich sind, machen seine Attraktivität aus. Zugleich mehren sich Kritiken, die seine Leistungsfähigkeit problematisieren, indem sie auf die begrenzte Reichweite der Explikationsfähigkeiten der Befragten, die Reaktivität der Erhebung oder die Differenz zwischen Handeln und dem Bericht über Handeln verweisen. Im Beitrag wird zwischen Ansätzen, die das Interview als Text, und solchen, die es als Interaktion verstehen, unterschieden. Nach dem Text-Verständnis werden Interviews unter inhaltlichen Gesichtspunkten analysiert und als Zugang zu einer vorgängigen sozialen oder psychischen Wirklichkeit angesehen. Das Interaktions-Verständnis versteht Interviews dagegen als situierte Praxis, in welcher im Hier und Jetzt von InterviewerInnen und Befragten gemeinsam soziale Sinnstrukturen hergestellt werden. Anhand ubiquitärer Phänomene der Interviewinteraktion – Fragen, Antworten und die Selbstpositionierung von InterviewerInnen und Befragten – werden Praktiken des interaktiv-performativen Handelns im Interview dargestellt. Ihre Relevanz für die Interviewkonstitution und ihre Erkenntnispotenziale für die Interviewauswertung werden aufgezeigt. Es wird dafür plädiert, die interaktive Konstitutionsweise von Interviews empirisch zu erforschen und methodisch konsequent zu berücksichtigen. URN: http://nbn-resolving.de/urn:nbn:de:0114-fqs1303131

  17. Science and Technology Text Mining: Nonlinear Dynamics

    Science.gov (United States)

    2004-02-01

    BUCHNER--J UCLA USA 5 CASATI--G UNIV MILAN ITALY 5 ELNASCHIE--MS CORNELL UNIV USA 5 EPSTEIN--IR BRANDEIS UNIV USA 5 ERTL--G MAX PLANCK GESELL GERMANY 5 The...BRISTOL ENGLAND 235 ARNOLD VI RUSSIAN ACADEMY OF SCIENCE RUSSIA 230 TAKENS F UNIV GRONINGEN NETHERLANDS 212 GASPARD P FREE UNIV BRUSSELS BELGIUM 199...IR BRANDEIS UNIV USA 5 ERTL--G MAX PLANCK GESELL GERMANY 5 Nonlinear Dynamics Text Mining References Page 11 The regional mix of authors has some major

  18. Methods for Mining and Summarizing Text Conversations

    CERN Document Server

    Carenini, Giuseppe; Murray, Gabriel

    2011-01-01

    Due to the Internet Revolution, human conversational data -- in written forms -- are accumulating at a phenomenal rate. At the same time, improvements in speech technology enable many spoken conversations to be transcribed. Individuals and organizations engage in email exchanges, face-to-face meetings, blogging, texting and other social media activities. The advances in natural language processing provide ample opportunities for these "informal documents" to be analyzed and mined, thus creating numerous new and valuable applications. This book presents a set of computational methods

  19. Sleep Habits and Nighttime Texting Among Adolescents.

    Science.gov (United States)

    Garmy, Pernilla; Ward, Teresa M

    2018-04-01

    The aim of this study was to examine sleep habits (i.e., bedtimes and rising times) and their association with nighttime text messaging in 15- to 17-year-old adolescents. This cross-sectional study analyzed data from a web-based survey of adolescent students attending secondary schools in southern Sweden ( N = 278, 50% female). Less than 8 hr of time in bed during school nights was significantly associated with more sleep difficulties, wake time variability on school days and weekends, daytime tiredness, and less enjoyment at school (all ps sleep habits ( p sleep habits and the problems associated with sleeping with a cell phone in the bedroom.

  20. Reduced Text Structure at Two Text Levels: Impacts on the Performance of Technical Readers.

    Science.gov (United States)

    Wenger, Michael J.; Spyridakis, Jan H.

    1993-01-01

    Studies empirically the effects on reader performance of reduced text structure in technical writing texts. Reveals that removal of cues to local coherence produced reliable decrements in reader performance. Discusses results with regard to questions of information design. (HB)