WorldWideScience

Sample records for circulaires textes generaux

  1. Circulaire economie en behoud van natuurlijk kapitaal - Paper

    NARCIS (Netherlands)

    Smits, M.J.W.; Burg, van den S.W.K.; Verburg, R.W.

    2013-01-01

    In deze paper worden vier inspirerende voorbeelden van circulaire productieprocessen in Nederland beschreven en geanalyseerd. De centrale vraag daarbij is hoe de circulaire economie kan bijdragen aan instandhouding van het natuurlijk kapitaal. Het natuurlijk kapitaal omvat zowel grondstoffen als bio

  2. Aspects generaux sur la decentralisation et l'autonomie locale

    Directory of Open Access Journals (Sweden)

    Călin Sabin GHIOLŢAN

    2000-10-01

    Full Text Available Romania, like other countries in transition towards a democratic system of government, faces, as a weakness, the challenge posed by the lack of citizen participation to political and administrative decisions. This paper intends to show that similar difficulties exist in communication process between citizens and officials in traditional democracies and to underline the importance of central-local relations framework in the modern administrative systems.

  3. Chances for a circular economy in the Netherlands; Kansen voor de circulaire economie in Nederland

    Energy Technology Data Exchange (ETDEWEB)

    Bastein, T.; Roelofs, E.; Rietveld, E.; Hoogendoorn, A.

    2013-06-15

    The concept of circular economy is an economic and industrial system that focuses on the reusability of products and raw materials, reduces value destruction in the overall system and aims at value creation within each tier of the system. In this report the (economic) opportunities are quantified as much as possible, and impacts on employment and the environmental are addressed. The study focuses specifically on the Dutch economy. The analysis starts by means of two detailed case studies: the use of biomass wastes and the circular economy that may arise in the metal-electronics industry [Dutch] Het begrip 'circulaire economie' is een economisch en industrieel systeem dat zich richt op de herbruikbaarheid van producten en grondstoffen, waarde vernietiging in het totale systeem minimaliseert en waarde creatie in iedere schakel van het systeem nastreeft. In dit rapport worden de (economische) kansen zoveel mogelijk gekwantificeerd, waarbij effecten op werkgelegenheid en milieudruk aan bod komen. De studie richt zich nadrukkelijk op de gehele Nederlandse economie. De analyse start aan de hand van twee gedetailleerde case studies: de benutting van reststromen uit biomassa en de circulaire economie die kan ontstaan t.b.v. producten uit de metaalelektro-sector.

  4. Tonsure circulaire dans l’église Orthodoxe

    Directory of Open Access Journals (Sweden)

    Miljković Bojan

    2013-01-01

    Full Text Available There were two ways of the clerical tonsure in the Orthodox Church during the middle Ages. The cutting of four locks of hair in the shape of cross and circular tonsure. The wreath of hair around the shaved top of the head symbolized Christ’s crown of thorns. The archpriests of the Serbian Orthodox Church were practicing circular tonsure, from its founder Sava until the middle of the 17th century. [Projekat Ministarstva nauke Republike Srbije, br. 177032: Tradicija, inovacija i identitet u vizantijskom svetu

  5. The interaction of circularly polarised electromagnetic waves with a plasma; Interaction d'ondes electromagnetiques a polarisation circulaire avec un plasma

    Energy Technology Data Exchange (ETDEWEB)

    Consoli, T.; Legardeur, R.; Slama, L. [Commissariat a l' Energie Atomique, Saclay (France). Centre d' Etudes Nucleaires

    1961-07-01

    The interaction of left and right handed circularly polarised waves with a plasma are studied. The individual trajectories of charges of both signs are traced with a analogical simulator. Applications to plasma heating and diagnostic are deduced. (author) [French] On etudie l'interaction des ondes a polarisation circulaire droite ou gauche avec un plasma. Les trajectoires individuelles des charges sont tracees a l'aide d'un dispositif analogique. On en deduit les applications au chauffage d'un plasma et a la mesure de ses parametres caracteristiques. (auteur)

  6. LES PRINCIPES GENERAUX APPLICABLES À LA DÉVOLUTION SUCCESSORALE LEGALE ET LES EXCEPTIONS DE CES PRINCIPES

    Directory of Open Access Journals (Sweden)

    Dumitru MACOVEI

    2006-09-01

    Full Text Available Starting from the fact that in the Romanian civil Code, the inheritance or the succession is seen as a way of obtaining and of transmitting the property right, the author examines the general principles applicable to the legal successional devolution:a the principle of priority class of heirs; b the principle of proximity of rank of kindred; c the principle of dividing the succession in equal parts between the relatives of the same rank.

  7. Boucliers Circulaires de l'Orient Musulman (Évolution et utilisation

    Directory of Open Access Journals (Sweden)

    Kalus, Ludvik

    1974-12-01

    Full Text Available LE bouclier, le plus simple et le plus ancien des armes défensives des guerriers solitaires et des soldats de l'infanterie ou de la cavalerie, fut utilisé chez presque tous les peuples à un certain stade de leur développement et il ne disparu qu'avec l'introduction des armes modernes (armes à poudre, armes chimiques et nucléaires. En suivant l'histoire des armes de tous les peuples du monde, nous remarquons des formes et des dimensions de boucliers très différentes, conditionnées par la mobilité de l'armée dans laquelle ils étaient utilisés, par le caractère des armes contre lesquelles ils devaient servir comme moyen de défense, par le poids du matériau de leur base et sans doute par les traditions du milieu où ils étaient utilisés.

  8. Le shunt circulaire dans la forme néonatale sévère de la maladie d’Ebstein : effet bénéfique ou délétère des prostaglandines?

    Science.gov (United States)

    Hakim, Kaouthar; Boussaada, Rafik; Ayari, Jihen; Imen, Hamdi; Msaad, Hela; Ouarda, Fatma; Chaker, Lilia

    2014-01-01

    Résumé La maladie d’Ebstein avec atrésie pulmonaire fonctionnelle est une présentation sévère néonatale de la maladie d’Ebstein où la conduite thérapeutique se base classiquement sur la prescription de prostaglandines. Le shunt circulaire est une complication « hémodynamique » grave et souvent méconnue, incitant à l’arrêt des prostaglandines. Nous rapportons une forme néonatale sévère de maladie d’Ebstein avec aggravation hémodynamique attribuée à un shunt circulaire. Le diagnostic de maladie d’Ebstein avec atrésie pulmonaire fonctionnelle a été fait en anténatal à 36 semaines d’aménorrhée. Le patient est né à 38 semaines d’aménorrhée par césarienne. Une échographie post-natale a confirmé le diagnostic. Un traitement par prostaglandines a été initialement institué pour maintenir le canal artériel vicariant. Malgré ce traitement, une dégradation hémodynamique a été observée. L’échographie de contrôle a montré des images en faveur d’un shunt circulaire. En effet, Le sang arrivant dans l’artère pulmonaire par le canal artériel large, était « aspiré » vers le ventricule droit, puis dans l’oreillette droite du fait de la régurgitation tricuspide et de là vers le coeur gauche via le foramen ovale shuntant droite-gauche; il était alors éjecté dans l’aorte et le canal artériel. Devant ce shunt circulaire, le traitement par prostaglandines était interrompu et un traitement visant à réduire plutôt les résistances pulmonaires a été prescrit. Cependant, le patient est décédé avant l’instauration de ce traitement. La forme néonatale de maladie d’Ebstein est une forme grave qui peut se compliquer d’un shunt circulaire. Ce phénomène hémodynamique encourage la fermeture précoce du canal artériel contre indiquant ainsi la prescription des prostaglandines. PMID:25642457

  9. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  10. Calcul de l'impédance d'un inducteur circulaire plat par conjonction des méthodes intégrales de frontière et éléments finis

    Science.gov (United States)

    Fouladgar, J.; Develey, G.

    1992-11-01

    The knowledge of the impedance of the whole inductor-load has a great importance for the design of the generators used in the induction heating technology. In this paper, we study the case of a plannar circular inductor fed by a voltage u of a variable frequency up to 30 kHz. To calculate the impedance, one should resolve the diffusion equation in a axisymetric configuration. Here we propose a mixed method, using the finit elements method in the volume of the inductor and load and a boundary elements method at their surfaces. La connaissance de l'impédance de l'ensemble inducteur-charge revêt une grande importance pour la conception des générateurs utilisés dans la technique du chauffage par induction. Dans cet article, nous étudions le cas d'un inducteur plat circulaire alimenté sous une tension fixe u, de fréquence variable jusqu'à 30 kHz. Le calcul de l'impédance passe par la résolution de l'équation de diffusion dans un système axisymétrique. Nous proposons ici une méthode mixte utilisant la méthode d'éléments finis dans le volume de la charge et de l'inducteur et la méthode d'intégrales de frontière à leur surface.

  11. Administrative memo relative to the delivery of energy conservation certificates; Circulaire relative a la delivrance des certificats d'economies d'energie

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2006-07-15

    This memo details the principles of the energy conservation certificates, the law texts of application, the part of the Government and the ADEME services, the certificates demand procedure, and the inscription of the certificates on the national registries. (A.L.B.)

  12. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  13. Text-Fabric

    NARCIS (Netherlands)

    Roorda, Dirk

    2016-01-01

    Text-Fabric is a Python3 package for Text plus Annotations. It provides a data model, a text file format, and a binary format for (ancient) text plus (linguistic) annotations. The emphasis of this all is on: data processing; sharing data; and contributing modules. A defining characteristic is that T

  14. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  15. Quality text editing

    Directory of Open Access Journals (Sweden)

    Gyöngyi Bujdosó

    2009-10-01

    Full Text Available Text editing is more than the knowledge of word processing techniques. Originally typographers, printers, text editors were the ones qualified to edit texts, which were well structured, legible, easily understandable, clear, and were able to emphasize the coreof the text. Time has changed, and nowadays everyone has access to computers as well as to text editing software and most users believe that having these tools is enough to edit texts. However, text editing requires more skills. Texts appearing either in printed or inelectronic form reveal that most of the users do not realize that they are not qualified to edit and publish their works. Analyzing the ‘text-products’ of the last decade a tendency can clearly be drawn. More and more documents appear, which instead of emphasizingthe subject matter, are lost in the maze of unstructured text slices. Without further thoughts different font types, colors, sizes, strange arrangements of objects, etc. are applied. We present examples with the most common typographic and text editing errors. Our aim is to call the attention to these mistakes and persuadeusers to spend time to educate themselves in text editing. They have to realize that a well-structured text is able to strengthen the effect on the reader, thus the original message will reach the target group.

  16. Semantic Text Indexing

    Directory of Open Access Journals (Sweden)

    Zbigniew Kaleta

    2014-01-01

    Full Text Available This article presents a specific issue of the semantic analysis of texts in natural language – text indexing and describes one field of its application (web browsing.The main part of this article describes the computer system assigning a set of semantic indexes (similar to keywords to a particular text. The indexing algorithm employs a semantic dictionary to find specific words in a text, that represent a text content. Furthermore it compares two given sets of semantic indexes to determine texts’ similarity (assigning numerical value. The article describes the semantic dictionary – a tool essentialto accomplish this task and its usefulness, main concepts of the algorithm and test results.

  17. Text Mining: (Asynchronous Sequences

    Directory of Open Access Journals (Sweden)

    Sheema Khan

    2014-12-01

    Full Text Available In this paper we tried to correlate text sequences those provides common topics for semantic clues. We propose a two step method for asynchronous text mining. Step one check for the common topics in the sequences and isolates these with their timestamps. Step two takes the topic and tries to give the timestamp of the text document. After multiple repetitions of step two, we could give optimum result.

  18. Planning Argumentative Texts

    CERN Document Server

    Huang, X

    1994-01-01

    This paper presents \\proverb\\, a text planner for argumentative texts. \\proverb\\'s main feature is that it combines global hierarchical planning and unplanned organization of text with respect to local derivation relations in a complementary way. The former splits the task of presenting a particular proof into subtasks of presenting subproofs. The latter simulates how the next intermediate conclusion to be presented is chosen under the guidance of the local focus.

  19. Mining text data

    CERN Document Server

    Aggarwal, Charu C

    2012-01-01

    Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. ""Mining Text Data"" introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including

  20. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  1. Clustering Text Data Streams

    Institute of Scientific and Technical Information of China (English)

    Yu-Bao Liu; Jia-Rong Cai; Jian Yin; Ada Wai-Chee Fu

    2008-01-01

    Clustering text data streams is an important issue in data mining community and has a number of applications such as news group filtering, text crawling, document organization and topic detection and tracing etc. However, most methods are similarity-based approaches and only use the TF*IDF scheme to represent the semantics of text data and often lead to poor clustering quality. Recently, researchers argue that semantic smoothing model is more efficient than the existing TF.IDF scheme for improving text clustering quality. However, the existing semantic smoothing model is not suitable for dynamic text data context. In this paper, we extend the semantic smoothing model into text data streams context firstly. Based on the extended model, we then present two online clustering algorithms OCTS and OCTSM for the clustering of massive text data streams. In both algorithms, we also present a new cluster statistics structure named cluster profile which can capture the semantics of text data streams dynamically and at the same time speed up the clustering process. Some efficient implementations for our algorithms are also given. Finally, we present a series of experimental results illustrating the effectiveness of our technique.

  2. Making Sense of Texts

    Science.gov (United States)

    Harper, Rebecca G.

    2014-01-01

    This article addresses the triadic nature regarding meaning construction of texts. Grounded in Rosenblatt's (1995; 1998; 2004) Transactional Theory, research conducted in an undergraduate Language Arts curriculum course revealed that when presented with unfamiliar texts, students used prior experiences, social interactions, and literary strategies…

  3. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  4. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  5. Extracting Text from Video

    Directory of Open Access Journals (Sweden)

    Jayshree Ghorpade

    2011-09-01

    Full Text Available The text data present in images and video contain certain useful information for automatic annotation,indexing, and structuring of images. However variations of the text due to differences in text style, font, size, orientation, alignment as well as low image contrast and complex background make the problem of automatic text extraction extremely difficult and challenging job. A large number of techniques have been proposed to address this problem and the purpose of this paper is to design algorithms for each phase of extracting text from a video using java libraries and classes. Here first we frame the input video into stream of images using the Java Media Framework (JMF with the input being a real time or a video from the database. Then we apply pre processing algorithms to convert the image to gray scale and remove the disturbances like superimposed lines over the text, discontinuity removal, and dot removal.Then we continue with the algorithms for localization, segmentation and recognition for which we use the neural network pattern matching technique. The performance of our approach is demonstrated by presenting experimental results for a set of static images.

  6. EXTRACTING TEXT FROM VIDEO

    Directory of Open Access Journals (Sweden)

    Jayshree Ghorpade

    2011-06-01

    Full Text Available The text data present in images and video contain certain useful information for automatic annotation,indexing, and structuring of images. However variations of the text due to differences in text style, font, size, orientation, alignment as well as low image contrast and complex background make the problem of automatic text extraction extremely difficult and challenging job. A large number of techniques have been proposed to address this problem and the purpose of this paper is to design algorithms for each phase of extracting text from a video using java libraries and classes. Here first we frame the input video into stream of images using the Java Media Framework (JMF with the input being a real time or a video from the database. Then we apply pre processing algorithms to convert the image to gray scale and remove the disturbances like superimposed lines over the text, discontinuity removal, and dot removal.Then we continue with the algorithms for localization, segmentation and recognition for which we use the neural network pattern matching technique. The performance of our approach is demonstrated by presenting experimental results for a set of static images.

  7. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2012-01-01

    <正>Centre for Agriculture and Bioscience International( CABI) is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly and has now been integrated into all our databases including CAB Abstracts,Global Health,our Internet Resources and Abstract Journals. There are currently over 60,000 full text articles available to access. These documents,made possible by agreement with third

  8. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2014-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain.It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment.CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly

  9. Emotion Detection from Text

    CERN Document Server

    Shivhare, Shiv Naresh

    2012-01-01

    Emotion can be expressed in many ways that can be seen such as facial expression and gestures, speech and by written text. Emotion Detection in text documents is essentially a content - based classification problem involving concepts from the domains of Natural Language Processing as well as Machine Learning. In this paper emotion recognition based on textual data and the techniques used in emotion detection are discussed.

  10. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain.It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment.CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly and has now been integrated into all our databases including CAB Abstracts,Global Health

  11. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain.It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment.CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly and has now been integrated into all our databases including CAB Abstracts,Global Health,our Internet Resources and Jour-

  12. Reading Authentic Texts

    DEFF Research Database (Denmark)

    Balling, Laura Winther

    2013-01-01

    Most research on cognates has focused on words presented in isolation that are easily defined as cognate between L1 and L2. In contrast, this study investigates what counts as cognate in authentic texts and how such cognates are read. Participants with L1 Danish read news articles in their highly...

  13. Texts On-Line.

    Science.gov (United States)

    Thomas, Jean-Jacques

    1993-01-01

    Maintains that the study of signs is divided between those scholars who use the Saussurian binary sign (semiology) and those who prefer the Peirce tripartite sign (semiotics). Concludes that neither the Saussurian nor Peircian analysis methods can produce a semiotic interpretation based on a hierarchy of the text's various components. (CFR)

  14. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International( CABI) is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.

  15. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International(CABI) is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.

  16. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.

  17. Summarizing Expository Texts

    Science.gov (United States)

    Westby, Carol; Culatta, Barbara; Lawrence, Barbara; Hall-Kenyon, Kendra

    2010-01-01

    Purpose: This article reviews the literature on students' developing skills in summarizing expository texts and describes strategies for evaluating students' expository summaries. Evaluation outcomes are presented for a professional development project aimed at helping teachers develop new techniques for teaching summarization. Methods: Strategies…

  18. Text analysis and computers

    OpenAIRE

    1995-01-01

    Content: Erhard Mergenthaler: Computer-assisted content analysis (3-32); Udo Kelle: Computer-aided qualitative data analysis: an overview (33-63); Christian Mair: Machine-readable text corpora and the linguistic description of danguages (64-75); Jürgen Krause: Principles of content analysis for information retrieval systems (76-99); Conference Abstracts (100-131).

  19. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  20. Polymorphous Perversity in Texts

    Science.gov (United States)

    Johnson-Eilola, Johndan

    2012-01-01

    Here's the tricky part: If we teach ourselves and our students that texts are made to be broken apart, remixed, remade, do we lose the polymorphous perversity that brought us pleasure in the first place? Does the pleasure of transgression evaporate when the borders are opened?

  1. Text as Image.

    Science.gov (United States)

    Woal, Michael; Corn, Marcia Lynn

    As electronically mediated communication becomes more prevalent, print is regaining the original pictorial qualities which graphemes (written signs) lost when primitive pictographs (or picture writing) and ideographs (simplified graphemes used to communicate ideas as well as to represent objects) evolved into first written, then printed, texts of…

  2. The Emar Lexical Texts

    NARCIS (Netherlands)

    Gantzert, Merijn

    2011-01-01

    This four-part work provides a philological analysis and a theoretical interpretation of the cuneiform lexical texts found in the Late Bronze Age city of Emar, in present-day Syria. These word and sign lists, commonly dated to around 1100 BC, were almost all found in the archive of a single school.

  3. Weaving with text

    DEFF Research Database (Denmark)

    Hagedorn-Rasmussen, Peter

    This paper explores how a school principal by means of practical authorship creates reservoirs of language that provide a possible context for collective sensemaking. The paper draws upon a field study in which a school principal, and his managerial team, was shadowed in a period of intensive...... changes. The paper explores how the manager weaves with text, extracted from stakeholders, administration, politicians, employees, public discourse etc., as a means of creating a new fabric, a texture, of diverse perspectives that aims for collective sensemaking....

  4. Metacomprehension of text material.

    Science.gov (United States)

    Maki, R H; Berry, S L

    1984-10-01

    Subjects' abilities to predict future multiple-choice test performance after reading sections of text were investigated in two experiments. In Experiment 1, subjects who scored above median test performance showed some accuracy in their predictions of that test performance. They gave higher mean ratings to material related to correct than to incorrect test answers. Subjects who scored below median test performance did not show this prediction accuracy. The retention interval between reading and the test was manipulated in Experiment 2. Subjects who were tested after at least a 24-hr delay showed results identical to those of Experiment 1. However, when subjects were tested immediately after reading, subjects above and below median test performance gave accurate predictions for the first immediate test. In contrast, both types of subjects gave inaccurate predictions for the second immediate test. Structural variables, such as length, serial position, and hierarchical level of the sections of text were related to subjects' predictions. These variables, in general, were not related to test performance, although the predictions were related to test performance in the conditions described above.

  5. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  6. Teaching Text Structure: Examining the Affordances of Children's Informational Texts

    Science.gov (United States)

    Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

    2016-01-01

    This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…

  7. Circular letter from January 22, 2004 to the presidents of companies having the status of chartered storage facility; Lettre circulaire du 22 janvier 2004 a Messieurs les presidents de societes titulaires du statut d'entrepositaire agree

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2004-07-01

    This circular letter is intended for owners of storage facilities for petroleum products benefiting from the obligation of strategic storage according to the article 2 of law no 92-1443 from December 31, 1992. The attached document recalls the reasons and content of this obligation, the prevailing strategic storage rules in France (reference texts, products in concern, operators, stockpiles localization, product substitution possibilities..), the monthly declarations, the controls and sanctions, the annual plan of stocks localization, the obligation of information, the loss of chartered status or the renouncement. A schematic synthesis of the system of stockpiles constitution is presented in appendix, for France and for the French overseas departements. The other appendixes concern: the list of petroleum products concerned by the legal obligation of strategic storage, the relations between the professional committee of strategic stockpiles (CPSSP) and the anonymous society of security stocks management (SAGESS), and some examples of monthly and annual declaration forms. (J.S.)

  8. Important Text Characteristics for Early-Grades Text Complexity

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2015-01-01

    The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…

  9. Retinal locus for scanning text.

    Science.gov (United States)

    Timberlake, George T; Sharma, Manoj K; Grose, Susan A; Maino, Joseph H

    2006-01-01

    A method of mapping the retinal location of text during reading is described in which text position is plotted cumulatively on scanning laser ophthalmoscope retinal images. Retinal locations that contain text most often are the brightest in the cumulative plot, and locations that contain text least often are the darkest. In this way, the retinal area that most often contains text is determined. Text maps were plotted for eight control subjects without vision loss and eight subjects with central scotomas from macular degeneration. Control subjects' text maps showed that the fovea contained text most often. Text maps of five of the subjects with scotomas showed that they used the same peripheral retinal area to scan text and fixate. Text maps of the other three subjects with scotomas showed that they used separate areas to scan text and fixate. Retinal text maps may help evaluate rehabilitative strategies for training individuals with central scotomas to use a particular retinal area to scan text.

  10. Automatic Text Decomposition and Structuring.

    Science.gov (United States)

    Salton, Gerard; And Others

    1996-01-01

    Text similarity measurements are used to determine relationships between natural-language texts and text excerpts. The resulting linked hypertext maps can be broken down into text segments and themes used to identify different text types and structures, leading to improved information access and utilization. Examples are provided for text…

  11. Temporal Adverbials in Text Structuring: On Temporal Text Strategy.

    Science.gov (United States)

    Virtanen, Tuija

    This paper discusses clause-initial adverbials of time functioning as signals of the temporal text strategy. A chain of such markers creates cohesion and coherence by forming continuity in the text and also signals textual boundaries that occur on different hierarchic levels. The temporal text strategy is closely associated with narrative text.…

  12. Text analysis methods, text analysis apparatuses, and articles of manufacture

    Science.gov (United States)

    Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

    2014-10-28

    Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.

  13. Short Text Classification: A Survey

    Directory of Open Access Journals (Sweden)

    Ge Song

    2014-05-01

    Full Text Available With the recent explosive growth of e-commerce and online communication, a new genre of text, short text, has been extensively applied in many areas. So many researches focus on short text mining. It is a challenge to classify the short text owing to its natural characters, such as sparseness, large-scale, immediacy, non-standardization. It is difficult for traditional methods to deal with short text classification mainly because too limited words in short text cannot represent the feature space and the relationship between words and documents. Several researches and reviews on text classification are shown in recent times. However, only a few of researches focus on short text classification. This paper discusses the characters of short text and the difficulty of short text classification. Then we introduce the existing popular works on short text classifiers and models, including short text classification using sematic analysis, semi-supervised short text classification, ensemble short text classification, and real-time classification. The evaluations of short text classification are analyzed in our paper. Finally we summarize the existing classification technology and prospect for development trend of short text classification

  14. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  15. Text-Attentional Convolutional Neural Network for Scene Text Detection.

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-06-01

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature globally computed from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this paper, we present a new system for scene text detection by proposing a novel text-attentional convolutional neural network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/non-text information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates the main task of text/non-text classification. In addition, a powerful low-level detector called contrast-enhancement maximally stable extremal regions (MSERs) is developed, which extends the widely used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 data set, with an F-measure of 0.82, substantially improving the state-of-the-art results.

  16. The Challenge of Challenging Text

    Science.gov (United States)

    Shanahan, Timothy; Fisher, Douglas; Frey, Nancy

    2012-01-01

    The Common Core State Standards emphasize the value of teaching students to engage with complex text. But what exactly makes a text complex, and how can teachers help students develop their ability to learn from such texts? The authors of this article discuss five factors that determine text complexity: vocabulary, sentence structure, coherence,…

  17. Text Classification using Artificial Intelligence

    CERN Document Server

    Kamruzzaman, S M

    2010-01-01

    Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user query, and text understanding systems, which transform text in some way such as producing summaries, answering questions or extracting data. Existing supervised learning algorithms for classifying text need sufficient documents to learn accurately. This paper presents a new algorithm for text classification using artificial intelligence technique that requires fewer documents for training. Instead of using words, word relation i.e. association rules from these words is used to derive feature set from pre-classified text documents. The concept of na\\"ive Bayes classifier is then used on derived features and finally only a single concept of genetic algorithm has been added for final classification. A syste...

  18. Text Classification using Data Mining

    CERN Document Server

    Kamruzzaman, S M; Hasan, Ahmed Ryadh

    2010-01-01

    Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user query, and text understanding systems, which transform text in some way such as producing summaries, answering questions or extracting data. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper presents a new algorithm for text classification using data mining that requires fewer documents for training. Instead of using words, word relation i.e. association rules from these words is used to derive feature set from pre-classified text documents. The concept of Naive Bayes classifier is then used on derived features and finally only a single concept of Genetic Algorithm has been added for final classification. A system based on the...

  19. Text analysis devices, articles of manufacture, and text analysis methods

    Science.gov (United States)

    Turner, Alan E; Hetzler, Elizabeth G; Nakamura, Grant C

    2013-05-28

    Text analysis devices, articles of manufacture, and text analysis methods are described according to some aspects. In one aspect, a text analysis device includes processing circuitry configured to analyze initial text to generate a measurement basis usable in analysis of subsequent text, wherein the measurement basis comprises a plurality of measurement features from the initial text, a plurality of dimension anchors from the initial text and a plurality of associations of the measurement features with the dimension anchors, and wherein the processing circuitry is configured to access a viewpoint indicative of a perspective of interest of a user with respect to the analysis of the subsequent text, and wherein the processing circuitry is configured to use the viewpoint to generate the measurement basis.

  20. Text-Attentional Convolutional Neural Networks for Scene Text Detection.

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-03-28

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this work, we present a new system for scene text detection by proposing a novel Text-Attentional Convolutional Neural Network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/nontext information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates main task of text/non-text classification. In addition, a powerful low-level detector called Contrast- Enhancement Maximally Stable Extremal Regions (CE-MSERs) is developed, which extends the widely-used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 dataset, with a F-measure of 0.82, improving the state-of-the-art results substantially.

  1. Text-Attentional Convolutional Neural Network for Scene Text Detection

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-06-01

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this work, we present a new system for scene text detection by proposing a novel Text-Attentional Convolutional Neural Network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/nontext information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates main task of text/non-text classification. In addition, a powerful low-level detector called Contrast- Enhancement Maximally Stable Extremal Regions (CE-MSERs) is developed, which extends the widely-used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 dataset, with a F-measure of 0.82, improving the state-of-the-art results substantially.

  2. Contrastive Study of Coherence in Chinese Text and English Text

    Institute of Scientific and Technical Information of China (English)

    王婷

    2013-01-01

    The paper presents the text-linguistic concepts on which the analysis of textual structure is based including text and discourse, coherence and cohesive. In addition we try to discover different manifestations of text between ET and CT, including different coherent structures.

  3. Test of Picture-Text Amalgams in Procedural Texts.

    Science.gov (United States)

    Stone, David Edey

    Designed to assess how people read and comprehend information presented in picture-text amalgams in procedural texts, this instrument presents various combinations of text information and illustrative information on slides. Subjects are assigned to one of four conditions and directed to follow the instructions presented on the slides. Videotapes…

  4. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  5. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  6. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases...

  7. Text Association Analysis and Ambiguity in Text Mining

    Science.gov (United States)

    Bhonde, S. B.; Paikrao, R. L.; Rahane, K. U.

    2010-11-01

    Text Mining is the process of analyzing a semantically rich document or set of documents to understand the content and meaning of the information they contain. The research in Text Mining will enhance human's ability to process massive quantities of information, and it has high commercial values. Firstly, the paper discusses the introduction of TM its definition and then gives an overview of the process of text mining and the applications. Up to now, not much research in text mining especially in concept/entity extraction has focused on the ambiguity problem. This paper addresses ambiguity issues in natural language texts, and presents a new technique for resolving ambiguity problem in extracting concept/entity from texts. In the end, it shows the importance of TM in knowledge discovery and highlights the up-coming challenges of document mining and the opportunities it offers.

  8. Text Signals Influence Team Artifacts

    Science.gov (United States)

    Clariana, Roy B.; Rysavy, Monica D.; Taricani, Ellen

    2015-01-01

    This exploratory quasi-experimental investigation describes the influence of text signals on team visual map artifacts. In two course sections, four-member teams were given one of two print-based text passage versions on the course-related topic "Social influence in groups" downloaded from Wikipedia; this text had two paragraphs, each…

  9. Too Dumb for Complex Texts?

    Science.gov (United States)

    Bauerlein, Mark

    2011-01-01

    High school students' lack of experience and practice with reading complex texts is a primary cause of their difficulties with college-level reading. Filling the syllabus with digital texts does little to address this deficiency. Complex texts demand three dispositions from readers: a willingness to probe works characterized by dense meanings, the…

  10. Multilingual Text Analysis for Text-to-Speech Synthesis

    CERN Document Server

    Sproat, R

    1996-01-01

    We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, German, Russian, Mandarin and Japanese.

  11. A Survey on Web Text Information Retrieval in Text Mining

    Directory of Open Access Journals (Sweden)

    Tapaswini Nayak

    2015-08-01

    Full Text Available In this study we have analyzed different techniques for information retrieval in text mining. The aim of the study is to identify web text information retrieval. Text mining almost alike to analytics, which is a process of deriving high quality information from text. High quality information is typically derived in the course of the devising of patterns and trends through means such as statistical pattern learning. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, creation of coarse taxonomies, sentiment analysis, document summarization and entity relation modeling. It is used to mine hidden information from not-structured or semi-structured data. This feature is necessary because a large amount of the Web information is semi-structured due to the nested structure of HTML code, is linked and is redundant. Web content categorization with a content database is the most important tool to the efficient use of search engines. A customer requesting information on a particular subject or item would otherwise have to search through hundred of results to find the most relevant information to his query. Hundreds of results through use of mining text are reduced by this step. This eliminates the aggravation and improves the navigation of information on the Web.

  12. Predicting Prosody from Text for Text-to-Speech Synthesis

    CERN Document Server

    Rao, K Sreenivasa

    2012-01-01

    Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.

  13. Text comprehension practice in school

    Directory of Open Access Journals (Sweden)

    Hernández, José Emilio

    2010-01-01

    Full Text Available The starting point of the study is the existence of relations between the two dimensions of text compression: the instrumental dimension and the cognitive dimension. The first one includes the system of actions, the second one the system of knowledge. A description of identifying, describing, inferring apprising and creating actions are suggested for each type of text. Likewise, the importance of implementing text comprehension is outlined on the basis of the assumption that the text is a tool for preserving and communicating culture, that allows human beings to wide their respective cultural horizons and develop cognitive and affective process that allow them to get universal morals.

  14. Text mining: A Brief survey

    Directory of Open Access Journals (Sweden)

    Falguni N. Patel , Neha R. Soni

    2012-12-01

    Full Text Available The unstructured texts which contain massive amount of information cannot simply be used for further processing by computers. Therefore, specific processing methods and algorithms are required in order to extract useful patterns. The process of extracting interesting information and knowledge from unstructured text completed by using Text mining. In this paper, we have discussed text mining, as a recent and interesting field with the detail of steps involved in the overall process. We have also discussed different technologies that teach computers with natural language so that they may analyze, understand, and even generate text. In addition, we briefly discuss a number of successful applications of text mining which are used currently and in future.

  15. TEXT DEIXIS IN NARRATIVE SEQUENCES

    Directory of Open Access Journals (Sweden)

    Josep Rivera

    2007-06-01

    Full Text Available This study looks at demonstrative descriptions, regarding them as text-deictic procedures which contribute to weave discourse reference. Text deixis is thought of as a metaphorical referential device which maps the ground of utterance onto the text itself. Demonstrative expressions with textual antecedent-triggers, considered as the most important text-deictic units, are identified in a narrative corpus consisting of J. M. Barrie’s Peter Pan and its translation into Catalan. Some linguistic and discourse variables related to DemNPs are analysed to characterise adequately text deixis. It is shown that this referential device is usually combined with abstract nouns, thus categorising and encapsulating (non-nominal complex discourse entities as nouns, while performing a referential cohesive function by means of the text deixis + general noun type of lexical cohesion.

  16. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    Today, information travels fast. Texts travel, too. In a corporate context, the question is how to manage which knowledge elements should travel to a new language area or market and in which form? The decision to let knowledge elements travel or not travel highly depends on the limitation...... and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  17. Texting while driving: is speech-based text entry less risky than handheld text entry?

    Science.gov (United States)

    He, J; Chaparro, A; Nguyen, B; Burge, R J; Crandall, J; Chaparro, B; Ni, R; Cao, S

    2014-11-01

    Research indicates that using a cell phone to talk or text while maneuvering a vehicle impairs driving performance. However, few published studies directly compare the distracting effects of texting using a hands-free (i.e., speech-based interface) versus handheld cell phone, which is an important issue for legislation, automotive interface design and driving safety training. This study compared the effect of speech-based versus handheld text entries on simulated driving performance by asking participants to perform a car following task while controlling the duration of a secondary text-entry task. Results showed that both speech-based and handheld text entries impaired driving performance relative to the drive-only condition by causing more variation in speed and lane position. Handheld text entry also increased the brake response time and increased variation in headway distance. Text entry using a speech-based cell phone was less detrimental to driving performance than handheld text entry. Nevertheless, the speech-based text entry task still significantly impaired driving compared to the drive-only condition. These results suggest that speech-based text entry disrupts driving, but reduces the level of performance interference compared to text entry with a handheld device. In addition, the difference in the distraction effect caused by speech-based and handheld text entry is not simply due to the difference in task duration.

  18. Text Type and Translation Strategy

    Institute of Scientific and Technical Information of China (English)

    刘福娟

    2015-01-01

    Translation strategy and translation standards are undoubtedly the core problems translators are confronted with in translation. There have arisen many kinds of translation strategies in translation history, among which the text type theory is considered an important breakthrough and a significant complement of traditional translation standards. This essay attempts to demonstrate the value of text typology (informative, expressive, and operative) to translation strategy, emphasizing the importance of text types and their communicative functions.

  19. Typesafe Modeling in Text Mining

    CERN Document Server

    Steeg, Fabian

    2011-01-01

    Based on the concept of annotation-based agents, this report introduces tools and a formal notation for defining and running text mining experiments using a statically typed domain-specific language embedded in Scala. Using machine learning for classification as an example, the framework is used to develop and document text mining experiments, and to show how the concept of generic, typesafe annotation corresponds to a general information model that goes beyond text processing.

  20. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  1. Hermeneutic reading of classic texts.

    Science.gov (United States)

    Koskinen, Camilla A-L; Lindström, Unni Å

    2013-09-01

    The purpose of this article is to broaden the understandinfg of the hermeneutic reading of classic texts. The aim is to show how the choice of a specific scientific tradition in conjunction with a methodological approach creates the foundation that clarifies the actual realization of the reading. This hermeneutic reading of classic texts is inspired by Gadamer's notion that it is the researcher's own research tradition and a clearly formulated theoretical fundamental order that shape the researcher's attitude towards texts and create the starting point that guides all reading, uncovering and interpretation. The researcher's ethical position originates in a will to openness towards what is different in the text and which constantly sets the researcher's preunderstanding and research tradition in movement. It is the researcher's attitude towards the text that allows the text to address, touch and arouse wonder. Through a flexible, lingering and repeated reading of classic texts, what is different emerges with a timeless value. The reading of classic texts is an act that may rediscover and create understanding for essential dimensions and of human beings' reality on a deeper level. The hermeneutic reading of classic texts thus brings to light constantly new possibilities of uncovering for a new envisioning and interpretation for a new understanding of the essential concepts and phenomena within caring science.

  2. Improve Reading with Complex Texts

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2015-01-01

    The Common Core State Standards have cast a renewed light on reading instruction, presenting teachers with the new requirements to teach close reading of complex texts. Teachers and administrators should consider a number of essential features of close reading: They are short, complex texts; rich discussions based on worthy questions; revisiting…

  3. Strategies for Translating Vocative Texts

    Directory of Open Access Journals (Sweden)

    Olga COJOCARU

    2014-12-01

    Full Text Available The paper deals with the linguistic and cultural elements of vocative texts and the techniques used in translating them by giving some examples of texts that are typically vocative (i.e. advertisements and instructions for use. Semantic and communicative strategies are popular in translation studies and each of them has its own advantages and disadvantages in translating vocative texts. The advantage of semantic translation is that it takes more account of the aesthetic value of the SL text, while communicative translation attempts to render the exact contextual meaning of the original text in such a way that both content and language are readily acceptable and comprehensible to the readership. Focus is laid on the strategies used in translating vocative texts, strategies that highlight and introduce a cultural context to the target audience, in order to achieve their overall purpose, that is to sell or persuade the reader to behave in a certain way. Thus, in order to do that, a number of advertisements from the field of cosmetics industry and electronic gadgets were selected for analysis. The aim is to gather insights into vocative text translation and to create new perspectives on this field of research, now considered a process of innovation and diversion, especially in areas as important as economy and marketing.

  4. Text Retrieval on a Microcomputer.

    Science.gov (United States)

    Giordano, Richard; And Others

    1988-01-01

    Presents description of the Generalized Automatic Text Organization and Retrieval system (GATOR), a database system that indexes and retrieves information from machine-readable texts such as interviews and case histories. Qualitative and quantitative analyses are discussed, and integrating GATOR with standard statistical packages is described.…

  5. Dangers of Texting While Driving

    Science.gov (United States)

    ... nhtsa.gov/risky-driving/distracted-driving . Print Out Texting While Driving Guide (pdf) File a Complaint with the FCC ... Office: Consumer and Governmental Affairs Tags: Consumers - Distracted Driving - Health and Safety - Texting Federal Communications Commission 445 12th Street SW, Washington, ...

  6. Text analysis for knowledge graphs

    NARCIS (Netherlands)

    Popping, Roel

    2007-01-01

    The concept of knowledge graphs is introduced as a method to represent the state of the art in a specific scientific discipline. Next the text analysis part in the construction of such graphs is considered. Here the 'translation' from text to graph takes place. The method that is used here is compar

  7. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  8. Text mining for systems biology.

    Science.gov (United States)

    Fluck, Juliane; Hofmann-Apitius, Martin

    2014-02-01

    Scientific communication in biomedicine is, by and large, still text based. Text mining technologies for the automated extraction of useful biomedical information from unstructured text that can be directly used for systems biology modelling have been substantially improved over the past few years. In this review, we underline the importance of named entity recognition and relationship extraction as fundamental approaches that are relevant to systems biology. Furthermore, we emphasize the role of publicly organized scientific benchmarking challenges that reflect the current status of text-mining technology and are important in moving the entire field forward. Given further interdisciplinary development of systems biology-orientated ontologies and training corpora, we expect a steadily increasing impact of text-mining technology on systems biology in the future.

  9. Text Analytics to Data Warehousing

    Directory of Open Access Journals (Sweden)

    Kalli Srinivasa Nageswara Prasad

    2010-09-01

    Full Text Available Information hidden or stored in unstructured data can play a critical role in making decisions, understanding and conducting other business functions. Integrating data stored in both structured and unstructured formats can add significant value to an organization. With the extent of development happening in Text Mining and technologies to deal with unstructured and semi structured data like XML and MML(Mining Markup Language to extract and analyze data, textanalytics has evolved to handle unstructured data to helps unlock and predict business results via Business Intelligence and Data Warehousing. Text mining involves dealing with texts in documents and discovering hidden patterns, but Text Analytics enhances InformationRetrieval in form of search and enabling clustering of results and more over Text Analytics is text mining and visualization. In this paper we would discuss on handling unstructured data that are in documents so that they fit into business applications like Data Warehouses for further analysis and it helps in the framework we have used for the solution.

  10. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  11. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  12. Text Steganographic Approaches: A Comparison

    Directory of Open Access Journals (Sweden)

    Monika Agarwal

    2013-02-01

    Full Text Available This paper presents three novel approaches of text steganography. The first approach uses the theme ofmissing letter puzzle where each character of message is hidden by missing one or more letters in a wordof cover. The average Jaro score was found to be 0.95 indicating closer similarity between cover andstego file. The second approach hides a message in a wordlist where ASCII value of embedded characterdetermines length and starting letter of a word. The third approach conceals a message, withoutdegrading cover, by using start and end letter of words of the cover. For enhancing the security of secretmessage, the message is scrambled using one-time pad scheme before being concealed and cipher text isthen concealed in cover. We also present an empirical comparison of the proposed approaches with someof the popular text steganographic approaches and show that our approaches outperform the existingapproaches.

  13. Functional Stylistics and Peripeteic Texts

    DEFF Research Database (Denmark)

    Borchmann, Simon

    2008-01-01

    Using a pragmatically based linguistic description apparatus on literary use of language is not unproblematic. Observations show that literary use of language violates the norms contained by this apparatus. With this paper I suggest how we can deal with this problem by setting up a frame for the ...... for the use of a functional linguistic description apparatus on literary texts. As an extension of this suggestion I present a model for describing a specific type of literary texts....

  14. Adaptive Personality Recogntion from Text

    OpenAIRE

    Celli, Fabio

    2012-01-01

    We address the issue of domain adaptation for automatic Personality Recognition from Text (PRT). The PRT task consists in the classification of the personality traits of some authors, given some pieces of text they wrote. The purpose of our work is to improve current approaches to PRT in order to extract personality information from social network sites, which is a really challenging task. We argue that current approaches, based on supervised learning, have several limitations for th...

  15. Analysing ESP Texts, but How?

    Directory of Open Access Journals (Sweden)

    Borza Natalia

    2015-03-01

    Full Text Available English as a second language (ESL teachers instructing general English and English for specific purposes (ESP in bilingual secondary schools face various challenges when it comes to choosing the main linguistic foci of language preparatory courses enabling non-native students to study academic subjects in English. ESL teachers intending to analyse English language subject textbooks written for secondary school students with the aim of gaining information about what bilingual secondary school students need to know in terms of language to process academic textbooks cannot avoiding deal with a dilemma. It needs to be decided which way it is most appropriate to analyse the texts in question. Handbooks of English applied linguistics are not immensely helpful with regard to this problem as they tend not to give recommendation as to which major text analytical approaches are advisable to follow in a pre-college setting. The present theoretical research aims to address this lacuna. Respectively, the purpose of this pedagogically motivated theoretical paper is to investigate two major approaches of ESP text analysis, the register and the genre analysis, in order to find the more suitable one for exploring the language use of secondary school subject texts from the point of view of an English as a second language teacher. Comparing and contrasting the merits and limitations of the two contrastive approaches allows for a better understanding of the nature of the two different perspectives of text analysis. The study examines the goals, the scope of analysis, and the achievements of the register perspective and those of the genre approach alike. The paper also investigates and reviews in detail the starkly different methods of ESP text analysis applied by the two perspectives. Discovering text analysis from a theoretical and methodological angle supports a practical aspect of English teaching, namely making an informed choice when setting out to analyse

  16. Princess Brambilla - images/text

    Directory of Open Access Journals (Sweden)

    Maria Aparecida Barbosa

    2016-01-01

    Full Text Available Read the illustrated literary text is simultaneously think pictures and words. This articulation between the written text and pictures adds potential, expands and becomes complex. Coincides with nowadays discussions on Giorgio Agamben's "contemporary" that add to what adheres to respectively time the displacement and the distance needed to understand it, shakes linear notions of historical chronology. Somehow the coincidence is related to the current interest in the concept of "Nachleben" (survival, which assumes the images of the past ransom, postulated by the art historian Aby Warburg in a research on ancient art of motion characteristics in Renaissance pictures Botticelli's. For the translation of the Princesa Brambilla – um capriccio segundo Jakob Callot, de E. T. A. Hoffmann, com 8 gravuras cunhadas a partir de moldes originais de Callot (1820 to Portuguese such discussions were fundamental, as I try to present in this article.

  17. A Guide Text or Many Texts? "That is the Question”

    Directory of Open Access Journals (Sweden)

    Delgado de Valencia Sonia

    2001-08-01

    Full Text Available The use of supplementary materials in the classroom has always been an essential part of the teaching and learning process. To restrict our teaching to the scope of one single textbook means to stand behind the advances of knowledge, in any area and context. Young learners appreciate any new and varied support that expands their knowledge of the world: diaries, letters, panels, free texts, magazines, short stories, poems or literary excerpts, and articles taken from Internet are materials that will allow learnersto share more and work more collaboratively. In this article we are going to deal with some of these materials, with the criteria to select, adapt, and create them that may be of interest to the learner and that may promote reading and writing processes. Since no text can entirely satisfy the needs of students and teachers, the creativity of both parties will be necessary to improve the quality of teaching through the adequate use and adaptation of supplementary materials.

  18. Identifying issue frames in text.

    Directory of Open Access Journals (Sweden)

    Eyal Sagi

    Full Text Available Framing, the effect of context on cognitive processes, is a prominent topic of research in psychology and public opinion research. Research on framing has traditionally relied on controlled experiments and manually annotated document collections. In this paper we present a method that allows for quantifying the relative strengths of competing linguistic frames based on corpus analysis. This method requires little human intervention and can therefore be efficiently applied to large bodies of text. We demonstrate its effectiveness by tracking changes in the framing of terror over time and comparing the framing of abortion by Democrats and Republicans in the U.S.

  19. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    Inspecting the quality of printed texts has its own importance in many industrial applications. To do so, this paper proposes a grading system which evaluates the performance of the printing task using some quality measures for each character and symbols. The purpose of these grading system is two......-folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...

  20. Seductive Texts with Serious Intentions.

    Science.gov (United States)

    Nielsen, Harriet Bjerrum

    1995-01-01

    Debates whether a text claiming to have scientific value is using seduction irresponsibly at the expense of the truth, and discusses who is the subject and who is the object of such seduction. It argues that, rather than being an assault against scientific ethics, seduction is a necessary premise for a sensible conversation to take place. (GR)

  1. Multilingual text induced spelling correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams

  2. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level...

  3. Basic Chad Arabic: Comprehension Texts.

    Science.gov (United States)

    Absi, Samir Abu; Sinaud, Andre

    This text, principally designed for use in a three-volume course on Chad Arabic, complements the pre-speech and active phases of the course in that it provides the answers to comprehension exercises students are required to complete during the course. The comprehension exercises require that students listen to an instructor or tape and write…

  4. Comparison of Text Categorization Algorithms

    Institute of Scientific and Technical Information of China (English)

    SHI Yong-feng; ZHAO Yan-ping

    2004-01-01

    This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages.It provides clues for making use of appropriate automatic classifying algorithms in different fields.Finally some evaluations and summaries of these algorithms are discussed, and directions to further research have been pointed out.

  5. Reviving "Walden": Mining the Text.

    Science.gov (United States)

    Hewitt Julia

    2000-01-01

    Describes how the author and her high school English students begin their study of Thoreau's "Walden" by mining the text for quotations to inspire their own writing and discussion on the topic, "How does Thoreau speak to you or how could he speak to someone you know?" (SR)

  6. COMPENDEX/TEXT-PAC: CIS.

    Science.gov (United States)

    Standera, Oldrich

    This report evaluates the engineering information services provided by the University of Calgary since implementation of the COMPENDEX (tape service of Engineering Index, Inc.) service using the IBM TEXT-PAC system. Evaluation was made by a survey of the users of the Current Information Selection (CIS) service, the interaction between the system…

  7. Locative inferences in medical texts.

    Science.gov (United States)

    Mayer, P S; Bailey, G H; Mayer, R J; Hillis, A; Dvoracek, J E

    1987-06-01

    Medical research relies on epidemiological studies conducted on a large set of clinical records that have been collected from physicians recording individual patient observations. These clinical records are recorded for the purpose of individual care of the patient with little consideration for their use by a biostatistician interested in studying a disease over a large population. Natural language processing of clinical records for epidemiological studies must deal with temporal, locative, and conceptual issues. This makes text understanding and data extraction of clinical records an excellent area for applied research. While much has been done in making temporal or conceptual inferences in medical texts, parallel work in locative inferences has not been done. This paper examines the locative inferences as well as the integration of temporal, locative, and conceptual issues in the clinical record understanding domain by presenting an application that utilizes two key concepts in its parsing strategy--a knowledge-based parsing strategy and a minimal lexicon.

  8. Text Segmentation Using Exponential Models

    CERN Document Server

    Beeferman, D; Lafferty, G D; Beeferman, Doug; Berger, Adam; Lafferty, John

    1997-01-01

    This paper introduces a new statistical approach to partitioning text automatically into coherent segments. Our approach enlists both short-range and long-range language models to help it sniff out likely sites of topic changes in text. To aid its search, the system consults a set of simple lexical hints it has learned to associate with the presence of boundaries through inspection of a large corpus of annotated data. We also propose a new probabilistically motivated error metric for use by the natural language processing and information retrieval communities, intended to supersede precision and recall for appraising segmentation algorithms. Qualitative assessment of our algorithm as well as evaluation using this new metric demonstrate the effectiveness of our approach in two very different domains, Wall Street Journal articles and the TDT Corpus, a collection of newswire articles and broadcast news transcripts.

  9. Learning Context for Text Categorization

    CERN Document Server

    Haribhakta, Y V

    2011-01-01

    This paper describes our work which is based on discovering context for text document categorization. The document categorization approach is derived from a combination of a learning paradigm known as relation extraction and an technique known as context discovery. We demonstrate the effectiveness of our categorization approach using reuters 21578 dataset and synthetic real world data from sports domain. Our experimental results indicate that the learned context greatly improves the categorization performance as compared to traditional categorization approaches.

  10. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level. Th....... By comparing the biological with the textual account of autopoietic agency, the end conclusion is that a newly derived concept of sociopoiesis might be better suited for discussing the architecture of textual systems....

  11. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  12. New Historicism: Text and Context

    Directory of Open Access Journals (Sweden)

    Violeta M. Vesić

    2016-02-01

    Full Text Available During most of the twentieth century history was seen as a phenomenon outside of literature that guaranteed the veracity of literary interpretation. History was unique and it functioned as a basis for reading literary works. During the seventies of the twentieth century there occurred a change of attitude towards history in American literary theory, and there appeared a new theoretical approach which soon became known as New Historicism. Since its inception, New Historicism has been identified with the study of Renaissance and Romanticism, but nowadays it has been increasingly involved in other literary trends. Although there are great differences in the arguments and practices at various representatives of this school, New Historicism has clearly recognizable features and many new historicists will agree with the statement of Walter Cohen that New Historicism, when it appeared in the eighties, represented something quite new in reference to the studies of theory, criticism and history (Cohen 1987, 33. Theoretical connection with Bakhtin, Foucault and Marx is clear, as well as a kind of uneasy tie with deconstruction and the work of Paul de Man. At the center of this approach is a renewed interest in the study of literary works in the light of historical and political circumstances in which they were created. Foucault encouraged readers to begin to move literary texts and to link them with discourses and representations that are not literary, as well as to examine the sociological aspects of the texts in order to take part in the social struggles of today. The study of literary works using New Historicism is the study of politics, history, culture and circumstances in which these works were created. With regard to one of the main fact which is located in the center of the criticism, that history cannot be viewed objectively and that reality can only be understood through a cultural context that reveals the work, re-reading and interpretation of

  13. Succincter Text Indexing with Wildcards

    CERN Document Server

    Thachuk, Chris

    2011-01-01

    We study the problem of indexing text with wildcard positions, motivated by the challenge of aligning sequencing data to large genomes that contain millions of single nucleotide polymorphisms (SNPs)---positions known to differ between individuals. SNPs modeled as wildcards can lead to more informed and biologically relevant alignments. We improve the space complexity of previous approaches by giving a succinct index requiring $(2 + o(1))n \\log \\sigma + O(n) + O(d \\log n) + O(k \\log k)$ bits for a text of length $n$ over an alphabet of size $\\sigma$ containing $d$ groups of $k$ wildcards. A key to the space reduction is a result we give showing how any compressed suffix array can be supplemented with auxiliary data structures occupying $O(n) + O(d \\log \\frac{n}{d})$ bits to also support efficient dictionary matching queries. The query algorithm for our wildcard index is faster than previous approaches using reasonable working space. More importantly our new algorithm greatly reduces the query working space to ...

  14. Segmental Rescoring in Text Recognition

    Science.gov (United States)

    2014-02-04

    ttm № tes/m, m* tmvr mowm* a Smyrna Of l δrtA£ACf02S’ A w m - y i p m AmiKSiS € f № ) C № № m .. sg6#?«rA fiθN ; Atφ h Sft№’·’Spxn mm m fim f№b t&m&mm...applying a Hidden Markov Model (HMM) recognition approach. Generating the plurality text hypotheses for the image forming includes generating a first...image. Applying segmental analysis to a segmentation determined by a first OCR engine, such as a segmentation determined by a Hidden Markov Model (HMM

  15. Linguistic dating of biblical texts

    DEFF Research Database (Denmark)

    Young, Ian; Rezetko, Robert; Ehrensvärd, Martin Gustaf

    and diglossia and textual criticism (Chapters 7, 13), and the significance of extra-biblical sources, including Amarna Canaanite, Ugaritic, Aramaic, Hebrew inscriptions of the monarchic period, Qumran and Mishnaic Hebrew, the Hebrew language of Ben Sira and Bar Kochba, and also Egyptian, Akkadian, Persian....... This is followed by an detailed synthesis of the topics introduced in the first volume, a series of detailed case studies on various linguistic issues, extensive tables of grammatical and lexical features, and a comprehensive bibliography. The authors argue that the scholarly use of language in dating biblical...... texts, and even the traditional standpoint on the chronological development of biblical Hebrew, require a thorough re-evaluation, and propose a new perspective on linguistic variety in biblical Hebrew. Early Biblical Hebrew and Late Biblical Hebrew do not represent different chronological periods...

  16. Everyday Life as a Text

    Directory of Open Access Journals (Sweden)

    Michael Lahey

    2016-02-01

    Full Text Available This article explores how audience data are utilized in the tentative partnerships created between television and social media companies. Specially, it looks at the mutually beneficial relationship formed between the social media platform Twitter and television. It calls attention to how audience data are utilized as a way for the television industry to map itself onto the everyday lives of digital media audiences. I argue that the data-intensive monitoring of everyday life offers some measure of soft control over audiences in a digital media landscape. To do this, I explore “Social TV”—the relationships created between social media technologies and television—before explaining how Twitter leverages user data into partnerships with various television companies. Finally, the article explains what is fruitful about understanding the Twitter–television relationship as a form of soft control.

  17. A programmed text in statistics

    CERN Document Server

    Hine, J

    1975-01-01

    Exercises for Section 2 42 Physical sciences and engineering 42 43 Biological sciences 45 Social sciences Solutions to Exercises, Section 1 47 Physical sciences and engineering 47 49 Biological sciences 49 Social sciences Solutions to Exercises, Section 2 51 51 PhYSical sciences and engineering 55 Biological sciences 58 Social sciences 62 Tables 2 62 x - tests involving variances 2 63,64 x - one tailed tests 2 65 x - two tailed tests F-distribution 66-69 Preface This project started some years ago when the Nuffield Foundation kindly gave a grant for writing a pro­ grammed text to use with service courses in statistics. The work carried out by Mrs. Joan Hine and Professor G. B. Wetherill at Bath University, together with some other help from time to time by colleagues at Bath University and elsewhere. Testing was done at various colleges and universities, and some helpful comments were received, but we particularly mention King Edwards School, Bath, who provided some sixth formers as 'guinea pigs' for the fir...

  18. Orientalist discourse in media texts

    Directory of Open Access Journals (Sweden)

    Necla Mora

    2009-10-01

    Full Text Available By placing itself at the center of the world with a Eurocentric point of view, the West exploits other countries and communities through inflicting cultural change and transformation on them either from within via colonialist movements or from outside via “Orientalist” discourses in line with its imperialist objectives.The West has fictionalized the “image of the Orient” in terms of science by making use of social sciences like anthropology, history and philology and launched an intensive propaganda which covers literature, painting, cinema and other fields of art in order to actualize this fiction. Accordingly, the image of the Orient – which has been built firstly in terms of science then socially – has been engraved into the collective memory of both the Westerner and the Easterner.The internalized “Orientalist” point of view and discourse cause the Westerner to see and perceive the Easterner with the image formed in his/her memory while looking at them. The Easterner represents and expresses himself/herself from the eyes of the Westerner and with the image which the Westerner fictionalized for him/her. Hence, in order to gain acceptance from the West, the East tries to shape itself into the “Orientalist” mold which the Westerner fictionalized for it.Artists, intellectuals, writers and media professionals, who embrace and internalize the stereotypical hegemonic-driven “Orientalist” discourse of the Westerner and who rank among the elite group, reflect their internalized “Orientalist” discourse on their own actions. This condition causes the “Orientalist” clichés to be engraved in the memory of the society; causes the society to view itself with an “Orientalist” point of view and perceive itself with the clichés of the Westerner. Consequently, the second ring of the hegemony is reproduced by the symbolic elites who represent the power/authority within the country.The “Orientalist” discourse, which is

  19. What's so Simple about Simplified Texts? A Computational and Psycholinguistic Investigation of Text Comprehension and Text Processing

    Science.gov (United States)

    Crossley, Scott A.; Yang, Hae Sung; McNamara, Danielle S.

    2014-01-01

    This study uses a moving windows self-paced reading task to assess both text comprehension and processing time of authentic texts and these same texts simplified to beginning and intermediate levels. Forty-eight second language learners each read 9 texts (3 different authentic, beginning, and intermediate level texts). Repeated measures ANOVAs…

  20. The socio-demographics of texting

    DEFF Research Database (Denmark)

    Ling, Richard; Bertel, Troels Fibæk; Sundsøy, Pål

    2012-01-01

    Who texts, and with whom do they text? This article examines the use of texting using metered traffic data from a large dataset (nearly 400 million anonymous text messages). We ask 1) How much do different age groups use mobile phone based texting (SMS)? 2) How wide is the circle of texting...... partners for different age groups? 3) To which degree are texting relationships characterized by age and gender homophily? We find that that texting is hugely popular among teens compared to other age groups. Further, the number of persons with whom people text is quite small. About half of all text...

  1. SIAM 2007 Text Mining Competition dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — Subject Area: Text Mining Description: This is the dataset used for the SIAM 2007 Text Mining competition. This competition focused on developing text mining...

  2. Text Anomalies Detection Using Histograms of Words

    Directory of Open Access Journals (Sweden)

    Abdulwahed Faraj Almarimi

    2016-01-01

    Full Text Available Authors of written texts mainly can be characterized by some collection of attributes obtained from texts. Texts of the same author are very similar from the style point of view. We can consider that attributes of a full text are very similar to attributes of parts in the same text. In the same thoughts can be compared different parts of the same text. In the paper, we describe an algorithm based on histograms of a mapped text to interval. In the mapping, it is kipped the word order as in the text. Histograms are analyzed from a cluster point of view. If a cluster dispersion is not large, the text is probably written by the same author. If the cluster dispersion is large, the text will be split in two or more parts and the same analysis will be done for the text parts.  The experiments were done on English and Arabic texts. For combined English texts our algorithm covers that texts were not written by one author. We have got the similar results for combined Arabic texts. Our algorithm can be used to basic text analysis if the text was written by one author.       

  3. TEXT CLASSIFICATION TOWARD A SCIENTIFIC FORUM

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Text mining, also known as discovering knowledge from the text, which has emerged as a possible solution for the current information explosion, refers to the process of extracting non-trivial and useful patterns from unstructured text. Among the general tasks of text mining such as text clustering,summarization, etc, text classification is a subtask of intelligent information processing, which employs unsupervised learning to construct a classifier from training text by which to predict the class of unlabeled text. Because of its simplicity and objectivity in performance evaluation, text classification was usually used as a standard tool to determine the advantage or weakness of a text processing method, such as text representation, text feature selection, etc. In this paper, text classification is carried out to classify the Web documents collected from XSSC Website (http://www. xssc.ac.cn). The performance of support vector machine (SVM) and back propagation neural network (BPNN) is compared on this task. Specifically, binary text classification and multi-class text classification were conducted on the XSSC documents. Moreover, the classification results of both methods are combined to improve the accuracy of classification. An experiment is conducted to show that BPNN can compete with SVM in binary text classification; but for multi-class text classification, SVM performs much better. Furthermore, the classification is improved in both binary and multi-class with the combined method.

  4. Noticeable Focuses in Reading a Text

    Institute of Scientific and Technical Information of China (English)

    李明

    2007-01-01

    This paper discusses the relationship between commanding those basic information contained in a text and the final p urpose of comprehending in a text-reading process. By using the main topic and the central meaning that all texts have as two main examples, the author mainly illustrates what a reader should pay attention to in reading a text.

  5. What makes a written text written

    Institute of Scientific and Technical Information of China (English)

    赵亦倩

    2008-01-01

    Text can be used for both written and spoken language, and different features of spoken and written texts provide us the possibility to have a general idea of the division of two main categories--spoken English and written English. In this article, an attempt will be given to a sample text in order to discuss the general features of written texts.

  6. Text To Speech System for Telugu Language

    OpenAIRE

    Siva kumar, M; E. Prakash Babu

    2014-01-01

    Telugu is one of the oldest languages in India. This paper describes the development of Telugu Text-to-Speech System (TTS).In Telugu TTS the input is Telugu text in Unicode. The voices are sampled from real recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform. Speech synthesis is a process of building machinery that can generate human-like speech from any text input to imitate human speakers. Text proc...

  7. A Proposed Arabic Handwritten Text Normalization Method

    Directory of Open Access Journals (Sweden)

    Tarik Abu-Ain

    2014-11-01

    Full Text Available Text normalization is an important technique in document image analysis and recognition. It consists of many preprocessing stages, which include slope correction, text padding, skew correction, and straight the writing line. In this side, text normalization has an important role in many procedures such as text segmentation, feature extraction and characters recognition. In the present article, a new method for text baseline detection, straightening, and slant correction for Arabic handwritten texts is proposed. The method comprises a set of sequential steps: first components segmentation is done followed by components text thinning; then, the direction features of the skeletons are extracted, and the candidate baseline regions are determined. After that, selection of the correct baseline region is done, and finally, the baselines of all components are aligned with the writing line.  The experiments are conducted on IFN/ENIT benchmark Arabic dataset. The results show that the proposed method has a promising and encouraging performance.

  8. Comparative Discourse Analysis of Parallel Texts

    CERN Document Server

    Van der Eijk, P

    1994-01-01

    A quantitative representation of discourse structure can be computed by measuring lexical cohesion relations among adjacent blocks of text. These representations have been proposed to deal with sub-topic text segmentation. In a parallel corpus, similar representations can be derived for versions of a text in various languages. These can be used for parallel segmentation and as an alternative measure of text-translation similarity.

  9. Multimodal Diversity of Postmodernist Fiction Text

    Directory of Open Access Journals (Sweden)

    U. I. Tykha

    2016-12-01

    Full Text Available The article is devoted to the analysis of structural and functional manifestations of multimodal diversity in postmodernist fiction texts. Multimodality is defined as the coexistence of more than one semiotic mode within a certain context. Multimodal texts feature a diversity of semiotic modes in the communication and development of their narrative. Such experimental texts subvert conventional patterns by introducing various semiotic resources – verbal or non-verbal.

  10. Evaluation Methods of The Text Entities

    Science.gov (United States)

    Popa, Marius

    2006-01-01

    The paper highlights some evaluation methods to assess the quality characteristics of the text entities. The main concepts used in building and evaluation processes of the text entities are presented. Also, some aggregated metrics for orthogonality measurements are presented. The evaluation process for automatic evaluation of the text entities is…

  11. Role of Terms in Popular Science Text

    Directory of Open Access Journals (Sweden)

    Zhabbarova F. U.

    2013-01-01

    Full Text Available The article examines and determines the specifics of terminological vocabulary used in a popular science text. It differentiates the notions of cohesion and coherence. The article reveals the main terminological means realizing cohesion in the text of a popular science article.

  12. The Ecological Approach to Text Visualization.

    Science.gov (United States)

    Wise, James A.

    1999-01-01

    Presents both theoretical and technical bases on which to build a "science of text visualization." The Spatial Paradigm for Information Retrieval and Exploration (SPIRE) text-visualization system, which images information from free-text documents as natural terrains, serves as an example of the "ecological approach" in its visual metaphor, its…

  13. Teacher Modeling Using Complex Informational Texts

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2015-01-01

    Modeling in complex texts requires that teachers analyze the text for factors of qualitative complexity and then design lessons that introduce students to that complexity. In addition, teachers can model the disciplinary nature of content area texts as well as word solving and comprehension strategies. Included is a planning guide for think aloud.

  14. Creating and Using Culturally Sustaining Informational Texts

    Science.gov (United States)

    Watanabe Kganetso, Lynne M.

    2017-01-01

    Current standards and assessments emphasize the importance of a variety of genres in students' literacy diets, which has placed increased attention on informational texts. Unfortunately, young students' current exposure to and experiences with informational texts are often limited by the texts' availability, quality, and relevance to children's…

  15. Text Complexity: Primary Teachers' Views

    Science.gov (United States)

    Fitzgerald, Jill; Hiebert, Elfrieda H.; Bowen, Kimberly; Relyea-Kim, E. Jackie; Kung, Melody; Elmore, Jeff

    2015-01-01

    The research question was, "What text characteristics do primary teachers think are most important for early grades text complexity?" Teachers from across the United States accomplished a two-part task. First, to stimulate teachers' thinking about important text characteristics, primary teachers completed an online paired-text…

  16. Applying statistical methods to text steganography

    CERN Document Server

    Nechta, Ivan

    2011-01-01

    This paper presents a survey of text steganography methods used for hid- ing secret information inside some covertext. Widely known hiding techniques (such as translation based steganography, text generating and syntactic embed- ding) and detection are considered. It is shown that statistical analysis has an important role in text steganalysis.

  17. Text-Picture Relations in Cooking Instructions

    NARCIS (Netherlands)

    van der Sluis, Ielka; Leito, Shadira; Redeker, Gisela; Bunt, Harry

    2016-01-01

    Like many other instructions, recipes on packages with ready-to-use ingredients for a dish combine a series of pictures with short text paragraphs. The information presentation in such multimodal instructions can be compact (either text or picture) and/or cohesive (text and picture). In an explorato

  18. Object reading: text recognition for object recognition

    NARCIS (Netherlands)

    Karaoglu, S.; van Gemert, J.C.; Gevers, T.

    2012-01-01

    We propose to use text recognition to aid in visual object class recognition. To this end we first propose a new algorithm for text detection in natural images. The proposed text detection is based on saliency cues and a context fusion step. The algorithm does not need any parameter tuning and can d

  19. Interdisciplinary Approach to Understanding Literary Texts

    Science.gov (United States)

    Dossanova, Altynay Zh.; Ismakova, Bibissara S.; Tapanova, Saule E.; Ayupova, Gulbagira K.; Gotting, Valentina V.; Kaltayeva, Gulnar K.

    2016-01-01

    The primary purpose is the implementation of the interdisciplinary approach to understanding and the construction of integrative models of understanding literary texts. The interdisciplinary methodological paradigm of studying text understanding, based on the principles of various sciences facilitating the identification of the text understanding…

  20. About Reformulation in Full-Text IRS.

    Science.gov (United States)

    Debili, Fathi; And Others

    1989-01-01

    Analyzes different kinds of reformulations used in information retrieval systems where full text databases are accessed through natural language queries. Tests of these reformulations on large full text databases managed by the Syntactic and Probabilistic Indexing and Retrieval of Information in Texts (SPIRIT) system are described, and an expert…

  1. Mathematical Texts as Narrative: Rethinking Curriculum

    Science.gov (United States)

    Dietiker, Leslie

    2013-01-01

    This paper proposes a framework for reading mathematics texts as narratives. Building from a narrative framework of Meike Bal, a reader's experience with the mathematical content as it unfolds in the text (the "mathematical story") is distinguished from his or her logical reconstruction of the content beyond the text (the…

  2. An Embedded Application for Degraded Text Recognition

    Directory of Open Access Journals (Sweden)

    Thillou Céline

    2005-01-01

    Full Text Available This paper describes a mobile device which tries to give the blind or visually impaired access to text information. Three key technologies are required for this system: text detection, optical character recognition, and speech synthesis. Blind users and the mobile environment imply two strong constraints. First, pictures will be taken without control on camera settings and a priori information on text (font or size and background. The second issue is to link several techniques together with an optimal compromise between computational constraints and recognition efficiency. We will present the overall description of the system from text detection to OCR error correction.

  3. Text line Segmentation of Curved Document Images

    Directory of Open Access Journals (Sweden)

    Anusree.M

    2014-05-01

    Full Text Available Document image analysis has been widely used in historical and heritage studies, education and digital library. Document image analytical techniques are mainly used for improving the human readability and the OCR quality of the document. During the digitization, camera captured images contain warped document due perspective and geometric distortions. The main difficulty is text line detection in the document. Many algorithms had been proposed to address the problem of printed document text line detection, but they failed to extract text lines in curved document. This paper describes a segmentation technique that detects the curled text line in camera captured document images.

  4. Text-speak processing impairs tactile location.

    Science.gov (United States)

    Head, James; Helton, William; Russell, Paul; Neumann, Ewald

    2012-09-01

    Dual task experiments have highlighted that driving while having a conversation on a cell phone can have negative impacts on driving (Strayer & Drews, 2007). It has also been noted that this negative impact is greater when reading a text-message (Lee, 2007). Commonly used in text-messaging are shortening devices collectively known as text-speak (e.g.,Ys I wll ttyl 2nite, Yes I will talk to you later tonight). To the authors' knowledge, there has been no investigation into the potential negative impacts of reading text-speak on concurrent performance on other tasks. Forty participants read a correctly spelled story and a story presented in text-speak while concurrently monitoring for a vibration around their waist. Slower reaction times and fewer correct vibration detections occurred while reading text-speak than while reading a correctly spelled story. The results suggest that reading text-speak imposes greater cognitive load than reading correctly spelled text. These findings suggest that the negative impact of text messaging on driving may be compounded by the messages being in text-speak, instead of orthographically correct text.

  5. CRIE: An automated analyzer for Chinese texts.

    Science.gov (United States)

    Sung, Yao-Ting; Chang, Tao-Hsing; Lin, Wei-Chun; Hsieh, Kuan-Sheng; Chang, Kuo-En

    2016-12-01

    Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

  6. The nuclear modification of charged particles in Pb-Pb at $\\sqrt{\\text{s}_\\text{NN}} = \\text{5.02}\\,\\text{TeV}$ measured with ALICE

    CERN Document Server

    INSPIRE-00537113

    2016-01-01

    The study of inclusive charged-particle production in heavy-ion collisions provides insights into the density of the medium and the energy-loss mechanisms. The observed suppression of high-$\\textit{p}_\\text{T}$ yield is generally attributed to energy loss of partons as they propagate through a deconfined state of quarks and gluons - Quark-Gluon Plasma (QGP) - predicted by QCD. Such measurements allow the characterization of the QGP by comparison with models. In these proceedings, results on high-$\\textit{p}_\\text{T}$ particle production measured by ALICE in Pb-Pb collisions at $ \\sqrt{\\text{s}_\\text{NN}}\\, = 5.02\\ \\rm{TeV}$ as well as well in pp at $\\sqrt{\\text{s}}\\,=5.02\\ \\rm{TeV}$ are presented for the first time. The nuclear modification factors ($\\text{R}_\\text{AA}$) in Pb-Pb collisions are presented and compared with model calculations.

  7. Text To Speech System for Telugu Language

    Directory of Open Access Journals (Sweden)

    M. Siva Kumar

    2014-03-01

    Full Text Available Telugu is one of the oldest languages in India. This paper describes the development of Telugu Text-to-Speech System (TTS.In Telugu TTS the input is Telugu text in Unicode. The voices are sampled from real recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform. Speech synthesis is a process of building machinery that can generate human-like speech from any text input to imitate human speakers. Text processing and speech generation are two main components of a text to speech system. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of these rules and their derivation depends upon the nature of the language. The quality of a speech synthesizer is judged by its closeness to the natural human voice and understandability. In this paper we described an approach to build a Telugu TTS system using concatenative synthesis method with syllable as a basic unit of concatenation.

  8. Colored-sketch of Text Information

    Directory of Open Access Journals (Sweden)

    Beomjin Kim

    2002-01-01

    Full Text Available This paper presents an information visualization method, which transforms text into abstracted visual representations. The proposed color-coding algorithm converts text into a sequence of colored icons that inform users about the distributional patterns of given queries, as well as the structural overview of a document simultaneously. By presenting the compact, but instructive visual abstraction of texts concurrently, users can compare multiple documents intuitively while alleviating the need to reference the underlying text. The system provides interactive navigation tools to support users' decision-making processes - including multi-level viewing, a tree hierarchy recording previous search activities, and suggestive words for refinement of the search scope. An experimental study evaluating this visual approach for delivering search results has been conducted on text corpora in comparison with a traditional information retrieval system. By informing search results to clientele in a perceptive form, the users' performance in obtaining desired information has been improved, while maintaining the accuracy.

  9. The Research of Chinese Text Proofreading Algorithm

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Generally, text proofreading consists of two procedures, finding the wrongly used words and then presenting the correct forms. At present, most of the Chinese text proofreading focuses on finding the wrongly used words, but pays less attention to correcting these errors. In this paper, the Chinese text features are interpreted first and then a Chinese text proofreading method and its algorithm are introduced. In this algorithm, text features, including text statistical feature and language structure feature, are properly used. Here, correcting errors goes on at the same time with finding errors. Experimental results show that this method has a performance of detecting 75% of wrongly used Chinese words and correcting about 60% of them with the first candidates.

  10. Algorithm for Generating Train Calendar Texts

    Directory of Open Access Journals (Sweden)

    Karel Greiner

    2013-04-01

    Full Text Available The article describes a possibility of generating train calendar text for the needs of compiling the annual timetable in the conditions of the Czech Republic. Based on the analysis of the types of texts of calendars that appear in various print outputs, a heuristic algorithm was designed to generate a text from a set of calendar days. The algorithm is a part of an application that also provides a tool to define the text of the calendar by using a mask of sub-periods and calendars to be displayed in them. The algorithm was tested on real data of the timetable. In most cases, the algorithm shows the same or better results than the previously used tools. In several cases, however, a better result can be obtained by the user. The described algorithm to generate the text of the calendar is a part of a program that is used for compiling the timetable for trains in the Czech Republic.

  11. Choices of texts for literary education

    DEFF Research Database (Denmark)

    Skyggebjerg, Anna Karlskov

    the possibility for positioning pupils/young adults ? What does the choice of texts mean for pupils’/young adults’ possibilities as readers and individual interpreters? How are the pupils’ potentials for envisioning and engaging in literature with certain choices of texts?......, and in the registration of texts for examinations. Genres such as poetry and short stories, periods such as avant-garde and modernism, and acknowledged and well-known authorships are often included, whereas, representations of popular fiction and such genres as fantasy, sci-fi, and biography are rare. Often, pupils......This paper charts the general implications of the choice of texts for literature teaching in the Danish school system, especially in Grades 8 and 9. It will analyze and discuss the premises of the choice of texts, and the possibilities of a certain choice of text in a concrete classroom situation...

  12. NEW TECHNIQUES USED IN AUTOMATED TEXT ANALYSIS

    Directory of Open Access Journals (Sweden)

    M. I strate

    2010-12-01

    Full Text Available Automated analysis of natural language texts is one of the most important knowledge discovery tasks for any organization. According to Gartner Group, almost 90% of knowledge available at an organization today is dispersed throughout piles of documents buried within unstructured text. Analyzing huge volumes of textual information is often involved in making informed and correct business decisions. Traditional analysis methods based on statistics fail to help processing unstructured texts and the society is in search of new technologies for text analysis. There exist a variety of approaches to the analysis of natural language texts, but most of them do not provide results that could be successfully applied in practice. This article concentrates on recent ideas and practical implementations in this area.

  13. Legal English and Adapted Legal Texts

    Directory of Open Access Journals (Sweden)

    Alvyda Liuolienė

    2012-06-01

    Full Text Available The article aims at analysing the significance of authentic legal English text and adapted legal texts in ESP classes. The authors point out the advantages and disadvantages of legal texts and analyse the possibilities of their efficient application in the teaching process. At the initial stage of teaching English legalese, materials prepared specially for teaching purposes in textbooks seem to be more appropriate as they are adapted for a particular level for law students whereas in more advanced levels, authentic texts in a legal English classroom can more considerably contribute to the learning experience. The usage of both legal authentic materials and adapted legal texts have tangible impact on mastering legal English.

  14. Biomechanical patterns of text-message distraction.

    Science.gov (United States)

    Le, Peter; Hwang, Jaejin; Grawe, Sarah; Li, Jing; Snyder, Alison; Lee, Christina; Marras, William S

    2015-01-01

    The objective of this study was to identify biomechanical measures that can distinguish texting distraction in a laboratory-simulated driving environment. The goal would be to use this information to provide an intervention for risky driving behaviour. Sixteen subjects participated in this study. Three independent variables were tested: task (texting, visual targeting, weighted and non-weighted movements), task direction (front and side) and task distance (close and far). Dependent variables consisted of biomechanical moments, head displacement and the length of time to complete each task. Results revealed that the time to complete each task was higher for texting compared to other tasks. Peak moments during texting were only distinguishable from visual targeting. Peak head displacement and cumulative biomechanical exposure measures indicated that texting can be distinguished from other tasks. Therefore, it may be useful to take into account both temporal and biomechanical measures when considering warning systems to detect texting distraction.

  15. Adaptive Text Entry for Mobile Devices

    DEFF Research Database (Denmark)

    Proschowsky, Morten Smidt

    The reduced size of many mobile devices makes it difficult to enter text with them. The text entry methods are often slow or complicated to use. This affects the performance and user experience of all applications and services on the device. This work introduces new easy-to-use text entry methods...... for mobile devices and a framework for adaptive context-aware language models. Based on analysis of current text entry methods, the requirements to the new text entry methods are established. Transparent User guided Prediction (TUP) is a text entry method for devices with one dimensional touch input. It can...... be touch sensitive wheels, sliders or similar input devices. The interaction design of TUP is done with a combination of high level task models and low level models of human motor behaviour. Three prototypes of TUP are designed and evaluated by more than 30 users. Observations from the evaluations are used...

  16. Colored-sketch of Text Information

    OpenAIRE

    Beomjin Kim; Philip Johnson; Adam S. Huarng

    2002-01-01

    This paper presents an information visualization method, which transforms text into abstracted visual representations. The proposed color-coding algorithm converts text into a sequence of colored icons that inform users about the distributional patterns of given queries, as well as the structural overview of a document simultaneously. By presenting the compact, but instructive visual abstraction of texts concurrently, users can compare multiple documents intuitively while alleviating the need...

  17. Beyond Text Theory: Understanding Literary Response

    OpenAIRE

    Miall, David S.; Kuiken, Don

    1994-01-01

    Approaches to text comprehension that focus on propositional, inferential, and elaborative processes have often been considered capable of extension in principle to literary texts, such as stories or poems. However, we argue that literary response is influenced by stylistic features that result in defamiliarization; that defamiliarization invokes feeling which calls on personal perspectives and meanings; and that these aspects of literary response are not addressed by current text theories. T...

  18. Translation Strategies of Non-literary Texts

    Institute of Scientific and Technical Information of China (English)

    杨静

    2015-01-01

    Translator's subjectivity is closely related to the choice of the style of the translated texts and translation strategies.This paper presents an analytical study of translation strategies of non-literary texts.It introduces different non-literary texts,and then generalizes some factors influencing the selection of translation strategies.Take these Influencing factors into account,Translators should adopt different translation strategies

  19. Handwritten text line segmentation by spectral clustering

    Science.gov (United States)

    Han, Xuecheng; Yao, Hui; Zhong, Guoqiang

    2017-02-01

    Since handwritten text lines are generally skewed and not obviously separated, text line segmentation of handwritten document images is still a challenging problem. In this paper, we propose a novel text line segmentation algorithm based on the spectral clustering. Given a handwritten document image, we convert it to a binary image first, and then compute the adjacent matrix of the pixel points. We apply spectral clustering on this similarity metric and use the orthogonal kmeans clustering algorithm to group the text lines. Experiments on Chinese handwritten documents database (HIT-MW) demonstrate the effectiveness of the proposed method.

  20. Financial Statement Fraud Detection using Text Mining

    Directory of Open Access Journals (Sweden)

    Rajan Gupta

    2013-01-01

    Full Text Available Data mining techniques have been used enormously by the researchers’ community in detecting financial statement fraud. Most of the research in this direction has used the numbers (quantitative information i.e. financial ratios present in the financial statements for detecting fraud. There is very little or no research on the analysis of text such as auditor’s comments or notes present in published reports. In this study we propose a text mining approach for detecting financial statement fraud by analyzing the hidden clues in the qualitative information (text present in financial statements.

  1. Multilingual Text Detection with Nonlinear Neural Network

    Directory of Open Access Journals (Sweden)

    Lin Li

    2015-01-01

    Full Text Available Multilingual text detection in natural scenes is still a challenging task in computer vision. In this paper, we apply an unsupervised learning algorithm to learn language-independent stroke feature and combine unsupervised stroke feature learning and automatically multilayer feature extraction to improve the representational power of text feature. We also develop a novel nonlinear network based on traditional Convolutional Neural Network that is able to detect multilingual text regions in the images. The proposed method is evaluated on standard benchmarks and multilingual dataset and demonstrates improvement over the previous work.

  2. Absolute frequency measurement of the {{}^{1}}{{\\text{S}}_{0}} – {{}^{3}}{{\\text{P}}_{0}} transition of 171Yb

    Science.gov (United States)

    Pizzocaro, Marco; Thoumany, Pierre; Rauf, Benjamin; Bregolin, Filippo; Milani, Gianmaria; Clivati, Cecilia; Costanzo, Giovanni A.; Levi, Filippo; Calonico, Davide

    2017-02-01

    We report the absolute frequency measurement of the unperturbed transition {{}1}{{\\text{S}}0} – {{}3}{{\\text{P}}0} at 578 nm in 171Yb realized in an optical lattice frequency standard relative to a cryogenic caesium fountain. The measurement result is 518 295 836 590 863.59(31) Hz with a relative standard uncertainty of 5.9× {{10}-16} . This value is in agreement with the ytterbium frequency recommended as a secondary representation of the second in the International System of Units.

  3. [Text comprehension, cognitive resources and aging].

    Science.gov (United States)

    Chesneau, Sophie; Jbabdi, Saad; Champagne-Lavau, Maud; Giroux, Francine; Ska, Bernadette

    2007-03-01

    Aging brings cognitive changes. Language is not immune to these changes. The use of compensation strategies may permit older adults to achieve a performance level identical to the one obtained by younger adults. This research aims to study text comprehension in aging and the reading strategies used for by older and younger adults. Kintsch's cognitive model (1988) allows the identification of different levels of representation within text treatment (linguistic form, macrostructure, microstructure and situation model) and predicts the underlying cognitive components. Eye-tracking analyses during reading permit inference about the moments of reading treatment and detection of reading strategies. Sixty highly educated participants were assessed. They were divided in two age groups (20-40 and 60-80 years old). Participants were asked to read and understand three texts constructed to highlight the features of text comprehension within each one of the different levels of text representation. The amount of detail and the necessity of updating the situation model varied for each text. Eye movements were registered by an eye-tracker (Cambridge research) during the reading process. Specific complementary tasks were administered to evaluate working memory, long-term memory, and executive functions. Variances analyses showed significantly lower performance by older adults regarding: 1) recall of the microstructure of the two texts with a high degree of detail, 2) macrostructure of the text with fewer details, and 3) performance on all tasks that evaluated cognitive components. Aging influenced treatment of levels of text representation depending on text characteristics. However, cluster analysis of the text comprehension and eye-tracker data revealed a group of older adults whose performance in reading comprehension was identical to the performance of younger adults, with the same reading profile. This result seems to show that use of compensation strategies by older adults at

  4. Searches for ttH and tH with $\\text{H}\\rightarrow\\text{b}\\bar{\\text{b}}$

    CERN Document Server

    Schroeder, Matthias

    2016-01-01

    The associated production of a Higgs boson with a top quark-antiquark pair (ttH production) or with a single top quark (tH production) allows a direct measurement of the top-Higgs-Yukawa coupling with minimal model dependence. In this article, recent results of searches for ttH and tH production in the $\\text{H}\\rightarrow\\text{b}\\bar{\\text{b}}$ channel performed by the ATLAS and CMS experiments are reviewed. The analyses use pp collision data collected at a centre-of-mass energy of $13\\,$TeV corresponding to an integrated luminosity of up to 13.2$\\,\\text{fb}^{-1}$.

  5. Classifying Written Texts Through Rhythmic Features

    NARCIS (Netherlands)

    Balint, Mihaela; Dascalu, Mihai; Trausan-Matu, Stefan

    2016-01-01

    Rhythm analysis of written texts focuses on literary analysis and it mainly considers poetry. In this paper we investigate the relevance of rhythmic features for categorizing texts in prosaic form pertaining to different genres. Our contribution is threefold. First, we define a set of rhythmic featu

  6. Opening Mathematics Texts: Resisting the Seduction

    Science.gov (United States)

    Wagner, David

    2012-01-01

    This analysis of the writing in a grade 7 mathematics textbook distinguishes between closed texts and open texts, which acknowledge multiple possibilities. I use tools that have recently been applied in mathematics contexts, focussing on grammatical features that include personal pronouns, modality, and types of imperatives, as well as on…

  7. Texts in multiple versions: histories of editions

    NARCIS (Netherlands)

    L. Giuliani; H. Brinkman; G. Lernout; M. Mathijsen

    2006-01-01

    Texts in multiple versions constitute the core problem of textual scholarship. For texts from antiquity and the medieval period, the many versions may be the result of manuscript transmission, requiring editors and readers to discriminate between levels of authority in variant readings produced alon

  8. Ontology Assisted Formal Specification Extraction from Text

    Directory of Open Access Journals (Sweden)

    Andreea Mihis

    2010-12-01

    Full Text Available In the field of knowledge processing, the ontologies are the most important mean. They make possible for the computer to understand better the natural language and to make judgments. In this paper, a method which use ontologies in the semi-automatic extraction of formal specifications from a natural language text is proposed.

  9. On the Techniques of Journalistic Text Translation

    Institute of Scientific and Technical Information of China (English)

    林燕

    2015-01-01

    With the development of economy globalization,the translation of journalistic text has become increasingly important to cultural exchanges or economy communication among different countries. This paper briefly introduces the characteristics of news text and provides some feasible techniques for translation from English to Chinese or Chinese to English based on the case study.

  10. Learning with Text in the Primary Grades.

    Science.gov (United States)

    Guillaume, Andrea M.

    1998-01-01

    Provides a rationale for learning-with-text experiences for primary-grade children; lists 10 general approaches to foster primary-grade content area reading; and offers a sample lesson incorporating these approaches that promotes comprehension of text and content matter. Suggests that trade books, textbooks, realistic fiction, and other print…

  11. The Patchwork Text in Teaching Greek Tragedy.

    Science.gov (United States)

    Parker, Jan

    2003-01-01

    Describes the rewards and challenges of using the Patchwork Text to teach Greek Tragedy to Cambridge University English final-year students. The article uses close reading of the students' texts, analysis and reflection to discuss both the products and the process of Patchwork writing. (Author/AEF)

  12. Text mining and visualization using VOSviewer

    CERN Document Server

    van Eck, Nees Jan

    2011-01-01

    VOSviewer is a computer program for creating, visualizing, and exploring bibliometric maps of science. In this report, the new text mining functionality of VOSviewer is presented. A number of examples are given of applications in which VOSviewer is used for analyzing large amounts of text data.

  13. The Managed Text: Prose and Qualms.

    Science.gov (United States)

    Kadushin, Charles

    1979-01-01

    Managed texts are written and designed by a team of writers and researchers under the direction and control of a publishing house. How these books got started, what needs they meet, their advantages and disadvantages, and the consequences they are having on college text publishing are addressed. (JMD)

  14. The Limited Benefits of Rereading Educational Texts

    Science.gov (United States)

    Callender, Aimee A.; McDaniel, Mark A.

    2009-01-01

    Though rereading is a study method commonly used by students, theoretical disagreement exists regarding whether rereading a text significantly enhances the representation and retention of the text's contents. In four experiments, we evaluated the effectiveness of rereading relative to a single reading in a context paralleling that faced by…

  15. Texts, Troubled Teens, and Troubling Times

    Science.gov (United States)

    Tatum, Alfred W., Ed.

    2009-01-01

    Seeking ways to effectively mediate texts with troubled teens in troubling times is worth the investment. Text is a powerful tool for shaping positive life trajectories, especially for those teens being affected by vulnerable-producing conditions that interrupt positive human development. These conditions, coupled with poor literacy skills…

  16. Modeling text with generalizable Gaussian mixtures

    DEFF Research Database (Denmark)

    Hansen, Lars Kai; Sigurdsson, Sigurdur; Kolenda, Thomas

    2000-01-01

    We apply and discuss generalizable Gaussian mixture (GGM) models for text mining. The model automatically adapts model complexity for a given text representation. We show that the generalizability of these models depends on the dimensionality of the representation and the sample size. We discuss...

  17. Extracting Conceptual Feature Structures from Text

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik; Lassen, Tine;

    2011-01-01

    This paper describes an approach to indexing texts by their conceptual content using ontologies along with lexico-syntactic information and semantic role assignment provided by lexical resources. The conceptual content of meaningful chunks of text is transformed into conceptual feature structures...

  18. A text in Romani from 1622

    DEFF Research Database (Denmark)

    Bakker, Peter

    2015-01-01

    this is a reprint of a 2012 article: A new old text in Romani: Lord's Prayer, 1622. International Journal of Romani Language and Culture 2 (2011): 193-212.......this is a reprint of a 2012 article: A new old text in Romani: Lord's Prayer, 1622. International Journal of Romani Language and Culture 2 (2011): 193-212....

  19. Text comprehension strategy instruction with poor readers

    NARCIS (Netherlands)

    Van den Bos, K.P.; Aarnoudse, C.C.; Brand-Gruwel, S.

    1998-01-01

    The goal of this study was to investigate the effects of teaching text comprehension strategies to children with decoding and reading comprehension problems and with a poor or normal listening ability. Two experiments are reported. Four text comprehension strategies, viz., question generation, summa

  20. Touchstone Texts: Fertile Ground for Creativity

    Science.gov (United States)

    Sturgell, Irma

    2008-01-01

    When state and local standards drive instruction, teachers often worry about compromising their creativity for a prescriptive curriculum with predictable outcomes. It is possible for creative teaching to flourish by using touchstone texts, or mentor texts, that engage both teacher and student in exploratory yet purposeful learning. (Contains 1…

  1. Texts, Transmissions, Receptions. Modern Approaches to Narratives

    NARCIS (Netherlands)

    Lardinois, A.P.M.H.; Levie, S.A.; Hoeken, H.; Lüthy, C.H.

    2015-01-01

    The papers collected in this volume study the function and meaning of narrative texts from a variety of perspectives. The word 'text' is used here in the broadest sense of the term: it denotes literary books, but also oral tales, speeches, newspaper articles and comics. One of the purposes of this v

  2. Text mining for the biocuration workflow.

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  3. Rapid and effective synthesis of $\\text{}^{40}\\text{Ca}-\\text{}^{27}\\text{Al}$ ion pair towards quantum logic optical clock

    CERN Document Server

    Shang, Junjuan; Cao, Jian; Wang, Shaomao; Shu, Hualin; Huang, Xueren

    2016-01-01

    High precision atomic clocks have been applied not only to very important technological problems such as synchronization and global navigation systems, but to the fundament precision measurement physics. Single $\\text{}^{27}\\text{Al}^+$ is one of the most attractions of selection system due to its very low blackbody radiation effect which dominates frequency shifts in other optical clock systems. Up to now, the $\\text{}^{27}\\text{Al}^+$ still could not be laser-cooled directly by reason that the absence of 167nm laser. Sympathetic cooling is a viable method to solve this problem. In this work, we used a single laser cooled $\\text{}^{40}\\text{Ca}^+$ to sympathetically cool one $\\text{}^{27}\\text{Al}^+$ in linear Paul trap. Comparing to laser ablation method we got a much lower velocity atoms sprayed from a home-made atom oven, which would make loading aluminum ion more efficient and the sympathetic cooling much easier. By the method of precisely measuring the secular frequency of the ion pair, finally we prove...

  4. Text Writing at an Undergraduate College.

    Science.gov (United States)

    Myers, David G.

    Strategies for writing a text are offered by a college professor on the basis of his own experience of writing a text on social psychology. Suggestions are given on creating an efficient office environment, researching the topic, and drafting the manuscript. One way to improve efficiency is to compress teaching into a few days, leaving the…

  5. Student Performance in an Electronic Text Environment.

    Science.gov (United States)

    Friedman, Edward A.; And Others

    1989-01-01

    Describes a project conducted at Stevens Institute of Technology to develop and test the applicability of full-text electronic databases and full-text retrieval technology for use in undergraduate humanities education. The creation of a machine-readable database on Galileo is described, student reactions are discussed, and further work is…

  6. Integrating Text Plans for Conciseness and Coherence

    CERN Document Server

    Harvey, T; Harvey, Terrence; Carberry, Sandra

    1998-01-01

    Our experience with a critiquing system shows that when the system detects problems with the user's performance, multiple critiques are often produced. Analysis of a corpus of actual critiques revealed that even though each individual critique is concise and coherent, the set of critiques as a whole may exhibit several problems that detract from conciseness and coherence, and consequently assimilation. Thus a text planner was needed that could integrate the text plans for individual communicative goals to produce an overall text plan representing a concise, coherent message. This paper presents our general rule-based system for accomplishing this task. The system takes as input a \\emph{set} of individual text plans represented as RST-style trees, and produces a smaller set of more complex trees representing integrated messages that still achieve the multiple communicative goals of the individual text plans. Domain-independent rules are used to capture strategies across domains, while the facility for addition...

  7. Chapter 16: text mining for translational bioinformatics.

    Directory of Open Access Journals (Sweden)

    K Bretonnel Cohen

    2013-04-01

    Full Text Available Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  8. Integrating image data into biomedical text categorization.

    Science.gov (United States)

    Shatkay, Hagit; Chen, Nawei; Blostein, Dorothea

    2006-07-15

    Categorization of biomedical articles is a central task for supporting various curation efforts. It can also form the basis for effective biomedical text mining. Automatic text classification in the biomedical domain is thus an active research area. Contests organized by the KDD Cup (2002) and the TREC Genomics track (since 2003) defined several annotation tasks that involved document classification, and provided training and test data sets. So far, these efforts focused on analyzing only the text content of documents. However, as was noted in the KDD'02 text mining contest-where figure-captions proved to be an invaluable feature for identifying documents of interest-images often provide curators with critical information. We examine the possibility of using information derived directly from image data, and of integrating it with text-based classification, for biomedical document categorization. We present a method for obtaining features from images and for using them-both alone and in combination with text-to perform the triage task introduced in the TREC Genomics track 2004. The task was to determine which documents are relevant to a given annotation task performed by the Mouse Genome Database curators. We show preliminary results, demonstrating that the method has a strong potential to enhance and complement traditional text-based categorization methods.

  9. The network of concepts in written texts

    CERN Document Server

    Caldeira, S M G; Andrade, R F S; Neme, A; Miranda, J G V; Caldeira, Silvia M. G.; Lobao, Thierry C. Petit; Neme, Alexis

    2005-01-01

    Complex network theory is used to investigate the structure of meaningful concepts in written texts of individual authors. Networks have been constructed after a two phase filtering, where words with less meaning contents are eliminated, and all remaining words are set to their canonical form, without any number, gender or time flexion. Each sentence in the text is added to the network as a clique. A large number of written texts have been scrutinized, and its found that texts have small-world as well as scale-free structures. The growth process of these networks has also been investigated, and a universal evolution of network quantifiers have been found among the set of texts written by distinct authors. Further analyzes, based on shufling procedures taken either on the texts or on the constructed networks, provide hints on the role played by the word frequency and sentence length distributions to the network structure. Since the meaningful words are related to concepts in the author's mind, results for text...

  10. Rhetorical structure theory and text analysis

    Science.gov (United States)

    Mann, William C.; Matthiessen, Christian M. I. M.; Thompson, Sandra A.

    1989-11-01

    Recent research on text generation has shown that there is a need for stronger linguistic theories that tell in detail how texts communicate. The prevailing theories are very difficult to compare, and it is also very difficult to see how they might be combined into stronger theories. To make comparison and combination a bit more approachable, we have created a book which is designed to encourage comparison. A dozen different authors or teams, all experienced in discourse research, are given exactly the same text to analyze. The text is an appeal for money by a lobbying organization in Washington, DC. It informs, stimulates and manipulates the reader in a fascinating way. The joint analysis is far more insightful than any one team's analysis alone. This paper is our contribution to the book. Rhetorical Structure Theory (RST), the focus of this paper, is a way to account for the functional potential of text, its capacity to achieve the purposes of speakers and produce effects in hearers. It also shows a way to distinguish coherent texts from incoherent ones, and identifies consequences of text structure.

  11. A New Text Location Approach Based Wavelet

    Institute of Scientific and Technical Information of China (English)

    Weihua Li; Zhen Fang; Shuozhong Wang

    2002-01-01

    With the advancement of content-based retrieval technology, the importance of semantics for text information contained in images attracts many researchers. An algorithm which will automatically locate the textual regions in the input image will facilitate the retrieving task, and the optical character recognizer can then be applied to only those regions of the image which contain text. In this paper a new text location method is described, which can be used to locate textual regions from complex image and video frame. Experimental results show that the textual regions in image can be located effectively and quickly.

  12. A New Text Location Approach Based Wavelet

    Institute of Scientific and Technical Information of China (English)

    Weihua Li; Zhen Fang; Shuozhong Wang

    2002-01-01

    With the advancement of content-based retrieval technology, the importance of semantics for text information contained in images attracts many researchers. An algorithm which will automatically locate the textual regions in the input image will facilitate the retrieving task, and the optical character recognizer can then be applied to only those regions of the image which contain text. In this paper a new text location method based wavelet is described, which can be used to locate textual regions from complex image and video frame. Experimental results show that the textual regions in image can be located effectively and quickly.

  13. NOTICING AND TEXT-BASED CHAT

    Directory of Open Access Journals (Sweden)

    Chun Lai

    2006-09-01

    Full Text Available This study examined the capacity of text-based online chat to promote learners’ noticing of their problematic language productions and of the interactional feedback from their interlocutors. In this study, twelve ESL learners formed six mixed-proficiency dyads. The same dyads worked on two spot-the-difference tasks, one via online chat and the other through face-to-face conversation. Stimulated recall sessions were held subsequently to identify instances of noticing. It was found that text-based online chat promotes noticing more than face-to-face conversations, especially in terms of learners’ noticing of their own linguistic mistakes.

  14. Extracting and Sharing Knowledge from Medical Texts

    Institute of Scientific and Technical Information of China (English)

    曹存根

    2002-01-01

    In recent years, we have been developing a new framework for acquiring medical knowledge from Encyclopedic texts. This framework consists of three major parts. The first part is an extended high-level conceptual language (called HLCL 1.1) for use by knowledge engineers to formalize knowledge texts in an encyclopedia. The other part is an HLCL 1.1compiler for parsing and analyzing the formalized texts into knowledge models. The third part is a set of domain-specific ontologies for sharing knowledge.

  15. Cohesive Function of Lexical Repetition in Text

    Institute of Scientific and Technical Information of China (English)

    张莉; 卢沛沛

    2013-01-01

    Lexical repetition is the most direct form of lexical cohesion,which is the central device for making texts hang together. Although repetition is the most direct way to emphasize,it performs the cohesive effect more apparently.

  16. AUTOMATIC TEXT SUMMARIZATION BASED ON TEXTUAL COHESION

    Institute of Scientific and Technical Information of China (English)

    Chen Yanmin; Liu Bingquan; Wang Xiaolong

    2007-01-01

    This paper presents two different algorithms that derive the cohesion structure in the form of lexical chains from two kinds of language resources HowNet and TongYiCiCiLin.The research that connects the cohesion structure of a text to the derivation of its summary is displayed.A novel model of automatic text summarization is devised,based on the data provided by lexicai chains from original texts.Moreover,the construction rules of lexical chains are modified according to characteristics of the knowledge database in order to be more suitable for Chinese suIninarization.Evaluation results show that high quality indicative summaries are produced from Chinese texts.

  17. Figures of thought mathematics and mathematical texts

    CERN Document Server

    Reed, David

    2003-01-01

    Examines the ways in which mathematical works can be read as texts, examines their textual strategiesand demonstrates that such readings provide a rich source of philosophical debate regarding mathematics.

  18. Voice to Text Language Translation (VTLT) Project

    Data.gov (United States)

    National Aeronautics and Space Administration — A feasibility analysis of adding a second modality to pilot/Air Traffic Control (ATC) communications. The real time availability of text in Air Traffic Control...

  19. Text-Filled Stacked Area Graphs

    DEFF Research Database (Denmark)

    Kraus, Martin

    2011-01-01

    Text can add a significant amount of detail and value to an information visualization. In particular, it can integrate more of the data that a visualization is based on, and it can also integrate information that is personally relevant to readers of a visualization. This may influence readers...... to consider a visualization a detailed enrichment of their personal experience instead of an abstract representation of anonymous numbers. However, the integration of textual detail into a visualization is often very challenging. This work discusses one particular approach to this problem, namely text......-filled stacked area graphs; i.e., graphs that feature stacked areas that are filled with small-typed text. Since these graphs allow for computing the text layout automatically, it is possible to include large amounts of textual detail with very little effort. We discuss the most important challenges and some...

  20. The Educational Objectives of International Relations Texts

    Science.gov (United States)

    Pearson, Frederic S.

    1974-01-01

    Certain educational objectives are proposed for undergraduate and graduate international studies education, investigating the interrelationship of these objectives and evaluating the appropriateness and adequacy of current texts for such objectives. (Author/KM)

  1. The Relationship between Paraphrasing and Text Analysis

    Directory of Open Access Journals (Sweden)

    María Luisa Cepeda Islas

    2013-04-01

    Full Text Available Given the importance of paraphrasing in the process of comprehension for college students, this study assessed the level of implementation of text analysis and paraphrases the response of a sample of senior students of the career psychology. We selected a group of freshmen to the Psychology course, which was asked to answer a questionnaire and carry out the summary of an empirical article. The results showed that participants have a low level of text analysis, at the same time had low levels of paraphrasing. It was seen that the predominant textual copy. They envision some possibilities for the structure of a training workshop not only paraphrasing but on the analysis of text.

  2. QuitNowTXT Text Messaging Library

    Data.gov (United States)

    U.S. Department of Health & Human Services — Overview: The QuitNowTXT text messaging program is designed as a resource that can be adapted to specific contexts including those outside the United States and in...

  3. Executive Decision:Text or Talk?

    Institute of Scientific and Technical Information of China (English)

    Rahma; Karam

    2011-01-01

    Nielsen Media, a global market researchcompany, reported in March that spending onvoice calls has gone down significantly over thelast five years, while customers’ text spendingis increasing. It’s anticipated that textingwill eclipse voice calls totally in three years.

  4. Being Brave: Writing Environmental Education Research Texts.

    Science.gov (United States)

    Lotz-Sisitka, Heila; Burt, Jane

    2002-01-01

    Explores some of the headwork that goes into textwork in environmental education research. Reflects upon some of the institutional and epistemological issues associated with writing social science research texts. (Contains 26 references.) (Author/YDS)

  5. Strategies to Increase Accuracy in Text Classification

    NARCIS (Netherlands)

    Blommesteijn, D.

    2014-01-01

    Text classification via supervised learning involves various steps from processing raw data, features extraction to training and validating classifiers. Within these steps implementation decisions are critical to the resulting classifier accuracy. This paper contains a report of the study performed

  6. Discovery of Recurring Anomalies in Text Reports

    Data.gov (United States)

    National Aeronautics and Space Administration — This paper describes the results of a significant research and development effort conducted at NASA Ames Research Center to develop new text mining algorithms to...

  7. Text-Based Recall and Extra-Textual Generations Resulting from Simplified and Authentic Texts

    Science.gov (United States)

    Crossley, Scott A.; McNamara, Danielle S.

    2016-01-01

    This study uses a moving windows self-paced reading task to assess text comprehension of beginning and intermediate-level simplified texts and authentic texts by L2 learners engaged in a text-retelling task. Linear mixed effects (LME) models revealed statistically significant main effects for reading proficiency and text level on the number of…

  8. How Popular Culture Texts Inform and Shape Students' Discussions of Social Studies Texts

    Science.gov (United States)

    Hall, Leigh A.

    2012-01-01

    In this article, I examine how 6th-grade students used pop culture texts to inform their understandings about social studies texts and shape their discussions of it. Discussions showed that students used pop culture texts in three ways when talking about social studies texts. First, students applied comprehension strategies to pop culture texts to…

  9. Engaging Texts: Effects of Concreteness on Comprehensibility, Interest, and Recall in Four Text Types.

    Science.gov (United States)

    Sadoski, Mark; Goetz, Ernest T.; Rodriguez, Maximo

    2000-01-01

    Investigates concreteness as a text feature that engaged undergraduate readers' comprehension, interest, and learning in four text types: persuasion, exposition, literary stories, and narratives. Results show that concrete texts were recalled better than abstract texts, although the magnitude of the advantage varied across text types. Concreteness…

  10. Cohesion in Computer Text Generation: Lexical Substitution.

    Science.gov (United States)

    1983-05-01

    substitutions. Paul is able to generate a cohesive text which exhibits the binding of sentences through presupposition dependencies, the marking of old...lexical substitutions, Paul is able to generate a cohesive text - • which exhibits the binding of sentences through presupposition dependencies, the...problem in using these cohesive devices is that it is necessary to guarantee that they are understandable. That is, since these items refer anaphorically

  11. Comparison between Two Text Digital Watermarking Algorithms

    Institute of Scientific and Technical Information of China (English)

    TANG Sheng; XUE Xu-ce

    2011-01-01

    In this paper,two text digital watermarking methods are compared in the context of their robustness performances.A nonlinear watermarking algorithm embeds the watermark into the reordered DCT coefficients of a text image,and utilizes a nonlinear detector to detect the watermark in some attacks.Compared with the classical watermarking algorithm,experimental results show that this nonlinear watennarking nlgorithm has some potential merits.

  12. Reading Comprehension Assessment : From Text Perspectives

    OpenAIRE

    小林, 美代子; コバヤシ, ミヨコ; MIYOKO, KOBAYASHI

    2004-01-01

    This paper investigates the nature of reading comprehension questions. Very few studies have so far examined comprehension questions in relation to text features. Kintsch and Yarbrough (1982) and Shohamy and Inbar (1991) are among the few studies, and their results suggest that there is an interaction between text features and the focus of questions. The present study builds on these findings and examines how Meyer's (1975, 1985) model of content structure analysis can help identify what exac...

  13. Text and Voice: Complements, Substitutes or Both?

    OpenAIRE

    Andersson, Kjetil; Foros, Øystein; Steen, Frode

    2006-01-01

    Text messaging has become an important revenue component for European and Asian mobile operators. We develop a simple model of demand for mobile services incorporating the existence of call externalities and network effects. We show that when incoming messages and calls stimulate outgoing communications, services that are perceived as substitutes, such as mobile text and voice, may evolve into complements in terms of the price effect when the network size becomes large. We esti...

  14. Dress and Identity in Old Babylonian Texts

    OpenAIRE

    Tanaka, Terri-lynn Wai Ping Hong

    2013-01-01

    The present study argues that using dress theory is a productive means of reading cuneiform texts from ancient Mesopotamia. Although anthropological studies on dress have flourished in recent years, and despite the economic and social importance of dress in ancient Mesopotamia, previous research has focused on either archaeological remains or pictorial representations of dress; however, anthropological theories on dress have not yet been applied to ancient Mesopotamian cuneiform texts writte...

  15. Position index preserving compression of text data

    OpenAIRE

    Akhtar, Nasim; Rashid, Mamunur; Islam, Shafiqul; Kashem, Mohammod Abul; Kolybanov, Cyrll Y.

    2011-01-01

    Data compression offers an attractive approach to reducing communication cost by using available bandwidth effectively. It also secures data during transmission for its encoded form. In this paper an index based position oriented lossless text compression called PIPC ( Position Index Preserving Compression) is developed. In PIPC the position of the input word is denoted by ASCII code. The basic philosopy of the secure compression is to preprocess the text and transform it into some intermedia...

  16. Figure-associated text summarization and evaluation.

    Science.gov (United States)

    Polepalli Ramesh, Balaji; Sethi, Ricky J; Yu, Hong

    2015-01-01

    Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).

  17. Text Entry by Gazing and Smiling

    Directory of Open Access Journals (Sweden)

    Outi Tuisku

    2013-01-01

    Full Text Available Face Interface is a wearable prototype that combines the use of voluntary gaze direction and facial activations, for pointing and selecting objects on a computer screen, respectively. The aim was to investigate the functionality of the prototype for entering text. First, three on-screen keyboard layout designs were developed and tested (n=10 to find a layout that would be more suitable for text entry with the prototype than traditional QWERTY layout. The task was to enter one word ten times with each of the layouts by pointing letters with gaze and select them by smiling. Subjective ratings showed that a layout with large keys on the edge and small keys near the center of the keyboard was rated as the most enjoyable, clearest, and most functional. Second, using this layout, the aim of the second experiment (n=12 was to compare entering text with Face Interface to entering text with mouse. The results showed that text entry rate for Face Interface was 20 characters per minute (cpm and 27 cpm for the mouse. For Face Interface, keystrokes per character (KSPC value was 1.1 and minimum string distance (MSD error rate was 0.12. These values compare especially well with other similar techniques.

  18. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  19. Inspiration and the Texts of the Bible

    Directory of Open Access Journals (Sweden)

    Dirk Buchner

    1997-01-01

    Full Text Available This article seeks to explore what the inspired text of the Old Testament was as it existed for the New Testament authors, particularly for the author of the book of Hebrews. A quick look at the facts makes. it clear that there was, at the time, more than one 'inspired' text, among these were the Septuagint and the Masoretic Text 'to name but two'. The latter eventually gained ascendancy which is why it forms the basis of our translated Old Testament today. Yet we have to ask: what do we make of that other text that was the inspired Bible to the early Church, especially to the writer of the book of Hebrews, who ignored the Masoretic text? This article will take a brief look at some suggestions for a doctrine of inspiration that keeps up with the facts of Scripture. Allied to this, the article is something of a bibliographical study of recent developments in textual research following the discovery of the Dead Sea scrolls.

  20. A method of text watermarking using presuppositions

    Science.gov (United States)

    Vybornova, O.; Macq, B.

    2007-02-01

    We propose a method for watermarking texts of arbitrary length using natural-language semantic structures. For the key of our approach we use the linguistic semantic phenomenon of presuppositions. Presupposition is the implicit information considered as well-known or which readers of the text are supposed to treat as well-known; this information is a semantic component of certain linguistic expressions (lexical items and syntactical constructions called presupposition triggers). The same sentence can be used with or without presupposition, or with a different presupposition trigger, provided that all the relations between subjects, objects and other discourse referents are preserved - such transformations will not change the meaning of the sentence. We define the distinct rules for presupposition identification for each trigger and regular transformation rules for using/non-using the presupposition in a given sentence (one bit per sentence in this case). Isolated sentences can carry the proposed watermarks. However, the longer is the text, the more efficient is the watermark. The proposed approach is resilient to main types of random transformations, like passivization, topicalization, extraposition, preposing, etc. The web of resolved presupposed information in the text will hold the watermark of the text (e.g. integrity watermark, or prove of ownership), introducing "secret ordering" into the text structure to make it resilient to "data loss" attacks and "data altering" attacks.

  1. A Survey of Unstructured Text Summarization Techniques

    Directory of Open Access Journals (Sweden)

    Sherif Elfayoumy

    2014-05-01

    Full Text Available Due to the explosive amounts of text data being created and organizations increased desire to leverage their data corpora, especially with the availability of Big Data platforms, there is not usually enough time to read and understand each document and make decisions based on document contents. Hence, there is a great demand for summarizing text documents to provide a representative substitute for the original documents. By improving summarizing techniques, precision of document retrieval through search queries against summarized documents is expected to improve in comparison to querying against the full spectrum of original documents. Several generic text summarization algorithms have been developed, each with its own advantages and disadvantages. For example, some algorithms are particularly good for summarizing short documents but not for long ones. Others perform well in identifying and summarizing single-topic documents but their precision degrades sharply with multi-topic documents. In this article we present a survey of the literature in text summarization. We also surveyed some of the most common evaluation methods for the quality of automated text summarization techniques. Last, we identified some of the challenging problems that are still open, in particular the need for a universal approach that yields good results for mixed types of documents.

  2. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  3. Practical vision based degraded text recognition system

    Science.gov (United States)

    Mohammad, Khader; Agaian, Sos; Saleh, Hani

    2011-02-01

    Rapid growth and progress in the medical, industrial, security and technology fields means more and more consideration for the use of camera based optical character recognition (OCR) Applying OCR to scanned documents is quite mature, and there are many commercial and research products available on this topic. These products achieve acceptable recognition accuracy and reasonable processing times especially with trained software, and constrained text characteristics. Even though the application space for OCR is huge, it is quite challenging to design a single system that is capable of performing automatic OCR for text embedded in an image irrespective of the application. Challenges for OCR systems include; images are taken under natural real world conditions, Surface curvature, text orientation, font, size, lighting conditions, and noise. These and many other conditions make it extremely difficult to achieve reasonable character recognition. Performance for conventional OCR systems drops dramatically as the degradation level of the text image quality increases. In this paper, a new recognition method is proposed to recognize solid or dotted line degraded characters. The degraded text string is localized and segmented using a new algorithm. The new method was implemented and tested using a development framework system that is capable of performing OCR on camera captured images. The framework allows parameter tuning of the image-processing algorithm based on a training set of camera-captured text images. Novel methods were used for enhancement, text localization and the segmentation algorithm which enables building a custom system that is capable of performing automatic OCR which can be used for different applications. The developed framework system includes: new image enhancement, filtering, and segmentation techniques which enabled higher recognition accuracies, faster processing time, and lower energy consumption, compared with the best state of the art published

  4. Native Language Processing using Exegy Text Miner

    Energy Technology Data Exchange (ETDEWEB)

    Compton, J

    2007-10-18

    Lawrence Livermore National Laboratory's New Architectures Testbed recently evaluated Exegy's Text Miner appliance to assess its applicability to high-performance, automated native language analysis. The evaluation was performed with support from the Computing Applications and Research Department in close collaboration with Global Security programs, and institutional activities in native language analysis. The Exegy Text Miner is a special-purpose device for detecting and flagging user-supplied patterns of characters, whether in streaming text or in collections of documents at very high rates. Patterns may consist of simple lists of words or complex expressions with sub-patterns linked by logical operators. These searches are accomplished through a combination of specialized hardware (i.e., one or more field-programmable gates arrays in addition to general-purpose processors) and proprietary software that exploits these individual components in an optimal manner (through parallelism and pipelining). For this application the Text Miner has performed accurately and reproducibly at high speeds approaching those documented by Exegy in its technical specifications. The Exegy Text Miner is primarily intended for the single-byte ASCII characters used in English, but at a technical level its capabilities are language-neutral and can be applied to multi-byte character sets such as those found in Arabic and Chinese. The system is used for searching databases or tracking streaming text with respect to one or more lexicons. In a real operational environment it is likely that data would need to be processed separately for each lexicon or search technique. However, the searches would be so fast that multiple passes should not be considered as a limitation a priori. Indeed, it is conceivable that large databases could be searched as often as necessary if new queries were deemed worthwhile. This project is concerned with evaluating the Exegy Text Miner installed in the

  5. A NOVEL MULTIDICTIONARY BASED TEXT COMPRESSION

    Directory of Open Access Journals (Sweden)

    Y. Venkataramani

    2012-01-01

    Full Text Available The amount of digital contents grows at a faster speed as a result does the demand for communicate them. On the other hand, the amount of storage and bandwidth increases at a slower rate. Thus powerful and efficient compression methods are required. The repetition of words and phrases cause the reordered text much more compressible than the original text. On the whole system is fast and achieves close to the best result on the test files. In this study a novel fast dictionary based text compression technique MBRH (Multidictionary with burrows wheeler transforms, Run length coding and Huffman coding is proposed for the purpose of obtaining improved performance on various document sizes. MBRH algorithm comprises of two stages, the first stage is concerned with the conversion of input text into dictionary based compression .The second stage deals mainly with reduction of the redundancy in multidictionary based compression by using BWT, RLE and Huffman coding. Bib test files of input size of 111, 261 bytes achieves compression ratio of 0.192, bit rate of 1.538 and high speed using MBRH algorithm. The algorithm has attained a good compression ratio, reduction of bit rate and the increase in execution speed.

  6. La dimension diachronique des textes beckettiens

    Directory of Open Access Journals (Sweden)

    Carla Taban

    2007-07-01

    Full Text Available La présente discussion se propose de montrer que les aspects diachroniques du français et de l’anglais – entendues restrictivement comme évolutions sémantiques des lexèmes des deux idiomes et non pas comme évolutions syntaxiques ou phonétiques de ceux-ci – opèrent dans les textes de Beckett en tant que modalités po(ïétiques de différenciation de sens. Autrement dit, la manière dont les unités lexicales sont inscrites dans leurs environnements intra-textuel (d’un texte donné et intra-inter-textuel (d’une paire bilingue de textes correspondants permet, voire requiert de les actualiser simultanément avec plusieurs significations, dont certaines sont originaires ou historiques. La dimension diachronique dans les deux langues offre ainsi à Beckett un outil d’accroissement du potentiel signifiant de ses textes.

  7. The Impact of Texting on Comprehension

    Directory of Open Access Journals (Sweden)

    Jamal K. M. Ali

    2015-07-01

    Full Text Available This paper presents a study of the effects of texting on English language comprehension. The authors believe that English used in texting causes a lack of comprehension for English speakers, learners, and texters. Wei, Xian-hai and Jiang (2008:3 declare “In Netspeak, there are some newly-created vocabularies, which people cannot comprehend them either from their partial pronunciation or from their figures.” Crystal (2007:23 claims; “variation causes problems of comprehension and acceptability. If you speak or write differently from the way I do, we may fail to understand each other.”  In this paper, the authors conducted a questionnaire at Aligarh Muslim University to ninety respondents from five different Faculties and four different levels. To measure respondents’ comprehension of English texting, the authors gave the respondents abbreviations used by texters and asked them to write the full forms of the abbreviations. The authors found that many abbreviations were not understood, which suggested that most of the respondents did not understand and did not use these abbreviations. Keywords: abbreviation, comprehension, texting, texters, variation

  8. ERRORS AND DIFFICULTIES IN TRANSLATING LEGAL TEXTS

    Directory of Open Access Journals (Sweden)

    Camelia, CHIRILA

    2014-11-01

    Full Text Available Nowadays the accurate translation of legal texts has become highly important as the mistranslation of a passage in a contract, for example, could lead to lawsuits and loss of money. Consequently, the translation of legal texts to other languages faces many difficulties and only professional translators specialised in legal translation should deal with the translation of legal documents and scholarly writings. The purpose of this paper is to analyze translation from three perspectives: translation quality, errors and difficulties encountered in translating legal texts and consequences of such errors in professional translation. First of all, the paper points out the importance of performing a good and correct translation, which is one of the most important elements to be considered when discussing translation. Furthermore, the paper presents an overview of the errors and difficulties in translating texts and of the consequences of errors in professional translation, with applications to the field of law. The paper is also an approach to the differences between languages (English and Romanian that can hinder comprehension for those who have embarked upon the difficult task of translation. The research method that I have used to achieve the objectives of the paper was the content analysis of various Romanian and foreign authors' works.

  9. Handwriting segmentation of unconstrained Oriya text

    Indian Academy of Sciences (India)

    N Tripathy; U Pal

    2006-12-01

    Segmentation of handwritten text into lines, words and characters is one of the important steps in the handwritten text recognition process. In this paper we propose a water reservoir concept-based scheme for segmentation of unconstrained Oriya handwritten text into individual characters. Here, at first, the text image is segmented into lines, and the lines are then segmented into individual words. For line segmentation, the document is divided into vertical stripes. Analysing the heights of the water reservoirs obtained from different components of the document, the width of a stripe is calculated. Stripe-wise horizontal histograms are then computed and the relationship of the peak–valley points of the histograms is used for line segmentation. Based on vertical projection profiles and structural features of Oriya characters, text lines are segmented into words. For character segmentation, at first, the isolated and connected (touching) characters in a word are detected. Using structural, topological and water reservoir concept-based features, characters of the word that touch are then segmented. From experiments we have observed that the proposed “touching character” segmentation module has 96·7% accuracy for two-character touching strings.

  10. Introduction, Critical Text logy and Textual Criticism

    Directory of Open Access Journals (Sweden)

    فرزاد قائمی

    2013-06-01

    Full Text Available Asadi’s Shahnameh is a great epic consisting of twenty-four thousand distiches and is attributed to Asadi or another poet of the same nickname. This work was created in the same line of development as Ferdowsi’s Shahnameh. The main theme is the old campaign of Soleymān to Iran to confront with Rostam and Keykhosrow and to repeat the pattern of Rostam’s battles with his children in a state of anonymity. The text structure is episodic with numerous central characters. The narratives are for the most part derived from oral literature. Textual evidence demonstrates that the poet is Shiite. The narrative content, chronogram as well as the literary and linguistic style of one of the manuscripts reveal that the text was written in the ninth century (probably 809 A.H.. The article first introduces the text and the origin of its narratives in oral literature; it then proceeds with the study of the narrative structure of the epic using three available manuscripts dating back to the thirteenth and fourteenth centuries (A.H.. Textology and Textual Criticism have been employed as the research methodology. The literary and linguistic features of the text have also been examined at three levels: lexical, syntactic and rhetorical.

  11. Word-Sized Graphics for Scientific Texts.

    Science.gov (United States)

    Beck, Fabian; Weiskopf, Daniel

    2017-02-24

    Generating visualizations at the size of a word creates dense information representations often called sparklines. The integration of word-sized graphics into text could avoid additional cognitive load caused by splitting the readers' attention between figures and text. In scientific publications, these graphics make statements easier to understand and verify because additional quantitative information is available where needed. In this work, we perform a literature review to find out how researchers have already applied such word-sized representations. Illustrating the versatility of the approach, we leverage these representations for reporting empirical and bibliographic data in three application examples. For interactive Web-based publications, we explore levels of interactivity and discuss interaction patterns to link visualization and text. We finally call the visualization community to be a pioneer in exploring new visualization-enriched and interactive publication formats.

  12. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...

  13. Monolingual Accounting Dictionaries for EFL Text Production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2009-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...

  14. Text Classification Using Sentential Frequent Itemsets

    Institute of Scientific and Technical Information of China (English)

    Shi-Zhu Liu; He-Ping Hu

    2007-01-01

    Text classification techniques mostly rely on single term analysis of the document data set, while more concepts,especially the specific ones, are usually conveyed by set of terms. To achieve more accurate text classifier, more informative feature including frequent co-occurring words in the same sentence and their weights are particularly important in such scenarios. In this paper, we propose a novel approach using sentential frequent itemset, a concept comes from association rule mining, for text classification, which views a sentence rather than a document as a transaction, and uses a variable precision rough set based method to evaluate each sentential frequent itemset's contribution to the classification. Experiments over the Reuters and newsgroup corpus are carried out, which validate the practicability of the proposed system.

  15. Tagging and Morphological Disambiguation of Turkish Text

    CERN Document Server

    Oflazer, K; Oflazer, Kemal; Kuruoz, Ilker

    1994-01-01

    Automatic text tagging is an important component in higher level analysis of text corpora, and its output can be used in many natural language processing applications. In languages like Turkish or Finnish, with agglutinative morphology, morphological disambiguation is a very crucial process in tagging, as the structures of many lexical forms are morphologically ambiguous. This paper describes a POS tagger for Turkish text based on a full-scale two-level specification of Turkish morphology that is based on a lexicon of about 24,000 root words. This is augmented with a multi-word and idiomatic construct recognizer, and most importantly morphological disambiguator based on local neighborhood constraints, heuristics and limited amount of statistical information. The tagger also has functionality for statistics compilation and fine tuning of the morphological analyzer, such as logging erroneous morphological parses, commonly used roots, etc. Preliminary results indicate that the tagger can tag about 98-99\\% of the...

  16. WYLBUR reference manual. [For interactive text editing

    Energy Technology Data Exchange (ETDEWEB)

    Krupp, R.F.; Messina, P.C.; Peavler, J.M.; Schustack, S.; Starai, T.

    1977-04-01

    WYLBUR is a system for manipulating various kinds of text, such as computer programs, manuscripts, letters, forms, articles, or reports. Its on-line interactive text-editing capabilities allow the user to create, change, and correct text, and to search and display it. WYLBUR also has facilities for job submission and retrieval from remote terminals that make it possible for a user to inquire about the status of any job in the system, cancel jobs that are executing or awaiting execution, reroute output, raise job priority, or get information on the backlog of batch jobs. WYLBUR also has excellent recovery capabilities and a fast response time. This manual describes the WYLBUR version currently used at ANL. It is intended primarily as a reference manual; thus, examples of WYLBUR commands are kept to a minimum. (RWR)

  17. Urdu Text Classification using Majority Voting

    Directory of Open Access Journals (Sweden)

    Muhammad Usman

    2016-08-01

    Full Text Available Text classification is a tool to assign the predefined categories to the text documents using supervised machine learning algorithms. It has various practical applications like spam detection, sentiment detection, and detection of a natural language. Based on the idea we applied five well-known classification techniques on Urdu language corpus and assigned a class to the documents using majority voting. The corpus contains 21769 news documents of seven categories (Business, Entertainment, Culture, Health, Sports, and Weird. The algorithms were not able to work directly on the data, so we applied the preprocessing techniques like tokenization, stop words removal and a rule-based stemmer. After preprocessing 93400 features are extracted from the data to apply machine learning algorithms. Furthermore, we achieved up to 94% precision and recall using majority voting.

  18. Text mining patents for biomedical knowledge.

    Science.gov (United States)

    Rodriguez-Esteban, Raul; Bundschus, Markus

    2016-06-01

    Biomedical text mining of scientific knowledge bases, such as Medline, has received much attention in recent years. Given that text mining is able to automatically extract biomedical facts that revolve around entities such as genes, proteins, and drugs, from unstructured text sources, it is seen as a major enabler to foster biomedical research and drug discovery. In contrast to the biomedical literature, research into the mining of biomedical patents has not reached the same level of maturity. Here, we review existing work and highlight the associated technical challenges that emerge from automatically extracting facts from patents. We conclude by outlining potential future directions in this domain that could help drive biomedical research and drug discovery.

  19. Tenosynovitis caused by texting: an emerging disease.

    Science.gov (United States)

    Ashurst, John V; Turco, Domenic A; Lieb, Brian E

    2010-05-01

    De Quervain tenosynovitis is characterized by pain that overlies the radial aspect of the wrist and that is aggravated by ulnar deviation of the hand. The most common cause of de Quervain tenosynovitis is overuse of the thumb musculature. The authors report a case of bilateral de Quervain tenosynovitis observed in a woman aged 48 years at a rural outpatient primary care office. The condition was induced by the patient's excessive use of the text messaging feature on her cellular telephone. Treatment, including naproxen, cock-up wrist splints, and limitation of texting, resulted in complete recovery of the patient. The authors urge physicians to be aware of the potential association between a patient's tenosynovitis symptoms and excessive texting.

  20. Preprocessing and Morphological Analysis in Text Mining

    Directory of Open Access Journals (Sweden)

    Krishna Kumar Mohbey Sachin Tiwari

    2011-12-01

    Full Text Available This paper is based on the preprocessing activities which is performed by the software or language translators before applying mining algorithms on the huge data. Text mining is an important area of Data mining and it plays a vital role for extracting useful information from the huge database or data ware house. But before applying the text mining or information extraction process, preprocessing is must because the given data or dataset have the noisy, incomplete, inconsistent, dirty and unformatted data. In this paper we try to collect the necessary requirements for preprocessing. When we complete the preprocess task then we can easily extract the knowledgful information using mining strategy. This paper also provides the information about the analysis of data like tokenization, stemming and semantic analysis like phrase recognition and parsing. This paper also collect the procedures for preprocessing data i.e. it describe that how the stemming, tokenization or parsing are applied.

  1. Text Mining the History of Medicine.

    Directory of Open Access Journals (Sweden)

    Paul Thompson

    Full Text Available Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc., synonyms/variant forms of concepts, and relationships holding between concepts (which drugs are used to treat which medical conditions, etc.. TM analysis allows search systems to incorporate functionality such as automatic suggestions of synonyms of user-entered query terms, exploration of different concepts mentioned within search results or isolation of documents in which concepts are related in specific ways. However, applying TM methods to historical text can be challenging, according to differences and evolutions in vocabulary, terminology, language structure and style, compared to more modern text. In this article, we present our efforts to overcome the various challenges faced in the semantic analysis of published historical medical text dating back to the mid 19th century. Firstly, we used evidence from diverse historical medical documents from different periods to develop new resources that provide accounts of the multiple, evolving ways in which concepts, their variants and relationships amongst them may be expressed. These resources were employed to support the development of a modular processing pipeline of TM tools for the robust detection of semantic information in historical medical documents with varying characteristics. We applied the pipeline to two large-scale medical document archives covering wide temporal ranges as the basis for the development of a publicly accessible semantically-oriented search system. The novel resources are available for research

  2. Multilingual Topic Models for Unaligned Text

    CERN Document Server

    Boyd-Graber, Jordan

    2012-01-01

    We develop the multilingual topic model for unaligned text (MuTo), a probabilistic model of text that is designed to analyze corpora composed of documents in two languages. From these documents, MuTo uses stochastic EM to simultaneously discover both a matching between the languages and multilingual latent topics. We demonstrate that MuTo is able to find shared topics on real-world multilingual corpora, successfully pairing related documents across languages. MuTo provides a new framework for creating multilingual topic models without needing carefully curated parallel corpora and allows applications built using the topic model formalism to be applied to a much wider class of corpora.

  3. Spatial Text Visualization Using Automatic Typographic Maps.

    Science.gov (United States)

    Afzal, S; Maciejewski, R; Jang, Yun; Elmqvist, N; Ebert, D S

    2012-12-01

    We present a method for automatically building typographic maps that merge text and spatial data into a visual representation where text alone forms the graphical features. We further show how to use this approach to visualize spatial data such as traffic density, crime rate, or demographic data. The technique accepts a vector representation of a geographic map and spatializes the textual labels in the space onto polylines and polygons based on user-defined visual attributes and constraints. Our sample implementation runs as a Web service, spatializing shape files from the OpenStreetMap project into typographic maps for any region.

  4. There is a Text in 'The Balloon'

    DEFF Research Database (Denmark)

    Elias, Camelia

    2009-01-01

    From the Introduction: Camelia Elias' "There is a Text in 'The Balloon': Donald Barthelme's Allegorical Flights" provides its reader with a much-need and useful distinction between fantasy and the fantastic: "whereas fantasy in critical discourse can be aligned with allegory, in which a supernatu......From the Introduction: Camelia Elias' "There is a Text in 'The Balloon': Donald Barthelme's Allegorical Flights" provides its reader with a much-need and useful distinction between fantasy and the fantastic: "whereas fantasy in critical discourse can be aligned with allegory, in which...

  5. Présentation des textes

    OpenAIRE

    Freitag, Michel

    2015-01-01

    Les textes choisis n’ont pas pour but la reconstitution ou le survol d’une carrière, mais la mise en valeur des étapes saillantes d’une double éclosion, celle d’Elizabeth Cady Stanton comme féministe et avec elle celle du mouvement de défense des droits des femmes aux États-Unis. Un tel objectif implique donc des limites temporelles en amont et en aval de l’événement fondateur que fut la Convention de Seneca Falls en 1848, origine du texte non moins fondateur de la Déclaration de sentiments r...

  6. A Sequential Algorithm for Training Text Classifiers

    CERN Document Server

    Lewis, D D; Lewis, David D.; Gale, William A.

    1994-01-01

    The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was developed and tested on a newswire text categorization task. This method, which we call uncertainty sampling, reduced by as much as 500-fold the amount of training data that would have to be manually classified to achieve a given level of effectiveness.

  7. Text Classification: A Sequential Reading Approach

    CERN Document Server

    Dulac-Arnold, Gabriel; Gallinari, Patrick

    2011-01-01

    We propose to model the text classification process as a sequential decision process. In this process, an agent learns to classify documents into topics while reading the document sentences sequentially and learns to stop as soon as enough information was read for deciding. The proposed algorithm is based on a modelisation of Text Classification as a Markov Decision Process and learns by using Reinforcement Learning. Experiments on four different classical mono-label corpora show that the proposed approach performs comparably to classical SVM approaches for large training sets, and better for small training sets. In addition, the model automatically adapts its reading process to the quantity of training information provided.

  8. The general principles of radiation protection and regulation; Les principes generaux de la radioprotection et la reglementation

    Energy Technology Data Exchange (ETDEWEB)

    Aurengo, A. [Societe Francaise de Radioprotection, 34 - Montpellier (France); Cesarini, J.P. [Societe Francaise de Radioprotection, Section Rayonnements non ionisants, 75 - Paris (France); Lecomte, J.F.; Barbier, G.; Crescini, D.; Biau, A. [CEA Fontenay aux Roses, Institut de Radioprotection et de Surete Nucleaire IRSN, 92 (France); Blain, A. [FRAMATOME, Dir. Combustible Nucleaire, Dept. Radioprotection Securite, 69 - Lyon (France); Bailloeuil, C.; Gonin, M. [Electricite de France, EDF-SCAST, 75 - Paris (France); Bergot, D. [Ministere des Affaires Sociales, du Travail et de la Solidarite, Dir. des Relations du Travail, 75 - Paris (France)

    2003-07-01

    Seven articles constitute this chapter about the radiation protection and the regulation. Radiological risk, reduction of public exposure to ultraviolet radiations, regulation for the radon, evolution of the French legislation against the dangers of ionizing radiations, the medical follow up after the professional life, the information system to reproduce the dosimetric data of workers, proposition of a scale to classify the radiations incidents in function of their seriousness. (N.C.)

  9. Calculating emissions into the air. General methodological principles; Calcul des emissions dans l'air. Principes methodologiques generaux

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2000-05-01

    Knowing the quantities of certain substances discharged into the atmosphere is a necessary and fundamental stage in any environmental protection policy to tackle today's problems such as acid rain, the degradation of air quality, global warming and climate change, the depletion of the ozone layer, etc. This quantification, usually known as an 'emission inventory', is built on a set of specific rules which may vary from one inventory to another. This state of affairs presents the enormous disadvantage that the data available are not comparable. At the international level, an attempt at harmonization has been going on for some years between the various international bodies. This work is being pursued in parallel with the improvement of methodologies to estimate discharges from various types of source. To take account of changes in specifications and of improvements in our understanding of phenomena giving rise to atmospheric pollution, the results of inventories of emissions need to be regularly revised, even retrospectively, to maintain a consistent series. CITEPA, which acts as a National Reference Centre, has developed a system of inventories as part of the CORALIE programme with financial help from the French Ministry for Planning and the Environment. (author)

  10. Putting Text Complexity in Context: Refocusing on Comprehension of Complex Text

    Science.gov (United States)

    Valencia, Sheila W.; Wixson, Karen K.; Pearson, P. David

    2014-01-01

    The Common Core State Standards for English Language Arts have prompted enormous attention to issues of text complexity. The purpose of this article is to put text complexity in perspective by moving from a primary focus on the text itself to a focus on the comprehension of complex text. We argue that a focus on comprehension is at the heart of…

  11. OMG! Texting in Class = U Fail :( Empirical Evidence That Text Messaging During Class Disrupts Comprehension

    Science.gov (United States)

    Gingerich, Amanda C.; Lineweaver, Tara T.

    2014-01-01

    In two experiments, we examined the effects of text messaging during lecture on comprehension of lecture material. Students (in Experiment 1) and randomly assigned participants (in Experiment 2) in a text message condition texted a prescribed conversation while listening to a brief lecture. Students and participants in the no-text condition…

  12. Exploring the Effect of Background Knowledge and Text Cohesion on Learning from Texts in Computer Science

    Science.gov (United States)

    Gasparinatou, Alexandra; Grigoriadou, Maria

    2013-01-01

    In this study, we examine the effect of background knowledge and local cohesion on learning from texts. The study is based on construction-integration model. Participants were 176 undergraduate students who read a Computer Science text. Half of the participants read a text of maximum local cohesion and the other a text of minimum local cohesion.…

  13. Evaluating Text-to-Speech Synthesizers

    Science.gov (United States)

    Cardoso, Walcir; Smith, George; Fuentes, Cesar Garcia

    2015-01-01

    Text-To-Speech (TTS) synthesizers have piqued the interest of researchers for their potential to enhance the L2 acquisition of writing (Kirstein, 2006), vocabulary and reading (Proctor, Dalton, & Grisham, 2007) and pronunciation (Cardoso, Collins, & White, 2012; Soler-Urzua, 2011). Despite their proven effectiveness, there is a need for…

  14. Fieldwork, Heritage and Engaging Landscape Texts

    Science.gov (United States)

    Mains, Susan P.

    2014-01-01

    This paper outlines and analyses efforts to critically engage with "heritage" through the development and responses to a series of undergraduate residential fieldwork trips held in the North Coast of Jamaica. The ways in which we read heritage through varied "texts"--specifically, material landscapes, guided heritage tours,…

  15. Digital Texts and the New Literacies

    Science.gov (United States)

    Webb, Allen

    2007-01-01

    When the literature anthologies did not arrive, Allen Webb turned to the Internet, where he found a wealth of classic and contemporary e-texts. Using these online resources opened up possibilities for new ways of teaching and learning traditional skills of close reading and critical analysis. Students created blogs of poems and commentary,…

  16. Electric Circuit Theory--Computer Illustrated Text.

    Science.gov (United States)

    Riches, Brian

    1990-01-01

    Discusses the use of a computer-illustrated text (CIT) with integrated software to teach electric circuit theory to college students. Examples of software use are given, including simple animation, graphical displays, and problem-solving programs. Issues affecting electric circuit theory instruction are also addressed, including mathematical…

  17. Selecting Full-Text Undergraduate Periodicals Databases.

    Science.gov (United States)

    Still, Julie M.; Kassabian, Vibiana

    1999-01-01

    Examines how libraries and librarians can compare full-text general periodical indices, using ProQuest Direct, Periodical Abstracts (via Ovid), and EBSCOhost as examples. Explores breadth and depth of coverage; manipulation of results (email/download/print); ease of use (searching); and indexing quirks. (AEF)

  18. A Graphophonic Investigation of Beginning Level Texts

    Science.gov (United States)

    Walker, Kevin Clark

    2010-01-01

    This study attempted to provide a systematic framework for phonics instruction for beginning readers in literature-based classrooms based on relative frequency of phoneme-grapheme occurrences found in three distinct corpora. The first corpus contained an academic word list. The second corpus contained the running text from 363 books identified as…

  19. "The Politics of Location": Text as Opposition.

    Science.gov (United States)

    Moreno, Renee

    Eduardo Galeano's "Memory of Fire: Genesis" raises a number of questions concerning the "politics of location," a term that may be defined as the intersections, tensions, and complications that people of color bring to space and what space means in terms of hierarchies and power, racial and gender stratifications. Text can also…

  20. Improving text recall with multiple summaries

    NARCIS (Netherlands)

    Meij, van der Hans; Meij, van der Jan

    2012-01-01

    Background. QuikScan (QS) is an innovative design that aims to improve accessibility, comprehensibility, and subsequent recall of expository text by means of frequent within-document summaries that are formatted as numbered list items. The numbers in the QS summaries correspond to numbers placed in

  1. Texts, Languages & Information Technology in Egyptology. Introduction

    OpenAIRE

    2013-01-01

    A short introduction to the volume "Texts, Languages & Information Technology in Egyptology. Selected papers from the meeting of the Computer Working Group of the International Association of Egyptologists (Informatique & Égyptologie), Liège, 6-8 July 2010".

  2. Examining Response Confidence in Multiple Text Tasks

    Science.gov (United States)

    List, Alexandra; Alexander, Patricia A.

    2015-01-01

    Students' confidence in their responses to a multiple text-processing task and their justifications for those confidence ratings were investigated. Specifically, 215 undergraduates responded to two academic questions, differing by type (i.e., discrete and open-ended) and by domain (i.e., developmental psychology and astrophysics), using a digital…

  3. Full Text Journal Subscriptions: An Evolutionary Process.

    Science.gov (United States)

    Luther, Judy

    1997-01-01

    Provides an overview of companies offering Web accessible subscriptions to full text electronic versions of scientific, technical, and medical journals (Academic Press, Blackwell, EBSCO, Elsevier, Highwire Press, Information Quest, Institute of Physics, Johns Hopkins University Press, OCLC, OVID, Springer, and SWETS). Also lists guidelines for…

  4. First Look: VU/TEXT Databases.

    Science.gov (United States)

    Willmann, Donna

    1985-01-01

    Profiles of online services provided by VU/TEXT, which maintains market access to electronic newspaper databases, highlights scope (newspapers, business information, wire services and nonnewspaper regional information, encyclopedia); search techniques; strengths; and upcoming enhancements. Descriptions of 17 databases and sample searches are…

  5. Understanding Curriculum as a Racial Text.

    Science.gov (United States)

    Pinar, William F.

    1991-01-01

    Discusses curriculum as a racial text, focusing on European Americans as a major part of the racial dilemma. The Eurocentric curriculum denies nonwhite students role models and denies white students self-understanding. African Americans' presence informs every element of U.S. life, and the absence of African-American knowledge in curriculum…

  6. CONAN : Text Mining in the Biomedical Domain

    NARCIS (Netherlands)

    Malik, R.

    2006-01-01

    This thesis is about Text Mining. Extracting important information from literature. In the last years, the number of biomedical articles and journals is growing exponentially. Scientists might not find the information they want because of the large number of publications. Therefore a system was cons

  7. Automatic Syntactic Analysis of Free Text.

    Science.gov (United States)

    Schwarz, Christoph

    1990-01-01

    Discusses problems encountered with the syntactic analysis of free text documents in indexing. Postcoordination and precoordination of terms is discussed, an automatic indexing system call COPSY (context operator syntax) that uses natural language processing techniques is described, and future developments are explained. (60 references) (LRW)

  8. Project Physics Text 4, Light and Electromagnetism.

    Science.gov (United States)

    Harvard Univ., Cambridge, MA. Harvard Project Physics.

    Optical and electromagnetic fundamentals are presented in this fourth unit of the Project Physics text for use by senior high students. Development of the wave theory in the first half of the 19th Century is described to deal with optical problems at the early stage. Following explanations of electric charges and forces, field concepts are…

  9. Polarity Analysis of Texts using Discourse Structure

    NARCIS (Netherlands)

    Heerschop, Bas; Goosen, Frank; Hogenboom, Alexander; Frasincar, Flavius; Kaymak, Uzay; Jong, de Franciska

    2011-01-01

    Sentiment analysis has applications in many areas and the exploration of its potential has only just begun. We propose Pathos, a framework which performs document sentiment analysis (partly) based on a document’s discourse structure. We hypothesize that by splitting a text into important and less im

  10. Assessing Assessment Texts: Where Is Planning?

    Science.gov (United States)

    Fives, Helenrose; Barnes, Nicole; Dacey, Charity; Gillis, Anna

    2016-01-01

    We conducted a content analysis of 27 assessment textbooks to determine how assessment planning was framed in texts for preservice teachers. We identified eight assessment planning themes: alignment, assessment purpose and types, reliability and validity, writing goals and objectives, planning specific assessments, unpacking, overall assessment…

  11. Leveled Reading and Engagement with Complex Texts

    Science.gov (United States)

    Hastings, Kathryn

    2016-01-01

    The benefits of engaging with age-appropriate reading materials in classroom settings are numerous. For example, students' comprehension is developed as they acquire new vocabulary and concepts. The Common Core requires all students have daily opportunities to engage with "complex text" regardless of students' decoding levels. However,…

  12. Elementary Functions, Student's Text, Unit 21.

    Science.gov (United States)

    Allen, Frank B.; And Others

    Unit 21 in the SMSG secondary school mathematics series is a student text covering the following topics in elementary functions: functions, polynomial functions, tangents to graphs of polynomial functions, exponential and logarithmic functions, and circular functions. Appendices discuss set notation, mathematical induction, significance of…

  13. Texts and Literacies of the Shi Jinrui

    Science.gov (United States)

    Carrington, Victoria

    2004-01-01

    In post-industrial societies saturated with the multimodal texts of consumer culture--film, computer games, interactive toys, SMS, email, the internet, television, DVDs--young people are developing literacy skills and knowledge in and for a world significantly changed from that of their parents and educators. Given this context, this paper seeks…

  14. Texts, Transmissions, Receptions. Modern Approaches to Narratives

    NARCIS (Netherlands)

    Lardinois, A.P.M.H.; Levie, S.A.; Hoeken, H.; Lüthy, C.H.

    2015-01-01

    The papers collected in this volume study the function and meaning of narrative texts from a variety of perspectives. The word “text” is used here in the broadest sense of the term: it denotes literary books, but also oral tales, speeches, newspaper articles and comics. One of the purposes of this v

  15. A Scheme for Text Analysis Using Fortran.

    Science.gov (United States)

    Koether, Mary E.; Coke, Esther U.

    Using string-manipulation algorithms, FORTRAN computer programs were designed for analysis of written material. The programs measure length of a text and its complexity in terms of the average length of words and sentences, map the occurrences of keywords or phrases, calculate word frequency distribution and certain indicators of style. Trials of…

  16. The Challenges of Qualitatively Coding Ancient Texts

    Science.gov (United States)

    Slingerland, Edward; Chudek, Maciej

    2012-01-01

    We respond to several important and valid concerns about our study ("The Prevalence of Folk Dualism in Early China," "Cognitive Science" 35: 997-1007) by Klein and Klein, defending our interpretation of our data. We also argue that, despite the undeniable challenges involved in qualitatively coding texts from ancient cultures, the standard tools…

  17. Mining Texts in Reading to Write.

    Science.gov (United States)

    Greene, Stuart

    1992-01-01

    Proposes a set of strategies for connecting reading and writing, placing the discussion in the context of other pedagogical approaches designed to exploit the relationship between reading and writing. Explores ways in which students employ the strategies involved in "mining" a text--reconstructing context, inferring or imposing structure, and…

  18. Automatic Induction of Rule Based Text Categorization

    Directory of Open Access Journals (Sweden)

    D.Maghesh Kumar

    2010-12-01

    Full Text Available The automated categorization of texts into predefined categories has witnessed a booming interest in the last 10 years, due to the increased availability of documents in digital form and the ensuingneed to organize them. In the research community the dominant approach to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. This paper describes, a novel method for the automatic induction of rule-based text classifiers. This method supports a hypothesis language of the form "if T1, … or Tn occurs in document d, and none of T1+n,... Tn+m occurs in d, then classify d under category c," where each Ti is a conjunction of terms. This survey discusses the main approaches to text categorization that fall within the machine learning paradigm. Issues pertaining tothree different problems, namely, document representation, classifier construction, and classifier evaluation were discussed in detail.

  19. Investigating Text Input Methods for Mobile Phones

    Directory of Open Access Journals (Sweden)

    Barry O’Riordan

    2005-01-01

    Full Text Available Human Computer Interaction is a primary factor in the success or failure of any device but if an objective view is taken of the current mobile phone market you would be forgiven for thinking usability was secondary to aesthetics. Many phone manufacturers modify the design of phones to be different than the competition and to target fashion trends, usually at the expense of usability and performance. There is a lack of awareness among many buyers of the usability of the device they are purchasing and the disposability of modern technology is an effect rather than a cause of this. Designing new text entry methods for mobile devices can be expensive and labour-intensive. The assessment and comparison of a new text entry method with current methods is a necessary part of the design process. The best way to do this is through an empirical evaluation. The aim of the study was to establish which mobile phone text input method best suits the requirements of a select group of target users. This study used a diverse range of users to compare devices that are in everyday use by most of the adult population. The proliferation of the devices is as yet unmatched by the study of their application and the consideration of their user friendliness.

  20. COMPENDEX/TEXT-PAC: RETROSPECTIVE SEARCH.

    Science.gov (United States)

    Standera, Oldrich

    The Text-Pac System is capable of generating indexes and bulletins to provide a current information service without the selectivity feature. Indexes of the accumulated data base may also be used as a basis for manual retrospective searching. The manual search involves searching computer-prepared indexes from a machine readable data base produced…

  1. The Cultural Content of Business Spanish Texts.

    Science.gov (United States)

    Grosse, Christine Uber; Uber, David

    A study examined eight business Spanish textbooks for cultural content by looking at commonly appearing cultural topics and themes, presentation of cultural information, activities and techniques used to promote cultural understanding, and incorporation of authentic materials. The texts were evenly divided among beginning, intermediate, and…

  2. Modified Approach to Transform Arc From Text to Linear Form Text : A Preprocessing Stage for OCR

    Directory of Open Access Journals (Sweden)

    Vijayashree C S

    2014-08-01

    Full Text Available Arc-form-text is an artistic-text which is quite common in several documents such as certificates, advertisements and history documents. OCRs fail to read such arc-form-text and it is necessary to transform the same to linear-form-text at preprocessing stage. In this paper, we present a modification to an existing transformation model for better readability by OCRs. The method takes the segmented arcform-text as input. Initially two concentric ellipses are approximated to enclose the arc-form-text and later the modified transformation model transforms the text in arc-form to linear-form. The proposed method is implemented on several upper semi-circular arc-form-text inputs and the readability of the transformed text is analyzed with an OCR

  3. Making School Development Credible. Text, Context, Irony

    Directory of Open Access Journals (Sweden)

    Mats Börjesson

    2012-01-01

    Full Text Available

    The article argues for the importance of an open, reflexive-methodological approach when switching between studying text, context and researcher activity. Close linguistic analysis can benefit from being linked with the researcher’s contextualisation of his empirical material as well as with more distanced readings. The more specific starting point for this article is that school development, like other similar terms such as school improvement and the like, makes use of linguistic building blocks with which whole narratives about today’s and tomorrow’s schools can be constructed. The subject of the study is a short text issued by the Swedish Schools Inspectorate (Skolinspektionen. Government language changes according to the authorities’ role in society and their own definitions of their functions, and an important aspect here is the legitimacy of the authorities’ texts. By means of various kinds of close linguistic analysis, the above-mentioned text is studied with regard to choice of categories, hierarchies of modalisation and the rhetorical effects of different types of formulations in a broader political-social landscape. The article concludes with a reflective discussion on the relationship between government language and irony as a stylistic device – a device that is based on the results of the close empirical analysis.[i]



    [i] The article is part of the project ”School  Development as Narrative”, funded by the Swedish Research Council. The author would like to thank the two reviewers for very valuable comments.

  4. Text Mining the History of Medicine.

    Science.gov (United States)

    Thompson, Paul; Batista-Navarro, Riza Theresa; Kontonatsios, Georgios; Carter, Jacob; Toon, Elizabeth; McNaught, John; Timmermann, Carsten; Worboys, Michael; Ananiadou, Sophia

    2016-01-01

    Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM) methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc.), synonyms/variant forms of concepts, and relationships holding between concepts (which drugs are used to treat which medical conditions, etc.). TM analysis allows search systems to incorporate functionality such as automatic suggestions of synonyms of user-entered query terms, exploration of different concepts mentioned within search results or isolation of documents in which concepts are related in specific ways. However, applying TM methods to historical text can be challenging, according to differences and evolutions in vocabulary, terminology, language structure and style, compared to more modern text. In this article, we present our efforts to overcome the various challenges faced in the semantic analysis of published historical medical text dating back to the mid 19th century. Firstly, we used evidence from diverse historical medical documents from different periods to develop new resources that provide accounts of the multiple, evolving ways in which concepts, their variants and relationships amongst them may be expressed. These resources were employed to support the development of a modular processing pipeline of TM tools for the robust detection of semantic information in historical medical documents with varying characteristics. We applied the pipeline to two large-scale medical document archives covering wide temporal ranges as the basis for the development of a publicly accessible semantically-oriented search system. The novel resources are available for research purposes, while

  5. Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction

    OpenAIRE

    Darko Brodić; Milivojević, Dragan R.; Zoran Milivojević

    2010-01-01

    Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, som...

  6. Text Character Extraction Implementation from Captured Handwritten Image to Text Conversionusing Template Matching Technique

    Directory of Open Access Journals (Sweden)

    Barate Seema

    2016-01-01

    Full Text Available Images contain various types of useful information that should be extracted whenever required. A various algorithms and methods are proposed to extract text from the given image, and by using that user will be able to access the text from any image. Variations in text may occur because of differences in size, style,orientation, alignment of text, and low image contrast, composite backgrounds make the problem during extraction of text. If we develop an application that extracts and recognizes those texts accurately in real time, then it can be applied to many important applications like document analysis, vehicle license plate extraction, text- based image indexing, etc and many applications have become realities in recent years. To overcome the above problems we develop such application that will convert the image into text by using algorithms, such as bounding box, HSV model, blob analysis,template matching, template generation.

  7. Automated assessment of medical training evaluation text.

    Science.gov (United States)

    Zhang, Rui; Pakhomov, Serguei; Gladding, Sophia; Aylward, Michael; Borman-Shoap, Emily; Melton, Genevieve B

    2012-01-01

    Medical post-graduate residency training and medical student training increasingly utilize electronic systems to evaluate trainee performance based on defined training competencies with quantitative and qualitative data, the later of which typically consists of text comments. Medical education is concomitantly becoming a growing area of clinical research. While electronic systems have proliferated in number, little work has been done to help manage and analyze qualitative data from these evaluations. We explored the use of text-mining techniques to assist medical education researchers in sentiment analysis and topic analysis of residency evaluations with a sample of 812 evaluation statements. While comments were predominantly positive, sentiment analysis improved the ability to discriminate statements with 93% accuracy. Similar to other domains, Latent Dirichlet Analysis and Information Gain revealed groups of core subjects and appear to be useful for identifying topics from this data.

  8. Can An Evolutionary Process Create English Text?

    Energy Technology Data Exchange (ETDEWEB)

    Bailey, David H.

    2008-10-29

    Critics of the conventional theory of biological evolution have asserted that while natural processes might result in some limited diversity, nothing fundamentally new can arise from 'random' evolution. In response, biologists such as Richard Dawkins have demonstrated that a computer program can generate a specific short phrase via evolution-like iterations starting with random gibberish. While such demonstrations are intriguing, they are flawed in that they have a fixed, pre-specified future target, whereas in real biological evolution there is no fixed future target, but only a complicated 'fitness landscape'. In this study, a significantly more sophisticated evolutionary scheme is employed to produce text segments reminiscent of a Charles Dickens novel. The aggregate size of these segments is larger than the computer program and the input Dickens text, even when comparing compressed data (as a measure of information content).

  9. Online Visual Analytics of Text Streams.

    Science.gov (United States)

    Liu, Shixia; Yin, Jialun; Wang, Xiting; Cui, Weiwei; Cao, Kelei; Pei, Jian

    2016-11-01

    We present an online visual analytics approach to helping users explore and understand hierarchical topic evolution in high-volume text streams. The key idea behind this approach is to identify representative topics in incoming documents and align them with the existing representative topics that they immediately follow (in time). To this end, we learn a set of streaming tree cuts from topic trees based on user-selected focus nodes. A dynamic Bayesian network model has been developed to derive the tree cuts in the incoming topic trees to balance the fitness of each tree cut and the smoothness between adjacent tree cuts. By connecting the corresponding topics at different times, we are able to provide an overview of the evolving hierarchical topics. A sedimentation-based visualization has been designed to enable the interactive analysis of streaming text data from global patterns to local details. We evaluated our method on real-world datasets and the results are generally favorable.

  10. Segmentation of Ancient Telugu Text Documents

    Directory of Open Access Journals (Sweden)

    Srinivasa Rao A.V

    2012-07-01

    Full Text Available OCR of ancient document images remains a challenging task till date. Scanning process itself introduces deformation of document images. Cleaning process of these document images will result in information loss. Segmentation contributes an invariance process in OCR. Complex scripts, like derivatives of Brahmi, encounter many problems in the segmentation process. Segmentation of meaningful units, (instead of isolated patterns, revealed interesting trends. A segmentation technique for the ancient Telugu document image into meaningful units is proposed. The topological features of the meaningful units within the script line are adopted as a basis, while segmenting the text line. Horizontal profile pattern is convolved with Gaussian kernel. The statistical properties of meaningful units are explored by extensively analyzing the geometrical patterns of the meaningful unit. The efficiency of the proposed algorithm involving segmentation process is found to be 73.5% for the case of uncleaned document images.

  11. Ordinary differential equations a graduate text

    CERN Document Server

    Bhamra, K S

    2015-01-01

    ORDINARY DIFFERENTIAL EQUATIONS: A Graduate Text presents a systematic and comprehensive introduction to ODEs for graduate and postgraduate students. The systematic organized text on differential inequalities, Gronwall's inequality, Nagumo's theorems, Osgood's criteria and applications of different equations of first order is dealt with in a greater depth. The book discusses qualitative and quantitative aspects of the Strum - Liouville problems, Green's function, integral equations, Laplace transform and is supported by a number of worked-out examples in each lesson to make the concepts clear. A lot of stress on stability theory is laid down, especially on Lyapunov and Poincare stability theory. A numerous figures in various lessons (in particular lessons dealing with stability theory) have been added to clarify the key concepts in DE theory. Nonlinear oscillation in conservative systems and Hamiltonian systems highlights basic nature of the systems considered. Perturbation techniques lesson deals in fairly d...

  12. Generalized entropies and the similarity of texts

    Science.gov (United States)

    Altmann, Eduardo G.; Dias, Laércio; Gerlach, Martin

    2017-01-01

    We show how generalized Gibbs–Shannon entropies can provide new insights on the statistical properties of texts. The universal distribution of word frequencies (Zipf’s law) implies that the generalized entropies, computed at the word level, are dominated by words in a specific range of frequencies. Here we show that this is the case not only for the generalized entropies but also for the generalized (Jensen–Shannon) divergences, used to compute the similarity between different texts. This finding allows us to identify the contribution of specific words (and word frequencies) for the different generalized entropies and also to estimate the size of the databases needed to obtain a reliable estimation of the divergences. We test our results in large databases of books (from the google n-gram database) and scientific papers (indexed by Web of Science).

  13. Extraction of information from unstructured text

    Energy Technology Data Exchange (ETDEWEB)

    Irwin, N.H.; DeLand, S.M.; Crowder, S.V.

    1995-11-01

    Extracting information from unstructured text has become an emphasis in recent years due to the large amount of text now electronically available. This status report describes the findings and work done by the end of the first year of a two-year LDRD. Requirements of the approach included that it model the information in a domain independent way. This means that it would differ from current systems by not relying on previously built domain knowledge and that it would do more than keyword identification. Three areas that are discussed and expected to contribute to a solution include (1) identifying key entities through document level profiling and preprocessing, (2) identifying relationships between entities through sentence level syntax, and (3) combining the first two with semantic knowledge about the terms.

  14. Generalized Entropies and the Similarity of Texts

    CERN Document Server

    Altmann, Eduardo G; Gerlach, Martin

    2016-01-01

    We show how generalized Gibbs-Shannon entropies can provide new insights on the statistical properties of texts. The universal distribution of word frequencies (Zipf's law) implies that the generalized entropies, computed at the word level, are dominated by words in a specific range of frequencies. Here we show that this is the case not only for the generalized entropies but also for the generalized (Jensen-Shannon) divergences, used to compute the similarity between different texts. This finding allows us to identify the contribution of specific words (and word frequencies) for the different generalized entropies and also to estimate the size of the databases needed to obtain a reliable estimation of the divergences. We test our results in large databases of books (from the Google n-gram database) and scientific papers (indexed by Web of Science).

  15. Cell Phoning and Texting While Driving

    Directory of Open Access Journals (Sweden)

    Judy Honoria Rosaire Telemaque

    2015-07-01

    Full Text Available A qualitative phenomenological study was conducted on the consequences of cell phone use while operating a vehicle. We discussed why talking and texting on cell phones are so popular through the analysis of our interviews with police officers, driving instructors, and parents of teens and young adults. The participants came from central, northeastern, northwestern, and southeastern Connecticut. All had exposure with respect to the effects of cell phone usage problem. The study reached a point of theoretical saturation or redundancy by which the analysis no longer resulted in new themes. We concluded that the discoveries revealed the necessity for education, expansion of technology, and additional driver education preparation, which may provide a path for leadership to help solve the problem.

  16. Bimodal Emotion Recognition from Speech and Text

    Directory of Open Access Journals (Sweden)

    Weilin Ye

    2014-01-01

    Full Text Available This paper presents an approach to emotion recognition from speech signals and textual content. In the analysis of speech signals, thirty-seven acoustic features are extracted from the speech input. Two different classifiers Support Vector Machines (SVMs and BP neural network are adopted to classify the emotional states. In text analysis, we use the two-step classification method to recognize the emotional states. The final emotional state is determined based on the emotion outputs from the acoustic and textual analyses. In this paper we have two parallel classifiers for acoustic information and two serial classifiers for textual information, and a final decision is made by combing these classifiers in decision level fusion. Experimental results show that the emotion recognition accuracy of the integrated system is better than that of either of the two individual approaches.

  17. An unpublished text of Jovellanos about mineralogy

    Directory of Open Access Journals (Sweden)

    Jorge ORDAZ GARGALLO

    2012-02-01

    Full Text Available An unpublished manuscript of Gaspar Melchor de Jovellanos about the history of mineralogy, written during his captivity in Bellver Castle (Palma de Mallorca is presented and analyzed. In this writing the importance of the chemical knowledge as a source of other branches of science and its applications in different fields of agriculture, mining and industry is considered. The author made a historical synthesis reviewing the men of science that contributed in a great extent to the advance of the chemistry and mineralogy. The text clearly supports the new contributions of Lavoisier and other supporters of experimentation as a scientific method, which agrees with Jovellanos’ ideas about the development of the «useful» sciences for the progress of the countries.

  18. Recognition of Text Image Using Multilayer Perceptron

    OpenAIRE

    Vijendra, Singh; Vasudeva, Nisha; Parashar, Hem Jyotsana

    2016-01-01

    The biggest challenge in the field of image processing is to recognize documents both in printed and handwritten format. Optical Character Recognition OCR is a type of document image analysis where scanned digital image that contains either machine printed or handwritten script input into an OCR software engine and translating it into an editable machine readable digital text format. A Neural network is designed to model the way in which the brain performs a particular task or function of int...

  19. Logistic regression a self-learning text

    CERN Document Server

    Kleinbaum, David G

    1994-01-01

    This textbook provides students and professionals in the health sciences with a presentation of the use of logistic regression in research. The text is self-contained, and designed to be used both in class or as a tool for self-study. It arises from the author's many years of experience teaching this material and the notes on which it is based have been extensively used throughout the world.

  20. Reading an ESL Writer’s Text

    Directory of Open Access Journals (Sweden)

    Paul Kei Matsuda

    2011-03-01

    Full Text Available This paper focuses on reading as a central act of communication in the tutorial session. Writing center tutors without extensive experience reading writing by second language writers may have difficulty getting past the many differences in surface-level features, organization, and rhetorical moves. After exploring some of the sources of these differences in writing, the authors present strategies that writing tutors can use to work effectively with second language writers.

  1. Le texte prophétique

    Directory of Open Access Journals (Sweden)

    Jacques Halbronn

    2000-05-01

    Full Text Available Littérature et prophétie entretiennent un lien complexe : il s’agit de deux sphères différentes, antagonistes même, ne répondant pas aux mêmes enjeux et ne regardant pas dans la même direction. En revanche l’interface entre prophétisme et actualité politique est beaucoup plus nette : le prophète interpelle son contemporain, cherche à peser sur les événements. Pour être bien comprise la littérature prophétique doit donc être replacée dans le contexte politico-religieux qui l’a vu naître, sans quoi elle risque de n’être prise que pour un délire sans fondement. Les ambiguïtés sémantiques et les incohérences apparentes du texte prophétique s’éclairent ainsi d’un jour nouveau lorsqu’on les envisage à l’aune de certaines réalités contextuelles, mais aussi lorsque l’on parvient à les resituer par rapport à tout un corpus de textes plus ou moins aisément identifiable qui leur a servi de source première et féconde. Le texte prophétique doit être envisagé en réseau, comme une variation au sein d’un vaste continuum textuel et langagier.

  2. Stemming of Slovenian library science texts

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2002-01-01

    Full Text Available The theme of the article is the preparation of a stemming algorithm for Slovenian library science texts. The procedure consisted of three phases: learning, testing and evaluation.The preparation of the optimal stemmer for Slovenian texts from the field of library science is presented, its testing and comparison with two other stemmers for the Slovenian language: the Popovič stemmer and the Generic stemmer. A corpus of 790.000 words from the field of library science was used for learning. Lists of stems, word endings and stop-words were built. In the testing phase, the component parts of the algorithm were tested on an additional corpus of 167.000 words. In the evaluation phase, a comparison of the three stemmers processing the same word corpus was made. The results of each stemmer were compared with an intellectually prepared control result of the stemming of the corpus. It consisted of groups of semantically connected words with no errors. Understemming was especially monitored – the number of stems for semantically connected words, produced by an algorithm. The results were statistically processed with the Kruskal-Wallis test. The Optimal stemmer produced the best results.It matched best with the reference results and also gave the smallest number of stems for one semantic meaning. The Popovič stemmer followed closely. The Generic stemmer proved to be the least accurate. The procedures described in the thesis can represent a platform for the development of the tools for automatic indexing and retrieval for library science texts in Slovenian language.

  3. Database Citation in Full Text Biomedical Articles

    OpenAIRE

    Şenay Kafkas; Jee-Hyub Kim; Johanna R. McEntyre

    2013-01-01

    Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleoti...

  4. On the Internet, with the Exploded Text

    Directory of Open Access Journals (Sweden)

    Jessamyn West

    2011-03-01

    Full Text Available Looking at print books from a writer’s first-person perspective I wrote a book in 2009 and 2010. It’s getting published this year (2011 sometime. Let me tell you about what it’s like writing a print book for a large trade publisher during the long leisurely sunset of print. It was different from what I thought [...

  5. Helios: Understanding Solar Evolution Through Text Analytics

    Energy Technology Data Exchange (ETDEWEB)

    Randazzese, Lucien [SRI International, Menlo Park, CA (United States)

    2016-12-02

    This proof-of-concept project focused on developing, testing, and validating a range of bibliometric, text analytic, and machine-learning based methods to explore the evolution of three photovoltaic (PV) technologies: Cadmium Telluride (CdTe), Dye-Sensitized solar cells (DSSC), and Multi-junction solar cells. The analytical approach to the work was inspired by previous work by the same team to measure and predict the scientific prominence of terms and entities within specific research domains. The goal was to create tools that could assist domain-knowledgeable analysts in investigating the history and path of technological developments in general, with a focus on analyzing step-function changes in performance, or “breakthroughs,” in particular. The text-analytics platform developed during this project was dubbed Helios. The project relied on computational methods for analyzing large corpora of technical documents. For this project we ingested technical documents from the following sources into Helios: Thomson Scientific Web of Science (papers), the U.S. Patent & Trademark Office (patents), the U.S. Department of Energy (technical documents), the U.S. National Science Foundation (project funding summaries), and a hand curated set of full-text documents from Thomson Scientific and other sources.

  6. Statistical Language Model for Chinese Text Proofreading

    Institute of Scientific and Technical Information of China (English)

    张仰森; 曹元大

    2003-01-01

    Statistical language modeling techniques are investigated so as to construct a language model for Chinese text proofreading. After the defects of n-gram model are analyzed, a novel statistical language model for Chinese text proofreading is proposed. This model takes full account of the information located before and after the target word wi, and the relationship between un-neighboring words wi and wj in linguistic environment(LE). First, the word association degree between wi and wj is defined by using the distance-weighted factor, wj is l words apart from wi in the LE, then Bayes formula is used to calculate the LE related degree of word wi, and lastly, the LE related degree is taken as criterion to predict the reasonability of word wi that appears in context. Comparing the proposed model with the traditional n-gram in a Chinese text automatic error detection system, the experiments results show that the error detection recall rate and precision rate of the system have been improved.

  7. TEXT SIGNAGE RECOGNITION IN ANDROID MOBILE DEVICES

    Directory of Open Access Journals (Sweden)

    Oi-Mean Foong

    2013-01-01

    Full Text Available This study presents a Text Signage Recognition (TSR model in Android mobile devices for Visually Impaired People (VIP. Independence navigation is always a challenge to VIP for indoor navigation in unfamiliar surroundings. Assistive Technology such as Android smart devices has great potential to assist VIPs in indoor navigation using built-in speech synthesizer. In contrast to previous TSR research which was deployed in standalone personal computer system using Otsu’s algorithm, we have developed an affordable Text Signage Recognition in Android Mobile Devices using Tesseract OCR engine. The proposed TSR model used the input images from the International Conference on Document Analysis and Recognition (ICDAR 2003 dataset for system training and testing. The TSR model was tested by four volunteers who were blind-folded. The system performance of the TSR model was assessed using different metrics (i.e., Precision, Recall, F-Score and Recognition Formulas to determine its accuracy. Experimental results show that the proposed TSR model has achieved recognition rate satisfactorily.

  8. Lances and javelins in administrative Ugaritic texts

    Directory of Open Access Journals (Sweden)

    Vidal, Jordi

    2007-12-01

    Full Text Available The main purpose of this paper is to analyse the various Ugaritic terms used to refer to lances and javelins. The data contained in the Ugaritic adminitrative texts point to the existence of different types of lances, part of the soldier’s offensive armament. Moreover, those very texts also attest the use of projectiles by the Ugaritic army, a type of weapon which some authors regard as javelines.

    El principal objetivo del artículo es el de analizar los distintos términos ugaríticos utilizados para hacer referencia a lanzas y jabalinas. En los textos administrativos de Ugarit se menciona la existencia de diferentes tipos de lanzas, como parte del armamento ofensivo de los soldados. Además, esos mismos textos hacen referencia también al uso de proyectiles por parte del ejército ugarítico, un tipo de arma que algunos autores han interpretado como jabalinas.

  9. DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

    Science.gov (United States)

    Yin, Xu-Cheng; Yang, Chun; Pei, Wei-Yi; Man, Haixia; Zhang, Jun; Learned-Miller, Erik; Yu, Hong

    2015-01-01

    Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. Since text is a rich source of information in figures, automatically extracting such text may assist in the task of mining figure information. A high-quality ground truth standard can greatly facilitate the development of an automated system. This article describes DeTEXT: A database for evaluating text extraction from biomedical literature figures. It is the first publicly available, human-annotated, high quality, and large-scale figure-text dataset with 288 full-text articles, 500 biomedical figures, and 9308 text regions. This article describes how figures were selected from open-access full-text biomedical articles and how annotation guidelines and annotation tools were developed. We also discuss the inter-annotator agreement and the reliability of the annotations. We summarize the statistics of the DeTEXT data and make available evaluation protocols for DeTEXT. Finally we lay out challenges we observed in the automated detection and recognition of figure text and discuss research directions in this area. DeTEXT is publicly available for downloading at http://prir.ustb.edu.cn/DeTEXT/.

  10. Influence of text cohesion on the persuasive power of expository text.

    Science.gov (United States)

    Kaakinen, Johanna K; Salonen, Jonna; Venäläinen, Paula; Hyönä, Jukka

    2011-06-01

    The present study examined how global text cohesion affects persuasion and memory for message arguments presented in expository text. Sixty-nine participants who held a neutral prior attitude towards NATO read a persuasive text about NATO that was either high or low in global cohesion. After reading, participants voted whether Finland should seek NATO membership and filled in an attitude questionnaire. After a 1-week delay they returned for a surprise recall task. The results showed that the high cohesion text was more persuasive than the low cohesion text. Moreover, attitude after reading but not text cohesion predicted later recall of the message arguments. The results show that global text cohesion increases text's persuasive power and that readers who form a positive attitude have better memory of the persuasive arguments after a delay than readers who are less persuaded.

  11. Learning from text: The effect of adjunct questions and alignment on text comprehension

    NARCIS (Netherlands)

    Reijners, Pauline; Kester, Liesbeth; Wetzels, Sandra; Kirschner, Paul A.

    2012-01-01

    Reijners, P. B. G., Kester, L., Wetzels, S. A. J., & Kirschner, P. A. (2012, November). Learning from text: The effect of adjunct questions and alignment on text comprehension. Poster presented at the ICO International Fall School, Girona, Spain.

  12. How we draw texts: a review of approaches to text visualization and exploration

    OpenAIRE

    Nualart, Jaume; Pérez-Montoro Gutiérrez, Mario; Whitelan, M.

    2014-01-01

    This paper presents a review of approaches to text visualization and exploration. Text visualization and exploration, we argue, constitute a subfield of data visualization, and are fuelled by the advances being made in text analysis research and by the growing amount of accessible data in text format. We propose an original classification for a total of 49 cases based on the visual features of the approaches adopted, identified using an inductive process of analysis. We group the cases (publi...

  13. Visualizing the semantic content of large text databases using text maps

    Science.gov (United States)

    Combs, Nathan

    1993-01-01

    A methodology for generating text map representations of the semantic content of text databases is presented. Text maps provide a graphical metaphor for conceptualizing and visualizing the contents and data interrelationships of large text databases. Described are a set of experiments conducted against the TIPSTER corpora of Wall Street Journal articles. These experiments provide an introduction to current work in the representation and visualization of documents by way of their semantic content.

  14. Text summarization as a decision support aid

    Directory of Open Access Journals (Sweden)

    Workman T

    2012-05-01

    Full Text Available Abstract Background PubMed data potentially can provide decision support information, but PubMed was not exclusively designed to be a point-of-care tool. Natural language processing applications that summarize PubMed citations hold promise for extracting decision support information. The objective of this study was to evaluate the efficiency of a text summarization application called Semantic MEDLINE, enhanced with a novel dynamic summarization method, in identifying decision support data. Methods We downloaded PubMed citations addressing the prevention and drug treatment of four disease topics. We then processed the citations with Semantic MEDLINE, enhanced with the dynamic summarization method. We also processed the citations with a conventional summarization method, as well as with a baseline procedure. We evaluated the results using clinician-vetted reference standards built from recommendations in a commercial decision support product, DynaMed. Results For the drug treatment data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.848 and 0.377, while conventional summarization produced 0.583 average recall and 0.712 average precision, and the baseline method yielded average recall and precision values of 0.252 and 0.277. For the prevention data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.655 and 0.329. The baseline technique resulted in recall and precision scores of 0.269 and 0.247. No conventional Semantic MEDLINE method accommodating summarization for prevention exists. Conclusion Semantic MEDLINE with dynamic summarization outperformed conventional summarization in terms of recall, and outperformed the baseline method in both recall and precision. This new approach to text summarization demonstrates potential in identifying decision support data for multiple needs.

  15. Unsupervised information extraction by text segmentation

    CERN Document Server

    Cortez, Eli

    2013-01-01

    A new unsupervised approach to the problem of Information Extraction by Text Segmentation (IETS) is proposed, implemented and evaluated herein. The authors' approach relies on information available on pre-existing data to learn how to associate segments in the input string with attributes of a given domain relying on a very effective set of content-based features. The effectiveness of the content-based features is also exploited to directly learn from test data structure-based features, with no previous human-driven training, a feature unique to the presented approach. Based on the approach, a

  16. Methods for Mining and Summarizing Text Conversations

    CERN Document Server

    Carenini, Giuseppe; Murray, Gabriel

    2011-01-01

    Due to the Internet Revolution, human conversational data -- in written forms -- are accumulating at a phenomenal rate. At the same time, improvements in speech technology enable many spoken conversations to be transcribed. Individuals and organizations engage in email exchanges, face-to-face meetings, blogging, texting and other social media activities. The advances in natural language processing provide ample opportunities for these "informal documents" to be analyzed and mined, thus creating numerous new and valuable applications. This book presents a set of computational methods

  17. CCM: A Text Classification Method by Clustering

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    In this paper, a new Cluster based Classification Model (CCM) for suspicious email detection and other text classification tasks, is presented. Comparative experiments of the proposed model against traditional classification models and the boosting algorithm are also discussed. Experimental results...... show that the CCM outperforms traditional classification models as well as the boosting algorithm for the task of suspicious email detection on terrorism domain email dataset and topic categorization on the Reuters-21578 and 20 Newsgroups datasets. The overall finding is that applying a cluster based...

  18. Image, text and Observatio: the Codex Kentmanus.

    Science.gov (United States)

    Kusukawa, Sachiko

    2009-01-01

    This paper examines the inter-relationship between image, text and object in the Codex Kentmanus, which is one of the earliest records of the plants in the botanical garden at Padua, studied by Johannes Kentmann (1518-77). The manuscript shows that "observation" for Kentmann involved a gradual process of assimilating knowledge from other physicians, apothecaries, and books in order to make the plants which were originally encountered at a specific time and place into a more generalised object of study for learned physicians.

  19. Advances in text analytics for drug discovery.

    Science.gov (United States)

    Roberts, Phoebe M; Hayes, William S

    2005-05-01

    The automated extraction of biological and chemical information has improved over the past year, with advances in access to content, entity extraction of genes, chemicals, kinetic data and relationships, and algorithms for generating and testing hypotheses. As the systems for reading and understanding scientific literature grow more powerful, so must the infrastructure in which to assemble information. Advances in infrastructure systems are discussed in this review. Research efforts have flourished as a result of text analytics competitions that attract participants from various disciplines, from computer science to bioinformatics.

  20. Prior Knowledge, Reading Skill, and Text Cohesion in the Comprehension of Science Texts

    Science.gov (United States)

    Ozuru, Yasuhiro; Dempsey, Kyle; McNamara, Danielle S.

    2009-01-01

    This study examined how text features (i.e., cohesion) and individual differences (i.e., reading skill and prior knowledge) contribute to biology text comprehension. College students with low and high levels of biology knowledge read two biology texts, one of which was high in cohesion and the other low in cohesion. The two groups were similar in…

  1. Learning from Conflicting Texts: The Role of Intertextual Conflict Resolution in Between-Text Integration

    Science.gov (United States)

    Kobayashi, Keiichi

    2015-01-01

    The present study examined the effect of intertextual conflict resolution on learning from conflicting texts. In two experiments, participants read sets of two texts under the condition of being encouraged either to resolve a conflict between the texts' arguments (the resolution condition) or to comprehend the arguments (the comprehension…

  2. Learning from Texts: Activation of Information from Previous Texts during Reading

    Science.gov (United States)

    Beker, Katinka; Jolles, Dietsje; Lorch, Robert F., Jr.; van den Broek, Paul

    2016-01-01

    Learning often involves integration of information from multiple texts. The aim of the current study was to determine whether relevant information from previously read texts is spontaneously activated during reading, allowing for integration between texts (experiment 1 and 2), and whether this process is related to the representation of the texts…

  3. How Much Handwritten Text Is Needed for Text-Independent Writer Verification and Identification

    NARCIS (Netherlands)

    Brink, Axel; Bulacu, Marius; Schomaker, Lambert

    2008-01-01

    The performance of off-line text-independent writer verification and identification increases when the documents contain more text. This relation was examined by repeatedly conducting writer verification and identification performance tests while gradually increasing the amount of text on the pages.

  4. Text Skimming: The Process and Effectiveness of Foraging through Text under Time Pressure

    Science.gov (United States)

    Duggan, Geoffrey B.; Payne, Stephen J.

    2009-01-01

    Is Skim reading effective? How do readers allocate their attention selectively? The authors report 3 experiments that use expository texts and allow readers only enough time to read half of each document. Experiment 1 found that, relative to reading half the text, skimming improved memory for important ideas from a text but did not improve memory…

  5. Towards Text Simplification for Poor Readers with Intellectual Disability: When Do Connectives Enhance Text Cohesion?

    Science.gov (United States)

    Fajardo, Inmaculada; Tavares, Gema; Avila, Vicenta; Ferrer, Antonio

    2013-01-01

    Cohesive elements of texts such as connectives (e.g., "but," "in contrast") are expected to facilitate inferential comprehension in poor readers. Two experiments tested this prediction in poor readers with intellectual disability (ID) by: (a) comparing literal and inferential text comprehension of texts with and without connectives and/or high…

  6. Layout-aware text extraction from full-text PDF of scientific articles

    Directory of Open Access Journals (Sweden)

    Ramakrishnan Cartic

    2012-05-01

    Full Text Available Abstract Background The Portable Document Format (PDF is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the ‘Layout-Aware PDF Text Extraction’ (LA-PDFText system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Results Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1 Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2 Classifying text blocks into rhetorical categories using a rule-based method and (3 Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF

  7. Visual Classifier Training for Text Document Retrieval.

    Science.gov (United States)

    Heimerl, F; Koch, S; Bosch, H; Ertl, T

    2012-12-01

    Performing exhaustive searches over a large number of text documents can be tedious, since it is very hard to formulate search queries or define filter criteria that capture an analyst's information need adequately. Classification through machine learning has the potential to improve search and filter tasks encompassing either complex or very specific information needs, individually. Unfortunately, analysts who are knowledgeable in their field are typically not machine learning specialists. Most classification methods, however, require a certain expertise regarding their parametrization to achieve good results. Supervised machine learning algorithms, in contrast, rely on labeled data, which can be provided by analysts. However, the effort for labeling can be very high, which shifts the problem from composing complex queries or defining accurate filters to another laborious task, in addition to the need for judging the trained classifier's quality. We therefore compare three approaches for interactive classifier training in a user study. All of the approaches are potential candidates for the integration into a larger retrieval system. They incorporate active learning to various degrees in order to reduce the labeling effort as well as to increase effectiveness. Two of them encompass interactive visualization for letting users explore the status of the classifier in context of the labeled documents, as well as for judging the quality of the classifier in iterative feedback loops. We see our work as a step towards introducing user controlled classification methods in addition to text search and filtering for increasing recall in analytics scenarios involving large corpora.

  8. Pathology of Commentary in Persian Literary Texts

    Directory of Open Access Journals (Sweden)

    احمد رضی

    2011-10-01

    Full Text Available Today commentary work has a significant role and place among the readers of Persian literary texts and those interested in them. The growing importance of commentary works in helping the readers understand and popularity of commentary works, notably in recent decades, has caused different commentators with different knowledge level and abilities to write comments and foster this disorganized market. This study intends to investigate the published commentary works in the past decades, analyze their week points. To do so, over 250 works, which have been written and published between 1300 AP (circa 1921 AD and 1387 AP (circa2008 AD and an attempt has been made to classify, describe, and analyze their most important problems and week points, and at the end, the most important items of best commentary and best commentators have been explained. This article intends to analyz the most important problems and week points of commentary works, which can be summarized in seven broad categories: 1 content shortcomings; 2 inappropriate approach; 3 incongruence between the structure of commentary work and type of the work and the commentator's objective; 4 lack of attention towards the readership; 5 carelessness and incompetency of the commentator; 6 complex statement and insensible language; 7 inaudibility of introductions. Key words: research methodology, commentary works, pathology, literary works

  9. Chemical-text hybrid search engines.

    Science.gov (United States)

    Zhou, Yingyao; Zhou, Bin; Jiang, Shumei; King, Frederick J

    2010-01-01

    As the amount of chemical literature increases, it is critical that researchers be enabled to accurately locate documents related to a particular aspect of a given compound. Existing solutions, based on text and chemical search engines alone, suffer from the inclusion of "false negative" and "false positive" results, and cannot accommodate diverse repertoire of formats currently available for chemical documents. To address these concerns, we developed an approach called Entity-Canonical Keyword Indexing (ECKI), which converts a chemical entity embedded in a data source into its canonical keyword representation prior to being indexed by text search engines. We implemented ECKI using Microsoft Office SharePoint Server Search, and the resultant hybrid search engine not only supported complex mixed chemical and keyword queries but also was applied to both intranet and Internet environments. We envision that the adoption of ECKI will empower researchers to pose more complex search questions that were not readily attainable previously and to obtain answers at much improved speed and accuracy.

  10. Handwritten Text Image Authentication using Back Propagation

    CERN Document Server

    Chakravarthy, A S N; Avadhani, P S

    2011-01-01

    Authentication is the act of confirming the truth of an attribute of a datum or entity. This might involve confirming the identity of a person, tracing the origins of an artefact, ensuring that a product is what it's packaging and labelling claims to be, or assuring that a computer program is a trusted one. The authentication of information can pose special problems (especially man-in-the-middle attacks), and is often wrapped up with authenticating identity. Literary can involve imitating the style of a famous author. If an original manuscript, typewritten text, or recording is available, then the medium itself (or its packaging - anything from a box to e-mail headers) can help prove or disprove the authenticity of the document. The use of digital images of handwritten historical documents has become more popular in recent years. Volunteers around the world now read thousands of these images as part of their indexing process. Handwritten text images of old documents are sometimes difficult to read or noisy du...

  11. El manual como texto Schoolbook as text

    Directory of Open Access Journals (Sweden)

    Agustín Escolano Benito

    2012-12-01

    Full Text Available Este trabajo aborda la cuestión de la identidad del libro escolar como un género textual específico en el contexto de la manualística clásica y moderna, contextualizando los análisis en el marco de la cultura de la escuela tradicional y en la era de la revolución digital y bajo una perspectiva historiográfica y teórica. También plantea el nacimiento y primeros desarrollos de la manualística como campo intelectual y académico y sus contribuciones a la definición de la identidad del libro escolar.This paper discusses the question of identifying a coursebook as a specific text genre in the context of the classical and modern manualistics, situating the analysis within the traditional school culture and the digital revolution era, under a historical and theoretical perspective. It also covers the birth and initial development of manualistics as an intelectual and academic field and its contributions to the definition of the schoolbook identity.

  12. Relative clauses in French children's narrative texts.

    Science.gov (United States)

    Jisa, H; Kern, S

    1998-10-01

    This study investigates the use of relative clauses in French children's narrative monologues. Narrative texts were collected from French-speaking monolinguals in four age groups (five, seven, ten years and adults). Twenty subjects from each group were asked to tell a story based on a picture book consisting of twenty-four images without text (Frog, Where are you?). Relative constructions were coded following the categories defined by Dasinger & Toupin (1994) into two main functional classes: general discourse and narrative functions. The results show that the use of relative clauses in general discourse functions precedes their use in more specific narrative functions. An analysis of textual connectivity (Berman & Slobin, 1994) in one episode reveals that children and adults differ in their choice of preferred structures. The results also show that children use fewer transitive predicates in relative clauses than do adults. Transitive verbs are essential for advancing the narrative plot (Hopper & Thompson, 1980). While subject relative clauses are acquired early and used frequently, the development of their multifunctional use in diverse narrative functions extends well beyond childhood.

  13. THE PSYCHOLOGICAL NATURE OF TEXT COMPREHENSION IN TERMS OF TEXT LEARNING PROCESSES

    Directory of Open Access Journals (Sweden)

    Ferhat ENSAR

    2013-06-01

    Full Text Available Texts are important tools for learning. Thus, the attempt to make texts more understandable is a reflection of a purpose-function related necessity for learning from text. On the other hand, the idea of development and recovery of informative texts via corrective teaching materials is frequently explored by contemporary researchers. Thus, it is evident that more advanced proficiency is needed for the illustrated aspect of the structure of texts in the learning process and to make the efforts to prepare educational materials at more scientific ground. Therefore, in this study textual organization and a general theory of learning from texts are outlined and later language processing in working memory and related phenomena about learning from texts and individual differences including information about texts development, texts comprehension, and inferences from texts are discussed. The reason for this is the idea that working memory is responsible for not only recalling the stored information but also for storing the results of partial processes such as successive processes like language comprehension as explained in the related literature for modern memory theories. The other reason is the generalizations about the interaction between the processes of physical representation and pattern of a text manifested in accordance with these ideas. Additionally, not only the different procedures used to develop informative texts, at the same time, differences of these procedures including a learner’s view of world and process styles and measurement of text comprehension and the complex relations among them are the current and available information in the literature. As a result, due to the nature of factors, which affect a learner’s level of recalling and his understanding from text, this study aims to discuss this assumptions.

  14. Computational text analysis and reading comprehension exam complexity towards automatic text classification

    CERN Document Server

    Liontou, Trisevgeni

    2014-01-01

    This book delineates a range of linguistic features that characterise the reading texts used at the B2 (Independent User) and C1 (Proficient User) levels of the Greek State Certificate of English Language Proficiency exams in order to help define text difficulty per level of competence. In addition, it examines whether specific reader variables influence test takers' perceptions of reading comprehension difficulty. The end product is a Text Classification Profile per level of competence and a formula for automatically estimating text difficulty and assigning levels to texts consistently and re

  15. MANAGING THE TRANSLATION OF ECONOMIC TEXTS

    Directory of Open Access Journals (Sweden)

    Pop Anamaria Mirabela

    2012-12-01

    Full Text Available Theoretically, translation may pass as science; practically, it seems closer to art. Translation is a challenging activity requiring a set of abilities and posing few difficulties that appear during the translation process. This paper investigates the extent to which sub-technical vocabulary can constitute a problem to Romanian students of economics reading in English, by looking at the translations produced as independent or pair work during English classes and analyzing the various errors which may appeared. The exigencies required by the efficient business communication have increased in the past few decades because of rising international trade, increased migration, globalization, the recognition of linguistic minorities, and the expansion of the mass media and technology. All these led us to approach the topic of translation which is actually a job that requires skills, stages of research necessary for disclosure of transfer characteristic into the target language, training, experience and a good sense of languages. The paper defines the theoretical issues and terminology: translation, types of translation, economic texts and then focuses on the presentation of the practical work carried out throughout the academic year of second year students. Considering that only 28% of the entire European population can read English, and even less people in South America and Asia can, it is obvious that an effective communication of business matters relies on an accurate understanding of terminology. Economics is a field of knowledge in accelerated scientific and technological development. As there is a permanent and ever increasing need to quickly update their knowledge, economists read and learn directly in the original language of the publication and stick to it in daily usage, including conferences, scientific events and articles written in Romanian. Besides researching properly the markets, finding distribution channels, and dealing with legal

  16. Representativeness and significance factors in ESP texts

    Directory of Open Access Journals (Sweden)

    Alejandro Curado Fuentes

    2000-04-01

    Full Text Available The development of communicative approaches and strategies in specialized discourse has led to revising notions of representative and significant language . Particularly in the work with academic genres, in science and technology (EST settings such as our own institution, the need for determining these factors is ever growing. The application of empirical resources such as specific language corpora, in fact, becomes convenient. In this paper, the aim is to specify the type of corpus linguistic representativeness and significance sought in the case of teaching English to our groups of Computer Science students. In that scope, we present data and samples on which to base our suggestions and claims regarding the exploitation of textual material.

  17. Word and text processing in developmental prosopagnosia.

    Science.gov (United States)

    Rubino, Cristina; Corrow, Sherryse L; Corrow, Jeffrey C; Duchaine, Brad; Barton, Jason J S

    2016-01-01

    The "many-to-many" hypothesis proposes that visual object processing is supported by distributed circuits that overlap for different object categories. For faces and words the hypothesis posits that both posterior fusiform regions contribute to both face and visual word perception and predicts that unilateral lesions impairing one will affect the other. However, studies testing this hypothesis have produced mixed results. We evaluated visual word processing in subjects with developmental prosopagnosia, a condition linked to right posterior fusiform abnormalities. Ten developmental prosopagnosic subjects performed a word-length effect task and a task evaluating the recognition of word content across variations in text style, and the recognition of style across variations in word content. All subjects had normal word-length effects. One had prolonged sorting time for word recognition in handwritten stimuli. These results suggest that the deficit in developmental prosopagnosia is unlikely to affect visual word processing, contrary to predictions of the many-to-many hypothesis.

  18. Exploiting Surrounding Text for Retrieving Web Images

    Directory of Open Access Journals (Sweden)

    S. A. Noah

    2008-01-01

    Full Text Available Web documents contain useful textual information that can be exploited for describing images. Research had been focused on representing images by means of its content (low level description such as color, shape and texture, little research had been directed to exploiting such textual information. The aim of this research was to systematically exploit the textual content of HTML documents for automatically indexing and ranking of images embedded in web documents. A heuristic approach for locating and assigning weight surrounding web images and a modified tf.idf weighting scheme was proposed. Precision-recall measures of evaluation had been conducted for ten queries and promising results had been achieved. The proposed approach showed slightly better precision measure as compared to a popular search engine with an average of 0.63 and 0.55 relative precision measures respectively.

  19. Conversation Analysis and Orality in Written Texts

    Directory of Open Access Journals (Sweden)

    Luiz Antônio da Silva

    2015-02-01

    Full Text Available Marcuschi (1977 points out that orality is an important topic to be developed in the classroom. Lamentably, however, it has been left aside, because teachers and those responsible for education do not consider it as an important feature to be emphasized in the mother tongue teaching. The main reason is the focus given to the language teaching in Brazilian schools: the school is supposed to teach writing, and how to write well. Despite the advances of Linguistic studies on speaking and writing; despite the contributions of Sociolinguistics and Conversation Analysis; and despite the overcoming of prejudices, especially on the strict distinction between the two modes, there is still a long way to go. Thus, it is beneficial to bring up a discussion on speaking and writing. After several years of Marcuschi´s findings (1977, textbook authors, teachers, researchers and those responsible for the Portuguese language teaching have another theoretical approach. Nonetheless, in practice, there is still a lot to be accomplished since writing continues to be the focus of the Portuguese language teaching in Brazilian schools. It seems that most of the teachers know the theory, but they experience difficulties when it comes to the practices of everyday school life. This paper aims to analyze oral marks or effects of orality in written literary texts, more precisely in dialogues produced. These analyzes will aid us in giving subsidies to a Portuguese teacher, so that he/she can work consistently and productively. To illustrate our observations, we have chosen fragments of chronicles written by Brazilian writer Luís Fernando Verissimo, published in three of his works: Comédias para se ler na escola, Sexo na cabeça e Amor Veríssimo.

  20. Counting OCR errors in typeset text

    Science.gov (United States)

    Sandberg, Jonathan S.

    1995-03-01

    Frequently object recognition accuracy is a key component in the performance analysis of pattern matching systems. In the past three years, the results of numerous excellent and rigorous studies of OCR system typeset-character accuracy (henceforth OCR accuracy) have been published, encouraging performance comparisons between a variety of OCR products and technologies. These published figures are important; OCR vendor advertisements in the popular trade magazines lead readers to believe that published OCR accuracy figures effect market share in the lucrative OCR market. Curiously, a detailed review of many of these OCR error occurrence counting results reveals that they are not reproducible as published and they are not strictly comparable due to larger variances in the counts than would be expected by the sampling variance. Naturally, since OCR accuracy is based on a ratio of the number of OCR errors over the size of the text searched for errors, imprecise OCR error accounting leads to similar imprecision in OCR accuracy. Some published papers use informal, non-automatic, or intuitively correct OCR error accounting. Still other published results present OCR error accounting methods based on string matching algorithms such as dynamic programming using Levenshtein (edit) distance but omit critical implementation details (such as the existence of suspect markers in the OCR generated output or the weights used in the dynamic programming minimization procedure). The problem with not specifically revealing the accounting method is that the number of errors found by different methods are significantly different. This paper identifies the basic accounting methods used to measure OCR errors in typeset text and offers an evaluation and comparison of the various accounting methods.

  1. Dialogical surface text features in abstracts

    Directory of Open Access Journals (Sweden)

    Ingrid García-Østbye

    2008-04-01

    Full Text Available A sample driven description of Research Article-Comment-Reply (RA-C-R abstracts in terms of abstract sentence length, reference, possessive structures, modal verbs and word range was carried out to find out whether their surface text features showed some trace of a dialogical construction of knowledge within the psychology discourse community. The study served an exploratory purpose. A Boolean search was conducted in the PsycLIT database yielding a sample of 149 PsycLIT RA-C-R abstracts (13,978 words. Relative frequency percent distributions were calculated for all variables, including reported speech verbs. Specific comparisons with a Medline corpus were conducted and variations were accounted for in terms of scientific discourse characteristics, field, database policies, and dialogical nature; that is, in the framework provided by the strands of research of quantitative applied linguistics, social concerns in genre analysis and the model monopoly theory developed in the implementation in sociology of the systems theory. The results suggest: (i a word range affected by both psychology as a discipline and the dialogical content on which PsycLIT RA-C-R abstracts report; (ii a complementarity of reference and possessive structures characterised by features of scientific discourse, feedback genres and dialogical dimensions; (iii the presence of both deontic and epistemic modality in the modal verbs of our sample; (iv and also that abstract length, sentence length and number of sentences per paragraph in our sample may not vary greatly in general terms from those of the social sciences.

  2. [Formula: see text] and [Formula: see text] Spoken Word Processing: Evidence from Divided Attention Paradigm.

    Science.gov (United States)

    Shafiee Nahrkhalaji, Saeedeh; Lotfi, Ahmad Reza; Koosha, Mansour

    2016-10-01

    The present study aims to reveal some facts concerning first language ([Formula: see text] and second language ([Formula: see text] spoken-word processing in unbalanced proficient bilinguals using behavioral measures. The intention here is to examine the effects of auditory repetition word priming and semantic priming in first and second languages of these bilinguals. The other goal is to explore the effects of attention manipulation on implicit retrieval of perceptual and conceptual properties of spoken [Formula: see text] and [Formula: see text] words. In so doing, the participants performed auditory word priming and semantic priming as memory tests in their [Formula: see text] and [Formula: see text]. In a half of the trials of each experiment, they carried out the memory test while simultaneously performing a secondary task in visual modality. The results revealed that effects of auditory word priming and semantic priming were present when participants processed [Formula: see text] and [Formula: see text] words in full attention condition. Attention manipulation could reduce priming magnitude in both experiments in [Formula: see text]. Moreover, [Formula: see text] word retrieval increases the reaction times and reduces accuracy on the simultaneous secondary task to protect its own accuracy and speed.

  3. Text Mining Approaches To Extract Interesting Association Rules from Text Documents

    Directory of Open Access Journals (Sweden)

    Vishwadeepak Singh Baghela

    2012-05-01

    Full Text Available A handful of text data mining approaches are available to extract many potential information and association from large amount of text data. The term data mining is used for methods that analyze data with the objective of finding rules and patterns describing the characteristic properties of the data. The 'mined information is typically represented as a model of the semantic structure of the dataset, where the model may be used on new data for prediction or classification. In general, data mining deals with structured data (for example relational databases, whereas text presents special characteristics and is unstructured. The unstructured data is totally different from databases, where mining techniques are usually applied and structured data is managed. Text mining can work with unstructured or semi-structured data sets A brief review of some recent researches related to mining associations from text documents is presented in this paper.

  4. Similarities and Differences between the Advertising Text in Romanian and the Advertising Text in French

    OpenAIRE

    Cristina Gruber

    2014-01-01

    The present study, entirely original, covers several aspects referring to advertising texts published in Romania and France in the first decade of the 21st century: statements, intertextuality, figures (construction-related, sound-related and semantic), semantic links between advertising text sequences, lexis. The analyzed body contains more than 2000 texts published in the press from Romania and France. We can assert, based on the analysis performed, that all the means and methods available ...

  5. Basic test framework for the evaluation of text line segmentation and text parameter extraction.

    Science.gov (United States)

    Brodić, Darko; Milivojević, Dragan R; Milivojević, Zoran

    2010-01-01

    Text line segmentation is an essential stage in off-line optical character recognition (OCR) systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms.

  6. Text mining for improved exposure assessment

    Science.gov (United States)

    Baker, Simon; Silins, Ilona; Guo, Yufan; Stenius, Ulla; Korhonen, Anna; Berglund, Marika

    2017-01-01

    Chemical exposure assessments are based on information collected via different methods, such as biomonitoring, personal monitoring, environmental monitoring and questionnaires. The vast amount of chemical-specific exposure information available from web-based databases, such as PubMed, is undoubtedly a great asset to the scientific community. However, manual retrieval of relevant published information is an extremely time consuming task and overviewing the data is nearly impossible. Here, we present the development of an automatic classifier for chemical exposure information. First, nearly 3700 abstracts were manually annotated by an expert in exposure sciences according to a taxonomy exclusively created for exposure information. Natural Language Processing (NLP) techniques were used to extract semantic and syntactic features relevant to chemical exposure text. Using these features, we trained a supervised machine learning algorithm to automatically classify PubMed abstracts according to the exposure taxonomy. The resulting classifier demonstrates good performance in the intrinsic evaluation. We also show that the classifier improves information retrieval of chemical exposure data compared to keyword-based PubMed searches. Case studies demonstrate that the classifier can be used to assist researchers by facilitating information retrieval and classification, enabling data gap recognition and overviewing available scientific literature using chemical-specific publication profiles. Finally, we identify challenges to be addressed in future development of the system. PMID:28257498

  7. Automatically ordering events and times in text

    CERN Document Server

    Derczynski, Leon R A

    2017-01-01

    The book offers a detailed guide to temporal ordering, exploring open problems in the field and providing solutions and extensive analysis. It addresses the challenge of automatically ordering events and times in text. Aided by TimeML, it also describes and presents concepts relating to time in easy-to-compute terms. Working out the order that events and times happen has proven difficult for computers, since the language used to discuss time can be vague and complex. Mapping out these concepts for a computational system, which does not have its own inherent idea of time, is, unsurprisingly, tough. Solving this problem enables powerful systems that can plan, reason about events, and construct stories of their own accord, as well as understand the complex narratives that humans express and comprehend so naturally. This book presents a theory and data-driven analysis of temporal ordering, leading to the identification of exactly what is difficult about the task. It then proposes and evaluates machine-learning so...

  8. Linguistically informed digital fingerprints for text

    Science.gov (United States)

    Uzuner, Özlem

    2006-02-01

    Digital fingerprinting, watermarking, and tracking technologies have gained importance in the recent years in response to growing problems such as digital copyright infringement. While fingerprints and watermarks can be generated in many different ways, use of natural language processing for these purposes has so far been limited. Measuring similarity of literary works for automatic copyright infringement detection requires identifying and comparing creative expression of content in documents. In this paper, we present a linguistic approach to automatically fingerprinting novels based on their expression of content. We use natural language processing techniques to generate "expression fingerprints". These fingerprints consist of both syntactic and semantic elements of language, i.e., syntactic and semantic elements of expression. Our experiments indicate that syntactic and semantic elements of expression enable accurate identification of novels and their paraphrases, providing a significant improvement over techniques used in text classification literature for automatic copy recognition. We show that these elements of expression can be used to fingerprint, label, or watermark works; they represent features that are essential to the character of works and that remain fairly consistent in the works even when works are paraphrased. These features can be directly extracted from the contents of the works on demand and can be used to recognize works that would not be correctly identified either in the absence of pre-existing labels or by verbatim-copy detectors.

  9. Textes de création

    Directory of Open Access Journals (Sweden)

    Auteurs multiples

    2014-04-01

    Full Text Available Du dedans et du dehors… cette expression inscrite dans l’actualité (événements, manifestations, symposiums se réfère aux initiatives culturelles menées en prison et aux productions carcérales diffusées en société. Ce rapport itératif est d’autant plus marqué par une altérité : changement des perceptions, redéfinition des personnes impliquées, transmission de connaissances, etc. Sylvie Frigon ainsi que Tina Charlebois, Éric Charlebois, Guy Thibodeau, Lise Careau, Alberte Villeneuve-Sinclair, Michèle Vinet et Martine Bisson Rodriguez, membres de l’AAOF, animés par une démarche pédagogique de leur crû, pénètrent ces lieux dès l’automne 2011 et donnent une voix aux participants enfermés. Bien sûr, il y a la détention physique à proprement parler, mais aussi l’enfermement-stigmate, l’enfermement-émotion ou l’enfermement-supervision.

  10. Mining Causality for Explanation Knowledge from Text

    Institute of Scientific and Technical Information of China (English)

    Chaveevan Pechsiri; Asanee Kawtrakul

    2007-01-01

    Mining causality is essential to provide a diagnosis. This research aims at extracting the causality existing within multiple sentences or EDUs (Elementary Discourse Unit). The research emphasizes the use of causality verbs because they make explicit in a certain way the consequent events of a cause, e.g., "Aphids suck the sap from rice leaves. Then leaves will shrink. Later, they will become yellow and dry.". A verb can also be the causal-verb link between cause and effect within EDU(s), e.g., "Aphids suck the sap from rice leaves causing leaves to be shrunk" ("causing" is equivalent to a causal-verb link in Thai). The research confronts two main problems: identifying the interesting causality events from documents and identifying their boundaries. Then, we propose mining on verbs by using two different machine learning techniques, Naive Bayes classifier and Support Vector Machine. The resulted mining rules will be used for the identification and the causality extraction of the multiple EDUs from text. Our multiple EDUs extraction shows 0.88 precision with 0.75 recall from Na'ive Bayes classifier and 0.89 precision with 0.76 recall from Support Vector Machine.

  11. TRANSLATING NEWS TEXTS FOR SPECIFIC LINGUISTIC AUDIENCES

    Directory of Open Access Journals (Sweden)

    Cătălina COMĂNECI

    2011-01-01

    Full Text Available In a world marked by communication and conflict, mass media tends to minimize the essential role of translation in facilitating linguistic and cultural exchanges on the international scene. This paper purports to present and explain the situations in which translators have to fill up the gap existing between translation and media projects, as well as to examine the ways in which geographic, socio-cultural and linguistic coordinates may influence the process of editing (and sometimes transediting of the global news. The methods used for highlighting cultural differences are both quantitative (based on a selection of articles from the British, French and Romanian press; for example, Romania’s 2009 presidential elections and its echoes in the British and French press and qualitative (particularly documentary, based on the latest research in the field. The inductive methods consist in identifying the textual and extra-textual strategies involved in the translation process and in exemplifying the editorial conventions applicable to the news coming from a different socio-cultural context. The expected outcomes of this paper are to highlight the causes of the refractions undergone by source information and to emphasize the translator’s overlooked role as, for most of the times, (she remains invisible in order to guarantee the quality of the translation and to respect the work and vision of the person producing the news. Last but not least, when it comes to news translation, the word translation itself gains new meanings, different from its traditional ones, as readers are totally unaware of the translational operations the articles they are reading have been through.

  12. Texts as Mirrors, Texts as Windows: Black Adolescent Boys and the Complexities of Textual Relevance

    Science.gov (United States)

    Sciurba, Katie

    2015-01-01

    Discussions of culturally relevant and "boy" literature stress the importance of offering readers occasions to see themselves in texts. However, young men of color have had few opportunities within this discourse to reveal their own experiences with literature. Rather than make presumptions about how texts serve as mirrors to them, as…

  13. Picture or Text First? Explaining Sequence Effects When Learning with Pictures and Text

    Science.gov (United States)

    Eitel, Alexander; Scheiter, Katharina

    2015-01-01

    The present article reviews 42 studies investigating the role of sequencing of text and pictures for learning outcomes. Whereas several of the reviewed studies revealed better learning outcomes from presenting the picture before the text rather than after it, other studies demonstrated the opposite effect. Against the backdrop of theories on…

  14. The effects of generative testing on text retention and text comprehension

    NARCIS (Netherlands)

    Dirkx, Kim; Kester, Liesbeth; Kirschner, Paul A.

    2011-01-01

    Dirkx, K. J. H., Kester, L., & Kirschner, P. A. (2011, 30 August). The effects of generative testing methods on text retention and text comprehension. Paper presented at the annual meeting of the European Association for Research on Learning and Instruction, Exeter, United Kingdom.

  15. The Link between Text Difficulty, Reading Speed and Exploration of Printed Text during Shared Book Reading

    Science.gov (United States)

    Roy-Charland, Annie; Perron, Melanie; Turgeon, Krystle-Lee; Hoffman, Nichola; Chamberland, Justin A.

    2016-01-01

    In the current study the reading speed of the narration and the difficulty of the text was manipulated and links were explored with children's attention to the printed text in shared book reading. Thirty-nine children (24 grade 1 and 15 grade 2) were presented easy and difficult books at slow (syllable by syllable) or fast (adult reading speed)…

  16. Comparative Effects of Computer-Based Concept Maps, Refutational Texts, and Expository Texts on Science Learning

    Science.gov (United States)

    Adesope, Olusola O.; Cavagnetto, Andy; Hunsu, Nathaniel J.; Anguiano, Carlos; Lloyd, Joshua

    2017-01-01

    This study used a between-subjects experimental design to examine the effects of three different computer-based instructional strategies (concept map, refutation text, and expository scientific text) on science learning. Concept maps are node-link diagrams that show concepts as nodes and relationships among the concepts as labeled links.…

  17. Fostering Multiple Text Comprehension: How Metacognitive Strategies and Motivation Moderate the Text-Belief Consistency Effect

    Science.gov (United States)

    Maier, Johanna; Richter, Tobias

    2014-01-01

    Learners often have difficulties comprehending multiple texts about controversial scientific issues. In particular, learners with strong prior beliefs tend to construct a one-sided mental representation that is biased towards belief-consistent information (text-belief consistency effect). In the present study we examined the effectiveness of…

  18. What can measures of text comprehension tell us about creative text production?

    NARCIS (Netherlands)

    Bos, Lisanne T.; de Koning, Bjorn; van Wesel, F.; Boonstra, Marije; van der Schoot, Menno

    2015-01-01

    Evidence is accumulating that the level of text comprehension is dependent on the situatedness and sensory richness of a child's mental representation formed during reading. This study investigated whether these factors involved in text comprehension also serve a functional role in writing a narrati

  19. "Romeo and Juliet" in the Minneapolis Public Schools: Accurate Text or Bowdlerized Text?

    Science.gov (United States)

    Reed, Margaret A.

    In 1984, parents of a Minneapolis, Minnesota, ninth grader came before the school district's "Students' Right to Learn Committee" to object to what they described as a bowdlerized version of "Romeo and Juliet" in the Scott, Foresman text, and the publisher's failure to acknowledge in the text that the play was abridged. The committee concurred…

  20. Semantic-based image retrieval by text mining on environmental texts

    Science.gov (United States)

    Yang, Hsin-Chang; Lee, Chung-Hong

    2003-01-01

    In this paper we propose a novel method to bridge the 'semantic gap' between a user's information need and the image content. The semantic gap describes the major deficiency of content-based image retrieval (CBIR) systems which use visual features extracted from images to describe the images. We conquer the deficiency by extracting semantic of an image from the environmental texts around it. Since an image generally co-exists with accompany texts in various formats, we may rely on such environmental texts to discover the semantic of the image. A text mining approach based on self-organizing maps is used to extract the semantic of an image from its environmental texts. We performed experiments on a small set of images and obtained promising results.

  1. Teacher, text, and inquiry science: Mediating instructional conversations on content, reasoning, and informational text

    Science.gov (United States)

    Pesko, Ellen Lawrence

    This dissertation is a case study of an accomplished elementary teacher and her fourth grade students as they work in an inquiry science environment using a text in the form of a scientist's notebook. The text was specifically designed to interplay with students' first-hand investigations. It is a descriptive study of the instructional moves and decisions of this teacher as she negotiated the competing goals of learning how to read informational text, learning science content, and engaging in scientific reasoning. The goal is to provide research on the mediation of text that is discipline specific and designed to complement a first-hand inquiry in the context of an elementary classroom. Although the focus of the study is on the teacher's instructional moves, it is impossible to talk about teacher mediation without discussing the kinds of challenges her students experience in learning from the text. Earlier research with the notebook texts on the topic of light (Cutter, Vincent, Magnusson & Palincsar, 2001; Ford, 1999; Palincsar, Magnusson & Hapgood, 2001) captured the role of the teacher in mediating interactions with text. Findings related to the role of teachers were: (1) teachers help to make explicit connections between the texts and students' first hand experiences, and (2) teachers guide and shape discussions in multiple ways (e.g. modeling their thinking about the scientist's questions, procedures, data and claims) (Magnusson & Palincsar, 2004). It is hoped that this dissertation will extend and add to what we know about teacher mediation of these texts, especially the mediation of features such as figures and tables. Findings of the study include the type of learning community that facilitated instructional goals, the content and reasoning opportunities that were taken up or omitted, and the teacher's instructional moves that supported learning from informational text.

  2. Discovering gene annotations in biomedical text databases

    Directory of Open Access Journals (Sweden)

    Ozsoyoglu Gultekin

    2008-03-01

    Full Text Available Abstract Background Genes and gene products are frequently annotated with Gene Ontology concepts based on the evidence provided in genomics articles. Manually locating and curating information about a genomic entity from the biomedical literature requires vast amounts of human effort. Hence, there is clearly a need forautomated computational tools to annotate the genes and gene products with Gene Ontology concepts by computationally capturing the related knowledge embedded in textual data. Results In this article, we present an automated genomic entity annotation system, GEANN, which extracts information about the characteristics of genes and gene products in article abstracts from PubMed, and translates the discoveredknowledge into Gene Ontology (GO concepts, a widely-used standardized vocabulary of genomic traits. GEANN utilizes textual "extraction patterns", and a semantic matching framework to locate phrases matching to a pattern and produce Gene Ontology annotations for genes and gene products. In our experiments, GEANN has reached to the precision level of 78% at therecall level of 61%. On a select set of Gene Ontology concepts, GEANN either outperforms or is comparable to two other automated annotation studies. Use of WordNet for semantic pattern matching improves the precision and recall by 24% and 15%, respectively, and the improvement due to semantic pattern matching becomes more apparent as the Gene Ontology terms become more general. Conclusion GEANN is useful for two distinct purposes: (i automating the annotation of genomic entities with Gene Ontology concepts, and (ii providing existing annotations with additional "evidence articles" from the literature. The use of textual extraction patterns that are constructed based on the existing annotations achieve high precision. The semantic pattern matching framework provides a more flexible pattern matching scheme with respect to "exactmatching" with the advantage of locating approximate

  3. MeInfoText: associated gene methylation and cancer information from text mining

    Directory of Open Access Journals (Sweden)

    Juan Hsueh-Fen

    2008-01-01

    Full Text Available Abstract Background DNA methylation is an important epigenetic modification of the genome. Abnormal DNA methylation may result in silencing of tumor suppressor genes and is common in a variety of human cancer cells. As more epigenetics research is published electronically, it is desirable to extract relevant information from biological literature. To facilitate epigenetics research, we have developed a database called MeInfoText to provide gene methylation information from text mining. Description MeInfoText presents comprehensive association information about gene methylation and cancer, the profile of gene methylation among human cancer types and the gene methylation profile of a specific cancer type, based on association mining from large amounts of literature. In addition, MeInfoText offers integrated protein-protein interaction and biological pathway information collected from the Internet. MeInfoText also provides pathway cluster information regarding to a set of genes which may contribute the development of cancer due to aberrant methylation. The extracted evidence with highlighted keywords and the gene names identified from each methylation-related abstract is also retrieved. The database is now available at http://mit.lifescience.ntu.edu.tw/. Conclusion MeInfoText is a unique database that provides comprehensive gene methylation and cancer association information. It will complement existing DNA methylation information and will be useful in epigenetics research and the prevention of cancer.

  4. Sustainable packaging. Packaging for a circular economy; Duurzaam verpakken. Verpakken voor de circulaire economie

    Energy Technology Data Exchange (ETDEWEB)

    Haffmans, S. [Partners for Innovation, Amsterdam (Netherlands); Standhardt, G. [Nederlands Verpaskkingscentrum NVC, Gouda (Netherlands); Hamer, A. [Agentschap NL, Utrecht (Netherlands)

    2013-10-15

    What is Sustainable Packaging? And what is the most sustainable packaging for a product? The publication is intended for anyone who wants to take into account the environment in the design of a product and packaging. It offers concrete suggestions and inspiring examples to bring sustainable packaging into practice [Dutch] Wat is Duurzaam Verpakken? En wat is de duurzaamste verpakking voor mijn product? De publicatie is bestemd voor iedereen die rekening wil houden met het milieu bij het ontwerp van een product-verpakkingscombinatie. Ze biedt concrete aanknopingspunten en inspirerende voorbeelden om hier praktisch mee aan de slag te gaan.

  5. Rupture sismique des fondations par perte de capacit\\'e portante: Le cas des semelles circulaires

    CERN Document Server

    Chatzigogos, Charisis; Salençon, J

    2008-01-01

    Within the context of earthquake-resistant design of shallow foundations, the present study is concerned with the determination of the seismic bearing capacity of a circular footing resting on the surface of a heterogene-ous purely cohesive semi-infinite soil layer. In the first part of the paper, a database, containing case histories of civil engineering structures that sustained a foundation seismic bearing capacity failure, is briefly pre-sented, aiming at a better understanding of the studied phenomenon and offering a number of case studies useful for validation of theoretical computations. In the second part of the paper, the aforementioned problem is addressed using the kinematic approach of the Yield Design theory, thus establishing optimal upper bounds for the ultimate seismic loads supported by the soil-footing system. The results lead to the establishment of some very simple guidelines that extend the existing formulae for the seismic bearing capacity contained in the European norms (proposed for st...

  6. AUTOMATED TEXT CLUSTERING OF NEWSPAPER AND SCIENTIFIC TEXTS IN BRAZILIAN PORTUGUESE: ANALYSIS AND COMPARISON OF METHODS

    Directory of Open Access Journals (Sweden)

    Alexandre Ribeiro Afonso

    2014-10-01

    Full Text Available This article reports the findings of an empirical study about Automated Text Clustering applied to scientific articles and newspaper texts in Brazilian Portuguese, the objective was to find the most effective computational method able to cluster the input of texts in their original groups. The study covered four experiments, each experiment had four procedures: 1. Corpus Selections (a set of texts is selected for clustering, 2. Word Class Selections (Nouns, Verbs and Adjectives are chosen from each text by using specific algorithms, 3. Filtering Algorithms (a set of terms is selected from the results of the preview stage, a semantic weight is also inserted for each term and an index is generated for each text, 4. Clustering Algorithms (the clustering algorithms Simple K-Means, sIB and EM are applied to the indexes. After those procedures, clustering correctness and clustering time statistical results were collected. The sIB clustering algorithm is the best choice for both scientific and newspaper corpus, under the condition that the sIB clustering algorithm asks for the number of clusters as input before running (for the newspaper corpus, 68.9% correctness in 1 minute and for the scientific corpus, 77.8% correctness in 1 minute. The EM clustering algorithm additionally guesses the number of clusters without user intervention, but its best case is less than 53% correctness. Considering the experiments carried out, the results of human text classification and automated clustering are distant; it was also observed that the clustering correctness results vary according to the number of input texts and their topics.

  7. Using LSA and text segmentation to improve automatic Chinese dialogue text summarization

    Institute of Scientific and Technical Information of China (English)

    LIU Chuan-han; WANG Yong-cheng; ZHENG Fei; LIU De-rong

    2007-01-01

    Automatic Chinese text summarization for dialogue style is a relatively new research area. In this paper, Latent Semantic Analysis (LSA) is first used to extract semantic knowledge from a given document, all question paragraphs are identified,an automatic text segmentation approach analogous to TextTiling is exploited to improve the precision of correlating question paragraphs and answer paragraphs, and finally some "important" sentences are extracted from the generic content and the question-answer pairs to generate a complete summary. Experimental results showed that our approach is highly efficient and improves significantly the coherence of the summary while not compromising informativeness.

  8. TextFlow: towards better understanding of evolving topics in text.

    Science.gov (United States)

    Cui, Weiwei; Liu, Shixia; Tan, Li; Shi, Conglei; Song, Yangqiu; Gao, Zekai J; Tong, Xin; Qu, Huamin

    2011-12-01

    Understanding how topics evolve in text data is an important and challenging task. Although much work has been devoted to topic analysis, the study of topic evolution has largely been limited to individual topics. In this paper, we introduce TextFlow, a seamless integration of visualization and topic mining techniques, for analyzing various evolution patterns that emerge from multiple topics. We first extend an existing analysis technique to extract three-level features: the topic evolution trend, the critical event, and the keyword correlation. Then a coherent visualization that consists of three new visual components is designed to convey complex relationships between them. Through interaction, the topic mining model and visualization can communicate with each other to help users refine the analysis result and gain insights into the data progressively. Finally, two case studies are conducted to demonstrate the effectiveness and usefulness of TextFlow in helping users understand the major topic evolution patterns in time-varying text data.

  9. Text4baby: Development and Implementation of a National Text Messaging Health Information Service

    Science.gov (United States)

    Whittaker, Robyn; Meehan, Judy; Jordan, Elizabeth; Stange, Paul; Cash, Amanda; Meyer, Paul; Baitty, Julie; Johnson, Pamela; Ratzan, Scott; Rhee, Kyu

    2012-01-01

    Text4baby is the first free national health text messaging service in the United States that aims to provide timely information to pregnant women and new mothers to help them improve their health and the health of their babies. Here we describe the development of the text messages and the large public–private partnership that led to the national launch of the service in 2010. Promotion at the local, state, and national levels produced rapid uptake across the United States. More than 320 000 people enrolled with text4baby between February 2010 and March 2012. Further evaluations of the effectiveness of the service are ongoing; however, important lessons can be learned from its development and uptake. PMID:23078509

  10. Metacognition and learning from text: Constructing a metacognitive questionnaire for text studying

    NARCIS (Netherlands)

    Schellings, G.; van Hout-Wolters, B.; Maes, A.; Ainsworth, S.

    2008-01-01

    Teaching metacognitive strategies in learning from text is an important educational objective. So, metacognitive assessment methods are necessary within school settings. Although the advantages of metacognitive questionnaires are numerous, the convergent validity (correlation) with the thinking alou

  11. SEGMENTATION OF OVERLAPPING TEXT LINES, CHARACTERS IN PRINTED TELUGU TEXT DOCUMENT IMAGES

    Directory of Open Access Journals (Sweden)

    M Swamy Das,

    2010-11-01

    Full Text Available Segmentation is an important task of any OCR system. It separates the image text documents into lines, words and characters. The accuracy of OCR system mainly depends on the segmentation algorithm being used.Segmentation Telugu text is difficult when compared with Latin based languages because of its structural complexity and increased character set. It contains vowels, consonants and compound characters. Some of the characters may overlap together. The profile based methods can only segment non-overlapping lines and characters. This paper addresses the segmentation of overlapped text lines and characters. The proposed algorithm is based on projection profiles, connected components and spatial vertical relationships. It also usesnearest neighborhood method to cluster the connected components. Experimental results it is observed that 100% line segmentation and about 98% character segmentation accuracy can be achieved with overlapping lines and characters.

  12. On the Wording of Texts: A Study of Intra-Text Word Frequency.

    Science.gov (United States)

    Goodman, Kenneth S.; Bird, Lois Bridges

    1984-01-01

    Describes and examines the word choice and frequency in six tests and raises questions about the use of word lists and controlled vocabulary in producing basal readers, judging and manipulating readability of texts, and building vocabulary. (HOD)

  13. Making sense of text : skills that support text comprehension and its development.

    OpenAIRE

    Cain, Kate

    2009-01-01

    Skilled reading involves two main components: word reading and text comprehension. In this article, I focus on three skills that have been shown to support the latter: integration and inference, comprehension monitoring, and knowledge and use of story structure. Research has shown that children with unexpectedly poor reading comprehension have difficulties with each of these text processing skills and that each skill contributes to development in reading comprehension during middle childhood....

  14. TextGen:a realistic text data content generation method for modern storage system benchmarks

    Institute of Scientific and Technical Information of China (English)

    Long-xiang WANG; Xiao-she DONG; Xing-jun ZHANG; Yin-feng WANG; Tao JU; Guo-fu FENG

    2016-01-01

    Modern storage systems incorporate data compressors to improve their performance and capacity. As a result, data content can significantly influence the result of a storage system benchmark. Because real-world proprietary datasets are too large to be copied onto a test storage system, and most data cannot be shared due to privacy issues, a benchmark needs to generate data synthetically. To ensure that the result is accurate, it is necessary to generate data content based on the characterization of real-world data properties that influence the storage system performance during the execution of a benchmark. The existing approach, called SDGen, cannot guarantee that the benchmark result is accurate in storage systems that have built-in word-based compressors. The reason is that SDGen characterizes the properties that influence compression performance only at the byte level, and no properties are characterized at the word level. To address this problem, we present TextGen, a realistic text data content generation method for modern storage system benchmarks. TextGen builds the word corpus by segmenting real-world text datasets, and creates a word-frequency distribution by counting each word in the corpus. To improve data generation performance, the word-frequency distribution is fitted to a lognormal distribution by maximum likelihood estimation. The Monte Carlo approach is used to generate synthetic data. The running time of TextGen generation depends only on the expected data size, which means that the time complexity of TextGen isO(n). To evaluate TextGen, four real-world datasets were used to perform an experiment. The experimental results show that, compared with SDGen, the compression performance and compression ratio of the datasets generated by TextGen deviate less from real-world datasets when end-tagged dense code, a representative of word-based compressors, is evaluated.

  15. PROSAIC TEXTS OF ABBÂS VESIM AND INVESTIGATING OF THE POEMS IN THESE TEXTS

    Directory of Open Access Journals (Sweden)

    İbrahim HALİL TUĞLUK

    2015-12-01

    Full Text Available Classical Turkish Literature has been formed by effect of Persian literature to a large extent and formed its own language and so had a large important period of Turkish literature. Basic expression means of this literature is poetry. Prose has been always in the shadow of poetry and second class, in fact prosaic texts are usually about didactical subjects just as history, geography, science of religion, astronomy, medicine and biography. Prosaic wording has also differences in the context of purpose and content. An important feature of prose form is its effort to approach to poetry form. Harmony in language that is constituted by rhythm especially approached prose texts to poetic wording further. Studies about these poetic texts in prosaic texts are quite important in points of confirming statistics of poetic texts in Classical Turkish prose literature, retaining, meaning and harmony combinations that have been established by poetry in prose and also designating classical cultural substructure of Ottoman. There are a lot of scholar and craftsman who came into prominence by their literal and scholarly identity in Ottoman history. Abbâs Vesȋm is among the people who lived in 18th century and had these features. His works must be investigated in many aspects because of these features. In this context, it is important to research poetical sections of his works about medicine and astrology that Abbâs Vesȋm wrote except literature in the sense of confirming poetical wording in prosaic texts and designating wording features in prosaic texts.In this study, it is aimed to search the prosaic works of Abbâs Vesȋm who is poet of 18th century, confirm the copies, transcript of poetic sections in these works and search these works in the sense of form and content.

  16. Understanding Adolescent Nonresponsiveness to Text Messages: Lessons from the DepoText Trial

    OpenAIRE

    Irons, Mallory; Tomaszewski, Kathy; Muñoz Buchanan, Cara R.; Trent, Maria

    2015-01-01

    Urban adolescents face economic, social, and behavioral challenges in adhering to long-term contraceptive use. Use of text messaging reminders has the potential to increase adherence to family planning appointments and to educate patients about safe sexual health practices; however, nonresponsiveness to messages is difficult to interpret and may jeopardize programmatic success. We aimed to understand why adolescent girls enrolled in a randomized, controlled pilot trial (DepoText) designed to ...

  17. Texting while driving: psychosocial influences on young people's texting intentions and behaviour.

    Science.gov (United States)

    Nemme, Heidi E; White, Katherine M

    2010-07-01

    Despite the dangers and illegality, there is a continued prevalence of texting while driving amongst young Australian drivers. The present study tested an extended theory of planned behaviour (TPB) to predict young drivers' (17-24 years) intentions to [1] send and [2] read text messages while driving. Participants (n=169 university students) completed measures of attitudes, subjective norm, perceived behavioural control, intentions, and the additional social influence measures of group norm and moral norm. One week later, participants reported on the number of texts sent and read while driving in the previous week. Attitude predicted intentions to both send and read texts while driving, and subjective norm and perceived behavioural control determined sending, but not reading, intentions. Further, intention, but not perceptions of control, predicted both texting behaviours 1 week later. In addition, both group norm and moral norm added predictive ability to the model. These findings provide support for the TPB in understanding students' decisions to text while driving as well as the inclusion of additional normative influences within this context, suggesting that a multi-strategy approach is likely to be useful in attempts to reduce the incidence of these risky driving behaviours.

  18. Towards text simplification for poor readers with intellectual disability: when do connectives enhance text cohesion?

    Science.gov (United States)

    Fajardo, Inmaculada; Tavares, Gema; Ávila, Vicenta; Ferrer, Antonio

    2013-04-01

    Cohesive elements of texts such as connectives (e.g., but, in contrast) are expected to facilitate inferential comprehension in poor readers. Two experiments tested this prediction in poor readers with intellectual disability (ID) by: (a) comparing literal and inferential text comprehension of texts with and without connectives and/or high frequency content words (Experiment 1) and (b) exploring the effects of type and familiarity of connectives on two-clause text comprehension by means of a cloze task (Experiment 2). Neither the addition of high frequency content words nor connectives in general produced inferential comprehension improvements. However, although readers with ID were less likely to select the target connective in the cloze task than chronologically age-matched readers (mean age=21 years) in general, their performance was affected by the type of connective and its familiarity. Familiarity had a facilitative effect for additive and contrastive connectives, but interfered in the case of temporal and causal connectives. The average performance of a reading level-matched control group (typically developing children) was similar to the group of readers with ID although the pattern of interaction between familiarity and type of connectives varied between groups. The implications of these findings for the adaptation of texts in special education contexts are discussed.

  19. Social Media Text Classification by Enhancing Well-Formed Text Trained Model

    Directory of Open Access Journals (Sweden)

    Phat Jotikabukkana

    2016-09-01

    Full Text Available Social media are a powerful communication tool in our era of digital information. The large amount of user-generated data is a useful novel source of data, even though it is not easy to extract the treasures from this vast and noisy trove. Since classification is an important part of text mining, many techniques have been proposed to classify this kind of information. We developed an effective technique of social media text classification by semi-supervised learning utilizing an online news source consisting of well-formed text. The computer first automatically extracts news categories, well-categorized by publishers, as classes for topic classification. A bag of words taken from news articles provides the initial keywords related to their category in the form of word vectors. The principal task is to retrieve a set of new productive keywords. Term Frequency-Inverse Document Frequency weighting (TF-IDF and Word Article Matrix (WAM are used as main methods. A modification of WAM is recomputed until it becomes the most effective model for social media text classification. The key success factor was enhancing our model with effective keywords from social media. A promising result of 99.50% accuracy was achieved, with more than 98.5% of Precision, Recall, and F-measure after updating the model three times.

  20. An enhanced text categorization method based on improved text frequency approach and mutual information algorithm

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Text categorization plays an important role in data mining.Feature selection is the most important process of text categorization.Focused on feature selection,we present an improved text frequency method for filtering of low frequency features to deal with the data preprocessing,propose an improved mutual information algorithm for feature selection,and develop an improved tf.idf method for characteristic weights evaluation.The proposed method is applied to the benchmark test set Reuters-21578 Top10 to examine its effectiveness.Numerical results show that the precision,the recall and the value of F1 of the proposed method are all superior to those of existing conventional methods.

  1. Effects of text genre and verbal ability on adult age differences in sensitivity to text structure.

    Science.gov (United States)

    Petros, T V; Norgaard, L; Olson, K; Tabor, L

    1989-06-01

    The present study examined the effects of verbal ability and text genre on adult age differences in sensitivity to the semantic structure of prose. Young and older adults of low or high verbal ability heard narrative and expository passages at different presentation rates. The results demonstrated that older adults recalled less than younger adults and that age differences in recall were larger for low-verbal adults and expository texts. However, subjects from all groups favored the main ideas in their recalls for both types of passages. The results indicated that adult age similarities in the ability to focus on the main ideas when processing prose was not compromised by the verbal ability of the subjects or the organization of the passages used. However, the results also demonstrate how the characteristics of the learner and the characteristics of the text modulate the size of the age differences observed.

  2. The Original Text and Translated Text in Derrida's Deconstruction Theory of Translation

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Since the 1960s translation has made great progress on the way to becoming a systematic and scientific discipline. The theory of deconstruction, originating in France, has made great impact on traditional translation. It has become more influential in recent days. Through the discussion of deconstruction and its idea of translation, this thesis clarifies people's skeptical attitudes towards deconstruction and explains radical changes it has brought for translation field, especially in explaining the relationship between the original text and the translated text in Derrida's deconstruction theory. At the end of this thesis, the application and limitations of deconstruction are discussed.

  3. Understanding Adolescent Nonresponsiveness to Text Messages: Lessons from the DepoText Trial.

    Science.gov (United States)

    Irons, Mallory; Tomaszewski, Kathy; Muñoz Buchanan, Cara R; Trent, Maria

    2015-06-01

    Urban adolescents face economic, social, and behavioral challenges in adhering to long-term contraceptive use. Use of text messaging reminders has the potential to increase adherence to family planning appointments and to educate patients about safe sexual health practices; however, nonresponsiveness to messages is difficult to interpret and may jeopardize programmatic success. We aimed to understand why adolescent girls enrolled in a randomized, controlled pilot trial (DepoText) designed to increase attendance at family planning visits were periodically nonresponsive to text messages through conducting structured interviews with participants whose text reply rates were less than 100 % during the trial period. Qualitative and quantitative data were collected and classified using descriptive data analysis. Reasons for nonresponsiveness, barriers to continuous cell phone coverage, cell phone plan characteristics, and attitudes toward the DepoText program were the primary endpoints of interest. Most participants (78%) attributed instances of nonresponsiveness to being away from the phone or due to a personal conflict such as school or work. Service interruption due to bill nonpayment (44%), phone loss (28%), and cell phone number change (28%) were significant barriers to continuous coverage during the trial period, and many respondents indicated that the downturn in the economy made it more difficult to maintain their cell phone plan. Almost a third reported having to choose between cell phone and other payments, but the vast majority (88%) considered their cell phone a "need" rather than a "want." Participants universally expressed satisfaction with the text messaging program and reported feeling more connected to the clinic (96%) through the messages serving as reminders (64%), encouragement to assume personal responsibility for their health care (12%), and enhanced personal connection with the clinic staff (4%). Our study suggests that a text messaging program can

  4. Practical text mining and statistical analysis for non-structured text data applications

    CERN Document Server

    Miner, Gary; Hill, Thomas; Nisbet, Robert; Delen, Dursun

    2012-01-01

    The world contains an unimaginably vast amount of digital information which is getting ever vaster ever more rapidly. This makes it possible to do many things that previously could not be done: spot business trends, prevent diseases, combat crime and so on. Managed well, the textual data can be used to unlock new sources of economic value, provide fresh insights into science and hold governments to account. As the Internet expands and our natural capacity to process the unstructured text that it contains diminishes, the value of text mining for information retrieval and search will increase d

  5. Text mining in livestock animal science: introducing the potential of text mining to animal sciences.

    Science.gov (United States)

    Sahadevan, S; Hofmann-Apitius, M; Schellander, K; Tesfaye, D; Fluck, J; Friedrich, C M

    2012-10-01

    In biological research, establishing the prior art by searching and collecting information already present in the domain has equal importance as the experiments done. To obtain a complete overview about the relevant knowledge, researchers mainly rely on 2 major information sources: i) various biological databases and ii) scientific publications in the field. The major difference between the 2 information sources is that information from databases is available, typically well structured and condensed. The information content in scientific literature is vastly unstructured; that is, dispersed among the many different sections of scientific text. The traditional method of information extraction from scientific literature occurs by generating a list of relevant publications in the field of interest and manually scanning these texts for relevant information, which is very time consuming. It is more than likely that in using this "classical" approach the researcher misses some relevant information mentioned in the literature or has to go through biological databases to extract further information. Text mining and named entity recognition methods have already been used in human genomics and related fields as a solution to this problem. These methods can process and extract information from large volumes of scientific text. Text mining is defined as the automatic extraction of previously unknown and potentially useful information from text. Named entity recognition (NER) is defined as the method of identifying named entities (names of real world objects; for example, gene/protein names, drugs, enzymes) in text. In animal sciences, text mining and related methods have been briefly used in murine genomics and associated fields, leaving behind other fields of animal sciences, such as livestock genomics. The aim of this work was to develop an information retrieval platform in the livestock domain focusing on livestock publications and the recognition of relevant data from

  6. Classification of protein-protein interaction full-text documents using text and citation network features.

    Science.gov (United States)

    Kolchinsky, Artemy; Abi-Haidar, Alaa; Kaur, Jasleen; Hamed, Ahmed Abdeen; Rocha, Luis M

    2010-01-01

    We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.

  7. Public Text/Private Text: Making Visible the Voices That Shape Our Social Conscience.

    Science.gov (United States)

    Urion, Marilyn Vogler

    1995-01-01

    Discusses Julia Kristeva's notion of text--the tension between the semiotic and the symbolic--and how the tension can be made visible through typeface variation and other shaping techniques possible with word-processing software. Shares ways the author encourages students in first-year English classes to explore possibilities for incorporating…

  8. Text(ing) in Context: The Future of Workplace Communication in the United States

    Science.gov (United States)

    Kiddie, Thomas J.

    2014-01-01

    Following Rogers's theory of the diffusion of innovations, the author questions whether youth entering the workforce will act as change agents to evolve primary business communication channels from email to text-messaging. Expanding on research performed in 2009, the author investigates three communication scenarios: scheduling meetings,…

  9. “Girls Text Really Weird”: Gender, Texting and Identity Among Teens

    DEFF Research Database (Denmark)

    Ling, Richard; Baron, Naomi; Lenhart, Amanda

    2014-01-01

    This article examines the strategies used by teenagers for interacting with members of the opposite sex when texting. This article uses material from a series of nine focus groups from 2009 in four US cities. It reports on the strategies they use and the problems they encounter as they negotiate...

  10. Enhancing Summarization Skills Using Twin Texts: Instruction in Narrative and Expository Text Structures

    Science.gov (United States)

    Furtado, Leena; Johnson, Lisa

    2010-01-01

    This action-research case study endeavors to enhance the summarization skills of first grade students who are reading at or above the third grade level during the first trimester of the academic school year. Students read "twin text" sources, meaning, fiction and nonfiction literary selections focusing on a common theme to help identify and…

  11. Connected text reading and differences in text reading fluency in adult readers

    NARCIS (Netherlands)

    Wallot, S.; Hollis, G.; Rooij, M. de

    2013-01-01

    The process of connected text reading has received very little attention in contemporary cognitive psychology. This lack of attention is in parts due to a research tradition that emphasizes the role of basic lexical constituents, which can be studied in isolated words or sentences. However, this lac

  12. Psychologie des discours et didactique des textes (Psychology of Discourse and the Teaching of Texts).

    Science.gov (United States)

    Bronckart, Jean-Paul, Ed.

    1995-01-01

    This collection of articles on the nature of discourse and writing instruction include: "Une demarche de psychologie de discours; quelques aspects introductifs" ("An Application of Discourse Psychology; Introductory Thoughts") (Jean-Paul Bronckart); "Les procedes de prise en charge enonciative dans trois genres de texts expositifs" ("The Processes…

  13. Text(ing) in Context: The Future of Workplace Communication in the United States

    Science.gov (United States)

    Kiddie, Thomas J.

    2014-01-01

    Following Rogers's theory of the diffusion of innovations, the author questions whether youth entering the workforce will act as change agents to evolve primary business communication channels from email to text-messaging. Expanding on research performed in 2009, the author investigates three communication scenarios: scheduling meetings,…

  14. Comprehending expository texts: the dynamic neurobiological correlates of building a coherent text representation.

    Science.gov (United States)

    Swett, Katherine; Miller, Amanda C; Burns, Scott; Hoeft, Fumiko; Davis, Nicole; Petrill, Stephen A; Cutting, Laurie E

    2013-01-01

    Little is known about the neural correlates of expository text comprehension. In this study, we sought to identify neural networks underlying expository text comprehension, how those networks change over the course of comprehension, and whether information central to the overall meaning of the text is functionally distinct from peripheral information. Seventeen adult subjects read expository passages while being scanned using functional magnetic resonance imaging (fMRI). By convolving phrase onsets with the hemodynamic response function (HRF), we were able to identify regions that increase and decrease in activation over the course of passage comprehension. We found that expository text comprehension relies on the co-activation of the semantic control network and regions in the posterior midline previously associated with mental model updating and integration [posterior cingulate cortex (PCC) and precuneus (PCU)]. When compared to single word comprehension, left PCC and left Angular Gyrus (AG) were activated only for discourse-level comprehension. Over the course of comprehension, reliance on the same regions in the semantic control network increased, while a parietal region associated with attention [intraparietal sulcus (IPS)] decreased. These results parallel previous findings in narrative comprehension that the initial stages of mental model building require greater visuospatial attention processes, while maintenance of the model increasingly relies on semantic integration regions. Additionally, we used an event-related analysis to examine phrases central to the text's overall meaning vs. peripheral phrases. It was found that central ideas are functionally distinct from peripheral ideas, showing greater activation in the PCC and PCU, while over the course of passage comprehension, central and peripheral ideas increasingly recruit different parts of the semantic control network. The finding that central information elicits greater response in mental model

  15. Mobile characters, mobile texts: homelessness and intertextuality in contemporary texts for young people

    Directory of Open Access Journals (Sweden)

    Mavis Reimer

    2013-06-01

    Full Text Available Since the 1990s, narratives about homelessness for and about young people have proliferated around the world. A cluster of thematic elements shared by many of these narratives of the age of globalization points to the deep anxiety that is being expressed about a social, economic, and cultural system under stress or struggling to find a new formation. More surprisingly, many of the narratives also use canonical cultural texts extensively as intertexts. This article considers three novels from three different national traditions to address the work of intertextuality in narratives about homelessness: Skellig by UK author David Almond, which was published in 1998; Chronicler of the Winds by Swedish author Henning Mankell, which was first published in 1988 in Swedish as Comédia Infantil and published in an English translation in 2006; and Stained Glass by Canadian author Michael Bedard, which was published in 2002. Using Julia Kristeva's definition of intertextuality as the “transposition of one (or several sign systems into another,” I propose that all intertexts can be thought of as metaphoric texts, in the precise sense that they carry one text into another. In the narratives under discussion in this article, the idea of homelessness is in perpetual motion between texts and intertexts, ground and figure, the literal and the symbolic. What the child characters and the readers who take up the position offered to implied readers are asked to do, I argue, is to put on a way of seeing that does not settle, a way of being that strains forward toward the new.

  16. [symbol: see text]Caspofungin and [symbol: see text]voriconazole for fungal infections.

    Science.gov (United States)

    2004-01-01

    Systemic fungal infections are difficult to treat and often fatal. Established treatment options include conventional amphotericin B or one of its lipid-based or liposomal formulations, or a triazole antifungal such as fluconazole or itraconazole. [symbol: see text]Caspofungin (Cancidas--Merck Sharp & Dohme) and [symbol: see text]voriconazole (Vfend--Pfizer) are two new antifungals for severe infections caused by Candida spp. (invasive candidiasis) and Aspergillus spp. (invasive aspergillosis). Caspofungin is the first licensed echinocandin antifungal, while voriconazole is a triazole. Promotional claims for caspofungin include that it "provides an effective, yet less toxic, alternative to amphotericin B" while voriconazole is claimed to offer "significantly improved survival in invasive aspergillosis compared with amphotericin B". Here we consider the place of caspofungin and voriconazole in managing patients with severe fungal infections.

  17. Using Collaborative Tagging for Text Classification: From Text Classification to Opinion Mining

    Directory of Open Access Journals (Sweden)

    Eric Charton

    2013-11-01

    Full Text Available Numerous initiatives have allowed users to share knowledge or opinions using collaborative platforms. In most cases, the users provide a textual description of their knowledge, following very limited or no constraints. Here, we tackle the classification of documents written in such an environment. As a use case, our study is made in the context of text mining evaluation campaign material, related to the classification of cooking recipes tagged by users from a collaborative website. This context makes some of the corpus specificities difficult to model for machine-learning-based systems and keyword or lexical-based systems. In particular, different authors might have different opinions on how to classify a given document. The systems presented hereafter were submitted to the D´Efi Fouille de Textes 2013 evaluation campaign, where they obtained the best overall results, ranking first on task 1 and second on task 2. In this paper, we explain our approach for building relevant and effective systems dealing with such a corpus.

  18. A new graph based text segmentation using Wikipedia for automatic text summarization

    Directory of Open Access Journals (Sweden)

    Mohsen Pourvali

    2012-01-01

    Full Text Available The technology of automatic document summarization is maturing and may provide a solution to the information overload problem. Nowadays, document summarization plays an important role in information retrieval. With a large volume of documents, presenting the user with a summary of each document greatly facilitates the task of finding the desired documents. Document summarization is a process of automatically creating a compressed version of a given document that provides useful information to users, and multi-document summarization is to produce a summary delivering the majority of information content from a set of documents about an explicit or implicit main topic. According to the input text, in this paper we use the knowledge base of Wikipedia and the words of the main text to create independent graphs. We will then determine the important of graphs. Then we are specified importance of graph and sentences that have topics with high importance. Finally, we extract sentences with high importance. The experimental results on an open benchmark datasets from DUC01 and DUC02 show that our proposed approach can improve the performance compared to state-of-the-art summarization approaches

  19. Relating interesting quantitative time series patterns with text events and text features

    Science.gov (United States)

    Wanner, Franz; Schreck, Tobias; Jentner, Wolfgang; Sharalieva, Lyubka; Keim, Daniel A.

    2013-12-01

    In many application areas, the key to successful data analysis is the integrated analysis of heterogeneous data. One example is the financial domain, where time-dependent and highly frequent quantitative data (e.g., trading volume and price information) and textual data (e.g., economic and political news reports) need to be considered jointly. Data analysis tools need to support an integrated analysis, which allows studying the relationships between textual news documents and quantitative properties of the stock market price series. In this paper, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which reflect quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a-priori method. First, based on heuristics we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a-priori method supports the discovery of such sequential temporal patterns. Then, various text features like the degree of sentence nesting, noun phrase complexity, the vocabulary richness, etc. are extracted from the news to obtain meta patterns. Meta patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time-, cluster- and sequence visualization and analysis functionality. We provide two case studies, showing the effectiveness of our combined quantitative and textual analysis work flow. The workflow can also be generalized to other

  20. On the reduction of generalized polylogarithms to $\\text{Li}_n$ and $\\text{Li}_{2,2}$ and on the evaluation thereof

    CERN Document Server

    Frellesvig, Hjalte; Wever, Christopher

    2016-01-01

    We give expressions for all generalized polylogarithms up to weight four in terms of the functions log, $\\text{Li}_n$, and $\\text{Li}_{2,2}$, valid for arbitrary complex variables. Furthermore we provide algorithms for manipulation and numerical evaluation of $\\text{Li}_n$ and $\\text{Li}_{2,2}$, and add codes in Mathematica and C++ implementing the results. With these results we calculate a number of previously unknown integrals, which we add in App. C.

  1. Book review: Le corps du texte. Pour une anthropologie des textes de la tradition juive.

    Directory of Open Access Journals (Sweden)

    I. Chiva

    1998-04-01

    Full Text Available It is the privilege of the ethnologist to grasp more so than others, to which extent the body is unique by its functions and its diversity in that it offers, like from inside of ourselves, a framework and a language for expressing the most hidden and buried part of our thoughts, feelings and soul. But the body is also, and at the same time, a baseline and a language to express the part of ourselves which is the most enrooted in the culture, the most dependent on the social shaping of the indi...

  2. Paratextual Transactions: Text and Off Text in William Blake’s Milton and Jerusalem

    Directory of Open Access Journals (Sweden)

    Annalisa Volpone

    2016-07-01

    Full Text Available Il saggio si concentra sulle dinamiche tra testo e paratesto negli ultimi due libri profetici di William Blake, Milton e Jerusalem. A partire dalle riflessioni di Genette in Soglie, si ragionerà su come parola e immagine interagiscono nelle tavole e concorrono a produrre significato. I libri profetici, in quanto opera d’arte “integrata”, implicano una partecipazione attiva da parte del lettore, che deve saper interpretare i molteplici livelli di interazione tra le componenti testuali e paratestuali, all’interno della complessa cornice rappresentata dalla letteratura apocalittica alla quale l’ultima produzione di Blake appartiene.

  3. Du texte mis entre parenthèses au texte dit à part

    Directory of Open Access Journals (Sweden)

    Xavier Leroux

    2010-12-01

    Full Text Available L’étude de l’aparté dans La Celestina est ici fondée sur une approche codicologique de plusieurs imprimés de l’œuvre de Fernando de Rojas. Une définition théorique de l’aparté permet d’en préciser les différentes réalisations au théâtre : l’aparté au public, l’aparté sélectif et l’aparté au moi, qui se distingue nettement du monologue. Dans le texte dramatique, le repérage de cette forme dramatique se révèle cependant plus délicate. L’étude de l’emploi des parenthèses dans plusieurs imprimés qui conservent La Celestina fait apparaître un usage très vigilant de ces signes de ponctuation pour marquer le recours à l’aparté.

  4. A text mining approach to detect mentions of protein glycosylation in biomedical text.

    Science.gov (United States)

    Shukla, Daksha; Jayaraman, Valadi K

    2012-01-01

    Protein Glycosylation is an important post translational event that plays a pivotal role in protein folding and protein is trafficking. We describe a dictionary based and a rule based approach to mine 'mentions' of protein glycosylation in text. The dictionary based approach relies on a set of manually curated dictionaries specially constructed to address this task. Abstracts are then screened for the 'mentions' of words from these dictionaries which are further scored followed by classification on the basis of a threshold. The rule based approaches also relies on the words in the dictionary to arrive at the features which are used for classification. The performance of the system using both the approaches has been evaluated using a manually curated corpus of 3133 abstracts. The evaluation suggests that the performance of the Rule based approach supersedes that of the Dictionary based approach.

  5. Strategic Use of Multiple Texts for the Evaluation of Arguments

    Science.gov (United States)

    Kobayashi, Keiichi

    2010-01-01

    Two experiments were conducted to examine whether students use arguments with refutation in one text for evaluating the opposite arguments without refutation in another text. Undergraduate students read two conflicting texts in either of the two orders: pro arguments text first and con arguments text first. After reading each text, they evaluated…

  6. A REVIEW ON TEXT MINING IN DATA MINING

    OpenAIRE

    2016-01-01

    Data mining is the knowledge discovery in databases and the gaol is to extract patterns and knowledge from large amounts of data. The important term in data mining is text mining. Text mining extracts the quality information highly from text. Statistical pattern learning is used to high quality information. High –quality in text mining defines the combinations of relevance, novelty and interestingness. Tasks in text mining are text categorization, text clustering, entity extraction and sentim...

  7. The Informational Text Structure Survey (ITS[superscript 2]): An Exploration of Primary Grade Teachers' Sensitivity to Text Structure in Young Children's Informational Texts

    Science.gov (United States)

    Reutzel, D. Ray; Jones, Cindy D.; Clark, Sarah K.; Kumar, Tamara

    2016-01-01

    There has been no research reported about if or how well primary grade teachers can identify information text structures in children's authentic informational texts. The ability to do so accurately and reliably is a prerequisite for teachers to be able to teach students how to recognize and use text structures to assist them in comprehending…

  8. GURMUKHI TEXT EXTRACTION FROM IMAGE USING SUPPORT VECTOR MACHINE (SVM

    Directory of Open Access Journals (Sweden)

    SUKHWINDER KAUR

    2011-04-01

    Full Text Available Extensive research has been done on image classification for different purposes like face recognition, identification of different objects and identification/extraction of text from image having some background. Text identification is an active research area where by system tries to identify the text area in a given image. Text area identified is then passed to OCR system for further recognition of the text. This work is about classifying image area in two classes text and non text using SVM (support vector machine. We identified the features and train a model based on the feature vector which is then used to classify text and non text area in an image. The system reports 70.5% accuracy for caption text images, 70.43% for document text images and 50.40% for scene text image.

  9. Reading Spaced and Unspaced Chinese Text: Evidence from Eye Movements

    Science.gov (United States)

    Bai, Xuejun; Yan, Guoli; Liversedge, Simon P.; Zang, Chuanli; Rayner, Keith

    2008-01-01

    Native Chinese readers' eye movements were monitored as they read text that did or did not demark word boundary information. In Experiment 1, sentences had 4 types of spacing: normal unspaced text, text with spaces between words, text with spaces between characters that yielded nonwords, and finally text with spaces between every character. The…

  10. T-Scan: a new tool for analyzing Dutch text

    NARCIS (Netherlands)

    Pander Maat, H.L.W.; Kraf, R.L.; van den Bosch, Antal; van Gompel, Maarten; Kleijn, S.; Sanders, T.J.M.; van der Sloot, Ko

    2014-01-01

    T-Scan is a new tool for analyzing Dutch text. It aims at extracting text features that are theoretically interesting, in that they relate to genre and text complexity, as well as practically interesting, in that they enable users and text producers to make text-specific diagnoses. T-Scan derives it

  11. On the Functions of Lexical Collocation in English Texts

    Institute of Scientific and Technical Information of China (English)

    XIAO Fuliang

    2016-01-01

    Lexical collocation , as a cohesive device of an English text, is helpful to make up a cohesive and coherent text. Therefore, to better comprehend English text, different patterns and functions of lexical collocation should be guided in detail.

  12. Closely Reading Informational Texts in the Primary Grades

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2014-01-01

    In this article we discuss the differences between close reading in the primary grades and upper elementary grades. We focus on text selection, initial reading. repeated reading, annotation, text-based discussions, and responding to texts.

  13. Techniques, Applications and Challenging Issue in Text Mining

    Directory of Open Access Journals (Sweden)

    Shaidah Jusoh

    2012-11-01

    Full Text Available Text mining is a very exciting research area as it tries to discover knowledge from unstructured texts. These texts can be found on a desktop, intranets and the internet. The aim of this paper is to give an overview of text mining in the contexts of its techniques, application domains and the most challenging issue. The focus is given on fundamentals methods of text mining which include natural language possessing and information extraction. This paper also gives a short review on domains which have employed text mining. The challenging issue in text mining which is caused by the complexity in a natural language is also addressed in this paper.

  14. Acoustic Evaluation as a Variety of Text Metonymy

    OpenAIRE

    Ella V. Nesterik; Anna D. Matrossova

    2013-01-01

    The article deals with sensorial evaluation, namely, acoustic evaluation as a text-forming category, studied in terms of text linguistics and text stylistics. Acoustic evaluation is considered as a variety of text metonymy, a sort of stylistic device expressing characters’ emotional state and time perception metonymically

  15. What Oral Text Reading Fluency Can Reveal about Reading Comprehension

    Science.gov (United States)

    Veenendaal, Nathalie J.; Groen, Margriet A.; Verhoeven, Ludo

    2015-01-01

    Text reading fluency--the ability to read quickly, accurately and with a natural intonation--has been proposed as a predictor of reading comprehension. In the current study, we examined the role of oral text reading fluency, defined as text reading rate and text reading prosody, as a contributor to reading comprehension outcomes in addition to…

  16. Introducing Text Analytics as a Graduate Business School Course

    Science.gov (United States)

    Edgington, Theresa M.

    2011-01-01

    Text analytics refers to the process of analyzing unstructured data from documented sources, including open-ended surveys, blogs, and other types of web dialog. Text analytics has enveloped the concept of text mining, an analysis approach influenced heavily from data mining. While text mining has been covered extensively in various computer…

  17. What oral text reading fluency can reveal about reading comprehension

    NARCIS (Netherlands)

    Veenendaal, N.J.; Groen, M.A.; Verhoeven, L.T.W.

    2015-01-01

    Text reading fluency – the ability to read quickly, accurately and with a natural intonation – has been proposed as a predictor of reading comprehension. In the current study, we examined the role of oral text reading fluency, defined as text reading rate and text reading prosody, as a contributor t

  18. Drawing on Text Features for Reading Comprehension and Composing

    Science.gov (United States)

    Risko, Victoria J.; Walker-Dalhouse, Doris

    2011-01-01

    Students read multiple-genre texts such as graphic novels, poetry, brochures, digitized texts with videos, and informational and narrative texts. Features such as overlapping illustrations and implied cause-and-effect relationships can affect students' comprehension. Teaching with these texts and drawing attention to organizational features hold…

  19. Classroom Writing Tasks and Students' Analytic Text-Based Writing

    Science.gov (United States)

    Matsumura, Lindsay Clare; Correnti, Richard; Wang, Elaine

    2015-01-01

    The Common Core State Standards emphasize students writing analytically in response to texts. Questions remain about the nature of instruction that develops students' text-based writing skills. In the present study, we examined the role that writing task quality plays in students' mastery of analytic text-based writing. Text-based writing tasks…

  20. More Than Words can Tell - Using Multimodal Texts to Support Reading Comprehension of Literary Texts in English

    OpenAIRE

    Leismann, Silke

    2015-01-01

    This thesis explores the possibilities of multimodality in supporting text comprehension of literary texts in language learning of the L2. While multimodal texts offer multiple ways of meaning making that sometimes go beyond the written text, I have focussed on multimodal expressions that mirror the context of a given text. I conducted an empirical study with 114 students (grade 9; 13-14 years) in two schools in Trondheim, Norway. The material I used consisted of three literary texts (e...

  1. Detection of text in images using SUSAN edge detector

    Institute of Scientific and Technical Information of China (English)

    MAO Wen-ge; ZHANG Tian-wen; WANG Li

    2005-01-01

    Text embedded in images is one of many important cues for indexing and retrieval of images and videos. In the paper, we present a novel method of detecting text aligned either horizontally or vertically, in which a pyramid structure is used to represent an image and the features of the text are extracted using SUSAN edge detector. Text regions at each level of the pyramid are identified according to the autocorrelation analysis. New techniques are introduced to split the text regions into basic ones and merge them into text lines. By evaluating the method on a set of images, we obtain a very good performance of text detection.

  2. A New Method to Extract Text from Natural Scenes

    Institute of Scientific and Technical Information of China (English)

    2005-01-01

    This paper presents a new method for text detection, location and binarization fron natural scenes. Several morphological steps are used to detect the general positian of the text, including English, Chinese and Japanese characters. Next bounding boxes are processed by a new "Expand, Break and Merge" (EBM) method to get the precise text areas. Finally, text is binarized by a hybrid method based on Otsu and Niblack. This new approach can extract different kinds of text from complicated natural scenes. It is insensitive to noise, distortedness, and text orientation. It also has good performance on extracting texts in various sizes.

  3. One-pot synthesis of [Formula: see text]-spiroiminolactones and [Formula: see text]-dispiroiminolactones using [Formula: see text]-disubstituted parabanic acid and thioparabanic acid derivatives.

    Science.gov (United States)

    Asghari, Sakineh; Qandalee, Mohammad; Sarmadi, Amir Ali

    2017-02-01

    A direct entry and simple process for the synthesis of [Formula: see text]-spiroiminolactones present in a large number of natural products has been developed. In the first step, the synthesis of parabanic acid derivatives was commenced from the reaction of [Formula: see text]-disubstituted urea and thiourea with oxalyl chloride, then a three-component reaction was carried out with isocyanides, acetylenic esters, and [Formula: see text]-disubstituted parabanic acid derivatives. The method allows the construction of a variety of [Formula: see text]-spiroiminolactone structures in good to high yields starting from readily available precursors. It was found that in the case of [Formula: see text]-diphenyl thioparabanic acid, additional products of [Formula: see text]-dispiroiminolactones have been formed due to the higher electrophilicity of [Formula: see text]-dicarbonyl groups. The structures were fully established using spectroscopic analysis NMR, IR, and Mass spectrometry. The crystal structure of [Formula: see text]-dispiroiminolactone was confirmed from single-crystal X-ray diffraction study.

  4. Presentation video retrieval using automatically recovered slide and spoken text

    Science.gov (United States)

    Cooper, Matthew

    2013-03-01

    Video is becoming a prevalent medium for e-learning. Lecture videos contain text information in both the presentation slides and lecturer's speech. This paper examines the relative utility of automatically recovered text from these sources for lecture video retrieval. To extract the visual information, we automatically detect slides within the videos and apply optical character recognition to obtain their text. Automatic speech recognition is used similarly to extract spoken text from the recorded audio. We perform controlled experiments with manually created ground truth for both the slide and spoken text from more than 60 hours of lecture video. We compare the automatically extracted slide and spoken text in terms of accuracy relative to ground truth, overlap with one another, and utility for video retrieval. Results reveal that automatically recovered slide text and spoken text contain different content with varying error profiles. Experiments demonstrate that automatically extracted slide text enables higher precision video retrieval than automatically recovered spoken text.

  5. Picosecond holmium fibre laser pumped at 1125 \\ {\\text{nm}}

    Science.gov (United States)

    Kamynin, V. A.; Filatova, S. A.; Zhluktova, I. V.; Tsvetkov, V. B.

    2016-12-01

    We report a passively mode-locked, all-fibre holmium laser based on nonlinear polarisation rotation. As a pump source use is made of an 1125-{\\text{nm}} ytterbium-doped fibre laser. The pulse repetition rate of the holmium laser is 7.5 {\\text{MHz}}, and the pulse duration does not exceed 52 {\\text{ps}} at wavelengths of 2065 and 2080 {\\text{nm}}. The average laser output power reaches 5 {\\text{mW}}.

  6. Digital text cycles: from medieval manuscripts to modern markup

    OpenAIRE

    2005-01-01

    The paper argues that the current implementation of digital publishing is a minor step in a long development of digital text cycles. Rather than being a revolution, the digital transformation of text is an evolutionary process heavily influenced by social and cultural factors. The paper introduces the concept of a "text cycle". An examination of basic features of paper-based text cycles and features of digital text cycles demonstrates that digital technology has a potential for change that fa...

  7. Incorporating other texts: Intertextuality in Malaysian CSR reports

    Directory of Open Access Journals (Sweden)

    Kumaran Rajandran

    2016-11-01

    Full Text Available In Malaysia, corporate social responsibility (CSR is relatively new but corporations have been required to engage in and disclose their CSR. A typical genre for disclosure is CSR reports and these reports often refer to other texts. The article investigates the act of referencing to other texts or intertextuality in Malaysian CSR reports. It creates an archive of CEO Statements and Environment Sections in CSR reports and studies the archive for keywords, which can identify the incorporated texts. The function of these texts is examined in relation to Malaysia’s corporate context. CSR reports contain explicit references to documents (policies, regulations, reports, research, standards and to individuals/groups (CEOs, stakeholders, expert organizations. The incorporated texts display variation in corporate control, which organizes these texts along an intertextual cline. The cline helps to identify corporate and non-corporate sources among the texts. The selection of incorporated texts may reflect government and stock exchange demands. The texts are not standardized and are relevant for the CSR domain and corporations, where these texts monitor and justify CSR performance. Yet, the incorporated texts may perpetuate inexact reporting because corporations select the texts and the parts of texts to refer to. Since these texts have been employed to scrutinize initiatives and results, CSR reports can claim to represent the “truth” about a corporation’s CSR. Hence, intertextuality serves corporate interests.

  8. A Survey On Various Approaches Of Text Extraction In Images

    Directory of Open Access Journals (Sweden)

    C.P. Sumathi

    2012-09-01

    Full Text Available Text Extraction plays a major role in finding vital and valuable information. Text extraction involvesdetection, localization, tracking, binarization, extraction, enhancement and recognition of the text from the given image. These text characters are difficult to be detected and recognized due to their deviation of size, font, style, orientation, alignment, contrast, complex colored, textured background. Due to rapid growth of available multimedia documents and growing requirement for information, identification, indexing and retrieval, many researches have been done on text extraction in images.Several techniqueshave been developed for extracting the text from an image. The proposed methods were based on morphological operators, wavelet transform, artificial neural network,skeletonization operation,edge detection algorithm, histogram technique etc. All these techniques have their benefits and restrictions. This article discusses various schemes proposed earlier for extracting the text from an image. This paper also provides the performance comparison of several existing methods proposed by researchers in extracting the text from an image.

  9. Why Are Some Texts Good and Others Not? Relationship between Text Quality and Management of the Writing Processes

    Science.gov (United States)

    Beauvais, Caroline; Olive, Thierry; Passerault, Jean-Michel

    2011-01-01

    Two experiments examined whether text quality is related to online management of the writing processes. Experiment 1 focused on the relationship between online management and text quality in narrative and argumentative texts. Experiment 2 investigated how this relationship might be affected by a goal emphasizing text quality. In both experiments,…

  10. The Application of the Cooperative Principle in Text Messages

    Institute of Scientific and Technical Information of China (English)

    李军霞

    2015-01-01

    The language of text messages speeds up the transmission of information,shows the richness of languages,and contains all kinds of implication. Many researches on text messages have been published but the analysis of the languages of text messages in the domain of Grice’s cooperative principle is open to investigate. This paper explores the language of text messages based on Grice’s Cooperative Principle(CP) and its maxims,which aims to understand how the theory influences the text message communication and create some humorous effect. It is of practical significance to research text messages as a kind of language phenomenon.

  11. The Application of the Cooperative Principle in Text Messages

    Institute of Scientific and Technical Information of China (English)

    李军霞

    2015-01-01

    The language of text messages speeds up the transmission of information,shows the richness of languages,and contains all kinds of implication. Many researches on text messages have been published but the analysis of the languages of text messages in the domain of Grice’s cooperative principle is open to investigate. This paper explores the language of text messages based on Grice’s Cooperative Principle (CP) and its maxims,which aims to understand how the theory influences the text message communication and create some humorous effect. It is of practical significance to research text messages as a kind of language phenomenon.

  12. Effects of H3O+, OH-, \\text{O}_{2}^{-} , \\text{NO}_{\\text{x}}^{-} and NO x for Escherichia coli inactivation in atmospheric pressure DC corona discharges

    Science.gov (United States)

    Sekimoto, Kanako; Gonda, Rena; Takayama, Mitsuo

    2015-08-01

    The effects of ionic and neutral species such as H3O+, OH-, \\text{O}2- , \\text{NO}x- (x = 2, 3), and NO x on Escherichia coli (E. coli) inactivation in gas and liquid phases was investigated using atmospheric pressure DC corona discharges with point-to-plane electrodes. The above chemical species as well as OH and O3 were selectively irradiated onto E. coli suspensions on agar plates using a needle angle of 45° with respect to the plates, airflow, and a grid plate. Irradiation with the positive ion H3O+ did not inactivate E. coli, while the negative ions OH-/\\text{O}2- resulted in bactericidal inactivation, in both gas and liquid phases. In contrast, the negative ions \\text{NO}x- and neutral species NO x in the gas phase had quite strong bactericidal effects on E. coli compared to those species in the liquid phase. These results suggest that liquid-phase HNO3, formed primarily via the reaction of gas-phase \\text{NO}x- and NO x with H2O in agar, has only a weak inactivation effect on E. coli. Furthermore, using naphthylethylenediamine spectrophotometry, the threshold amount of gas-phase \\text{NO}x- and NO x for E. coli inactivation was determined to be  ≈1.3   ×   10-9 mol mm-1.

  13. Optimizing the 3R Study Strategy to Learn from Text

    NARCIS (Netherlands)

    Reijners, Pauline; Kester, Liesbeth; Wetzels, Sandra; Kirschner, Paul A.

    2014-01-01

    Learning from text is often very difficult for students. In this presentation the results of a study with the 3R study strategy are presented in which possible mechanisms for stimulating successful text learning are discussed.

  14. EU External Relations Law: Text, Cases and Materials

    DEFF Research Database (Denmark)

    Butler, Graham

    2014-01-01

    EU External Relations Law: Text, Cases and Materials, Bart Van Vooren and Ramses A. Wessel, Cambridge University Press, UK, 2014.......EU External Relations Law: Text, Cases and Materials, Bart Van Vooren and Ramses A. Wessel, Cambridge University Press, UK, 2014....

  15. A New Fragile Watermarking Scheme for Text Documents Authentication

    Institute of Scientific and Technical Information of China (English)

    XIANG Huazheng; SUN Xingming; TANG Chengliang

    2006-01-01

    Because there are different modification types of deleting characters and inserting characters in text documents, the algorithms for image authentication can not be used for text documents authentication directly. A text watermarking scheme for text document authentication is proposed in this paper. By extracting the features of character cascade together with the user secret key, the scheme combines the features of the text with the user information as a watermark which is embedded into the transformed text itself. The receivers can verify the integrity and the authentication of the text through the blind detection technique. A further research demonstrates that it can also localize the tamper, classify the type of modification, and recover part of modified text documents. The aforementioned conclusion has been proved by both our experiment results and analysis.

  16. Text Detection, Tracking and Recognition in Video: A Comprehensive Survey.

    Science.gov (United States)

    Yin, Xu-Cheng; Zuo, Ze-Yu; Tian, Shu; Liu, Cheng-Lin

    2016-04-14

    Intelligent analysis of video data is currently in wide demand because video is a major source of sensory data in our lives. Text is a prominent and direct source of information in video, while recent surveys of text detection and recognition in imagery [1], [2] focus mainly on text extraction from scene images. Here, this paper presents a comprehensive survey of text detection, tracking and recognition in video with three major contributions. First, a generic framework is proposed for video text extraction that uniformly describes detection, tracking, recognition, and their relations and interactions. Second, within this framework, a variety of methods, systems and evaluation protocols of video text extraction are summarized, compared, and analyzed. Existing text tracking techniques, tracking based detection and recognition techniques are specifically highlighted. Third, related applications, prominent challenges, and future directions for video text extraction (especially from scene videos and web videos) are also thoroughly discussed.

  17. Message Encryption Using Deceptive Text and Randomized Hashing

    Directory of Open Access Journals (Sweden)

    VAMSIKRISHNA YENIKAPATI,

    2011-02-01

    Full Text Available In this paper a new approach for message encryption using the concept called deceptive text is proposed.In this scheme we don’t need send encrypted plain text to receiver, instead, we send a meaningful deceptive text and an encrypted special index file to message receiver.The original message is embedded in the meaningful deceptive text.The positions of the characters of the plain text in thedeceptive text are stored in the index file.The receiver decrypts the index file and gets back the original message from the received deceptive text. Authentication is achieved by verifying the hash value of the plaintext created by the Message Digest Algorithm at the receiver side.In order to prevent collision attcks on hashing algorithms that are intended for use with standard digital signature algorithms we provide an extra layer of security using randomized hashing method.

  18. Towards Multi Label Text Classification through Label Propagation

    Directory of Open Access Journals (Sweden)

    Shweta C. Dharmadhikari

    2012-06-01

    Full Text Available Classifying text data has been an active area of research for a long time. Text document is multifaceted object and often inherently ambiguous by nature. Multi-label learning deals with such ambiguous object. Classification of such ambiguous text objects often makes task of classifier difficult while assigning relevant classes to input document. Traditional single label and multi class text classification paradigms cannot efficiently classify such multifaceted text corpus. Through our paper we are proposing a novel label propagation approach based on semi supervised learning for Multi Label Text Classification. Our proposed approach models the relationship between class labels and also effectively represents input text documents. We are using semi supervised learning technique for effective utilization of labeled and unlabeled data for classification. Our proposed approach promises better classification accuracy and handling of complexity and elaborated on the basis of standard datasets such as Enron, Slashdot and Bibtex.

  19. The text plan concept: contributions to the writing planning process

    Directory of Open Access Journals (Sweden)

    Ana Lúcia Tinoco Cabral

    2013-12-01

    Full Text Available Students - at different levels, ranging from early grades up to PhD - face problems both on comprehension and text production. This paper focuses on the text plan concept according to the DTA (Discourse Text Analysis approach, i.e., a principle of organization that allows students to put into practice the production intention as well as to arrange text information while producing; being responsible for the text compositional structure (Adam, 2008. The study analyzes the relation between text plan and the writing planning process, in which the first one provides the second with theoretical support. In order to develop such research, the study covers some issues related to the reading skill, analyzes an argumentative text as per its textual plan, and presents some reflections on the writing process, focusing on the relation between textual plan and the writing planning process.

  20. Complex network analysis of literary and scientific texts

    CERN Document Server

    Grabska-Gradzinska, Iwona; Kwapien, Jaroslaw; Drozdz, Stanislaw

    2012-01-01

    We present results from our quantitative study of statistical and network properties of literary and scientific texts written in two languages: English and Polish. We show that Polish texts are described by the Zipf law with the scaling exponent smaller than the one for the English language. We also show that the scientific texts are typically characterized by the rank-frequency plots with relatively short range of power-law behavior as compared to the literary texts. We then transform the texts into their word-adjacency network representations and find another difference between the languages. For the majority of the literary texts in both languages, the corresponding networks revealed the scale-free structure, while this was not always the case for the scientific texts. However, all the network representations of texts were hierarchical. We do not observe any qualitative and quantitative difference between the languages. However, if we look at other network statistics like the clustering coefficient and the...

  1. Review: Current writing: Text and reception in Southern Africa

    Directory of Open Access Journals (Sweden)

    A. L. Combrink

    1990-05-01

    Full Text Available Current writing: Text and reception in Southern Africa. (Published by the University of Natal under the joint editorship of Margaret Lenta, Michael Chapman, Margaret Daymond and Johan U. Jacobs. Volume 1, 1989 - editor: Margaret Lenta

  2. Modeling, Learning, and Processing of Text Technological Data Structures

    CERN Document Server

    Kühnberger, Kai-Uwe; Lobin, Henning; Lüngen, Harald; Storrer, Angelika; Witt, Andreas

    2012-01-01

    Researchers in many disciplines have been concerned with modeling textual data in order to account for texts as the primary information unit of written communication. The book “Modelling, Learning and Processing of Text-Technological Data Structures” deals with this challenging information unit. It focuses on theoretical foundations of representing natural language texts as well as on concrete operations of automatic text processing. Following this integrated approach, the present volume includes contributions to a wide range of topics in the context of processing of textual data. This relates to the learning of ontologies from natural language texts, the annotation and automatic parsing of texts as well as the detection and tracking of topics in texts and hypertexts. In this way, the book brings together a wide range of approaches to procedural aspects of text technology as an emerging scientific discipline.

  3. Pseudo-Label Generation for Multi-Label Text Classification

    Data.gov (United States)

    National Aeronautics and Space Administration — With the advent and expansion of social networking, the amount of generated text data has seen a sharp increase. In order to handle such a huge volume of text data,...

  4. A Text Categorization Algorithm Based on Sense Group

    Directory of Open Access Journals (Sweden)

    Jing Wan

    2013-02-01

    Full Text Available Giving further consideration on linguistic feature, this study proposes an algorithm of Chinese text categorization based on sense group. The algorithm extracts sense group by analyzing syntactic and semantic properties of Chinese texts and builds the category sense group library. SVM is used for the experiment of text categorization. The experimental results show that the precision and recall of the new algorithm based on sense group is better than that of traditional algorithms.

  5. Generating an Ordered Data Set from an OCR Text File

    Directory of Open Access Journals (Sweden)

    Jon Crump

    2014-11-01

    Full Text Available This tutorial illustrates strategies for taking raw OCR output from a scanned text, parsing it to isolate and correct essential elements of metadata, and generating an ordered data set (a python dictionary from it. These illustrations are specific to a particular text, but the overall strategy, and some of the individual procedures, can be adapted to organize any scanned text, even if it doesn’t look like this one.

  6. Generating Weather Forecast Texts with Case Based Reasoning

    OpenAIRE

    Adeyanju, Ibrahim

    2015-01-01

    Several techniques have been used to generate weather forecast texts. In this paper, case based reasoning (CBR) is proposed for weather forecast text generation because similar weather conditions occur over time and should have similar forecast texts. CBR-METEO, a system for generating weather forecast texts was developed using a generic framework (jCOLIBRI) which provides modules for the standard components of the CBR architecture. The advantage in a CBR approach is that systems can be built...

  7. Estimation of Morphological Tables Using Text Analysis Results

    Directory of Open Access Journals (Sweden)

    Illia Savchenko

    2016-08-01

    Full Text Available This paper proposes methods for obtaining input data, necessary for the modified morphological analysis method, from the text sources of data using text analysis tools. Several methods are described that are suitable for calculating initial estimates of alternatives and cross-consistency matrix values based on processing text fragments by rule-based categorization and sentiment analysis tools. A practical implementation of this tool set for assessing statements in news regarding Ukraine is considered.

  8. Texting while driving as impulsive choice: A behavioral economic analysis

    OpenAIRE

    Hayashi, Yusuke; Russo, Christopher T.; Wirth, Oliver

    2015-01-01

    The goal of the present study was to examine the utility of a behavioral economic analysis to investigate the role of delay discounting in texting while driving. A sample of 147 college students completed a survey to assess how frequently they send and read text messages while driving. Based on this information, students were assigned to one of two groups: 19 students who frequently text while driving and 19 matched-control students who infrequently text while driving but were similar in gend...

  9. MINING TEXTS TO UNDERSTAND CUSTOMERS' IMAGE OF BRANDS

    Directory of Open Access Journals (Sweden)

    Hyung Jun Ahn

    2013-06-01

    Full Text Available Text mining is becoming increasingly important in understanding customers and markets these days. This paper presents a method of mining texts about customer sentiments using a network analysis technique. A data set collected about two global mobile device manufactures were used for testing the method. The analysis results show that the method can be effectively used to extract key sentiments in the customers' texts.

  10. Metacognitive Strategies Help Students to Comprehend All Text

    Science.gov (United States)

    Eilers, Linda H.; Pinkley, Christine

    2006-01-01

    Reading comprehension instruction in many classrooms focuses on teacher-generated questions which actually measure comprehension of specific text rather than developing metacognitive strategies for comprehending all text. Explicit instruction in the metacognitive strategies of making text connections, predicting, and sequencing, was evaluated for…

  11. Effect of Alignment on Text Cohesion in the Continuation Task

    Science.gov (United States)

    Jiang, Lin; Xu, Xin

    2016-01-01

    A continuation task provides learners with a text with its ending removed and requires them to complete it through writing in a most coherent and logical way. The current study investigated (a) whether the continuation task had a positive effect on text cohesion and (b) whether texts produced by pairs exhibited higher cohesion than those produced…

  12. Text Mapping Plus: Improving Comprehension through Supported Retellings

    Science.gov (United States)

    Lapp, Diane; Fisher, Douglas; Johnson, Kelly

    2010-01-01

    Modeled in this column is the teaching of a text mapping routine that supports students reading and remembering the salient features of the text. The authors renamed the story mapping technique "text mapping plus" because they found that as students added relational words and graphics to their maps their retells of both fiction and nonnarrative…

  13. Selecting Texts and Tasks for Content Area Reading and Learning

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2015-01-01

    For students to learn science, social studies, and technical subjects, their teachers have to engage them in meaningful lessons. As part of those lessons, students read informational texts. The selection of those texts is critical. Teachers can select texts worthy of attention and then align instruction and the post-reading tasks such that…

  14. Text Complexity: The Importance of Building the Right Staircase

    Science.gov (United States)

    Papola-Ellis, Aimee L.

    2014-01-01

    As more districts begin implementing the Common Core State Standards, text complexity is receiving a lot of discussion. It is important for educators to understand the numerous factors involved with text complexity and to have a wide range of strategies to support students with challenging text. This paper shares data from three elementary…

  15. 48 CFR 1952.102-2 - Incorporation in full text.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Incorporation in full text... Clauses 1952.102-2 Incorporation in full text. All IAAR provisions and clauses shall be incorporated in solicitations and/or contracts in full text....

  16. 48 CFR 2852.102-270 - Incorporation in full text.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Incorporation in full text... 2852.102-270 Incorporation in full text. JAR provisions or clauses shall be incorporated in solicitations and contracts in full text....

  17. Evaluation and Audience Acceptance in Biotech News Texts

    DEFF Research Database (Denmark)

    Holmgreen, Lise-Lotte; Vestergaard, Torben

    2009-01-01

    It is well known that news texts are not value neutral and that in these texts even genuinely factual statements can function as evaluations. Hence, only an analysis of the types of evaluation used will reveal the true picture of the attitudinal import of reporting texts. The paper explores...

  18. Using Versions of Literary Texts to Improve Comprehension.

    Science.gov (United States)

    Samuel, Moses

    1995-01-01

    Discusses the use of the original text of Shakespeare's "Macbeth," a simplified version, and a comic-book version of the play in a college-level English-as-a-Second-Language (ESL) course. The results indicate that multiple versions of a text can help offset the shortcomings of using only the original text or a simplified version. (three…

  19. Science and Technology Text Mining: Management Decision Aids

    Science.gov (United States)

    2007-11-02

    review; data mining; text mining; bibliometrics ; scientometrics; resource allocation; project selection; operations research; management science. REPORT...review; data mining; text mining; bibliometrics ; scientometrics; resource allocation; project selection; operations research; management science. 16...support techniques include roadmaps, metrics, peer review, data and text mining, information retrieval, bibliometrics , and retrospective studies. The

  20. Recognition of pornographic web pages by classifying texts and images.

    Science.gov (United States)

    Hu, Weiming; Wu, Ou; Chen, Zhouyao; Fu, Zhouyu; Maybank, Steve

    2007-06-01

    With the rapid development of the World Wide Web, people benefit more and more from the sharing of information. However, Web pages with obscene, harmful, or illegal content can be easily accessed. It is important to recognize such unsuitable, offensive, or pornographic Web pages. In this paper, a novel framework for recognizing pornographic Web pages is described. A C4.5 decision tree is used to divide Web pages, according to content representations, into continuous text pages, discrete text pages, and image pages. These three categories of Web pages are handled, respectively, by a continuous text classifier, a discrete text classifier, and an algorithm that fuses the results from the image classifier and the discrete text classifier. In the continuous text classifier, statistical and semantic features are used to recognize pornographic texts. In the discrete text classifier, the naive Bayes rule is used to calculate the probability that a discrete text is pornographic. In the image classifier, the object's contour-based features are extracted to recognize pornographic images. In the text and image fusion algorithm, the Bayes theory is used to combine the recognition results from images and texts. Experimental results demonstrate that the continuous text classifier outperforms the traditional keyword-statistics-based classifier, the contour-based image classifier outperforms the traditional skin-region-based image classifier, the results obtained by our fusion algorithm outperform those by either of the individual classifiers, and our framework can be adapted to different categories of Web pages.

  1. Analytic Geometry. Student's Text, Unit No. 64. Revised Edition.

    Science.gov (United States)

    Ayre, H. Glenn; And Others

    This text provides a one-semester study of analytic geometry for secondary school students. It is designed for use at the 12th grade level. A deliberate effort was made to tie this text to previous SMSG texts; the usual language of sets, ordered pairs, number properties, etc. are included. This flavor is what distinguishes this book from others in…

  2. Combining text clustering and retrieval for corpus adaptation

    Science.gov (United States)

    He, Feng; Ding, Xiaoqing

    2007-01-01

    The application-relevant text data are very useful in various natural language applications. Using them can achieve significantly better performance for vocabulary selection, language modeling, which are widely employed in automatic speech recognition, intelligent input method etc. In some situations, however, the relevant data is hard to collect. Thus, the scarcity of application-relevant training text brings difficulty upon these natural language processing. In this paper, only using a small set of application specific text, by combining unsupervised text clustering and text retrieval techniques, the proposed approach can find the relevant text from unorganized large scale corpus, thereby, adapt training corpus towards the application area of interest. We use the performance of n-gram statistical language model, which is trained from the text retrieved and test on the application-specific text, to evaluate the relevance of the text acquired, accordingly, to validate the effectiveness of our corpus adaptation approach. The language models trained from the ranked text bundles present well discriminated perplexities on the application-specific text. The preliminary experiments on short message text and unorganized large corpus demonstrate the performance of the proposed methods.

  3. Guidelines for Effective Usage of Text Highlighting Techniques.

    Science.gov (United States)

    Strobelt, Hendrik; Oelke, Daniela; Kwon, Bum Chul; Schreck, Tobias; Pfister, Hanspeter

    2016-01-01

    Semi-automatic text analysis involves manual inspection of text. Often, different text annotations (like part-of-speech or named entities) are indicated by using distinctive text highlighting techniques. In typesetting there exist well-known formatting conventions, such as bold typeface, italics, or background coloring, that are useful for highlighting certain parts of a given text. Also, many advanced techniques for visualization and highlighting of text exist; yet, standard typesetting is common, and the effects of standard typesetting on the perception of text are not fully understood. As such, we surveyed and tested the effectiveness of common text highlighting techniques, both individually and in combination, to discover how to maximize pop-out effects while minimizing visual interference between techniques. To validate our findings, we conducted a series of crowdsourced experiments to determine: i) a ranking of nine commonly-used text highlighting techniques; ii) the degree of visual interference between pairs of text highlighting techniques; iii) the effectiveness of techniques for visual conjunctive search. Our results show that increasing font size works best as a single highlighting technique, and that there are significant visual interferences between some pairs of highlighting techniques. We discuss the pros and cons of different combinations as a design guideline to choose text highlighting techniques for text viewers.

  4. 49 CFR 392.80 - Prohibition against texting.

    Science.gov (United States)

    2010-10-01

    ... driver shall engage in texting while driving. (b) Motor Carriers. No motor carrier shall allow or require its drivers to engage in texting while driving. (c) Definition. For the purpose of this section only... § 390.3(f)(1) and (6) are not applicable to this section. (2) Emergency Use. Texting while driving...

  5. Gender differences in psychosocial predictors of texting while driving.

    Science.gov (United States)

    Struckman-Johnson, Cindy; Gaster, Samuel; Struckman-Johnson, Dave; Johnson, Melissa; May-Shinagle, Gabby

    2015-01-01

    A sample of 158 male and 357 female college students at a midwestern university participated in an on-line study of psychosocial motives for texting while driving. Men and women did not differ in self-reported ratings of how often they texted while driving. However, more women sent texts of less than a sentence while more men sent texts of 1-5 sentences. More women than men said they would quit texting while driving due to police warnings, receiving information about texting dangers, being shown graphic pictures of texting accidents, and being in a car accident. A hierarchical regression for men's data revealed that lower levels of feeling distracted by texting while driving (20% of the variance), higher levels of cell phone dependence (11.5% of the variance), risky behavioral tendencies (6.5% of the variance) and impulsivity (2.3%) of the variance) were significantly associated with more texting while driving (total model variance=42%). A separate regression for women revealed that higher levels of cell phone dependence (10.4% of the variance), risky behavioral tendencies (9.9% of the variance), texting distractibility (6.2%), crash risk estimates (2.2% of the variance) and driving confidence (1.3% of the variance) were significantly associated with more texting while driving (total model variance=31%.) Friendship potential and need for intimacy were not related to men's or women's texting while driving. Implications of the results for gender-specific prevention strategies are discussed.

  6. Using Diagnostic Text Information to Constrain Situation Models

    NARCIS (Netherlands)

    Dutke, S.; Baadte, C.; Hähnel, A.; Hecker, U. von; Rinck, M.

    2010-01-01

    During reading, the model of the situation described by the text is continuously accommodated to new text input. The hypothesis was tested that readers are particularly sensitive to diagnostic text information that can be used to constrain their existing situation model. In 3 experiments, adult part

  7. Comprehension Strategy Instruction: Teaching Narrative Text Structure Awareness

    Science.gov (United States)

    Dymock, Susan

    2007-01-01

    Research shows that students who have a good understanding of narrative text structure have fewer problems comprehending stories. Research also suggests that many students require explicit instruction in how to comprehend this text type. While some children are able to figure out the more elaborate structure of narrative text on their own (i.e.,…

  8. Districts Gear up for Shift to Informational Texts

    Science.gov (United States)

    Gewertz, Catherine

    2012-01-01

    The Common Core State Standards' emphasis on informational text arose in part from research suggesting that employers and college instructors found students weak at comprehending technical manuals, scientific and historical journals, and other texts pivotal to work in those arenas. The common core's vision of informational text includes literary…

  9. PISA - A procedure for analyzing the structure of explanatory texts.

    NARCIS (Netherlands)

    Sanders, T.J.M.; van Wijk, C.

    1996-01-01

    Linguistic analyses of text corpora have contributed to the understanding of natural language processing in both reading and writing. However, the impact of text analysis in psycho-linguistic research has been limited, mainly because the analyses hardly ever concern text structure. Existing models f

  10. On the Concept of Zero Meaning of Text

    Directory of Open Access Journals (Sweden)

    Nikitina E.S.

    2015-08-01

    Full Text Available In the semiotic tradition text is considered a sign with its own content. This content is shaped by three meanings within three spaces of sign: semantic, syntactic and pragmatic. It is crucial that text is heterogeneous from the point of view of meaning organization. Three spaces or three spheres of experience integrated within text existential, rational and communicative focus upon themselves the narrative, typological and paralogical meanings of text. These meanings constitute the true 'pattern' of text. The world of text is the one created, arranged and thought over in great details. The first layer is the level of the plot of existence. And since text is, by definition, an intertextual determinacy, in communication this level of meaning acts as the initial, or zero, meaning in the process of understanding text. However, understanding content only begins at this point, continuing through the typological level and, further on, through interpretational practices, finally reaching the paralogical subtleties of understanding. Text is a reality oriented at being understood. And it is the very structure of meaning of text that shapes the technologies of understanding. One can and must be taught to understand. The paper addresses the concept of zero meaning as the initial, existential layer of meanings that form the subjectness of text

  11. A Theoretical Discussion for E-Text Communication in Learning

    Science.gov (United States)

    Lee, Hye-Jung

    2015-01-01

    With the recent development of internet and mobile technology, a new kind of e-text communication is emerging. From messenger chatting, mobile texting, to social networking through Twitter or Facebook, e-text communication is becoming a main communication channel, especially for the younger generation. However, there has not been sufficient…

  12. Demo: Using RapidMiner for Text Mining

    OpenAIRE

    2013-01-01

    In this demo the basic text mining technologies by using RapidMining have been reviewed. RapidMining basic characteristics and operators of text mining have been described. Text mining example by using Navie Bayes algorithm and process modeling have been revealed.

  13. Education as Text: The Varieties of Educational Hiddenness.

    Science.gov (United States)

    Gordon, David

    1988-01-01

    Using the ideas of Paul Ricoeur and Clifford Geertz, this article develops the notion of education as a "text" and analyzes the "hidden curriculum" of that text as it is read by all members of the society. The hypothesis is proposed that education becomes a text about society's myths and sacred beliefs. (TE)

  14. Text Analytics: the convergence of Big Data and Artificial Intelligence

    Directory of Open Access Journals (Sweden)

    Antonio Moreno

    2016-03-01

    Full Text Available The analysis of the text content in emails, blogs, tweets, forums and other forms of textual communication constitutes what we call text analytics. Text analytics is applicable to most industries: it can help analyze millions of emails; you can analyze customers’ comments and questions in forums; you can perform sentiment analysis using text analytics by measuring positive or negative perceptions of a company, brand, or product. Text Analytics has also been called text mining, and is a subcategory of the Natural Language Processing (NLP field, which is one of the founding branches of Artificial Intelligence, back in the 1950s, when an interest in understanding text originally developed. Currently Text Analytics is often considered as the next step in Big Data analysis. Text Analytics has a number of subdivisions: Information Extraction, Named Entity Recognition, Semantic Web annotated domain’s representation, and many more. Several techniques are currently used and some of them have gained a lot of attention, such as Machine Learning, to show a semisupervised enhancement of systems, but they also present a number of limitations which make them not always the only or the best choice. We conclude with current and near future applications of Text Analytics.

  15. THE IMPACT OF TEXT DRIVING ON DRIVING SAFETY

    Directory of Open Access Journals (Sweden)

    Sanaz Motamedi

    2016-09-01

    Full Text Available In an increasingly mobile era, the wide availability of technology for texting and the prevalence of hands-free form have introduced a new safety concern for drivers. To assess this concern, a questionnaire was first deployed online to gain an understanding of drivers’ text driving experiences as well as their demographic information. The results from 232 people revealed that the majority of drivers are aware of the associated risks with texting while driving. However, more than one-fourth of them still frequently send or read text messages while driving. In addition to the questionnaire, through the use of a virtual-reality driving simulator, this study examined drivers’ driving performance while they were engaged in some forms of text driving under different challenging traffic conditions. Through a blocked factorial experiment, drivers would either read a text message or respond to it with two levels of text complexity while using either hand-held or hands-free texting method. Their driving performance was assessed based on the number of driving violations observed in each scenario. Conclusions regarding the impacts of different forms of texting, text complexity, and response mode on drivers driving performance were drawn.

  16. Effects of audience awareness on procedural text writing.

    Science.gov (United States)

    Sato, Koichi; Matsushima, Kazutoshi

    2006-08-01

    Effects of audience awareness were examined. Some participants acted as writers and the others acted as readers. Writers wrote a text describing a geometrical figure. Readers read the text and tried to draw the figure according to the description. In Exp. 1, audience awareness was manipulated among undergraduate students, 11 men and 34 women. Writers in the high audience-awareness condition spent more time planning and writing texts than writers in the low audience-awareness condition. Texts in the high audience-awareness condition consisted of more letters and sentences with descriptions elaborating the texts. In Exp. 2, prototype texts were constructed based on the results of Exp. 1. Undergraduate students, 11 men and 47 women, who read the prototype text in the high audience-awareness condition could draw the figure more accurately. In Exp. 3, effects of feedback from readers were examined. Ninth-grade students, 22 boys and 34 girls, participated as writers and 7th-grade students, 22 boys and 34 girls, participated as readers. Merely being told to attend to an audience did not improve the quality of texts written by 9th-grade students. However, feedback from the readers who were 7th-grade students was effective. Writers could revise the texts appropriately according to feedback and improve the quality of texts. In addition, the experience of revising the text according to feedback transferred to later writing. Educational implications of the results are discussed.

  17. Structure strategy interventions: Increasing reading comprehension of expository text

    Directory of Open Access Journals (Sweden)

    Bonnie J. F. MEYER

    2011-11-01

    Full Text Available In this review of the literature we examine empirical studies designed to teach the structure strategy to increase reading comprehension of expository texts. First, we review the research that has served as a foundation for many of the studies examining the effects of text structure instruction. Text structures generally can be grouped into six categories: comparison, problem-and solution, causation, sequence, collection, and description. Next, we provide a historical look at research of structure strategyinterventions. Strategy interventions employ modeling, practice, and feedback to teach students how to use text structure strategically and eventually automatically. Finally, we review recent text structure interventions for elementary school students. We present similarities and differences among these studies and applications for instruction. Our review of intervention research suggests that direct instruction, modeling, scaffolding, elaborated feedback, and adaptation of instruction to student performance are keys in teaching students to strategically use knowledge about text structure.

  18. Texting Dependence, iPod Dependence, and Delay Discounting.

    Science.gov (United States)

    Ferraro, F Richard; Weatherly, Jeffrey N

    2016-01-01

    We gave 127 undergraduates questionnaires about their iPod and texting dependence and 2 hypothetical delay discounting scenarios related to free downloaded songs and free texting for life. Using regression analyses we found that when iPod dependence was the dependent variable, Text2-excessive use, Text4-psychological and behavioral symptoms, iPod2-excessive use, and iPod3-relationship disruption were significant predictors of discounting. When texting dependence was the dependent variable, Text4-psychological and behavioral symptoms and iPod3-relationship disruption were significant predictors of discounting. These are the first data to show that delay discounting relates to certain aspects of social media, namely iPod and texting dependence. These data also show that across these 2 dependencies, both psychological and behavioral symptoms and relationship disruptions are affected.

  19. Automatic Contextual Text Correction Using The Linguistic Habits Graph Lhg

    Directory of Open Access Journals (Sweden)

    Marcin Gadamer

    2009-01-01

    Full Text Available Automatic text correction is an essential problem of today text processors and editors. Thispaper introduces a novel algorithm for automation of contextual text correction using a LinguisticHabit Graph (LHG also introduced in this paper. A specialist internet crawler hasbeen constructed for searching through web sites in order to build a Linguistic Habit Graphafter text corpuses gathered in polish web sites. The achieved correction results on a basis ofthis algorithm using this LHG were compared with commercial programs which also enableto make text correction: Microsoft Word 2007, Open Office Writer 3.0 and search engineGoogle. The achieved results of text correction were much better than correction made bythese commercial tools.

  20. Text analysis with R for students of literature

    CERN Document Server

    Jockers, Matthew L

    2014-01-01

    Text Analysis with R for Students of Literature is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological tool kit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that we simply cannot gather using traditional qualitative methods of close reading and human synthesis. Text Analysis with R for Students of Literature provides a practical introduction to computational text analysis using the open source programming language R. R is extremely popular throughout the sciences and because of its accessibility, R is now used increasingly in other research areas. Readers begin working with text right away and each chapter works through a new technique or process such that readers gain a broad exposure to core R procedures and a basic understanding of the possibilities of computational text analysis at both the micro and macro scale. Each c...

  1. Reconceptualising Understandings of Texts, Readers and Contexts: One English Teacher's Response to Using Multimodal Texts and Interactive Whiteboards

    Science.gov (United States)

    Kitson, Lisbeth

    2011-01-01

    The comprehension of multimodal texts is now a key concern with the release of the Australian National Curriculum for English (ACARA, 2010). However, the nature of multimodal texts, the diversity of readers in classrooms, and the complex technological environments through which multimodal texts are mediated, requires English teachers to reconsider…

  2. Comprehension Challenges in the Fourth Grade: The Roles of Text Cohesion, Text Genre, and Readers' Prior Knowledge

    Science.gov (United States)

    McNamara, Danielle S.; Ozuru, Yasuhiro; Floyd, Randy G.

    2011-01-01

    We examined young readers' comprehension as a function of text genre (narrative, science), text cohesion (high, low), and readers' abilities (reading decoding skills and world knowledge). The overarching purpose of this study was to contribute to our understanding of the "fourth grade slump". Children in grade 4 read four texts,…

  3. Learning history by composing synthesis texts: Effects of an instructional programme on learning, reading and writing processes, and text quality

    NARCIS (Netherlands)

    I. Martínez; M. Mateos; E. Martín; G. Rijlaarsdam

    2015-01-01

    The aim of the present study was to improve learning from texts via strategies that train students how to process synthesis texts. Processing such texts requires goal-oriented interaction between reading and writing activities. The participants were 62 sixth-grade students, 33 in the experimental an

  4. Complex Network Analysis of Literary and Scientific Texts

    Science.gov (United States)

    Grabska-Gradzińska, Iwona; Kulig, Andrzej; Kwapień, Jarosław; Drożdż, Stanisław

    2012-07-01

    We present results from our quantitative study of statistical and network properties of literary and scientific texts written in two languages: English and Polish. We show that Polish texts are described by the Zipf law with the scaling exponent smaller than the one for the English language. We also show that the scientific texts are typically characterized by the rank-frequency plots with relatively short range of power-law behavior as compared to the literary texts. We then transform the texts into their word-adjacency network representations and find another difference between the languages. For the majority of the literary texts in both languages, the corresponding networks revealed the scale-free structure, while this was not always the case for the scientific texts. However, all the network representations of texts were hierarchical. We do not observe any qualitative and quantitative difference between the languages. However, if we look at other network statistics like the clustering coefficient and the average shortest path length, the English texts occur to possess more clustered structure than do the Polish ones. This result was attributed to differences in grammar of both languages, which was also indicated in the Zipf plots. All the texts, however, show network structure that differs from any of the Watts-Strögatz, the Barabási-Albert, and the Erdös-Rényi architectures.

  5. TEXT AREA IDENTIFICATION FOR RECOGNIZING DESTINATION PLACES FROM VEHICLES

    Directory of Open Access Journals (Sweden)

    Selvanayaki K.S

    2014-07-01

    Full Text Available Nowadays, automatic detection of text from the vehicles is an important problem in many applications. Text information present in an image can be easily understood by both human and computer. It has wide applications such as license plate reading, sign detection, identification of destination places, mobile text recognition and so on. This problem is challenging due to complex backgrounds, the non-uniform illuminations, variations of text font, size and line orientation. Once the text is identified, it can be analyzed, recognized and interpreted. Hence, there is a need for a better algorithm for detection and localization of text from vehicles. A method is proposed for detecting text from vehicles. The method makes use of features such as Histogram of oriented Gradients (HOG and Local Binary Pattern (LBP. These features are stored which can be further used for feature matching at the time of classification. After the text region is being detected, it can be further subjected to character segmentation and recognition thereby identifying the destination places. The ability to recognize text area from the vehicles, especially buses has obvious applications like traffic management in the bus stands. The obtained results are verified and performance parameters like speed, precision and recall are determined.

  6. Automatic extraction of corollaries from semantic structure of text

    Science.gov (United States)

    Nurtazin, Abyz T.; Khisamiev, Zarif G.

    2016-11-01

    The aim of this study is to develop an algorithm for automatic representation of the text of natural language as a formal system for the subsequent automatic extraction as reasonable answers to profound questions in the context of the text, and the deep logical consequences of the text and related areas of knowledge to which the text refers. The most universal method of constructing algorithms of automatic treatment of text for a particular purpose is a representation of knowledge in the form of a graph expressing the semantic values of the text. The paper presents an algorithm of automatic presentation of text and its associated knowledge as a formal logic programming theory for sufficiently strict texts, such as legal texts. This representation is a semantic-syntactic as the causal-investigatory relationships between the various parts are both logical and semantic. This representation of the text allows to resolve the issues of causal-investigatory relationships of present concepts, as methods of the theory and practice of logic programming and methods of model theory as well. In particular, these means of classical branches of mathematics can be used to address such issues as the definition and determination of consequences and questions of consistency of the theory.

  7. Research on BIM-based Construction Domain Text Information Management

    Directory of Open Access Journals (Sweden)

    Shaohua Jiang

    2013-06-01

    Full Text Available Construction project produces a large amount of unstructured information throughout the whole lifecycle, most of them is text information. Building information modeling (BIM can support lifecycle information management of construction project, so BIM-based construction domain text information integration management can improve the efficiency and quality of project management to a large extent. The concept of BIM and its implementation platform, as well as the data exchange standard, i.e. industry foundation class (IFC, are introduced firstly. Then this paper puts forward a systematic unstructured construction domain text information management system framework, and the implementation of BIM-based text information integration methodology: the unstructured text information is transformed to structured information by means of text mining to facilitate information retrieval and ranking; then the text information is classified according to the IFC standard, and is associated with entities in BIM to realize the integration of text information and BIM. Finally, this paper takes contract document as an example for verification. The proposed method can improve text information management ability and efficiency of construction domain.

  8. Text Classification Retrieval Based on Complex Network and ICA Algorithm

    Directory of Open Access Journals (Sweden)

    Hongxia Li

    2013-08-01

    Full Text Available With the development of computer science and information technology, the library is developing toward information and network. The library digital process converts the book into digital information. The high-quality preservation and management are achieved by computer technology as well as text classification techniques. It realizes knowledge appreciation. This paper introduces complex network theory in the text classification process and put forwards the ICA semantic clustering algorithm. It realizes the independent component analysis of complex network text classification. Through the ICA clustering algorithm of independent component, it realizes character words clustering extraction of text classification. The visualization of text retrieval is improved. Finally, we make a comparative analysis of collocation algorithm and ICA clustering algorithm through text classification and keyword search experiment. The paper gives the clustering degree of algorithm and accuracy figure. Through simulation analysis, we find that ICA clustering algorithm increases by 1.2% comparing with text classification clustering degree. Accuracy can be improved by 11.1% at most. It improves the efficiency and accuracy of text classification retrieval. It also provides a theoretical reference for text retrieval classification of eBook

  9. Is searching full text more effective than searching abstracts?

    Directory of Open Access Journals (Sweden)

    Lin Jimmy

    2009-02-01

    Full Text Available Abstract Background With the growing availability of full-text articles online, scientists and other consumers of the life sciences literature now have the ability to go beyond searching bibliographic records (title, abstract, metadata to directly access full-text content. Motivated by this emerging trend, I posed the following question: is searching full text more effective than searching abstracts? This question is answered by comparing text retrieval algorithms on MEDLINE® abstracts, full-text articles, and spans (paragraphs within full-text articles using data from the TREC 2007 genomics track evaluation. Two retrieval models are examined: bm25 and the ranking algorithm implemented in the open-source Lucene search engine. Results Experiments show that treating an entire article as an indexing unit does not consistently yield higher effectiveness compared to abstract-only search. However, retrieval based on spans, or paragraphs-sized segments of full-text articles, consistently outperforms abstract-only search. Results suggest that highest overall effectiveness may be achieved by combining evidence from spans and full articles. Conclusion Users searching full text are more likely to find relevant articles than searching only abstracts. This finding affirms the value of full text collections for text retrieval and provides a starting point for future work in exploring algorithms that take advantage of rapidly-growing digital archives. Experimental results also highlight the need to develop distributed text retrieval algorithms, since full-text articles are significantly longer than abstracts and may require the computational resources of multiple machines in a cluster. The MapReduce programming model provides a convenient framework for organizing such computations.

  10. Self-Taught convolutional neural networks for short text clustering.

    Science.gov (United States)

    Xu, Jiaming; Xu, Bo; Wang, Peng; Zheng, Suncong; Tian, Guanhua; Zhao, Jun; Xu, Bo

    2017-01-12

    Short text clustering is a challenging problem due to its sparseness of text representation. Here we propose a flexible Self-Taught Convolutional neural network framework for Short Text Clustering (dubbed STC(2)), which can flexibly and successfully incorporate more useful semantic features and learn non-biased deep text representation in an unsupervised manner. In our framework, the original raw text features are firstly embedded into compact binary codes by using one existing unsupervised dimensionality reduction method. Then, word embeddings are explored and fed into convolutional neural networks to learn deep feature representations, meanwhile the output units are used to fit the pre-trained binary codes in the training process. Finally, we get the optimal clusters by employing K-means to cluster the learned representations. Extensive experimental results demonstrate that the proposed framework is effective, flexible and outperform several popular clustering methods when tested on three public short text datasets.

  11. Coh-metrix: analysis of text on cohesion and language.

    Science.gov (United States)

    Graesser, Arthur C; McNamara, Danielle S; Louwerse, Max M; Cai, Zhiqiang

    2004-05-01

    Advances in computational linguistics and discourse processing have made it possible to automate many language- and text-processing mechanisms. We have developed a computer tool called Coh-Metrix, which analyzes texts on over 200 measures of cohesion, language, and readability. Its modules use lexicons, part-of-speech classifiers, syntactic parsers, templates, corpora, latent semantic analysis, and other components that are widely used in computational linguistics. After the user enters an English text, CohMetrix returns measures requested by the user. In addition, a facility allows the user to store the results of these analyses in data files (such as Text, Excel, and SPSS). Standard text readability formulas scale texts on difficulty by relying on word length and sentence length, whereas Coh-Metrix is sensitive to cohesion relations, world knowledge, and language and discourse characteristics.

  12. Analysis Of Aspects Of Messages Hiding In Text Environments

    Directory of Open Access Journals (Sweden)

    Afanasyeva Olesya

    2015-09-01

    Full Text Available In the work are researched problems, which arise during hiding of messages in text environments, being transmitted by electronic communication channels and the Internet. The analysis of selection of places in text environment (TE, which can be replaced by word from the message is performed. Selection and replacement of words in the text environment is implemented basing on semantic analysis of text fragment, consisting of the inserted word, and its environment in TE. For implementation of such analysis is used concept of semantic parameters of words coordination and semantic value of separate word. Are used well-known methods of determination of values of these parameters. This allows moving from quality level to quantitative level analysis of text fragments semantics during their modification by word substitution. Invisibility of embedded messages is ensured by providing preset values of the semantic cooperation parameter deviations.

  13. Intertextuality: On the use of the Bible in mystical texts

    Directory of Open Access Journals (Sweden)

    Kees Waaijman

    2010-02-01

    Full Text Available This article discussed the use of the Bible in mystical texts by focusing on intertextuality as a literary approach which analyses the intersection of texts. It investigated how mystical texts, as phenotexts, relate to the Bible as archetext: firstly, the intertextual relations affect the surface of the text in a mono-causal way and secondly, they govern the production of meaning reciprocally. The article also discussed forms of intersection (quotations, collage, allusions and reproduction before it analysed the three intertextual strategies producing meaning: participation, detachment and change or rearrangement. Finally, six functions and dimensions of meaning were delineated in the intertextual dynamic between the Bible and the mystical texts. In these the Bible serves as an authoritative framework for argumentation, as a guide and blueprint of the mystical way, as a vocabulary of mystical experience, as an initiation into the divine infinity, as the place of mystical transformation in love and as the articulation of transformation in glory.

  14. Patterns of Text Reuse in a Scientific Corpus

    CERN Document Server

    Citron, Daniel T

    2014-01-01

    We consider the incidence of text "reuse" by researchers, via a systematic pairwise comparison of the text content of all articles deposited to arXiv.org from 1991--2012. We measure the global frequencies of three classes of text reuse, and measure how chronic text reuse is distributed among authors in the dataset. We infer a baseline for accepted practice, perhaps surprisingly permissive compared with other societal contexts, and a clearly delineated set of aberrant authors. We find a negative correlation between the amount of reused text in an article and its influence, as measured by subsequent citations. Finally, we consider the distribution of countries of origin of articles containing large amounts of reused text.

  15. Text Linguistics in the Context of the Communication Sciences

    Directory of Open Access Journals (Sweden)

    Silviu Serban

    2011-05-01

    Full Text Available This paper tries to analyse the conditions of emerging of text linguistics, taking into consideration the rootsof the preoccupations in its domain, originated in the framework of the communication studies. Thus, the change ofthe perspective on communication, from the mechanistic transmission to interactivity and the exchange of themeanings, led to the pragmatic orientation of the linguistic researches, not just to the message itself, but also to theelements of the communicative act and to the context where the exchange of the meanings takes place. As a result,text linguistics defines the text as communicational occurrence, involving both the members of the communicationand the conditions of the production and the reception of the message, unlike conventional linguistics which studiesthe text in abstracto, just the message itself, ignoring the world that the text refers to, or the users of the message, thetransmitter and the receiver.

  16. College students' prevalence and perceptions of text messaging while driving.

    Science.gov (United States)

    Harrison, Marissa A

    2011-07-01

    By analyzing self-reports from sample of 91 college students from the United States who are frequent drivers, the present study examined the prevalence of text messaging (or "texting") while driving and the incidence of recklessness and consequences that accompany this behavior. Analyses revealed that 91% of participants reported having used text messaging while driving, with many reporting doing so with passengers, including children, riding in their vehicles. Further, a substantial number of participants reported driving dangerously above the speed limit and drifting into other traffic lanes while texting, and many reported "sexting" and arguing via text messages while driving. However, these young drivers agreed that texting while driving is dangerous and should be illegal. These results and the limitations to the present study are discussed.

  17. Assisted entry mitigates text messaging-based driving detriment.

    Science.gov (United States)

    Sawyer, Benjamin D; Hancock, Peter A

    2012-01-01

    Previous research using cell phones indicates that manual manipulation is not a principal component of text messaging relating driving detriment. This paper suggests that manipulation of a phone in conjunction with the cognitive need to compose the message itself co-act to contribute to driving degradation. This being so, drivers sending text messages might experience reduced interference to the driving task if the text messaging itself were assisted through the predictive T9 system. We evaluated undergraduate drivers in a simulator who drove and texted using either Assisted Text entry, via Nokia's T9 system, or unassisted entry via the multitap interface. Results supported the superiority of the T9 system over the multitap system implying that specific assistive technologies can modulate the degradation of capacity which texting tragically induces.

  18. EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION

    Directory of Open Access Journals (Sweden)

    N. Adilah Hanin Zahri

    2015-03-01

    Full Text Available Many of previous research have proven that the usage of rhetorical relations is capable to enhance many applications such as text summarization, question answering and natural language generation. This work proposes an approach that expands the benefit of rhetorical relations to address redundancy problem for cluster-based text summarization of multiple documents. We exploited rhetorical relations exist between sentences to group similar sentences into multiple clusters to identify themes of common information. The candidate summary were extracted from these clusters. Then, cluster-based text summarization is performed using Conditional Markov Random Walk Model to measure the saliency scores of the candidate summary. We evaluated our method by measuring the cohesion and separation of the clusters constructed by exploiting rhetorical relations and ROUGE score of generated summaries. The experimental result shows that our method performed well which shows promising potential of applying rhetorical relation in text clustering which benefits text summarization of multiple documents

  19. Topic- and Time-Oriented Visual Text Analysis.

    Science.gov (United States)

    Dou, Wenwen; Liu, Shixia

    2016-01-01

    To facilitate the process of converting textual data into actionable knowledge, visual text analysis has become a popular topic with active research efforts contributed by researchers worldwide. Here the authors present the benefits of combing text analysis (topic models in particular) with interactive visualization. They then highlight examples from prior work on topic- and time-oriented visual text analysis and discuss challenges that warrant additional future research.

  20. Learning Taxonomy for Text Segmentation by Formal Concept Analysis

    CERN Document Server

    Lupea, Mihaiela; Marian, Zsuzsana

    2010-01-01

    In this paper the problems of deriving a taxonomy from a text and concept-oriented text segmentation are approached. Formal Concept Analysis (FCA) method is applied to solve both of these linguistic problems. The proposed segmentation method offers a conceptual view for text segmentation, using a context-driven clustering of sentences. The Concept-oriented Clustering Segmentation algorithm (COCS) is based on k-means linear clustering of the sentences. Experimental results obtained using COCS algorithm are presented.