WorldWideScience

Sample records for non-phonetic text abbreviations

  1. Using UMLS lexical resources to disambiguate abbreviations in clinical text.

    Science.gov (United States)

    Kim, Youngjun; Hurdle, John; Meystre, Stéphane M

    2011-01-01

    Clinical text is rich in acronyms and abbreviations, and they are highly ambiguous. As a pre-processing step before subsequent NLP analysis, we are developing and evaluating clinical abbreviation disambiguation methods. The evaluation of two sequential steps, the detection and the disambiguation of abbreviations, is reported here, for various types of clinical notes. For abbreviations detection, our result indicated the SPECIALIST Lexicon LRABR needed to be revised for better abbreviation detection. Our semi-supervised method using generated training data based on expanded form matching for 12 frequent abbreviations in our clinical notes reached over 90% accuracy in five-fold cross validation and unsupervised approach produced comparable results with the semi-supervised methods.

  2. Children's Text Messaging: Abbreviations, Input Methods and Links with Literacy

    Science.gov (United States)

    Kemp, N.; Bushnell, C.

    2011-01-01

    This study investigated the effects of mobile phone text-messaging method (predictive and multi-press) and experience (in texters and non-texters) on children's textism use and understanding. It also examined popular claims that the use of text-message abbreviations, or "textese" spelling, is associated with poor literacy skills. A sample of 86…

  3. Txt Msg N School Literacy: Does Texting and Knowledge of Text Abbreviations Adversely Affect Children's Literacy Attainment?

    Science.gov (United States)

    Plester, Beverly; Wood, Clare; Bell, Victoria

    2008-01-01

    This paper reports on two studies which investigated the relationship between children's texting behaviour, their knowledge of text abbreviations and their school attainment in written language skills. In Study One, 11-12-year-old children provided information on their texting behaviour. They were also asked to translate a standard English…

  4. Text-Message Abbreviations and Language Skills in High School and University Students

    Science.gov (United States)

    De Jonge, Sarah; Kemp, Nenagh

    2012-01-01

    This study investigated the use of text-message abbreviations (textisms) in Australian adolescents and young adults, and relations between textism use and literacy abilities. Fifty-two high school students aged 13-15 years, and 53 undergraduates aged 18-24 years, all users of predictive texting, translated conventional English sentences into…

  5. Automated Disambiguation of Acronyms and Abbreviations in Clinical Texts: Window and Training Size Considerations

    Science.gov (United States)

    Moon, Sungrim; Pakhomov, Serguei; Melton, Genevieve B.

    2012-01-01

    Acronyms and abbreviations within electronic clinical texts are widespread and often associated with multiple senses. Automated acronym sense disambiguation (WSD), a task of assigning the context-appropriate sense to ambiguous clinical acronyms and abbreviations, represents an active problem for medical natural language processing (NLP) systems. In this paper, fifty clinical acronyms and abbreviations with 500 samples each were studied using supervised machine-learning techniques (Support Vector Machines (SVM), Naïve Bayes (NB), and Decision Trees (DT)) to optimize the window size and orientation and determine the minimum training sample size needed for optimal performance. Our analysis of window size and orientation showed best performance using a larger left-sided and smaller right-sided window. To achieve an accuracy of over 90%, the minimum required training sample size was approximately 125 samples for SVM classifiers with inverted cross-validation. These findings support future work in clinical acronym and abbreviation WSD and require validation with other clinical texts. PMID:23304410

  6. An Investigation into the Impact of Abbreviated Didactic Texting on Language Learning

    Directory of Open Access Journals (Sweden)

    Seyyed Reza Mousavinia

    2014-03-01

    Full Text Available This study aimed to investigate whether application of abbreviations in instructional texting (SMS plays any role in promoting students’ performance in learning English through reducing distance and language anxiety. Parallel with examining elliptical features and abbreviations in creating SMS advertisements for addressing their special customers around the world with informal style, to borrow some of the features for the compass of language teaching, 120 participants in two groups at Isfahan university of technology were presented with the same type of content, namely, English grammar notes. They used directions with different lexemes and grammars. To compare the participants’ grammar learning, t-test was run. Results indicated that the difference between the performance of learners of the groups was statistically significant. Analyses showed that the didactic SMS with abbreviations and elliptical forms was significantly more effective than the SMS without such features in reducing learners’ anxiety, thereby enhancing their language learning. The findings of this study can have implications for both designing texting for advertisements and didactic SMS.

  7. Automatic Word Sense Disambiguation of Acronyms and Abbreviations in Clinical Texts

    Science.gov (United States)

    Moon, Sungrim

    2012-01-01

    The use of acronyms and abbreviations is increasing profoundly in the clinical domain in large part due to the greater adoption of electronic health record (EHR) systems and increased electronic documentation within healthcare. A single acronym or abbreviation may have multiple different meanings or senses. Comprehending the proper meaning of an…

  8. Deciphering Journal Abbreviations with JAbbr

    Directory of Open Access Journals (Sweden)

    Keith Jenkins

    2009-06-01

    Full Text Available JAbbr is an online tool developed at Cornell University to help users decipher journal title abbreviations. This article discusses why these abbreviations are so problematic, and how traditional tools are often insufficient, and then describes the novel approach used by JAbbr. Given an abbreviation, JAbbr creates a regular expression for fuzzy matching, tests it against a list of serial titles extracted from the library catalog, and returns a list of possible matches to the user. JAbbr is available as a web site and as a web service.

  9. Abbreviations in Maritime English

    Science.gov (United States)

    Yang, Zhirong

    2011-01-01

    Aiming at the phenomena that more and more abbreviations occur in maritime English correspondences, the composing laws of the abbreviations in maritime English correspondence are analyzed, and the correct methods to answer the abbreviations are pointed out, and the translation method of abbreviations are summarized in this article, and the…

  10. FDA Acronyms and Abbreviations

    Data.gov (United States)

    U.S. Department of Health & Human Services — The FDA Acronyms and Abbreviations database provides a quick reference to acronyms and abbreviations related to Food and Drug Administration (FDA) activities

  11. MBA: a literature mining system for extracting biomedical abbreviations

    Directory of Open Access Journals (Sweden)

    Lei YiMing

    2009-01-01

    Full Text Available Abstract Background The exploding growth of the biomedical literature presents many challenges for biological researchers. One such challenge is from the use of a great deal of abbreviations. Extracting abbreviations and their definitions accurately is very helpful to biologists and also facilitates biomedical text analysis. Existing approaches fall into four broad categories: rule based, machine learning based, text alignment based and statistically based. State of the art methods either focus exclusively on acronym-type abbreviations, or could not recognize rare abbreviations. We propose a systematic method to extract abbreviations effectively. At first a scoring method is used to classify the abbreviations into acronym-type and non-acronym-type abbreviations, and then their corresponding definitions are identified by two different methods: text alignment algorithm for the former, statistical method for the latter. Results A literature mining system MBA was constructed to extract both acronym-type and non-acronym-type abbreviations. An abbreviation-tagged literature corpus, called Medstract gold standard corpus, was used to evaluate the system. MBA achieved a recall of 88% at the precision of 91% on the Medstract gold-standard EVALUATION Corpus. Conclusion We present a new literature mining system MBA for extracting biomedical abbreviations. Our evaluation demonstrates that the MBA system performs better than the others. It can identify the definition of not only acronym-type abbreviations including a little irregular acronym-type abbreviations (e.g., , but also non-acronym-type abbreviations (e.g., .

  12. Abbreviation and acronym disambiguation in clinical discourse.

    Science.gov (United States)

    Pakhomov, Sergeui; Pedersen, Ted; Chute, Christopher G

    2005-01-01

    Use of abbreviations and acronyms is pervasive in clinical reports despite many efforts to limit the use of ambiguous and unsanctioned abbreviations and acronyms. Due to the fact that many abbreviations and acronyms are ambiguous with respect to their sense, complete and accurate text analysis is impossible without identification of the sense that was intended for a given abbreviation or acronym. We present the results of an experiment where we used the contexts harvested from the Internet through Google API to collect contextual data for a set of 8 acronyms found in clinical notes at the Mayo Clinic. We then used the contexts to disambiguate the sense of abbreviations in a manually annotated corpus.

  13. ISO Abbreviations for Names of Polymeric Substances

    Directory of Open Access Journals (Sweden)

    V. Jarm

    2011-04-01

    Full Text Available The use of abbreviations for the names of polymers is practical and economic in written and spoken language. Taking into consideration the several hundreds of polymers appearing in literature annually, some of them having complicated structures, it is almost impossible to derive a systematic and unique abbreviation to polymer structures. Therefore, IUPAC has taken over the well-established ISO list of abbreviated terms (about 120 items mainly selected on the basis of the scale of production. The presented ISO nomenclature is not necessarily in accord with IUPAC recommendations.

  14. PHONETIC AND NON-PHONETIC LANGUAGES: A CONTRASTIVE STUDY OF ENGLISH AND TURKISH PHONOLOGY FOCUSING ON THE ORTHOGRAPHY-INDUCED PRONUNCIATION PROBLEMS OF TURKISH LEARNERS OF ENGLISH AS A FOREIGN LANGUAGE (TURKISH EFL LEARNERS

    Directory of Open Access Journals (Sweden)

    Amir KHALILZADEH

    2014-04-01

    Full Text Available The present study aims to investigate the pronunciation problems of Turkish learners of English as a foreign language (Turkish EFL learners due to the orthography system of English. Orthography is a standardized system for using a particular writing system (script to write a particular language. It includes rules of spelling, and may also concern other elements of the written language such as punctuation and capitalization. It is clear that English is a non-phonetic and Turkish is a phonetic language, so it is very natural for the Turkish EFL learners to have some phonological problems in learning English. The author has done a contrastive study concerning three linguistic systems, i.e. consonants, vowels and syllable structures of English and Turkish to find the causes of the problems to be used in teaching English as a foreign language to Turks. The results of the study showed that the problems under discussion are caused by some differences between the orthography and the phonology of the two languages. As a result, English teachers, to be helpful, should focus on the differences and help the Turkish learners overcome the pronunciation problems. The author of the paper believes that an English teacher should be both aware of the differences and be able to teach them effectively to the Turkish EFL learners.

  15. Deciphering Journal Abbreviations with JAbbr

    OpenAIRE

    Keith Jenkins

    2009-01-01

    JAbbr is an online tool developed at Cornell University to help users decipher journal title abbreviations. This article discusses why these abbreviations are so problematic, and how traditional tools are often insufficient, and then describes the novel approach used by JAbbr. Given an abbreviation, JAbbr creates a regular expression for fuzzy matching, tests it against a list of serial titles extracted from the library catalog, and returns a list of possible matches to the user. JAbbr is ava...

  16. Global change: Acronyms and abbreviations

    Energy Technology Data Exchange (ETDEWEB)

    Woodard, C.T. [Oak Ridge National Lab., TN (United States); Stoss, F.W. [Univ. of Tennessee, Knoxville, TN (United States). Energy, Environment and Resources Center

    1995-05-01

    This list of acronyms and abbreviations is compiled to provide the user with a ready reference to dicipher the linguistic initialisms and abridgements for the study of global change. The terms included in this first edition were selected from a wide variety of sources: technical reports, policy documents, global change program announcements, newsletters, and other periodicals. The disciplinary interests covered by this document include agriculture, atmospheric science, ecology, environmental science, oceanography, policy science, and other fields. In addition to its availability in hard copy, the list of acronyms and abbreviations is available in DOS-formatted diskettes and through CDIAC`s anonymous File Transfer Protocol (FTP) area on the Internet.

  17. Will Banning Foreign Abbreviations Help?

    Institute of Scientific and Technical Information of China (English)

    2010-01-01

    @@ From early April, China's national broad-caster CCTV banned the use of borrowed English abbreviations such as NBA, GDP, WTO and CPI in all its programs. The move was launched in line with a government directive after several national legislators and political advisors called for the preservation of the Chinese language's purity.

  18. Classification and Translation of Chinese Abbreviations

    Institute of Scientific and Technical Information of China (English)

    郭颖婷

    2014-01-01

    Chinese abbreviation, containing fewer words and delivering a wealth of information, is a vital component of Chinese language. But the tremendous differences between Chinese and English make it an arduous task to translate Chinese abbreviations into English. Based on the analyses of the structure and patterns of word-formation of Chinese abbreviations, it makes a classifi-cation of Chinese abbreviations, summarize the translation methods, and point out some attention points in translation. A system-atic analysis on the structure and classification of Chinese abbreviations will be beneficial to reduce the mistakes in its translation.

  19. Abbreviations

    OpenAIRE

    2014-01-01

    CBA Cost-Benefit Analysis CBD Convention on Biological Diversity CGAPS Coordinating Group on Alien Pest Species CITES Convention on International Trade in Endangered Species of Wild Fauna and Flora CRC Cooperative Research Center for Australian Weed Management DAE Direction des affaires économiques, New Caledonia DAVAR Direction des affaires vétérinaire, alimentaire et rurale, New Caledonia DDE-E Direction du développement économique et de l’environnement, New Caledonia. DDR Direction du déve...

  20. Abbreviations

    OpenAIRE

    2013-01-01

    "AB" The official French logo for certified organic produce ("Agriculture Biologique") CF Conventional farming EF Ecological farming IFS Integrated farming systems LIF Low-input farming OF Organic farming OFgc Organic farming under group certification AFSAA Agence Française de Sécurité Sanitaire des Aliments (French food safety agency) AMAP Association pour le Maintien d'une Agriculture Paysanne (Association for the maintenance of small-scale farming – there is a network of such associations ...

  1. Acronyms, initialisms, and abbreviations: Fourth Revision

    Energy Technology Data Exchange (ETDEWEB)

    Tolman, B.J. [comp.

    1994-04-01

    This document lists acronyms used in technical writing. The immense list is supplemented by an appendix containing chemical elements, classified information access, common abbreviations used for functions, conversion factors for selected SI units, a flowcharting template, greek alphabet, metrix terminology, proofreader`s marks, signs and symbols, and state abbreviations.

  2. Processes and changes in Minas Gerais’ 18th century abbreviations: regularity and rupture

    Directory of Open Access Journals (Sweden)

    Aléxia Teles Duchowny

    2015-02-01

    Full Text Available This study analyzed 18th century abbreviations from documents written in Arraial do Tijuco, today Diamantina, in Minas Gerais, Brazil. Brachygraphic resources used in religious brotherhoods’ commitments from different social strata were compared to test two hypotheses: (i abbreviations reflect differences between strata and therefore (ii they allow identifying the degree of literacy of writing subjects. The analysis undertaken do not attest the correctness of assumptions, but the generalizations reached indicate that abbreviations, as any other linguistic phenomenon, suffer systematic, organized and multiple change processes, a different result from those that the meagre literature on the subject provides.

  3. Pharmacist and Physician Interpretation of Abbreviations for Acetaminophen Intended for Use in a Consumer Icon

    Directory of Open Access Journals (Sweden)

    Saul Shiffman

    2015-10-01

    Full Text Available Concomitant use of multiple acetaminophen medications is associated with overdose. To help patients identify acetaminophen medications and thus avoid concomitant use, an icon with an abbreviation for “acetaminophen” has been proposed for all acetaminophen medications. This study assessed pharmacists’ and physicians’ use and interpretation of abbreviations for “acetaminophen”, to identify abbreviations with other meanings that might cause confusion. Physicians (n = 150 reported use and interpretation of candidate abbreviations Ac and Acm. Pharmacists (n = 150 interpretations of prescription orders using the candidate abbreviations APAP, Ac, Ace and Acm in typed, handwritten or spoken form, were judged for critical confusions likely to cause patient harm. Critical confusion was rare, except for omission by pharmacists of the acetaminophen dose for Hydrocodone/APAP prescriptions (10%. Ac was in common use to indicate “before meals”, and was interpreted as such, but some physicians (8% said they use Ac to indicate anticoagulant drugs. Most pharmacists (54% interpreted Ace as acetaminophen, and none interpreted it as referring to ACE-inhibitors. Acm was rarely used in prescriptions, had no common interfering meanings, and was often (63% interpreted as acetaminophen, especially when prescribed in combination with an opiate (85%. The data validated concerns about abbreviations in prescribing: all abbreviations resulted in some misinterpretations. However, Acm was rarely misinterpreted, was readily associated with “acetaminophen”, and seemed appropriate for use in a graphic icon to help consumers/patients identify acetaminophen medications.

  4. 40 CFR 600.403-77 - Abbreviations.

    Science.gov (United States)

    2010-07-01

    ... ECONOMY AND CARBON-RELATED EXHAUST EMISSIONS OF MOTOR VEHICLES Fuel Economy Regulations for 1977 and Later Model Year Automobiles-Dealer Availability of Fuel Economy Information § 600.403-77 Abbreviations....

  5. 40 CFR 86.203-94 - Abbreviations.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 18 2010-07-01 2010-07-01 false Abbreviations. 86.203-94 Section 86.203-94 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED... Later Model Year Gasoline-Fueled New Light-Duty Vehicles, New Light-Duty Trucks and New...

  6. 40 CFR 72.3 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 16 2010-07-01 2010-07-01 false Measurements, abbreviations, and acronyms. 72.3 Section 72.3 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR..., abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this part are defined as follows...

  7. 40 CFR 96.303 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Measurements, abbreviations, and acronyms. 96.303 Section 96.303 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR..., abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this subpart and subparts BBBB...

  8. 40 CFR 91.303 - Acronyms and abbreviations.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Acronyms and abbreviations. 91.303 Section 91.303 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS....303 Acronyms and abbreviations. (a) The acronyms and abbreviations in § 91.5 apply to this subpart. (b...

  9. 40 CFR 89.3 - Acronyms and abbreviations.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Acronyms and abbreviations. 89.3...) CONTROL OF EMISSIONS FROM NEW AND IN-USE NONROAD COMPRESSION-IGNITION ENGINES General § 89.3 Acronyms and abbreviations. The following acronyms and abbreviations apply to part 89. AECD Auxiliary emission control device...

  10. 40 CFR 60.4103 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 6 2010-07-01 2010-07-01 false Measurements, abbreviations, and acronyms. 60.4103 Section 60.4103 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR....4103 Measurements, abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this...

  11. 40 CFR 91.4 - Acronyms and abbreviations.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Acronyms and abbreviations. 91.4...) CONTROL OF EMISSIONS FROM MARINE SPARK-IGNITION ENGINES General § 91.4 Acronyms and abbreviations. The following acronyms and abbreviations apply to this part 91. AECD—Auxiliary emission control device ASME...

  12. 40 CFR 90.5 - Acronyms and abbreviations.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Acronyms and abbreviations. 90.5...) CONTROL OF EMISSIONS FROM NONROAD SPARK-IGNITION ENGINES AT OR BELOW 19 KILOWATTS General § 90.5 Acronyms and abbreviations. The following acronyms and abbreviations apply to part 90. AECD—Auxiliary emission...

  13. The stylistic coloring of abbreviation in Business English

    Institute of Scientific and Technical Information of China (English)

    李美玲

    2011-01-01

    It is obvious to apply different vocabulary in various styles. As application of vocabulary becomes more, the distinct stylistic coloring is forming gradually. In this paper, it studies the definitions, structures, features ,of abbreviation in Business English. After that, it summarizes the stylistic coloring of abbreviation in Business English, and proves the necessity and importance to study abbreviation in Business English.

  14. Abbreviated Title of the Artwork in the System of Signs by Ch. Peirce

    Directory of Open Access Journals (Sweden)

    Grigoriy Valeryevich Tokarev

    2015-09-01

    Full Text Available The article is devoted to the semiotic aspect of the functioning of the abbreviated title of the postmodern artwork. The authors analyze the relationship of title-sign to the object which it replaces. The title is considered from the perspective of three main features peculiar of the sign in accordance with the Charles S. Peirce's theory. This fact allows us to conclude that, being a sign, the abbreviated title replaces a literary text, which is also expressed in symbolic form of the author's knowledge of reality. In this aspect the title becomes the metasign of its text. It is shown that in this connection, decoding and interpretation process take place in two stages – before reading the text and in the process of its reading and interpretation. It is alleged that the result of the interpretation of the title depends on the reader's competence which is determined by their individual literary scope, as well as by the skills of productive work with the text. On the basis of the classification of signs created by Charles Pierce, it was found that the abbreviated title has a complex semiotic nature combining the features of indexicality, conventionality, and iconicity, the latter of which may be present only in the abbreviated title.

  15. The Maximum Entropy Approach to Record Abbreviation for Optimal Record Control.

    Science.gov (United States)

    Goyal, P.

    1983-01-01

    Tests performed on 6,260 titles from 3 machine-readable British National Bibliography files using an entropy based technique for abbreviation of text strings for use as a control code found that more than 94 percent of the titles generated a unique seven character code. Six references and an illustrative example are appended. (EJS)

  16. Symbolic Capital in a Virtual Heterosexual Market: Abbreviation and Insertion in Italian iTV SMS

    Science.gov (United States)

    Herring, Susan C.; Zelenkauskaite, Asta

    2009-01-01

    This study analyzes gender variation in nonstandard typography--specifically, abbreviations and insertions--in mobile phone text messages (SMS) posted to a public Italian interactive television (iTV) program. All broadcast SMS were collected for a period of 2 days from the Web archive for the iTV program, and the frequency and distribution of…

  17. Symbolic Capital in a Virtual Heterosexual Market: Abbreviation and Insertion in Italian iTV SMS

    Science.gov (United States)

    Herring, Susan C.; Zelenkauskaite, Asta

    2009-01-01

    This study analyzes gender variation in nonstandard typography--specifically, abbreviations and insertions--in mobile phone text messages (SMS) posted to a public Italian interactive television (iTV) program. All broadcast SMS were collected for a period of 2 days from the Web archive for the iTV program, and the frequency and distribution of…

  18. Enhancing acronym/abbreviation knowledge bases with semantic information.

    Science.gov (United States)

    Torii, Manabu; Liu, Hongfang

    2007-10-11

    In the biomedical domain, a terminology knowledge base that associates acronyms/abbreviations (denoted as SFs) with the definitions (denoted as LFs) is highly needed. For the construction such terminology knowledge base, we investigate the feasibility to build a system automatically assigning semantic categories to LFs extracted from text. Given a collection of pairs (SF,LF) derived from text, we i) assess the coverage of LFs and pairs (SF,LF) in the UMLS and justify the need of a semantic category assignment system; and ii) automatically derive name phrases annotated with semantic category and construct a system using machine learning. Utilizing ADAM, an existing collection of (SF,LF) pairs extracted from MEDLINE, our system achieved an f-measure of 87% when assigning eight UMLS-based semantic groups to LFs. The system has been incorporated into a web interface which integrates SF knowledge from multiple SF knowledge bases. Web site: http://gauss.dbb.georgetown.edu/liblab/SFThesurus.

  19. Abbreviated guide pneumatic conveying design guide

    CERN Document Server

    Mills, David

    1990-01-01

    Abbreviated Guide: Pneumatic Conveying Design Guide describes the selection, design, and specification of conventional pneumatic conveying systems. The design procedure uses previous test data on the materials to be conveyed. The book also discusses system economics, operating costs, the choice of appropriate components or systems, system control, and system flexibility. The design system involves the type of conveying system for installation, the pipeline parameters, and also the plant components. System selection covers the properties of the material to be conveyed, plant layout, material pr

  20. Open architecture for multilingual parallel texts

    CERN Document Server

    Benitez, M T Carrasco

    2008-01-01

    Multilingual parallel texts (abbreviated to parallel texts) are linguistic versions of the same content ("translations"); e.g., the Maastricht Treaty in English and Spanish are parallel texts. This document is about creating an open architecture for the whole Authoring, Translation and Publishing Chain (ATP-chain) for the processing of parallel texts.

  1. 40 CFR 97.203 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... acronyms. 97.203 Section 97.203 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Trading Program General Provisions § 97.203 Measurements, abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this subpart and subparts BBB through III are defined as follows: Btu...

  2. 40 CFR 97.103 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... acronyms. 97.103 Section 97.103 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Annual Trading Program General Provisions § 97.103 Measurements, abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this subpart and subparts BB through II are defined as...

  3. 40 CFR 97.303 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... acronyms. 97.303 Section 97.303 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Ozone Season Trading Program General Provisions § 97.303 Measurements, abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this subpart and subparts BBBB through IIII are defined as...

  4. 40 CFR 96.3 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Measurements, abbreviations, and acronyms. 96.3 Section 96.3 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... acronyms. Measurements, abbreviations, and acronyms used in this part are defined as follows: Btu—British...

  5. 40 CFR 96.103 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Measurements, abbreviations, and acronyms. 96.103 Section 96.103 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... acronyms. Measurements, abbreviations, and acronyms used in this subpart and subparts BB through II are...

  6. 40 CFR 97.3 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... acronyms. 97.3 Section 97.3 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Trading Program General Provisions § 97.3 Measurements, abbreviations, and acronyms. Measurements, abbreviations, and acronyms used in this part are defined as follows: Btu-British thermal unit. CO2-carbon...

  7. 40 CFR 96.203 - Measurements, abbreviations, and acronyms.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Measurements, abbreviations, and acronyms. 96.203 Section 96.203 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... acronyms. Measurements, abbreviations, and acronyms used in this subpart and subparts BBB through III are...

  8. 7 CFR 1951.852 - Definitions and abbreviations.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 14 2010-01-01 2009-01-01 true Definitions and abbreviations. 1951.852 Section 1951....852 Definitions and abbreviations. (a) General definitions. The following definitions are applicable...) Low-income. The level of income of a person or family which is at or below the Poverty Guidelines as...

  9. 32 CFR 516.3 - Explanation of abbreviations and terms.

    Science.gov (United States)

    2010-07-01

    ... AUTHORITIES AND PUBLIC RELATIONS LITIGATION General § 516.3 Explanation of abbreviations and terms. (a) The Glossary contains explanations of abbreviations and terms. (b) The masculine gender has been used throughout this regulation for simplicity and consistency. Any reference to the masculine gender is...

  10. 40 CFR 205.155 - Motorcycle class and manufacturer abbreviation.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 24 2010-07-01 2010-07-01 false Motorcycle class and manufacturer abbreviation. 205.155 Section 205.155 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED... Motorcycle class and manufacturer abbreviation. (a) Motorcycles must be grouped into classes determined...

  11. On Non-Phonetic Errors in“Topic Talk”Items in the Mandarin Proficiency Test and Test-Training Strategies%普通话水平测试“命题说话”项的非语音失误与应试培训策略

    Institute of Scientific and Technical Information of China (English)

    贾淑云

    2015-01-01

    “Topic Talk”is the key part in the Mandarin Proficiency Test. It is quite difficult,but the score is high. In this part,the examinees’weaknesses are easily exposed so that it becomes the most important in test-training tutorials. This paper mainly analyses the examinees’non-phonetic errors in“topic talk”items and their causes,and points out that these errors could be avoided in“topic talk”in the Mandarin Proficiency Test. So long as the examinees make the full preparation,they will do better and have a contented result. Therefore,the paper suggests that,in the tutorials before the examination,teachers not only lay emphasis on speech training,but also specially strengthen the test-taking guidance. Based on the marking criterion for“topic talk”,teachers should give the examinees more training on the mistakes that they often make so as to help them reduce their non-phonetic errors,increase the scoring average of“topic talk”,and improve the examinees’scores in the Mandarin Proficiency Test.

  12. Abbreviations and acronyms: the case of Tlhalosi ya

    African Journals Online (AJOL)

    user

    creation of new abbreviations and acronyms because of new technologies such as mobile ... single words, as in NATO, NASA or UNESCO. Landau (1989: 27), on ..... Technology Research and Innovation); BOCRA (Botswana Communications.

  13. 32 CFR Attachment 1 to Part 855 - Glossary of References, Abbreviations, Acronyms, and Terms

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Glossary of References, Abbreviations, Acronyms... Attachment 1 to Part 855—Glossary of References, Abbreviations, Acronyms, and Terms Section A—References AFPD... Carriers Section B—Abbreviations and Acronyms Abbreviations and acronyms Definitions AFI Air Force...

  14. McArthur-Bates Communicative Development Inventory (CDI: Proposal of an abbreviate version

    Directory of Open Access Journals (Sweden)

    Chamarrita Farkas Klein

    2011-01-01

    Full Text Available The McArthur-Bates Communicative Development Inventories (CDI assesses language development en children, through a significant caregiver report. The first inventory assesses verbal and non verbal language in infants who are from 8 to 18 months old and it is composed of 949 items distributed in 6 scales. This study proposes an abbreviate form of this instrument, and was tested on families and educators of 130 Chilean children of 11-15 months old. Analyses related to the items, reliability and validity of the instrument and factorial analyses of subscales were realized. The abbreviate version consider 241 items distributed in 4 scales. The evaluation of the psychometric properties of the instrument was acceptable, demonstrating adequate reliability and validity.

  15. The Impact of Texting on Comprehension

    Directory of Open Access Journals (Sweden)

    Jamal K. M. Ali

    2015-07-01

    Full Text Available This paper presents a study of the effects of texting on English language comprehension. The authors believe that English used in texting causes a lack of comprehension for English speakers, learners, and texters. Wei, Xian-hai and Jiang (2008:3 declare “In Netspeak, there are some newly-created vocabularies, which people cannot comprehend them either from their partial pronunciation or from their figures.” Crystal (2007:23 claims; “variation causes problems of comprehension and acceptability. If you speak or write differently from the way I do, we may fail to understand each other.”  In this paper, the authors conducted a questionnaire at Aligarh Muslim University to ninety respondents from five different Faculties and four different levels. To measure respondents’ comprehension of English texting, the authors gave the respondents abbreviations used by texters and asked them to write the full forms of the abbreviations. The authors found that many abbreviations were not understood, which suggested that most of the respondents did not understand and did not use these abbreviations.

  16. BUSINESS ENGLISH OUTSIDE THE BOX. BUSINESS JARGON AND ABBREVIATIONS IN BUSINESS COMMUNICATION

    Directory of Open Access Journals (Sweden)

    Pop Anamaria-Mirabela

    2014-12-01

    Full Text Available Business English is commonly understood language, yet Harvard Business Review called business jargon “The Silent Killer of Big Companies”. As we all have been taught in school, we are aware of the fact that in communication we must comply with linguistic rules so that our message gets across succinctly. Yet, there is one place where all these rules can be omitted (at least in the recent decades: the corporate office. Here, one can use euphemisms and clichés, can capitalize any word that is considered important, the passive voice is used wherever possible and abbreviations occur in every sentence. The worst part is that all of these linguistic enormities are carried out deliberately. The purpose of this paper is to analyse to what extent business jargon and abbreviations have affected business communication (which most of the time, it is filled with opaque language to mask different activities and operations and the reasons for which these linguistic phenomena have become so successful in the present. One of the reasons for the research is that in business English, jargon can be annoying because it overcomplicates. It is frequently unnecessary and it can transform a simple idea or instruction into something very confusing. It is true that every field has its jargon. Education, journalism, law, politics, medicine, urban planning – no filed is immune. Yet, it seems that business jargon has been described as “the most annoying”. Another reason is that jargon tends to be elitist. Those who do not understand the terms feel confused and uncertain. The paper starts with defining these two concepts, business jargon and abbreviations, and then it attempts to explain the “unusual” pervasion of these, both in business communication and in everyday communication. For this, the paper includes a list with the most common business jargon and abbreviations. In this view, the authors have accessed different economic blogs and specialty journals

  17. The Only Safe SMS Texting Is No SMS Texting.

    Science.gov (United States)

    Toth, Cheryl; Sacopulos, Michael J

    2015-01-01

    Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.

  18. 32 CFR Appendix B to Part 806 - Abbreviations and Acronyms

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Abbreviations and Acronyms B Appendix B to Part 806 National Defense Department of Defense (Continued) DEPARTMENT OF THE AIR FORCE ADMINISTRATION AIR... Acronyms AFCA—Air Force Communications Agency AFCIC—Air Force Communications and Information Center AFRC...

  19. 40 CFR 87.2 - Acronyms and abbreviations.

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Acronyms and abbreviations. 87.2 Section 87.2 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED) CONTROL OF AIR POLLUTION FROM AIRCRAFT AND AIRCRAFT ENGINES General Provisions § 87.2 Acronyms and...

  20. 7 CFR 770.2 - Abbreviations and definitions.

    Science.gov (United States)

    2010-01-01

    ... Interior pursuant to the Indian Reorganization Act. Reserve is an account established for loans approved in... AGRICULTURE SPECIAL PROGRAMS INDIAN TRIBAL LAND ACQUISITION LOANS § 770.2 Abbreviations and definitions. (a... requirements of part 761 of this chapter. Applicant is a Native American tribe or tribal corporation...

  1. 27 CFR 19.726 - Authorized abbreviations to identify spirits.

    Science.gov (United States)

    2010-04-01

    ... records: Kinds of spirits Abbreviations Alcohol A Brandy BR Bourbon Whisky BW Canadian Whisky CNW Completely Denatured Alcohol CDA Corn Whisky CW Grain Spirits GS Irish Whisky IW Light Whisky LW Malt Whisky MW Neutral Spirits NS Neutral Spirits Grain NSG Rye Whisky RW Scotch Whisky SW Specially...

  2. Abbreviated Pandemic Influenza Planning Template for Primary Care Offices

    Energy Technology Data Exchange (ETDEWEB)

    HCTT CHE

    2010-01-01

    The Abbreviated Pandemic Influenza Plan Template for Primary Care Provider Offices is intended to assist primary care providers and office managers with preparing their offices for quickly putting a plan in place to handle an increase in patient calls and visits, whether during the 2009-2010 influenza season or future influenza seasons.

  3. 16 CFR 300.9 - Abbreviations, ditto marks, and asterisks.

    Science.gov (United States)

    2010-01-01

    ... 16 Commercial Practices 1 2010-01-01 2010-01-01 false Abbreviations, ditto marks, and asterisks..., ditto marks, and asterisks. (a) In disclosing required information, words or terms shall not be designated by ditto marks or appear in footnotes referred to by asterisks or other symbols in...

  4. 76 FR 44013 - Draft Guidance for Industry: Implementation of Acceptable Full-Length and Abbreviated Donor...

    Science.gov (United States)

    2011-07-22

    .... The Plasma Protein Therapeutics Association (PPTA) Source Plasma donor history questionnaires and... Full- Length and Abbreviated Donor History Questionnaires and Accompanying Materials for Use in... entitled ``Guidance for Industry: Implementation of Acceptable Full-Length and Abbreviated Donor History...

  5. 76 FR 13880 - Investigational New Drug Applications and Abbreviated New Drug Applications; Technical Amendment

    Science.gov (United States)

    2011-03-15

    ... HUMAN SERVICES Food and Drug Administration 21 CFR Parts 312 and 314 Investigational New Drug Applications and Abbreviated New Drug Applications; Technical Amendment AGENCY: Food and Drug Administration... amending its investigational new drug application (IND) regulations and abbreviated new drug...

  6. Predicting Chinese Abbreviations from Definitions: An Empirical Learning Approach Using Support Vector Regression

    Institute of Scientific and Technical Information of China (English)

    Xu Sun; Hou-Feng Wang; Bo Wang

    2008-01-01

    In Chinese, phrases and named entities play a central role in information retrieval. Abbreviations, however,make keyword-based approaches less effective. This paper presents an empirical learning approach to Chinese abbreviation prediction. In this study, each abbreviation is taken as a reduced form of the corresponding definition (expanded form),and the abbreviation prediction is formalized as a scoring and ranking problem among abbreviation candidates, which are automatically generated from the corresponding definition. By employing Support Vector Regression (SVR) for scoring,we can obtain multiple abbreviation candidates together with their SVR values, which are used for candidate ranking.Experimental results show that the SVR method performs better than the popular heuristic rule of abbreviation prediction.In addition, in abbreviation prediction, the SVR method outperforms the hidden Markov model (HMM).

  7. An abbreviated version of the brief assessment of cognition in schizophrenia (BACS

    Directory of Open Access Journals (Sweden)

    MD Yasuhiro Kaneda

    2015-06-01

    Full Text Available Background and Objectives: A short version of the Brief Assessment of Cognition in Schizophrenia (BACS was derived. Methods: We calculated the corrected item-total correlation (CITC for each test score relative to the composite score, and then computed the proportion of variance that each test shares with the global score excluding that test (Rt² = CITCt² and the variance explained per minute of administration time for each test (Rt²/mint. Results and Conclusions: The 3 tests with the highest Rt²/mint, Symbol Coding, Digit Sequencing, and Token Motor, were selected for the Abbreviated BACS.

  8. 21 CFR 314.127 - Refusal to approve an abbreviated new drug application.

    Science.gov (United States)

    2010-04-01

    ... 21 Food and Drugs 5 2010-04-01 2010-04-01 false Refusal to approve an abbreviated new drug... HUMAN SERVICES (CONTINUED) DRUGS FOR HUMAN USE APPLICATIONS FOR FDA APPROVAL TO MARKET A NEW DRUG FDA Action on Applications and Abbreviated Applications § 314.127 Refusal to approve an abbreviated new...

  9. Abbreviated Case Studies in Organizational Communication

    Science.gov (United States)

    Wanguri, Deloris McGee

    2005-01-01

    The cases contained within organizational communication texts are generally two to three pages, often followed by questions. These case studies are certainly useful. They generally describe events in the present, provide some type of organizational context, include first-hand data, include a record of what people say and think, develop a…

  10. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  11. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  12. Abbreviated MRI Protocols: Wave of the Future for Breast Cancer Screening.

    Science.gov (United States)

    Chhor, Chloe M; Mercado, Cecilia L

    2017-02-01

    The purpose of this article is to describe the use of abbreviated breast MRI protocols for improving access to screening for women at intermediate risk. Breast MRI is not a cost-effective modality for screening women at intermediate risk, including those with dense breast tissue as the only risk. Abbreviated breast MRI protocols have been proposed as a way of achieving efficiency and rapid throughput. Use of these abbreviated protocols may increase availability and provide women with greater access to breast MRI.

  13. Text Illustrations.

    Science.gov (United States)

    Duchastel, Philippe C.

    1983-01-01

    Discusses three roles of textbook illustrations--to arrest the reader's attention and arouse interest, to provide explanation and clarification of complex verbal descriptions, and to aid retention of the information presented in the text. It is recommended that illustrations be designed with their specific role(s) in mind. (EAO)

  14. Validation of an abbreviated quality of life scale for schizophrenia.

    Science.gov (United States)

    Fervaha, Gagan; Remington, Gary

    2013-09-01

    The field of therapeutics in schizophrenia is redefining optimal outcome, moving beyond clinical remission to a more comprehensive model that also includes functional recovery. The Quality of Life Scale (QLS) has been adopted by many large clinical trials, including CATIE and CUtLASS, as a measure of functioning. The QLS is a 21-item semi-structured interview that takes approximately 45min to administer. Although the QLS is considered comprehensive, its length limits its applicability across studies. To circumvent this issue, short scales of the QLS have been created that estimate total scores with high accuracy. However, these abbreviated measures have not been adequately cross-validated in a large enough sample to allow for subsample estimations nor has its predictive ability been compared to the full scale. Here, we used data from the CATIE trial (n=1460) to demonstrate the validity and utility of an abbreviated 7-item QLS. The shortened QLS was robust in estimating total scores (r=0.953, p<0.001) across subsamples and demonstrated predictive ability similar to the full QLS in multiple regression models. The abridged QLS is recommended as a surrogate measure of psychosocial functioning, especially in cases where functioning is not the primary outcome. Copyright © 2012 Elsevier B.V. and ECNP. All rights reserved.

  15. 78 FR 26785 - Guidance for Industry: Implementation of an Acceptable Abbreviated Donor History Questionnaire...

    Science.gov (United States)

    2013-05-08

    ...The Food and Drug Administration (FDA) is announcing the availability of a document entitled ``Guidance for Industry: Implementation of an Acceptable Abbreviated Donor History Questionnaire and Accompanying Materials for Use in Screening Frequent Donors of Blood and Blood Components'' dated May 2013. The guidance document recognizes the abbreviated donor history questionnaire and accompanying......

  16. 76 FR 65735 - Draft Guidance for Industry: Implementation of Acceptable Abbreviated Donor History Questionnaire...

    Science.gov (United States)

    2011-10-24

    ...The Food and Drug Administration (FDA) is announcing the availability of a draft document entitled ``Guidance for Industry: Implementation of Acceptable Abbreviated Donor History Questionnaire and Accompanying Materials for Use in Screening Frequent Donors of Blood and Blood Components'' dated October 2011. The draft guidance document recognizes the abbreviated donor history questionnaire and......

  17. 21 CFR 314.153 - Suspension of approval of an abbreviated new drug application.

    Science.gov (United States)

    2010-04-01

    ... 21 Food and Drugs 5 2010-04-01 2010-04-01 false Suspension of approval of an abbreviated new drug... HUMAN SERVICES (CONTINUED) DRUGS FOR HUMAN USE APPLICATIONS FOR FDA APPROVAL TO MARKET A NEW DRUG FDA... new drug application. (a) Suspension of approval. The approval of an abbreviated new drug...

  18. 21 CFR 314.101 - Filing an application and receiving an abbreviated new drug application.

    Science.gov (United States)

    2010-04-01

    ... new drug application. 314.101 Section 314.101 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT... A NEW DRUG FDA Action on Applications and Abbreviated Applications § 314.101 Filing an application and receiving an abbreviated new drug application. (a)(1) Within 60 days after FDA receives...

  19. 77 FR 50702 - Ranbaxy Laboratories Limited; Withdrawal of Approval of 27 Abbreviated New Drug Applications

    Science.gov (United States)

    2012-08-22

    ... Abbreviated New Drug Applications AGENCY: Food and Drug Administration, HHS. ACTION: Notice. ] SUMMARY: The Food and Drug Administration (FDA) is withdrawing approval of 27 abbreviated new drug applications... introduction into interstate commerce of products without approved new drug applications violates section...

  20. 16 CFR 303.5 - Abbreviations, ditto marks, and asterisks prohibited.

    Science.gov (United States)

    2010-01-01

    ... 16 Commercial Practices 1 2010-01-01 2010-01-01 false Abbreviations, ditto marks, and asterisks... Abbreviations, ditto marks, and asterisks prohibited. (a) In disclosing required information, words or terms shall not be designated by ditto marks or appear in footnotes referred to by asterisks or other...

  1. Compression and the origins of Zipf's law of abbreviation

    CERN Document Server

    Ferrer-i-Cancho, R; Seguin, C

    2015-01-01

    Languages across the world exhibit Zipf's law of abbreviation, namely more frequent words tend to be shorter. The generalized version of the law, namely an inverse relationship between the frequency of a unit and its magnitude, holds also for the behaviors of other species and the genetic code. The apparent universality of this pattern in human language and its ubiquity in other domains calls for a theoretical understanding of its origins. We generalize the information theoretic concept of mean code length as a mean energetic cost function over the probability and the magnitude of the symbols of the alphabet. We show that the minimization of that cost function and a negative correlation between probability and the magnitude of symbols are intimately related.

  2. 78 FR 13071 - Guidance for Industry: Implementation of an Acceptable Full-Length and Abbreviated Donor History...

    Science.gov (United States)

    2013-02-26

    ... Plasma Protein Therapeutics Association (PPTA) Source Plasma donor history questionnaires and...- Length and Abbreviated Donor History Questionnaires and Accompanying Materials for Use in Screening... ``Guidance for Industry: Implementation of an Acceptable Full-Length and Abbreviated Donor History...

  3. 76 FR 26307 - Guidance for Industry on the Submission of Summary Bioequivalence Data for Abbreviated New Drug...

    Science.gov (United States)

    2011-05-06

    ... HUMAN SERVICES Food and Drug Administration Guidance for Industry on the Submission of Summary Bioequivalence Data for Abbreviated New Drug Applications; Availability AGENCY: Food and Drug Administration, HHS... guidance for industry entitled ``Submission of Summary Bioequivalence Data for Abbreviated New...

  4. 21 CFR 314.152 - Notice of withdrawal of approval of an application or abbreviated application for a new drug.

    Science.gov (United States)

    2010-04-01

    ... or abbreviated application for a new drug. 314.152 Section 314.152 Food and Drugs FOOD AND DRUG... APPROVAL TO MARKET A NEW DRUG FDA Action on Applications and Abbreviated Applications § 314.152 Notice of withdrawal of approval of an application or abbreviated application for a new drug. If the Food and...

  5. Adaptation of abbreviated mathematics anxiety rating scale for engineering students

    Science.gov (United States)

    Nordin, Sayed Kushairi Sayed; Samat, Khairul Fadzli; Sultan, Al Amin Mohamed; Halim, Bushra Abdul; Ismail, Siti Fatimah; Mafazi, Nurul Wirdah

    2015-05-01

    Mathematics is an essential and fundamental tool used by engineers to analyse and solve problems in their field. Due to this, most engineering education programs involve a concentration of study in mathematics courses whereby engineering students have to take mathematics courses such as numerical methods, differential equations and calculus in the first two years and continue to do so until the completion of the sequence. However, the students struggled and had difficulties in learning courses that require mathematical abilities. Hence, this study presents the factors that caused mathematics anxiety among engineering students using Abbreviated Mathematics Anxiety Rating Scale (AMARS) through 95 students of Universiti Teknikal Malaysia Melaka (UTeM). From 25 items in AMARS, principal component analysis (PCA) suggested that there are four mathematics anxiety factors, namely experiences of learning mathematics, cognitive skills, mathematics evaluation anxiety and students' perception on mathematics. Minitab 16 software was used to analyse the nonparametric statistics. Kruskal-Wallis Test indicated that there is a significant difference in the experience of learning mathematics and mathematics evaluation anxiety among races. The Chi-Square Test of Independence revealed that the experience of learning mathematics, cognitive skills and mathematics evaluation anxiety depend on the results of their SPM additional mathematics. Based on this study, it is recommended to address the anxiety problems among engineering students at the early stage of studying in the university. Thus, lecturers should play their part by ensuring a positive classroom environment which encourages students to study mathematics without fear.

  6. Frontiers of biomedical text mining: current progress

    Science.gov (United States)

    Zweigenbaum, Pierre; Demner-Fushman, Dina; Yu, Hong; Cohen, Kevin B.

    2008-01-01

    It is now almost 15 years since the publication of the first paper on text mining in the genomics domain, and decades since the first paper on text mining in the medical domain. Enormous progress has been made in the areas of information retrieval, evaluation methodologies and resource construction. Some problems, such as abbreviation-handling, can essentially be considered solved problems, and others, such as identification of gene mentions in text, seem likely to be solved soon. However, a number of problems at the frontiers of biomedical text mining continue to present interesting challenges and opportunities for great improvements and interesting research. In this article we review the current state of the art in biomedical text mining or ‘BioNLP’ in general, focusing primarily on papers published within the past year. PMID:17977867

  7. 75 FR 77897 - Long Walk National Historic Trail Feasibility Study, Abbreviated Final Environmental Impact...

    Science.gov (United States)

    2010-12-14

    ... trail would be designated, emphasizing the removal experiences common to both tribes. An auto tour route... National Park Service Long Walk National Historic Trail Feasibility Study, Abbreviated Final Environmental Impact Statement, National Trails Intermountain Region, NM AGENCY: National Park Service,...

  8. Self efficacy for fruit, vegetable and water intakes: Expanded and abbreviated scales from item response modeling analyses

    Directory of Open Access Journals (Sweden)

    Cullen Karen W

    2010-03-01

    Full Text Available Abstract Objective To improve an existing measure of fruit and vegetable intake self efficacy by including items that varied on levels of difficulty, and testing a corresponding measure of water intake self efficacy. Design Cross sectional assessment. Items were modified to have easy, moderate and difficult levels of self efficacy. Classical test theory and item response modeling were applied. Setting One middle school at each of seven participating sites (Houston TX, Irvine CA, Philadelphia PA, Pittsburg PA, Portland OR, rural NC, and San Antonio TX. Subjects 714 6th grade students. Results Adding items to reflect level (low, medium, high of self efficacy for fruit and vegetable intake achieved scale reliability and validity comparable to existing scales, but the distribution of items across the latent variable did not improve. Selecting items from among clusters of items at similar levels of difficulty along the latent variable resulted in an abbreviated scale with psychometric characteristics comparable to the full scale, except for reliability. Conclusions The abbreviated scale can reduce participant burden. Additional research is necessary to generate items that better distribute across the latent variable. Additional items may need to tap confidence in overcoming more diverse barriers to dietary intake.

  9. The Convergent, Discriminant, and Concurrent Validity of Scores on the Abbreviated Self-Leadership Questionnaire

    Directory of Open Access Journals (Sweden)

    Faruk Şahin

    2015-10-01

    Full Text Available The present study reports the psychometric properties of a short measure of self-leadership in the Turkish context: the Abbreviated Self-Leadership Questionnaire (ASLQ. The ASLQ was examined using two samples and showed sound psychometric properties. Confirmatory factor analysis showed that nine-item ASLQ measured a single construct of self-leadership. The results supported the convergent and discriminant validity of the one-factor model of the ASLQ in relation to the 35-item Revised Self-Leadership Questionnaire and General Self-Efficacy scale, respectively. With regard to internal consistency and test-retest reliability, the ASLQ showed acceptable results. Furthermore, the results provided evidence that scores on the ASLQ positively predicted individual's self-reported task performance and self-efficacy mediated this relationship. Taken together, these findings suggest that the Turkish version of the ASLQ is a reliable and valid measure that can be used to measure self-leadership as one variable of interest in the future studies.

  10. Reliability and validity of the Farsi version of the standardized assessment of personality-abbreviated scale

    Directory of Open Access Journals (Sweden)

    Maryam Sepehri

    2017-06-01

    Full Text Available Introduction: A short screening tool for high-risk individuals with personality disorder (PD is useful both for clinicians and researchers. The aim of this study was to assess the validity and reliability of the Farsi version of the Standardized Assessment of Personality-Abbreviated Scale (SAPAS. Methods: The original English version of the SAPAS questionnaire was translated into Farsi, and then, translated back into English by two professionals. A survey was then conducted using the questionnaire on 150 clients of primary health care centers in Tabriz, Iran. A total of 235 medical students were also studied for the reliability assessment of the questionnaire. The SAPAS was compared to the short form of Minnesota Multiphasic Personality Inventory (MMPI. The data analysis was performed using receiver operating characteristic (ROC curve technique, operating characteristic for diagnostic efficacy, Cronbach's alpha, and test-retest for reliability evaluation. Results: We found an area under the curve (AUC of 0.566 [95% confidence intervals (CI: 0.455-0.677]; sensitivity of 0.89 and specificity of 0.26 at the cut-off score of 2 and higher. The total Cronbach's alpha coefficient was 0.38 and Cohen's kappa ranged between 0.5 and 0.8. Conclusion: The current study showed that the Farsi version of the SAPAS was relatively less efficient, in term of validity and reliability, in the screening of PD in the population.

  11. A Confirmatory Factor Analysis of the Structure of Abbreviated Math Anxiety Scale

    Directory of Open Access Journals (Sweden)

    Farahman Farrokhi

    2011-06-01

    Full Text Available "nObjective: The aim of this study is to explore the confirmatory factor analysis results of the Persian adaptation of Abbreviated Math Anxiety Scale (AMAS, proposed by Hopko, Mahadevan, Bare & Hunt. "nMethod: The validity and reliability assessments of the scale were performed on 298 college students chosen randomly from Tabriz University in Iran. The confirmatory factor analysis (CFA was carried out to determine the factor structures of the Persian version of AMAS. "nResults: As expected, the two-factor solution provided a better fit to the data than a single factor. Moreover, multi-group analyses showed that this two-factor structure was invariant across sex. Hence, AMAS provides an equally valid measure for use among college students. "nConclusions:  Brief AMAS demonstrates adequate reliability and validity. The AMAS scores can be used to compare symptoms of math anxiety between male and female students. The study both expands and adds support to the existing body of math anxiety literature.

  12. Screening for personality disorder with the Standardised Assessment of Personality: Abbreviated Scale (SAPAS: further evidence of concurrent validity

    Directory of Open Access Journals (Sweden)

    Moran Paul

    2010-01-01

    Full Text Available Abstract Background The assessment of personality disorders (PD is costly and time-consuming. There is a need for a brief screen for personality disorders that can be used in routine clinical settings and epidemiological surveys. Aims: To test the validity of the Standardised Assessment of Personality: Abbreviated Scale (SAPAS as a screen for PD in a clinical sample of substance abusers. Methods Convergent validity of the SAPAS with both categorical and dimensional representations of personality disorders was estimated. Results In this sample, the SAPAS correlated well with dimensional representations of cluster A and C personality disorders, even after controlling for ADHD symptoms, anxiety/depression symptoms and recent substance use. The SAPAS was also significantly associated with total number of PD criteria, although correlation with categorical measures of PD was weak. Conclusions The SAPAS is an valid brief screen for PD as assessed dimensionally.

  13. Abbreviated protocol for breast MRI: Are multiple sequences needed for cancer detection?

    Energy Technology Data Exchange (ETDEWEB)

    Mango, Victoria L., E-mail: vlm2125@columbia.edu [Columbia University Medical Center, Herbert Irving Pavilion, 161 Fort Washington Avenue, 10th Floor, New York, NY 10032 (United States); Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); Morris, Elizabeth A., E-mail: morrise@mskcc.org [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); David Dershaw, D., E-mail: dershawd@mskcc.org [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); Abramson, Andrea, E-mail: abramsoa@mskcc.org [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); Fry, Charles, E-mail: charles_fry@nymc.edu [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); New York Medical College, 40 Sunshine Cottage Rd, Valhalla, NY 10595 (United States); Moskowitz, Chaya S. [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); Hughes, Mary, E-mail: hughesm@mskcc.org [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); Kaplan, Jennifer, E-mail: kaplanj@mskcc.org [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States); Jochelson, Maxine S., E-mail: jochelsm@mskcc.org [Memorial Sloan-Kettering Cancer Center, Breast and Imaging Center, 300 East 66th Street, New York, NY 10065 (United States)

    2015-01-15

    Highlights: • Abbreviated breast MR demonstrates high sensitivity for breast carcinoma detection. • Time to perform/interpret the abbreviated exam is shorter than a standard MRI exam. • An abbreviated breast MRI could reduce costs and make MRI screening more available. - Abstract: Objective: To evaluate the ability of an abbreviated breast magnetic resonance imaging (MRI) protocol, consisting of a precontrast T1 weighted (T1W) image and single early post-contrast T1W image, to detect breast carcinoma. Materials and methods: A HIPAA compliant Institutional Review Board approved review of 100 consecutive breast MRI examinations in patients with biopsy proven unicentric breast carcinoma. 79% were invasive carcinomas and 21% were ductal carcinoma in situ. Four experienced breast radiologists, blinded to carcinoma location, history and prior examinations, assessed the abbreviated protocol evaluating only the first post-contrast T1W image, post-processed subtracted first post-contrast and subtraction maximum intensity projection images. Detection and localization of tumor were compared to the standard full diagnostic examination consisting of 13 pre-contrast, post-contrast and post-processed sequences. Results: All 100 cancers were visualized on initial reading of the abbreviated protocol by at least one reader. The mean sensitivity for each sequence was 96% for the first post-contrast sequence, 96% for the first post-contrast subtraction sequence and 93% for the subtraction MIP sequence. Within each sequence, there was no significant difference between the sensitivities among the 4 readers (p = 0.471, p = 0.656, p = 0.139). Mean interpretation time was 44 s (range 11–167 s). The abbreviated imaging protocol could be performed in approximately 10–15 min, compared to 30–40 min for the standard protocol. Conclusion: An abbreviated breast MRI protocol allows detection of breast carcinoma. One pre and post-contrast T1W sequence may be adequate for detecting

  14. 76 FR 64951 - Apothecon et al.; Withdrawal of Approval of 103 New Drug Applications and 35 Abbreviated New Drug...

    Science.gov (United States)

    2011-10-19

    ... HUMAN SERVICES Food and Drug Administration Apothecon et al.; Withdrawal of Approval of 103 New Drug Applications and 35 Abbreviated New Drug Applications; Correction AGENCY: Food and Drug Administration, HHS... new drug applications (NDAs) and 35 abbreviated new drug applications (ANDAs) from multiple...

  15. 78 FR 25749 - Submission of New Drug Application/Abbreviated New Drug Application Field Alert Reports: Notice...

    Science.gov (United States)

    2013-05-02

    ... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF HEALTH AND HUMAN SERVICES Food and Drug Administration Submission of New Drug Application/Abbreviated New Drug... submit new drug application (NDA) and abbreviated new drug application (ANDA) Field Alert Reports...

  16. Towards a Theory and View of Teaching Compressed and Abbreviated Research Methodology and Statistics Courses

    Directory of Open Access Journals (Sweden)

    James Carifio

    2007-01-01

    Full Text Available One of the highly questionable effects of educational reform and other curriculum reshaping factors at both the high school, post-secondary and graduate levels has been the shift to teaching compressed, pared-down or abbreviated courses in still needed or required subject-matter that became de-emphasized in the current educational reformation. Research methodology, particularly the highly quantitative and experimental kind and statistics, are two still needed to some degree subject matters that has been especially affected by this demotion and compression movement at the pre-service, in-service, professional development, undergraduate, continuing education and graduate levels, even though the professional areas of education, science, business, politics and most other areas (including history have become far more quantitative and objective research oriented than in the past. Until there are more enlightened policy shifts, effective means of teaching such compressed courses need to be devised and tested, if only to lessen the negative outcomes of such critical courses. This article, therefore, analyzes compressed courses from the point of view of cognitive learning and then describes 5 methods and approaches that were tested to improve the effectiveness of research methodology and statistics courses taught in these formats. Each of the formats helped to reduce student stress and anxiety about the content and its compressed presentation and improved understanding and achievement. The theory and view developed in this article is also applicable to similar compressed courses for scientific and/or technical content which are currently prevalent in allied health and biotechnology areas.

  17. Abbreviated quality of life scales for schizophrenia: comparison and utility of two brief community functioning measures.

    Science.gov (United States)

    Fervaha, Gagan; Foussias, George; Siddiqui, Ishraq; Agid, Ofer; Remington, Gary

    2014-04-01

    The Heinrichs-Carpenter Quality of Life Scale (QLS) is the most extensively used real-world community functioning scale in schizophrenia research. However, the extensive time required to administer it and the inclusion of items that overlap conceptually with negative symptoms limit its use across studies. The present study examined the validity and utility of two abbreviated QLS measures against the full QLS excluding negative symptom items. The sample included 1427 patients with schizophrenia who completed the baseline visit in the CATIE study. The validity of two abbreviated QLS measures (7-item and 4-item) were examined with the full QLS, excluding the intrapsychic foundations subscale, using correlation analysis. The utility of the abbreviated measures was explored by examining associations between the functioning scales and clinical variables and longitudinal change. Both abbreviated QLS measures were highly predictive of the full QLS (both r=0.91, pschizophrenia, especially when assessment of functional outcome is not the focus. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. 77 FR 12877 - Record of Decision for the General Management Plan/Abbreviated Final Environmental Impact...

    Science.gov (United States)

    2012-03-02

    ... National Park Service Record of Decision for the General Management Plan/Abbreviated Final Environmental... Management Plan for New River Gorge National River, West Virginia. The Record of Decision selects the... the Record of Decision selecting Alternative 5 as the approved General Management Plan for New River...

  19. 21 CFR 314.150 - Withdrawal of approval of an application or abbreviated application.

    Science.gov (United States)

    2010-04-01

    ... HEALTH AND HUMAN SERVICES (CONTINUED) DRUGS FOR HUMAN USE APPLICATIONS FOR FDA APPROVAL TO MARKET A NEW... that is described in the application or abbreviated application and that is essential to show that the... or contract research organization that conducted a bioavailability or bioequivalence study...

  20. New drug applications and abbreviated new drug applications; technical amendment. Final rule; technical amendment.

    Science.gov (United States)

    2009-03-06

    The Food and Drug Administration (FDA) is amending its new drug application (NDA) and abbreviated new drug application (ANDA) regulations to update agency contacts for patent information and patent notifications and to correct an inaccurate cross-reference. This action is being taken to ensure accuracy and clarity in the agency's regulations.

  1. 75 FR 37295 - Change of Address; Abbreviated New Drug Applications; Technical Amendment

    Science.gov (United States)

    2010-06-29

    ...The Food and Drug Administration (FDA) is amending its regulations to update the address for applicants to submit abbreviated new drug applications (ANDAs) and ANDA amendments, supplements, and resubmissions. FDA is also updating the address for ANDA applicants to submit investigational new drug applications (INDs) for in vivo bioavailability and bioequivalence studies in humans that are......

  2. Evaluating an Abbreviated Version of the Hispanic Stress Inventory for Immigrants

    Science.gov (United States)

    Cavazos-Rehg, Patricia A.; Zayas, Luis H.; Walker, Mark S.; Fisher, Edwin B.

    2006-01-01

    This study evaluates an abbreviated version of the Hispanic Stress Inventory-Immigrant version (HSI-I) with a nonclinical sample of 143 adult Hispanic immigrants residing in a large midwestern city. The HSI-I consists of 73 items and 5 distinct subscales that assess psychosocial experiences on five dimensions, namely, occupational/economic,…

  3. Psychometric Properties of the Abbreviated Perceived Motivational Climate in Exercise Questionnaire

    Science.gov (United States)

    Moore, E. Whitney G.; Brown, Theresa C.; Fry, Mary D.

    2015-01-01

    The purpose of this study was to develop an abbreviated version of the Perceived Motivational Climate in Exercise Questionnaire (PMCEQ-A) to provide a more practical instrument for use in applied exercise settings. In the calibration step, two shortened versions' measurement and latent model values were compared to each other and the original…

  4. Relationship between Acceptable Noise Level and the Abbreviated Profile of Hearing Aid Benefit

    Science.gov (United States)

    Freyaldenhoven, Melinda C.; Nabelek, Anna K.; Tampas, Joanna W.

    2008-01-01

    Purpose: This study investigated the relationship between acceptable noise levels (ANLs) and the Abbreviated Profile of Hearing Aid Benefit (APHAB; R. M. Cox & G. C. Alexander, 1995). This study further examined the APHAB's ability to predict hearing aid use. Method: ANL and APHAB data were collected for 191 listeners with impaired hearing,…

  5. 78 FR 52931 - Draft Guidance for Industry on Abbreviated New Drug Applications: Stability Testing of Drug...

    Science.gov (United States)

    2013-08-27

    ... HUMAN SERVICES Food and Drug Administration Draft Guidance for Industry on Abbreviated New Drug Applications: Stability Testing of Drug Substances and Products, Questions and Answers; Availability AGENCY... announcing the availability of a draft guidance for industry entitled ``ANDAs: Stability Testing of...

  6. 77 FR 58999 - Draft Guidance for Industry on Abbreviated New Drug Applications: Stability Testing of Drug...

    Science.gov (United States)

    2012-09-25

    ... HUMAN SERVICES Food and Drug Administration Draft Guidance for Industry on Abbreviated New Drug... availability of a draft guidance for industry entitled ``ANDAs: Stability Testing of Drug Substances and... of a draft guidance for industry entitled ``ANDAs: Stability Testing of Drug Substances and...

  7. 75 FR 73108 - Guidance for Industry on Abbreviated New Drug Applications: Impurities in Drug Products...

    Science.gov (United States)

    2010-11-29

    ... HUMAN SERVICES Food and Drug Administration Guidance for Industry on Abbreviated New Drug Applications...: The Food and Drug Administration (FDA) is announcing the availability of a guidance for industry...) guidance for industry ``Q3B(R) Impurities in New Drug Products,'' which was announced in August 2006....

  8. 78 FR 37231 - Guidance for Industry; Guidance on Abbreviated New Drug Applications: Stability Testing of Drug...

    Science.gov (United States)

    2013-06-20

    ... HUMAN SERVICES Food and Drug Administration Guidance for Industry; Guidance on Abbreviated New Drug... the availability of a guidance for industry entitled ``ANDAs: Stability Testing of Drug Substances and... generic drug review, FDA is recommending that the generic drug industry follow the approach in...

  9. The Use of Abbreviations in English-Medium Astrophysics Research Paper Titles: A Problematic Issue

    Science.gov (United States)

    Méndez, David I.; Alcaraz, M. Ángeles

    2015-01-01

    In this study, we carry out a qualitative and quantitative analysis of abbreviations in 300 randomly collected research paper titles published in the most prestigious European and US-based Astrophysics journals written in English. Our main results show that the process of shortening words and groups of words is one of the most characteristic and…

  10. A novel abbreviation standard for organobromine, organochlorine and organophosphorus flame retardants and some characteristics of the chemicals.

    Science.gov (United States)

    Bergman, Ake; Rydén, Andreas; Law, Robin J; de Boer, Jacob; Covaci, Adrian; Alaee, Mehran; Birnbaum, Linda; Petreas, Myrto; Rose, Martin; Sakai, Shinichi; Van den Eede, Nele; van der Veen, Ike

    2012-11-15

    Ever since the interest in organic environmental contaminants first emerged 50years ago, there has been a need to present discussion of such chemicals and their transformation products using simple abbreviations so as to avoid the repetitive use of long chemical names. As the number of chemicals of concern has increased, the number of abbreviations has also increased dramatically, sometimes resulting in the use of different abbreviations for the same chemical. In this article, we propose abbreviations for flame retardants (FRs) substituted with bromine or chlorine atoms or including a functional group containing phosphorus, i.e. BFRs, CFRs and PFRs, respectively. Due to the large number of halogenated and organophosphorus FRs, it has become increasingly important to develop a strategy for abbreviating the chemical names of FRs. In this paper, a two step procedure is proposed for deriving practical abbreviations (PRABs) for the chemicals discussed. In the first step, structural abbreviations (STABs) are developed using specific STAB criteria based on the FR structure. However, since several of the derived STABs are complicated and long, we propose instead the use of PRABs. These are, commonly, an extract of the most essential part of the STAB, while also considering abbreviations previously used in the literature. We indicate how these can be used to develop an abbreviation that can be generally accepted by scientists and other professionals involved in FR related work. Tables with PRABs and STABs for BFRs, CFRs and PFRs are presented, including CAS (Chemical Abstract Service) numbers, notes of abbreviations that have been used previously, CA (Chemical Abstract) name, common names and trade names, as well as some fundamental physico-chemical constants.

  11. A novel abbreviation standard for organobromine, organochlorine and organophosphorus flame retardants and some characteristics of the chemicals

    Science.gov (United States)

    Bergman, Åke; Rydén, Andreas; Law, Robin J.; de Boer, Jacob; Covaci, Adrian; Alaee, Mehran; Birnbaum, Linda; Petreas, Myrto; Rose, Martin; Sakai, Shinichi; Van den Eede, Nele; van der Veen, Ike

    2012-01-01

    Ever since the interest in organic environmental contaminants first emerged 50 years ago, there has been a need to present discussion of such chemicals and their transformation products using simple abbreviations so as to avoid the repetitive use of long chemical names. As the number of chemicals of concern has increased, the number of abbreviations has also increased dramatically, sometimes resulting in the use of different abbreviations for the same chemical. In this article, we propose abbreviations for flame retardants (FRs) substituted with bromine or chlorine atoms or including a functional group containing phosphorus, i.e. BFRs, CFRs and PFRs, respectively. Due to the large number of halogenated and organophosphorus FRs, it has become increasingly important to develop a strategy for abbreviating the chemical names of FRs. In this paper, a two step procedure is proposed for deriving practical abbreviations (PRABs) for the chemicals discussed. In the first step, structural abbreviations (STABs) are developed using specific STAB criteria based on the FR structure. However, since several of the derived STABs are complicated and long, we propose instead the use of PRABs. These are, commonly, an extract of the most essential part of the STAB, while also considering abbreviations previously used in the literature. We indicate how these can be used to develop an abbreviation that can be generally accepted by scientists and other professionals involved in FR related work. Tables with PRABs and STABs for BFRs, CFRs and PFRs are presented, including CAS (Chemical Abstract Service) numbers, notes of abbreviations that have been used previously, CA (Chemical Abstract) name, common names and trade names, as well as some fundamental physico-chemical constants. PMID:22982223

  12. Measuring health outcomes of a multidisciplinary care approach in individuals with chronic environmental conditions using an abbreviated symptoms questionnaire

    Directory of Open Access Journals (Sweden)

    Roy Fox

    2008-12-01

    Full Text Available Roy Fox1, Tara Sampalli1, Jonathan Fox11Nova Scotia Environmental Health Centre, Fall River, NS, CanadaAbstract: The Nova Scotia Environmental Health Centre is a treatment facility for individuals with chronic environmental conditions such as multiple chemical sensitivity, chronic fatigue syndrome, fibromyalgia, chronic respiratory conditions and in some cases chronic pain. The premise of care is to provide a patient-centred multidisciplinary care approach leading to self-management strategies. In order to measure the outcome of the treatment in these complex problems, with overlapping diagnoses, symptoms in many body systems and suspected environmental triggers, a detailed symptoms questionnaire was developed specifically for this patient population and validated. Results from a pilot study in which an abbreviated symptoms questionnaire based on the top reported symptoms captured in previous research was used to measure the efficacy of a multidisciplinary care approach in individuals with multiple chemical sensitivity are presented in this paper. The purpose of this study was to examine the extent, type and patterns of changes over time in the top reported symptoms with treatment measured using the abbreviated symptoms questionnaire. A total of 183 active and 109 discharged patients participated in the study where the health status was measured at different time periods of follow up since the commencement of treatment at the Centre. The findings from this study were successful in generating an initial picture of the nature and type of changes in these symptoms. For instance, symptoms such as difficulty concentrating, sinus conditions and tiredness showed early improvement, within the first 6 months of being in treatment, while others, such as fatigue, hoarseness or loss of voice, took longer while others showed inconsistent changes warranting further enquiry. A controlled longitudinal study is planned to confirm the findings of the pilot study

  13. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  14. Morpheme matching based text tokenization for a scarce resourced language.

    Science.gov (United States)

    Rehman, Zobia; Anwar, Waqas; Bajwa, Usama Ijaz; Xuan, Wang; Chaoying, Zhou

    2013-01-01

    Text tokenization is a fundamental pre-processing step for almost all the information processing applications. This task is nontrivial for the scarce resourced languages such as Urdu, as there is inconsistent use of space between words. In this paper a morpheme matching based approach has been proposed for Urdu text tokenization, along with some other algorithms to solve the additional issues of boundary detection of compound words, affixation, reduplication, names and abbreviations. This study resulted into 97.28% precision, 93.71% recall, and 95.46% F1-measure; while tokenizing a corpus of 57000 words by using a morpheme list with 6400 entries.

  15. Development and validation of a complementary map to enhance the existing 1998 to 2008 Abbreviated Injury Scale map

    Directory of Open Access Journals (Sweden)

    McLellan Susan

    2011-05-01

    Full Text Available Abstract Introduction Many trauma registries have used the Abbreviated Injury Scale 1990 Revision Update 98 (AIS98 to classify injuries. In the current AIS version (Abbreviated Injury Scale 2005 Update 2008 - AIS08, injury classification and specificity differ substantially from AIS98, and the mapping tools provided in the AIS08 dictionary are incomplete. As a result, data from different AIS versions cannot currently be compared. The aim of this study was to develop an additional AIS98 to AIS08 mapping tool to complement the current AIS dictionary map, and then to evaluate the completed map (produced by combining these two maps using double-coded data. The value of additional information provided by free text descriptions accompanying assigned codes was also assessed. Methods Using a modified Delphi process, a panel of expert AIS coders established plausible AIS08 equivalents for the 153 AIS98 codes which currently have no AIS08 map. A series of major trauma patients whose injuries had been double-coded in AIS98 and AIS08 was used to assess the maps; both of the AIS datasets had already been mapped to another AIS version using the AIS dictionary maps. Following application of the completed (enhanced map with or without free text evaluation, up to six AIS codes were available for each injury. Datasets were assessed for agreement in injury severity measures, and the relative performances of the maps in accurately describing the trauma population were evaluated. Results The double-coded injuries sustained by 109 patients were used to assess the maps. For data conversion from AIS98, both the enhanced map and the enhanced map with free text description resulted in higher levels of accuracy and agreement with directly coded AIS08 data than the currently available dictionary map. Paired comparisons demonstrated significant differences between direct coding and the dictionary maps, but not with either of the enhanced maps. Conclusions The newly

  16. Text Maps: Helping Students Navigate Informational Texts.

    Science.gov (United States)

    Spencer, Brenda H.

    2003-01-01

    Notes that a text map is an instructional approach designed to help students gain fluency in reading content area materials. Discusses how the goal is to teach students about the important features of the material and how the maps can be used to build new understandings. Presents the procedures for preparing and using a text map. (SG)

  17. Financial reporting by small companies in the UK: the demand for abbreviated accounts

    OpenAIRE

    Collis, Jill

    2003-01-01

    The aim of the study is to provide generalisble evidence of the utility of the statutory financial statements of small companies to the directors. It took the form of a postal questionnaire survey of the directors of a tranche of 385 companies meeting the EC size criteria for a small company. This paper focuses on the factors that influence the filing choices of the directors of these small companies and the demand for abbreviated accounts.

  18. Validation of an abbreviated Treatment Satisfaction Questionnaire for Medication (TSQM-9 among patients on antihypertensive medications

    Directory of Open Access Journals (Sweden)

    Desrosiers Marie-Pierre

    2009-04-01

    Full Text Available Abstract Background The 14-item Treatment Satisfaction Questionnaire for Medication (TSQM Version 1.4 is a reliable and valid instrument to assess patients' satisfaction with medication, providing scores on four scales – side effects, effectiveness, convenience and global satisfaction. In naturalistic studies, administering the TSQM with the side effects domain could provoke the physician to assess the presence or absence of adverse events in a way that is clinically atypical, carrying the potential to interfere with routine medical care. As a result, an abbreviated 9-item TSQM (TSQM-9, derived from the TSQM Version 1.4 but without the five items of the side effects domain was created. In this study, an interactive voice response system (IVRS-administered TSQM-9 was psychometrically evaluated among patients taking antihypertensive medication. Methods A total of 3,387 subjects were invited to participate in the study from an online panel who self-reported taking a prescribed antihypertensive medication. The subjects were asked to complete the IVRS-administered TSQM-9 at the start of the study, along with the modified Morisky scale, and again within 7 to 14 days. Standard psychometric analyses were conducted; including Cronbach's alpha, intraclass correlation coefficients, structural equation modeling, Spearman correlation coefficients and analysis of covariance (ANCOVA. Results A total of 396 subjects completed all the study procedures. Approximately 50% subjects were male with a good racial/ethnic mix: 58.3% white, 18.9% black, 17.7% Hispanic and 5.1% either Asian or other. There was evidence of construct validity of the TSQM-9 based on the structural equation modeling findings of the observed data fitting the Decisional Balance Model of Treatment Satisfaction even without the side effects domain. TSQM-9 domains had high internal consistency as evident from Cronbach's alpha values of 0.84 and greater. TSQM-9 domains also demonstrated good test

  19. Abbreviated epitaxial growth mode (AGM) method for reducing cost and improving quality of LEDs and lasers

    Science.gov (United States)

    Tansu, Nelson; Chan, Helen M; Vinci, Richard P; Ee, Yik-Khoon; Biser, Jeffrey

    2013-09-24

    The use of an abbreviated GaN growth mode on nano-patterned AGOG sapphire substrates, which utilizes a process of using 15 nm low temperature GaN buffer and bypassing etch-back and recovery processes during epitaxy, enables the growth of high-quality GaN template on nano-patterned AGOG sapphire. The GaN template grown on nano-patterned AGOG sapphire by employing abbreviated growth mode has two orders of magnitude lower threading dislocation density than that of conventional GaN template grown on planar sapphire. The use of abbreviated growth mode also leads to significant reduction in cost of the epitaxy. The growths and characteristics of InGaN quantum wells (QWs) light emitting diodes (LEDs) on both templates were compared. The InGaN QWs LEDs grown on the nano-patterned AGOG sapphire demonstrated at least a 24% enhancement of output power enhancement over that of LEDs grown on conventional GaN templates.

  20. Abbreviated epitaxial growth mode (AGM) method for reducing cost and improving quality of LEDs and lasers

    Energy Technology Data Exchange (ETDEWEB)

    Tansu, Nelson; Chan, Helen M; Vinci, Richard P; Ee, Yik-Khoon; Biser, Jeffrey

    2013-09-24

    The use of an abbreviated GaN growth mode on nano-patterned AGOG sapphire substrates, which utilizes a process of using 15 nm low temperature GaN buffer and bypassing etch-back and recovery processes during epitaxy, enables the growth of high-quality GaN template on nano-patterned AGOG sapphire. The GaN template grown on nano-patterned AGOG sapphire by employing abbreviated growth mode has two orders of magnitude lower threading dislocation density than that of conventional GaN template grown on planar sapphire. The use of abbreviated growth mode also leads to significant reduction in cost of the epitaxy. The growths and characteristics of InGaN quantum wells (QWs) light emitting diodes (LEDs) on both templates were compared. The InGaN QWs LEDs grown on the nano-patterned AGOG sapphire demonstrated at least a 24% enhancement of output power enhancement over that of LEDs grown on conventional GaN templates.

  1. Math Anxiety Assessment with the Abbreviated Math Anxiety Scale: Applicability and usefulness: insights from the Polish adaptation

    Directory of Open Access Journals (Sweden)

    Krzysztof eCipora

    2015-11-01

    Full Text Available Math anxiety has an important impact on mathematical development and performance. However, although math anxiety is supposed to be a transcultural trait, assessment instruments are scarce and are validated mainly for Western cultures so far. Therefore, we aimed at examining the transcultural generality of math anxiety by a thorough investigation of the validity of math anxiety assessment in Eastern Europe. We investigated the validity and reliability of a Polish adaptation of the Abbreviated Math Anxiety Scale (AMAS, known to have very good psychometric characteristics in its original, American-English version as well as in its Italian and Iranian adaptations.We also observed high reliability, both for internal consistency and test-retest stability of the AMAS in the Polish sample. The results also show very good construct, convergent and discriminant validity: The factorial structure in Polish adult participants (n = 857 was very similar to the one previously found in other samples; AMAS scores correlated moderately in expected directions with state and trait anxiety, self-assessed math achievement and skill as well temperamental traits of emotional reactivity, briskness, endurance and perseverance. Average scores obtained by participants as well as gender differences and correlations with external measures were also similar across cultures. Beyond the cultural comparison, we used path model analyses to show that math anxiety relates to math grades and self-competence when controlling for trait anxiety.The current study shows transcultural validity of math anxiety assessment with the AMAS.

  2. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  3. Text-Fabric

    NARCIS (Netherlands)

    Roorda, Dirk

    2016-01-01

    Text-Fabric is a Python3 package for Text plus Annotations. It provides a data model, a text file format, and a binary format for (ancient) text plus (linguistic) annotations. The emphasis of this all is on: data processing; sharing data; and contributing modules. A defining characteristic is that T

  4. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  5. Quality text editing

    Directory of Open Access Journals (Sweden)

    Gyöngyi Bujdosó

    2009-10-01

    Full Text Available Text editing is more than the knowledge of word processing techniques. Originally typographers, printers, text editors were the ones qualified to edit texts, which were well structured, legible, easily understandable, clear, and were able to emphasize the coreof the text. Time has changed, and nowadays everyone has access to computers as well as to text editing software and most users believe that having these tools is enough to edit texts. However, text editing requires more skills. Texts appearing either in printed or inelectronic form reveal that most of the users do not realize that they are not qualified to edit and publish their works. Analyzing the ‘text-products’ of the last decade a tendency can clearly be drawn. More and more documents appear, which instead of emphasizingthe subject matter, are lost in the maze of unstructured text slices. Without further thoughts different font types, colors, sizes, strange arrangements of objects, etc. are applied. We present examples with the most common typographic and text editing errors. Our aim is to call the attention to these mistakes and persuadeusers to spend time to educate themselves in text editing. They have to realize that a well-structured text is able to strengthen the effect on the reader, thus the original message will reach the target group.

  6. Abbreviations, acronyms, and initialisms frequently used by Martin Marietta Energy Systems, Inc.. Second edition

    Energy Technology Data Exchange (ETDEWEB)

    Miller, J.T.

    1994-09-01

    Guidelines are given for using abbreviations, acronyms, and initialisms (AAIs) in documents prepared by US Department of Energy facilities managed by Martin Marietta Energy Systems, Inc., in Oak Ridge, Tennessee. The more than 10,000 AAIs listed represent only a small portion of those found in recent documents prepared by contributing editors of the Information Management Services organization of Oak Ridge National Laboratory, the Oak Ridge K-25 Site, and the Oak Ridge Y-12 Plant. This document expands on AAIs listed in the Document Preparation Guide and is intended as a companion document

  7. Attenuation by phentolamine of hypoxia and levcromakalim-induced abbreviation of the cardiac action potential.

    OpenAIRE

    Tweedie, D.; Boachie-Anash, G.; Henderson, C. G.; Kane, K. A.

    1993-01-01

    1. The effects of phentolamine (5-30 microM) and glibenclamide (10 microM) on action potential characteristics were examined in guinea-pig papillary muscle exposed to either hypoxia or levcromakalim (20 microM). 2. The hypoxia-induced abbreviation of action potential duration (APD) and effective refractory period (ERP) were attenuated but not abolished by glibenclamide (10 microM). Hypoxia reduced APD by 24 +/- 2 vs 65 +/- 4% in glibenclamide- and vehicle-treated tissue, respectively. 3. Phen...

  8. Semantic Text Indexing

    Directory of Open Access Journals (Sweden)

    Zbigniew Kaleta

    2014-01-01

    Full Text Available This article presents a specific issue of the semantic analysis of texts in natural language – text indexing and describes one field of its application (web browsing.The main part of this article describes the computer system assigning a set of semantic indexes (similar to keywords to a particular text. The indexing algorithm employs a semantic dictionary to find specific words in a text, that represent a text content. Furthermore it compares two given sets of semantic indexes to determine texts’ similarity (assigning numerical value. The article describes the semantic dictionary – a tool essentialto accomplish this task and its usefulness, main concepts of the algorithm and test results.

  9. Text Mining: (Asynchronous Sequences

    Directory of Open Access Journals (Sweden)

    Sheema Khan

    2014-12-01

    Full Text Available In this paper we tried to correlate text sequences those provides common topics for semantic clues. We propose a two step method for asynchronous text mining. Step one check for the common topics in the sequences and isolates these with their timestamps. Step two takes the topic and tries to give the timestamp of the text document. After multiple repetitions of step two, we could give optimum result.

  10. Text Coherence in Translation

    Science.gov (United States)

    Zheng, Yanping

    2009-01-01

    In the thesis a coherent text is defined as a continuity of senses of the outcome of combining concepts and relations into a network composed of knowledge space centered around main topics. And the author maintains that in order to obtain the coherence of a target language text from a source text during the process of translation, a translator can…

  11. Development of Abbreviated Eight-Item Form of the Penn Verbal Reasoning Test

    Science.gov (United States)

    Bilker, Warren B.; Wierzbicki, Michael R.; Brensinger, Colleen M.; Gur, Raquel E.; Gur, Ruben C.

    2014-01-01

    The ability to reason with language is a highly valued cognitive capacity that correlates with IQ measures and is sensitive to damage in language areas. The Penn Verbal Reasoning Test (PVRT) is a 29-item computerized test for measuring abstract analogical reasoning abilities using language. The full test can take over half an hour to administer, which limits its applicability in large-scale studies. We previously described a procedure for abbreviating a clinical rating scale and a modified procedure for reducing tests with a large number of items. Here we describe the application of the modified method to reducing the number of items in the PVRT to a parsimonious subset of items that accurately predicts the total score. As in our previous reduction studies, a split sample is used for model fitting and validation, with cross-validation to verify results. We find that an 8-item scale predicts the total 29-item score well, achieving a correlation of .9145 for the reduced form for the model fitting sample and .8952 for the validation sample. The results indicate that a drastically abbreviated version, which cuts administration time by more than 70%, can be safely administered as a predictor of PVRT performance. PMID:24577310

  12. Planning Argumentative Texts

    CERN Document Server

    Huang, X

    1994-01-01

    This paper presents \\proverb\\, a text planner for argumentative texts. \\proverb\\'s main feature is that it combines global hierarchical planning and unplanned organization of text with respect to local derivation relations in a complementary way. The former splits the task of presenting a particular proof into subtasks of presenting subproofs. The latter simulates how the next intermediate conclusion to be presented is chosen under the guidance of the local focus.

  13. Mining text data

    CERN Document Server

    Aggarwal, Charu C

    2012-01-01

    Text mining applications have experienced tremendous advances because of web 2.0 and social networking applications. Recent advances in hardware and software technology have lead to a number of unique scenarios where text mining algorithms are learned. ""Mining Text Data"" introduces an important niche in the text analytics field, and is an edited volume contributed by leading international researchers and practitioners focused on social networks & data mining. This book contains a wide swath in topics across social networks & data mining. Each chapter contains a comprehensive survey including

  14. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  15. [A study on the abbreviated form of the Eysenck Personality Questionnaire Revised-Abbreviated (EPQR-A) in a student population].

    Science.gov (United States)

    Bouvard, M; Aulard-Jaccod, J; Pessonneaux, S; Hautekeete, M; Rogé, B

    2010-12-01

    The aim of this paper is to examine the short questionnaire of the Eysenck Personality Questionnaire Revised (the Eysenck Personality Questionnaire Revised-Abbreviated [EPQR-A]) among a student population. University students were invited, in groups, to fill in the forms proposed. Three sites were compared, representing a sample of 346 participants (Chambéry=118 subjects [44 males and 74 females]; Lille=110 subjects [50 males and 60 females] and Toulouse=118 subjects [60 males and 58 females]). The three groups of students have comparable scores on the EPQR-A wherever they live (Chambéry, Lille or Toulouse). Moreover, neither the age nor the gender allowed the detection of differences between subjects. Our sample of students is situated in the range of a "normal" group of students. Regarding the internal consistency coefficients, the French version we used of the neuroticism and the extraversion scales of the EPQR-A obtained a satisfactory result. The internal consistency coefficient of psychoticism was rather low (<70). This unsatisfactory level of internal reliability for the psychoticism is also found in the English version [7]. The four-factor model of the EPQR-A is judged to be an adequate explanation of the data. In the end, self-esteem correlated positively with extraversion and negatively with neuroticism. On the other hand, there is no link between psychoticism and self-esteem.

  16. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  17. Making Sense of Texts

    Science.gov (United States)

    Harper, Rebecca G.

    2014-01-01

    This article addresses the triadic nature regarding meaning construction of texts. Grounded in Rosenblatt's (1995; 1998; 2004) Transactional Theory, research conducted in an undergraduate Language Arts curriculum course revealed that when presented with unfamiliar texts, students used prior experiences, social interactions, and literary strategies…

  18. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  19. Clustering Text Data Streams

    Institute of Scientific and Technical Information of China (English)

    Yu-Bao Liu; Jia-Rong Cai; Jian Yin; Ada Wai-Chee Fu

    2008-01-01

    Clustering text data streams is an important issue in data mining community and has a number of applications such as news group filtering, text crawling, document organization and topic detection and tracing etc. However, most methods are similarity-based approaches and only use the TF*IDF scheme to represent the semantics of text data and often lead to poor clustering quality. Recently, researchers argue that semantic smoothing model is more efficient than the existing TF.IDF scheme for improving text clustering quality. However, the existing semantic smoothing model is not suitable for dynamic text data context. In this paper, we extend the semantic smoothing model into text data streams context firstly. Based on the extended model, we then present two online clustering algorithms OCTS and OCTSM for the clustering of massive text data streams. In both algorithms, we also present a new cluster statistics structure named cluster profile which can capture the semantics of text data streams dynamically and at the same time speed up the clustering process. Some efficient implementations for our algorithms are also given. Finally, we present a series of experimental results illustrating the effectiveness of our technique.

  20. [Traditional midwife texts].

    Science.gov (United States)

    Lundgren, Ingela; Stolt, Carl-Magnus

    2007-01-01

    We report an hermeneutic text study in two early midwife text books. In Louise Bourgeois book from early 17th century the individual caring perspective is more present than in Helena Malhiems book from the middle of the 18th century. In both books, however, non-technological aspects of child birth delivery is more prominent than in books written by doctors.

  1. Extracting Text from Video

    Directory of Open Access Journals (Sweden)

    Jayshree Ghorpade

    2011-09-01

    Full Text Available The text data present in images and video contain certain useful information for automatic annotation,indexing, and structuring of images. However variations of the text due to differences in text style, font, size, orientation, alignment as well as low image contrast and complex background make the problem of automatic text extraction extremely difficult and challenging job. A large number of techniques have been proposed to address this problem and the purpose of this paper is to design algorithms for each phase of extracting text from a video using java libraries and classes. Here first we frame the input video into stream of images using the Java Media Framework (JMF with the input being a real time or a video from the database. Then we apply pre processing algorithms to convert the image to gray scale and remove the disturbances like superimposed lines over the text, discontinuity removal, and dot removal.Then we continue with the algorithms for localization, segmentation and recognition for which we use the neural network pattern matching technique. The performance of our approach is demonstrated by presenting experimental results for a set of static images.

  2. EXTRACTING TEXT FROM VIDEO

    Directory of Open Access Journals (Sweden)

    Jayshree Ghorpade

    2011-06-01

    Full Text Available The text data present in images and video contain certain useful information for automatic annotation,indexing, and structuring of images. However variations of the text due to differences in text style, font, size, orientation, alignment as well as low image contrast and complex background make the problem of automatic text extraction extremely difficult and challenging job. A large number of techniques have been proposed to address this problem and the purpose of this paper is to design algorithms for each phase of extracting text from a video using java libraries and classes. Here first we frame the input video into stream of images using the Java Media Framework (JMF with the input being a real time or a video from the database. Then we apply pre processing algorithms to convert the image to gray scale and remove the disturbances like superimposed lines over the text, discontinuity removal, and dot removal.Then we continue with the algorithms for localization, segmentation and recognition for which we use the neural network pattern matching technique. The performance of our approach is demonstrated by presenting experimental results for a set of static images.

  3. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2012-01-01

    <正>Centre for Agriculture and Bioscience International( CABI) is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly and has now been integrated into all our databases including CAB Abstracts,Global Health,our Internet Resources and Abstract Journals. There are currently over 60,000 full text articles available to access. These documents,made possible by agreement with third

  4. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2014-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain.It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment.CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly

  5. Emotion Detection from Text

    CERN Document Server

    Shivhare, Shiv Naresh

    2012-01-01

    Emotion can be expressed in many ways that can be seen such as facial expression and gestures, speech and by written text. Emotion Detection in text documents is essentially a content - based classification problem involving concepts from the domains of Natural Language Processing as well as Machine Learning. In this paper emotion recognition based on textual data and the techniques used in emotion detection are discussed.

  6. Abbreviated bibliography on energy development—A focus on the Rocky Mountain Region

    Science.gov (United States)

    Montag, Jessica M.; Willis, Carolyn J.; Glavin, Levi W.

    2011-01-01

    Energy development of all types continues to grow in the Rocky Mountain Region of the western United States. Federal resource managers increasingly need to balance energy demands, effects on the natural landscape and public perceptions towards these issues. To assist in efficient access to valuable information, this abbreviated bibliography provides citations to relevant information for myriad of issues for which resource managers must contend. The bibliography is organized by seven large topics with various sup-topics: broad energy topics (energy crisis, conservation, supply and demand, etc.); energy sources (fossil fuel, nuclear, renewable, etc.); natural landscape effects (climate change, ecosystem, mitigation, restoration, and reclamation, wildlife, water, etc.); human landscape effects (attitudes and perceptions, economics, community effects, health, Native Americans, etc.); research and technology; international research; and, methods and modeling. A large emphasis is placed on the natural and human landscape effects.

  7. Densitometric properties of rapid manual processing solutions: abbreviated versus complete rapid processing.

    Science.gov (United States)

    Geist, J R; Gleason, M J

    1995-04-01

    Rapid manual processing solutions produce wet, readable radiographs in 1 to 2 min. However, some manufacturers permit time reductions for various processing steps to obtain images even more quickly. Differences in densitometric characteristics and spatial resolution between abbreviated rapid processing (ARP) and complete rapid processing were examined in four rapid manual processing systems on D- and E-speed film. When compared with films processed conventionally in an automatic processor, films processed in rapid manual processing chemistries had more fog and generally lower levels of speed and contrast. ARP radiographs were excessively stained unless they were washed for at least 60 s after fixing. The most severe depreciation in ARP film quality occurred when developing time was reduced by 50%; the complete rapid processing developing time should always be used. E-speed films produced radiographs with comparable densitometric and resolution characteristics to D-speed films for ARP and complete rapid processing techniques while requiring 40% less radiation.

  8. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain.It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment.CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly and has now been integrated into all our databases including CAB Abstracts,Global Health

  9. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain.It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment.CABI Full-text is one of the publishing products of CABI.CABI’s full text repository is growing rapidly and has now been integrated into all our databases including CAB Abstracts,Global Health,our Internet Resources and Jour-

  10. Performance of an Abbreviated Version of the Lubben Social Network Scale among Three European Community-Dwelling Older Adult Populations

    Science.gov (United States)

    Lubben, James; Blozik, Eva; Gillmann, Gerhard; Iliffe, Steve; von Renteln-Kruse, Wolfgang; Beck, John C.; Stuck, Andreas E.

    2006-01-01

    Purpose: There is a need for valid and reliable short scales that can be used to assess social networks and social supports and to screen for social isolation in older persons. Design and Methods: The present study is a cross-national and cross-cultural evaluation of the performance of an abbreviated version of the Lubben Social Network Scale…

  11. Matching Element Symbols with State Abbreviations: A Fun Activity for Browsing the Periodic Table of Chemical Elements

    Science.gov (United States)

    Woelk, Klaus

    2009-01-01

    A classroom activity is presented in which students are challenged to find matches between the United States two-letter postal abbreviations for states and chemical element symbols. The activity aims to lessen negative apprehensions students might have when the periodic table of the elements with its more than 100 combinations of letters is first…

  12. Matching Element Symbols with State Abbreviations: A Fun Activity for Browsing the Periodic Table of Chemical Elements

    Science.gov (United States)

    Woelk, Klaus

    2009-01-01

    A classroom activity is presented in which students are challenged to find matches between the United States two-letter postal abbreviations for states and chemical element symbols. The activity aims to lessen negative apprehensions students might have when the periodic table of the elements with its more than 100 combinations of letters is first…

  13. 21 CFR 314.430 - Availability for public disclosure of data and information in an application or abbreviated...

    Science.gov (United States)

    2010-04-01

    ... 21 Food and Drugs 5 2010-04-01 2010-04-01 false Availability for public disclosure of data and... APPROVAL TO MARKET A NEW DRUG Miscellaneous Provisions § 314.430 Availability for public disclosure of data... acknowledged, no data or information in the application or abbreviated application is available for...

  14. The Revised Junior Eysenck Personality Questionnaire (JEPQ-R): Dutch replications of the full length, short, and abbreviated forms

    NARCIS (Netherlands)

    Scholte, R.H.J.; Bruyn, E.E.J. De

    2001-01-01

    This study examines the full-length, short and abbreviated forms of the Revised Junior Eysenck Personality Questionnaire (JEPQ-R) in a Dutch sample of 215 boys and 207 girls, aged 12–14. The reliability and concurrent validity of the scales of the full-length form (JEPQ-R, 81 items), short form (JEP

  15. 78 FR 60292 - Draft Guidance for Industry on Abbreviated New Drug Application Submissions-Refuse-to-Receive...

    Science.gov (United States)

    2013-10-01

    ... HUMAN SERVICES Food and Drug Administration Draft Guidance for Industry on Abbreviated New Drug Application Submissions--Refuse-to-Receive Standards; Availability AGENCY: Food and Drug Administration, HHS. ACTION: Notice. SUMMARY: The Food and Drug Administration (FDA) is announcing the availability of a...

  16. Development of an Abbreviated Social Phobia and Anxiety Inventory (SPAI) Using Item Response Theory: The SPAI-23

    Science.gov (United States)

    Roberson-Nay, Roxann; Strong, David R.; Nay, William T.; Beidel, Deborah C.; Turner, Samuel M.

    2007-01-01

    An abbreviated version of the Social Phobia and Anxiety Inventory (SPAI) was developed using methods based in nonparametric item response theory. Participants included a nonclinical sample of 1,482 undergraduates (52% female, mean age = 19.4 years) as well as a clinical sample of 105 individuals (56% female, mean age = 36.4 years) diagnosed with…

  17. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    and free online dictionaries. The Diccionario español para la producción de textos is an example of a general text production dictionary that makes use of internet technologies, is based on a lexicographic theory, contains all the lexicographic data that users need in a production situation, and aims...

  18. Text as Image.

    Science.gov (United States)

    Woal, Michael; Corn, Marcia Lynn

    As electronically mediated communication becomes more prevalent, print is regaining the original pictorial qualities which graphemes (written signs) lost when primitive pictographs (or picture writing) and ideographs (simplified graphemes used to communicate ideas as well as to represent objects) evolved into first written, then printed, texts of…

  19. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International( CABI) is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.

  20. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2013-01-01

    <正>Centre for Agriculture and Bioscience International(CABI) is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people’s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.

  1. About CABI Full Text

    Institute of Scientific and Technical Information of China (English)

    2011-01-01

    <正>Centre for Agriculture and Bioscience International(CABI)is a not-for-profit international Agricultural Information Institute with headquarters in Britain. It aims to improve people s lives by providing information and applying scientific expertise to solve problems in agriculture and the environment. CABI Full-text is one of the publishing products of CABI.

  2. Text analysis and computers

    OpenAIRE

    1995-01-01

    Content: Erhard Mergenthaler: Computer-assisted content analysis (3-32); Udo Kelle: Computer-aided qualitative data analysis: an overview (33-63); Christian Mair: Machine-readable text corpora and the linguistic description of danguages (64-75); Jürgen Krause: Principles of content analysis for information retrieval systems (76-99); Conference Abstracts (100-131).

  3. The Emar Lexical Texts

    NARCIS (Netherlands)

    Gantzert, Merijn

    2011-01-01

    This four-part work provides a philological analysis and a theoretical interpretation of the cuneiform lexical texts found in the Late Bronze Age city of Emar, in present-day Syria. These word and sign lists, commonly dated to around 1100 BC, were almost all found in the archive of a single school.

  4. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2017-01-01

    modality, which is both an independent modality and a container in which other modalities may be contained. In the first case, the notion of electronic text would be paradigmatically formed around the e-book, conceived as a digital copy a printed book, but now produced as a deliberately closed work. Even...

  5. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  6. Polymorphous Perversity in Texts

    Science.gov (United States)

    Johnson-Eilola, Johndan

    2012-01-01

    Here's the tricky part: If we teach ourselves and our students that texts are made to be broken apart, remixed, remade, do we lose the polymorphous perversity that brought us pleasure in the first place? Does the pleasure of transgression evaporate when the borders are opened?

  7. Texts On-Line.

    Science.gov (United States)

    Thomas, Jean-Jacques

    1993-01-01

    Maintains that the study of signs is divided between those scholars who use the Saussurian binary sign (semiology) and those who prefer the Peirce tripartite sign (semiotics). Concludes that neither the Saussurian nor Peircian analysis methods can produce a semiotic interpretation based on a hierarchy of the text's various components. (CFR)

  8. Summarizing Expository Texts

    Science.gov (United States)

    Westby, Carol; Culatta, Barbara; Lawrence, Barbara; Hall-Kenyon, Kendra

    2010-01-01

    Purpose: This article reviews the literature on students' developing skills in summarizing expository texts and describes strategies for evaluating students' expository summaries. Evaluation outcomes are presented for a professional development project aimed at helping teachers develop new techniques for teaching summarization. Methods: Strategies…

  9. Texts On-Line.

    Science.gov (United States)

    Thomas, Jean-Jacques

    1993-01-01

    Maintains that the study of signs is divided between those scholars who use the Saussurian binary sign (semiology) and those who prefer the Peirce tripartite sign (semiotics). Concludes that neither the Saussurian nor Peircian analysis methods can produce a semiotic interpretation based on a hierarchy of the text's various components. (CFR)

  10. Use of global context for handling noisy names in discussion texts of a homeopathy discussion forum

    Directory of Open Access Journals (Sweden)

    Mukta Majumder

    2014-03-01

    Full Text Available The task of identifying named entities from the discussion texts in Web forums faces the challenge of noisy names. As the names are often misspelled or abbreviated, the conventional techniques have failed to detect the noisy names properly. In this paper we propose a global context based framework for handling the noisy names. The framework is tested on a named entity recognition system designed to identify the names from the discussion texts in a homeopathy diagnosis discussion forum. The proposed global context-based framework is found to be effective in improving the accuracy of the named entity recognition system.

  11. Stemming and N-gram matching for term conflation in Turkish texts

    Directory of Open Access Journals (Sweden)

    F. Çuna Ekmekçioglu

    1996-01-01

    Full Text Available One of the main problems involved in the use of free text for indexing and retrieval is the variation in word forms that is likely to be encountered. The most common type of variations are spelling errors, alternative spellings, multi-word concepts, transliteration, affixes and abbreviations. One way to alleviate this problem is to use a conflation algorithm, a computational procedure that is designed to bring together words that are semantically related, and to reduce them to a single form for retrieval purposes. In this paper, we discuss the use of conflation techniques for Turkish text databases.

  12. Wisdom Texts and Philosophy

    Directory of Open Access Journals (Sweden)

    Anthony Preus

    2013-11-01

    Full Text Available The last essay of this issue concerns to a more "technical" subject: in many ancient cultures, literary monuments are mainly "wisdom literature". In these early works. Philosophy and Literature are more closely related than in many contemporary approaches. The author here tries to sketch the relationships between the ancient wisdom literatures of Egipt, Greece and Israel, and to show how this literary genre precedes "philosophy".

  13. Weaving with text

    DEFF Research Database (Denmark)

    Hagedorn-Rasmussen, Peter

    This paper explores how a school principal by means of practical authorship creates reservoirs of language that provide a possible context for collective sensemaking. The paper draws upon a field study in which a school principal, and his managerial team, was shadowed in a period of intensive cha...... changes. The paper explores how the manager weaves with text, extracted from stakeholders, administration, politicians, employees, public discourse etc., as a means of creating a new fabric, a texture, of diverse perspectives that aims for collective sensemaking....

  14. Metacomprehension of text material.

    Science.gov (United States)

    Maki, R H; Berry, S L

    1984-10-01

    Subjects' abilities to predict future multiple-choice test performance after reading sections of text were investigated in two experiments. In Experiment 1, subjects who scored above median test performance showed some accuracy in their predictions of that test performance. They gave higher mean ratings to material related to correct than to incorrect test answers. Subjects who scored below median test performance did not show this prediction accuracy. The retention interval between reading and the test was manipulated in Experiment 2. Subjects who were tested after at least a 24-hr delay showed results identical to those of Experiment 1. However, when subjects were tested immediately after reading, subjects above and below median test performance gave accurate predictions for the first immediate test. In contrast, both types of subjects gave inaccurate predictions for the second immediate test. Structural variables, such as length, serial position, and hierarchical level of the sections of text were related to subjects' predictions. These variables, in general, were not related to test performance, although the predictions were related to test performance in the conditions described above.

  15. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  16. Indonesian Text-To-Speech System Using Diphone Concatenative Synthesis

    Directory of Open Access Journals (Sweden)

    Sutarman

    2015-02-01

    Full Text Available In this paper, we describe the design and develop a database of Indonesian diphone synthesis using speech segment of recorded voice to be converted from text to speech and save it as audio file like WAV or MP3. In designing and develop a database of Indonesian diphone there are several steps to follow; First, developed Diphone database includes: create a list of sample of words consisting of diphones organized by prioritizing looking diphone located in the middle of a word if not at the beginning or end; recording the samples of words by segmentation. ;create diphones made with a tool Diphone Studio 1.3. Second, develop system using Microsoft Visual Delphi 6.0, includes: the conversion system from the input of numbers, acronyms, words, and sentences into representations diphone. There are two kinds of conversion (process alleged in analyzing the Indonesian text-to-speech system. One is to convert the text to be sounded to phonem and two, to convert the phonem to speech. Method used in this research is called Diphone Concatenative synthesis, in which recorded sound segments are collected. Every segment consists of a diphone (2 phonems. This synthesizer may produce voice with high level of naturalness. The Indonesian Text to Speech system can differentiate special phonemes like in ‘Beda’ and ‘Bedak’ but sample of other spesific words is necessary to put into the system. This Indonesia TTS system can handle texts with abbreviation, there is the facility to add such words.

  17. The Modified Abbreviated Math Anxiety Scale: A Valid and Reliable Instrument for Use with Children.

    Science.gov (United States)

    Carey, Emma; Hill, Francesca; Devine, Amy; Szűcs, Dénes

    2017-01-01

    Mathematics anxiety (MA) can be observed in children from primary school age into the teenage years and adulthood, but many MA rating scales are only suitable for use with adults or older adolescents. We have adapted one such rating scale, the Abbreviated Math Anxiety Scale (AMAS), to be used with British children aged 8-13. In this study, we assess the scale's reliability, factor structure, and divergent validity. The modified AMAS (mAMAS) was administered to a very large (n = 1746) cohort of British children and adolescents. This large sample size meant that as well as conducting confirmatory factor analysis on the scale itself, we were also able to split the sample to conduct exploratory and confirmatory factor analysis of items from the mAMAS alongside items from child test anxiety and general anxiety rating scales. Factor analysis of the mAMAS confirmed that it has the same underlying factor structure as the original AMAS, with subscales measuring anxiety about Learning and Evaluation in math. Furthermore, both exploratory and confirmatory factor analysis of the mAMAS alongside scales measuring test anxiety and general anxiety showed that mAMAS items cluster onto one factor (perceived to represent MA). The mAMAS provides a valid and reliable scale for measuring MA in children and adolescents, from a younger age than is possible with the original AMAS. Results from this study also suggest that MA is truly a unique construct, separate from both test anxiety and general anxiety, even in childhood.

  18. Abbreviated New Drug Applications and 505(b)(2) Applications. Final rule.

    Science.gov (United States)

    2016-10-06

    The Food and Drug Administration (FDA, the Agency, or we) is issuing a final rule to implement Title XI of the Medicare Prescription Drug, Improvement, and Modernization Act of 2003 (MMA), which amended provisions of the Federal Food, Drug, and Cosmetic Act (the FD&C Act) that govern the approval of 505(b)(2) applications and abbreviated new drug applications (ANDAs). This final rule implements portions of Title XI of the MMA that pertain to provision of notice to each patent owner and the new drug application (NDA) holder of certain patent certifications made by applicants submitting 505(b)(2) applications or ANDAs; the availability of 30-month stays of approval on 505(b)(2) applications and ANDAs that are otherwise ready to be approved; submission of amendments and supplements to 505(b)(2) applications and ANDAs; and the types of bioavailability and bioequivalence data that can be used to support these applications. This final rule also amends certain regulations regarding 505(b)(2) applications and ANDAs to facilitate compliance with and efficient enforcement of the FD&C Act.

  19. Effectiveness of abbreviated CBT for insomnia in psychiatric outpatients: sleep and depression outcomes.

    Science.gov (United States)

    Wagley, J Nile; Rybarczyk, Bruce; Nay, William T; Danish, Steven; Lund, Hannah G

    2013-10-01

    To test the efficacy of cogntive-behavioral therapy for insomnia (CBT-I) as a supplement treatment for psychiatric outpatients. Comorbid insomnia is prevalent among individuals with varied psychiatric disorders and evidence indicates that CBT-I may be effective for reducing insomnia and other psychiatric symptoms. The present study randomly assigned 30 psychiatric outpatients (mean duration of treatment = 3.6 years) with low sleep quality and residual depressive symptoms to two sessions of CBT-I or a treatment as usual control group. Assessment included the Pittsburgh Sleep Quality Index (PSQI) for insomnia and the Patient Health Questionnaire (PHQ-9) for depression at pretreatment and 4 and 8 weeks posttreatment. Patients who received CBT-I demonstrated within group changes in PSQI and the PHQ-9 scores at both 4 and 8 weeks posttreatment, but did not show between-group differences. Additionally, 38% of the treatment participants achieved normal sleep at follow-up compared with none in the control condition. This study provides preliminary evidence that abbreviated behavioral treatment has beneficial effects on residual insomnia and depression in long-term psychiatric outpatients. © 2012 Wiley Periodicals, Inc.

  20. Development of an abbreviated version of the delirium motor subtyping scale (DMSS-4).

    Science.gov (United States)

    Meagher, D; Adamis, D; Leonard, M; Trzepacz, P; Grover, S; Jabbar, F; Meehan, K; O'Connor, M; Cronin, C; Reynolds, P; Fitzgerald, J; O'Regan, N; Timmons, S; Slor, C; de Jonghe, J; de Jonghe, A; van Munster, B C; de Rooij, S E; Maclullich, A

    2014-04-01

    Delirium is a common neuropsychiatric syndrome with considerable heterogeneity in clinical profile. Identification of clinical subtypes can allow for more targeted clinical and research efforts. We sought to develop a brief method for clinical subtyping in clinical and research settings. A multi-site database, including motor symptom assessments conducted in 487 patients from palliative care, adult and old age consultation-liaison psychiatry services was used to document motor activity disturbances as per the Delirium Motor Checklist (DMC). Latent class analysis (LCA) was used to identify the class structure underpinning DMC data and also items for a brief subtyping scale. The concordance of the abbreviated scale was then compared with the original Delirium Motor Subtype Scale (DMSS) in 375 patients having delirium as per the American Psychiatric Association's Diagnostic and Statistical Manual (4th edition) criteria. Latent class analysis identified four classes that corresponded closely with the four recognized motor subtypes of delirium. Further, LCA of items (n = 15) that loaded >60% to the model identified four features that reliably identified the classes/subtypes, and these were combined as a brief motor subtyping scale (DMSS-4). There was good concordance for subtype attribution between the original DMSS and the DMSS-4 (κ = 0.63). The DMSS-4 allows for rapid assessment of clinical subtypes in delirium and has high concordance with the longer and well-validated DMSS. More consistent clinical subtyping in delirium can facilitate better delirium management and more focused research effort.

  1. Increasing body mass index portends abbreviated survival following pancreatoduodenectomy for pancreatic adenocarcinoma.

    Science.gov (United States)

    Mathur, Abhishek; Luberice, Kenneth; Paul, Harold; Franka, Co; Rosemurgy, Alexander

    2015-06-01

    Body mass index (BMI), a common surrogate marker for grading obesity, does not differentiate between metabolically active visceral fat and the relatively inert subcutaneous fat. We aim to determine the utility of BMI as a prognostic marker for the impact of obesity on outcomes and survival following pancreatoduodenectomy for pancreatic adenocarcinoma. From a database of over 1,000 patients who had undergone pancreatoduodenectomy, 228 patients with a diagnosis of pancreatic adenocarcinoma were identified. Demographic data including BMI and perioperative parameters-operative time, estimated blood loss, length of stay, survival, nodal status, and American Joint Committee on Cancer stage-were obtained. Data are presented as median. One hundred ninety-two patients had a BMI less than or equal to 29 and 36 patients had a BMI greater than or equal to 30 (24 vs. 34, P obese patients had positive nodes (69% vs. 62%, P pancreatic adenocarcinoma undergoing pancreatoduodenectomy, obesity does not impact operative complexity or length of stay but results in a shortened survival. Therefore, we conclude that BMI is an important prognostic marker that portends an abbreviated survival following pancreatoduodenectomy for pancreatic adenocarcinoma. Copyright © 2015 Elsevier Inc. All rights reserved.

  2. Validity and Reliability of the Abbreviated Barratt Impulsiveness Scale in Spanish (BIS-15S)*

    Science.gov (United States)

    Orozco-Cabal, Luis; Rodríguez, Maritza; Herin, David V.; Gempeler, Juanita; Uribe, Miguel

    2010-01-01

    Objective This study determined the validity and reliability of a new, abbreviated version of the Spanish Barratt Impulsiveness Scale (BIS-15S) in Colombian subjects. Method The BIS-15S was tested in non-clinical (n=283) and clinical (n=164) native Spanish-speakers. Intra-scale reliability was calculated using Cronbach’s α, and test-retest reliability was measured with Pearson correlations. Psychometric properties were determined using standard statistics. A factor analysis was performed to determine BIS-15S factor structure. Results 447 subjects participated in the study. Clinical subjects were older and more educated compared to non-clinical subjects. Impulsivity scores were normally distributed in each group. BIS-15S total, motor, non-planning and attention scores were significantly lower in non-clinical vs. clinical subjects. Subjects with substance-related disorders had the highest BIS-15S total scores, followed by subjects with bipolar disorders and bulimia nervosa/binge eating. Internal consistency was 0.793 and test-retest reliability was 0.80. Factor analysis confirmed a three-factor structure (attention, motor, non-planning) accounting for 47.87% of the total variance in BIS-15S total scores. Conclusions The BIS-15S is a valid and reliable self-report measure of impulsivity in this population. Further research is needed to determine additional components of impulsivity not investigated by this measure. PMID:21152412

  3. Development of an abbreviated Career Indecision Profile-65 using item response theory: The CIP-Short.

    Science.gov (United States)

    Xu, Hui; Tracey, Terence J G

    2017-03-01

    The current study developed an abbreviated version of the Career Indecision Profile-65 (CIP-65; Hacker, Carr, Abrams, & Brown, 2013) by using item response theory. In order to improve the efficiency of the CIP-65 in measuring career indecision, the individual item performance of the CIP-65 was examined with respect to the ordering of response occurrence and gender differential item functioning. The best 5 items of each scale of the CIP-65 (i.e., neuroticism/negative affectivity, choice/commitment anxiety, lack of readiness, and interpersonal conflicts) were retained in the CIP-Short using a sample of 588 college students. A validation sample (N = 174) supported the reliability and structural validity of the CIP-Short. The convergent and divergent validity of the CIP-Short was additionally supported in the findings of a hypothesized differential relational pattern in a separate sample (N = 360). While the current study supported the CIP-Short being a sound brief measure of career indecision, the limitations of this study and suggestions for future research were discussed as well. (PsycINFO Database Record

  4. Proposal for a revised taxonomy of the family Filoviridae: classification, names of taxa and viruses, and virus abbreviations

    OpenAIRE

    Jens H. Kuhn; Becker, Stephan (Prof. Dr.); Ebihara, Hideki; Geisbert, Thomas W.; Johnson, Karl M.; Kawaoka, Yoshihiro; Lipkin, W. Ian; Negredo, Ana I.; Netesov, Sergey V.; Stuart T Nichol; Palacios, Gustavo; Peters, Clarence J.; Tenorio, Antonio; Volchkov, Viktor E.; Jahrling, Peter B.

    2010-01-01

    The taxonomy of the family Filoviridae (marburgviruses and ebolaviruses) has changed several times since the discovery of its members, resulting in a plethora of species and virus names and abbreviations. The current taxonomy has only been partially accepted by most laboratory virologists. Confusion likely arose for several reasons: species names that consist of several words or which (should) contain diacritical marks, the current orthographic identity of species and virus names, and the sim...

  5. THE SPECIFICS OF THE TRANSLATION OF SOCIAL AND POLITICAL TEXTS

    Directory of Open Access Journals (Sweden)

    N. D. PASHKOWSKAYA

    2015-01-01

    Full Text Available The article is about the aspect of translation in the process of foreign language teaching. The main attention is paid to translation of social and political texts such as statements, addresses and speeches of state, party and public figures, all sorts of articles and publications of international, governmental and non-governmental organizations, the number of which is very large in mass media. They contain information of the leading economists on the activities of international financial institutions, reflect the real world of business, and offer a wide range of views and opinions on political events. Socio-political texts in Russian, as well as in foreign languages, not only provide information about various events or problems, but also make special impact on readers. Particular attention is paid to the lexical units which are used in translation of such texts. Such lexical units reflect the evolution of society, define public relationships. They are in continual «movement», replacing or supplementing the existing notation systems by words and abbreviations which reflect the emergence of new facts and concepts in the relevant field of social life. Knowledge of this kind of vocabulary enables students to solve practical problems of translation, summarization and annotation of this kind of literature.

  6. Abbreviated injury scale unification: the case for a unified injury system for global use.

    Science.gov (United States)

    Garthe, E; States, J D; Mango, N K

    1999-08-01

    The Abbreviated Injury Scale (AIS), developed by the Association for the Advancement of Automotive Medicine is the most widely used anatomic injury severity scale in the world (Association for the Advancement of Automotive Medicine. The Abbreviated Injury Scale; 1985 and 1990 revisions. Des Plaines, IL: Association for the Advancement of Automotive Medicine). However, different user groups have modified the AIS system to fit their needs, and these modifications prevent ready comparison and trending of data collected in these systems in the United States and throughout the world. The United States currently has five AIS-based severity systems and two AIS-based impairment systems in use, with additional revisions forthcoming. Other modified AIS systems are known to be in use in the United Kingdom and Japan. The data collected in these systems cannot be accurately combined or compared without re-coding or the use of complex "mapping" methodologies. Furthermore, the expanding use of data linked from multiple databases to answer complex medical, engineering, or policy issues emphasizes the need for coordination between severity and other injury systems. Linkage of state-wide motor vehicle crash data with data from hospital injury classification systems, mortality files, trauma registry, and national crash databases brings into immediate focus the lack of well defined relationships between the severity coding systems and these other widely used injury systems (Mango N, Garthe E. SAE Congress, February, 1998; Johnson, S, Walker, J. NHTSA Technical Report. DOT HS 808 338, Washington, DC: NHTSA; January, 1996). With the expanding use of linked data in state and national policy decisions, it is vital that consistent standards for injury descriptions, severities, and impairments be available for clinical, engineering, and policy users. This paper compares five anatomic severity systems and two impairment systems in terms of purpose, code structure, and use and discusses the

  7. Teaching Text Structure: Examining the Affordances of Children's Informational Texts

    Science.gov (United States)

    Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

    2016-01-01

    This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…

  8. Abbreviated MRI protocols for detecting breast cancer in women with dense breasts

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Shung Qing; Huang, Min; Shen, Yu Ying; Liu, Chen Lu; Xu, Chuan Xiao [The Affiliated Suzhou Hospital, Nanjing Medical University, Suzhou (China)

    2017-06-15

    To evaluate the validity of two abbreviated protocols (AP) of MRI in breast cancer screening of dense breast tissue. This was a retrospective study in 356 participants with dense breast tissue and negative mammography results. The study was approved by the Nanjing Medical University Ethics Committee. Patients were imaged with a full diagnostic protocol (FDP) of MRI. Two APs (AP-1 consisting of the first post-contrast subtracted [FAST] and maximum-intensity projection [MIP] images, and AP-2 consisting of AP-1 combined with diffusion-weighted imaging [DWI]) and FDP images were analyzed separately, and the sensitivities and specificities of breast cancer detection were calculated. Of the 356 women, 67 lesions were detected in 67 women (18.8%) by standard MR protocol, and histological examination revealed 14 malignant lesions and 53 benign lesions. The average interpretation time of AP-1 and AP-2 were 37 seconds and 54 seconds, respectively, while the average interpretation time of the FDP was 3 minutes and 25 seconds. The sensitivities of the AP-1, AP-2, and FDP were 92.9, 100, and 100%, respectively, and the specificities of the three MR protocols were 86.5, 95.0, and 96.8%, respectively. There was no significant difference among the three MR protocols in the diagnosis of breast cancer (p > 0.05). However, the specificity of AP-1 was significantly lower than that of AP-2 (p = 0.031) and FDP (p = 0.035), while there was no difference between AP-2 and FDP (p > 0.05). The AP may be efficient in the breast cancer screening of dense breast tissue. FAST and MIP images combined with DWI of MRI are helpful to improve the specificity of breast cancer detection.

  9. The Modified Abbreviated Math Anxiety Scale: A Valid and Reliable Instrument for Use with Children

    Science.gov (United States)

    Carey, Emma; Hill, Francesca; Devine, Amy; Szűcs, Dénes

    2017-01-01

    Mathematics anxiety (MA) can be observed in children from primary school age into the teenage years and adulthood, but many MA rating scales are only suitable for use with adults or older adolescents. We have adapted one such rating scale, the Abbreviated Math Anxiety Scale (AMAS), to be used with British children aged 8–13. In this study, we assess the scale's reliability, factor structure, and divergent validity. The modified AMAS (mAMAS) was administered to a very large (n = 1746) cohort of British children and adolescents. This large sample size meant that as well as conducting confirmatory factor analysis on the scale itself, we were also able to split the sample to conduct exploratory and confirmatory factor analysis of items from the mAMAS alongside items from child test anxiety and general anxiety rating scales. Factor analysis of the mAMAS confirmed that it has the same underlying factor structure as the original AMAS, with subscales measuring anxiety about Learning and Evaluation in math. Furthermore, both exploratory and confirmatory factor analysis of the mAMAS alongside scales measuring test anxiety and general anxiety showed that mAMAS items cluster onto one factor (perceived to represent MA). The mAMAS provides a valid and reliable scale for measuring MA in children and adolescents, from a younger age than is possible with the original AMAS. Results from this study also suggest that MA is truly a unique construct, separate from both test anxiety and general anxiety, even in childhood. PMID:28154542

  10. AJI BLEGODAWA TEXT IN THE PERSPECTIVE OF FUNCTIONAL SYSTEMIC LINGUISTICTS

    Directory of Open Access Journals (Sweden)

    I Wayan Rasna

    2012-11-01

    Full Text Available This research give answers to the following five problems; they are (1 the lexico grammar of AjiBlegodawa Text (Text Aji Blegodawa; hereon abbreviated to TAB; (2 the context of situation (registerand the context of culture (genre of TAB; (4 the ideational, interpersonal and textual meanings of TAB;and (5 the values in TAB. Note taking method was employed for collecting the data needed for thelexicogrammar, the context of situation, the functions, meanings, and the values. The data needed for thecultural context were collected by note taking, questionnaire, observation and structured interview.Structured interview, in which eleven informants were interviewed, was also employed for collecting dataneeded for the values. Functional system linguistics (hereon abbreviated to FSL introduced by Hallidaywas employed to analyze the data (Halliday, 1985: 2004; 2005; (Halliday and Maththiessen, 2004.The findings show that the frequencies of the processes in the text are as follows: the materialprocess appears 674 times (52.29%; the relational process takes place 233 times (18.08% and the mentalprocess occurs 177 (13.73%.With regard to circumstances, the circumstance of location is the most dominant followed by thecircumstance of manner. From the context of situation, it can be identified that the field is black magic;from the participants, it can be identified that the main participant is Blegodawa. The mode issimultaneously used to form the configuration of meaning. It can be revealed that the main participantsupported by the supporting participants kill the victim. Viewed from the cultural point of view, thecultural norms referred to in TAB destroy life. The linguistic functionsin TAB are: 1 ideational function which includes belief, the tradition of the magic world, taboo,historical relationship and ritual; 2 interpersonal function which includes interactive function and selfexpressive function and 3 textual function. The meanings in TAB include

  11. Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts

    Directory of Open Access Journals (Sweden)

    Helena Gómez-Adorno

    2016-01-01

    Full Text Available We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.

  12. Important Text Characteristics for Early-Grades Text Complexity

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2015-01-01

    The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…

  13. Improving text recognition by distinguishing scene and overlay text

    Science.gov (United States)

    Quehl, Bernhard; Yang, Haojin; Sack, Harald

    2015-02-01

    Video texts are closely related to the content of a video. They provide a valuable source for indexing and interpretation of video data. Text detection and recognition task in images or videos typically distinguished between overlay and scene text. Overlay text is artificially superimposed on the image at the time of editing and scene text is text captured by the recording system. Typically, OCR systems are specialized on one kind of text type. However, in video images both types of text can be found. In this paper, we propose a method to automatically distinguish between overlay and scene text to dynamically control and optimize post processing steps following text detection. Based on a feature combination a Support Vector Machine (SVM) is trained to classify scene and overlay text. We show how this distinction in overlay and scene text improves the word recognition rate. Accuracy of the proposed methods has been evaluated by using publicly available test data sets.

  14. Retinal locus for scanning text.

    Science.gov (United States)

    Timberlake, George T; Sharma, Manoj K; Grose, Susan A; Maino, Joseph H

    2006-01-01

    A method of mapping the retinal location of text during reading is described in which text position is plotted cumulatively on scanning laser ophthalmoscope retinal images. Retinal locations that contain text most often are the brightest in the cumulative plot, and locations that contain text least often are the darkest. In this way, the retinal area that most often contains text is determined. Text maps were plotted for eight control subjects without vision loss and eight subjects with central scotomas from macular degeneration. Control subjects' text maps showed that the fovea contained text most often. Text maps of five of the subjects with scotomas showed that they used the same peripheral retinal area to scan text and fixate. Text maps of the other three subjects with scotomas showed that they used separate areas to scan text and fixate. Retinal text maps may help evaluate rehabilitative strategies for training individuals with central scotomas to use a particular retinal area to scan text.

  15. The utility of abbreviated patient-reported outcomes for predicting survival in early stage colorectal cancer.

    Science.gov (United States)

    Hsu, Tina; Speers, Caroline H; Kennecke, Hagen F; Cheung, Winson Y

    2017-05-15

    Patient-reported outcomes (PROs) are increasingly used in clinical settings. Prior research suggests that PROs collected at baseline may be associated with cancer survival, but most of those studies were conducted in patients with breast or lung cancer. The objective of this study was to determine the correlation between prospectively collected PROs and cancer-specific outcomes in patients with early stage colorectal cancer. Patients who had newly diagnosed stage II or III colorectal cancer from 2009 to 2010 and had a consultation at the British Columbia Cancer Agency completed the brief Psychosocial Screen for Cancer (PSSCAN) questionnaire, which collects data on patients' perceived social supports, quality of life (QOL), anxiety and depression, and general health. PROs from the PSSCAN were linked with the Gastrointestinal Cancers Outcomes Database, which contains information on patient and tumor characteristics, treatment details, and cancer outcomes. Cox regression models were constructed for overall survival (OS), and Fine and Gray regression models were developed for disease-specific survival (DSS). In total, 692 patients were included. The median patient age was 67 years (range, 26-95 years), and the majority had colon cancer (61%), were diagnosed with stage III disease (54%), and received chemotherapy (58%). In general, patients felt well supported and reported good overall health and QOL. On multivariate analysis, increased fatigue was associated with worse OS (hazard ratio [HR], 1.99; P = .00007) and DSS (HR, 1.63; P = .03), as was lack of emotional support (OS: HR, 4.36; P = .0003; DSS: HR, 1.92; P = .02). Although most patients described good overall health and QOL and indicated that they were generally well supported, patients who experienced more pronounced fatigue or lacked emotional support had a higher likelihood of worse OS and DSS. These findings suggest that abbreviated PROs can inform and assist clinicians to identify patients who have a worse

  16. Automatic Text Decomposition and Structuring.

    Science.gov (United States)

    Salton, Gerard; And Others

    1996-01-01

    Text similarity measurements are used to determine relationships between natural-language texts and text excerpts. The resulting linked hypertext maps can be broken down into text segments and themes used to identify different text types and structures, leading to improved information access and utilization. Examples are provided for text…

  17. Does a booster intervention augment the preventive effects of an abbreviated version of the coping power program for aggressive children?

    Science.gov (United States)

    Lochman, John E; Baden, Rachel E; Boxmeyer, Caroline L; Powell, Nicole P; Qu, Lixin; Salekin, Karen L; Windle, Michel

    2014-01-01

    Booster interventions have been presumed to be important methods for maintaining the effects of evidence-based programs for children with behavioral problems, but there has been remarkably little empirical attention to this assumption. The present study examines the effect of a child-oriented booster preventive intervention with children who had previously received an abbreviated version (24 child sessions, 10 parent sessions) of the Coping Power targeted prevention program. Two hundred and forty-one children (152 boys, 89 girls) were screened as having moderate to high levels of aggressive behavior in 4th grade, then half were randomly assigned to receive the abbreviated Coping Power program in 5th grade, and half of the preventive intervention children were then randomly assigned to a Booster condition in 6th grade. The Booster sessions consisted of brief monthly individual contacts, and were primarily with the children. Five assessments across 4 years were collected from teachers, providing a three-year follow-up for all children who participated in the project. Results indicated that the abbreviated Coping Power program (one-third shorter than the full intervention) had long-term effects in reducing children's externalizing problem behaviors, proactive and reactive aggression, impulsivity traits and callous-unemotional traits. The Booster intervention did not augment these prevention effects. These findings indicate that a briefer and more readily disseminated form of an evidence-based targeted preventive intervention was effective. The findings have potential implications for policy and guidelines about possible intervention length and booster interventions.

  18. Is the full version of the AUDIT really necessary? Study of the validity and internal construct of its abbreviated versions.

    Science.gov (United States)

    Meneses-Gaya, Carolina; Zuardi, Antonio W; Loureiro, Sonia R; Hallak, Jaime E C; Trzesniak, Clarissa; de Azevedo Marques, João M; Machado-de-Sousa, João P; Chagas, Marcos H N; Souza, Roberto M; Crippa, José A S

    2010-08-01

    This study was aimed at assessing the psychometric qualities of the abbreviated versions of the Alcohol Use Disorders Identification Test (AUDIT-3, AUDIT-4, AUDIT-C, AUDIT-PC, AUDIT-QF, FAST, and Five-Shot) and at comparing them to the 10-item AUDIT and the CAGE in 2 samples of Brazilian adults. The validity and internal consistency of the scales were assessed in a sample of 530 subjects attended at an emergency department and at a Psychosocial Care Center for Alcohol and Drugs. The Structured Clinical Interview for DSM-IV was used as the diagnostic comparative measure for the predictive validity assessment. The concurrent validity between the scales was analyzed by means of Pearson's correlation coefficient. The assessment of the predictive validity of the abbreviated versions showed high sensitivity (of 0.78 to 0.96) and specificity (of 0.74 to 0.94) indices, with areas under the curve as elevated as those of the AUDIT (0.89 and 0.92 to screen for abuse and 0.93 and 0.95 in the screening of dependence). The CAGE presented lower indices: 0.81 for abuse and 0.87 for dependence. The analysis of the internal consistency of the AUDIT and its versions exhibited Cronbach's alpha coefficients between 0.83 and 0.94, while the coefficient for the CAGE was 0.78. Significant correlations were found between the 10-item AUDIT and its versions, ranging from 0.91 to 0.99. Again, the results for the CAGE were satisfactory (0.77), although inferior to the other instruments. The results obtained in this study confirm the validity of the abbreviated versions of the AUDIT for the screening of alcohol use disorders and show that their psychometric properties are as satisfactory as those of the 10-item AUDIT and the CAGE.

  19. Eye movements when reading text messaging (txt msgng).

    Science.gov (United States)

    Perea, Manuel; Acha, Joana; Carreiras, Manuel

    2009-08-01

    The growing popularity of mobile-phone technology has led to changes in the way people--particularly younger people--communicate. A clear example of this is the advent of Short Message Service (SMS) language, which includes orthographic abbreviations (e.g., omitting vowels, as in wk, week) and phonetic respelling (e.g., using u instead of you). In the present study, we examined the pattern of eye movements during reading of SMS sentences (e.g., my hols wr gr8), relative to normally written sentences, in a sample of skilled "texters". SMS sentences were created by using (mostly) orthographic or phonological abbreviations. Results showed that there is a reading cost--both at a local level and at a global level--for individuals who are highly expert in SMS language. Furthermore, phonological abbreviations resulted in a greater cost than orthographic abbreviations.

  20. Temporal Adverbials in Text Structuring: On Temporal Text Strategy.

    Science.gov (United States)

    Virtanen, Tuija

    This paper discusses clause-initial adverbials of time functioning as signals of the temporal text strategy. A chain of such markers creates cohesion and coherence by forming continuity in the text and also signals textual boundaries that occur on different hierarchic levels. The temporal text strategy is closely associated with narrative text.…

  1. Text analysis methods, text analysis apparatuses, and articles of manufacture

    Science.gov (United States)

    Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

    2014-10-28

    Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.

  2. A longitudinal study of children's text messaging and literacy development.

    Science.gov (United States)

    Wood, Clare; Meachem, Sally; Bowyer, Samantha; Jackson, Emma; Tarczynski-Bowles, M Luisa; Plester, Beverly

    2011-08-01

    Recent studies have shown evidence of positive concurrent relationships between children's use of text message abbreviations ('textisms') and performance on standardized assessments of reading and spelling. This study aimed to determine the direction of this association. One hundred and nineteen children aged between 8 and 12 years were assessed on measures of general ability, reading, spelling, rapid phonological retrieval, and phonological awareness at the beginning and end of an academic year. The children were also asked to provide a sample of the text messages that they sent over a 2-day period. These messages were analyzed to determine the extent to which textisms were used. It was found that textism use at the beginning of the academic year was able to predict unique variance in spelling performance at the end of the academic year after controlling for age, verbal IQ, phonological awareness, and spelling ability at the beginning of the year. When the analysis was reversed, reading and spelling ability were unable to predict unique variance in textism usage. These data suggest that there is some evidence of a causal contribution of textism usage to spelling performance in children aged 8-12 years. However, when the measure of rapid phonological retrieval (rapid picture naming) was controlled in the analysis, the relationship between textism use and spelling ability just failed to reach statistical significance, suggesting that phonological access skills may mediate some of the relationship between textism use and spelling performance. ©2011 The British Psychological Society.

  3. Short Text Classification: A Survey

    Directory of Open Access Journals (Sweden)

    Ge Song

    2014-05-01

    Full Text Available With the recent explosive growth of e-commerce and online communication, a new genre of text, short text, has been extensively applied in many areas. So many researches focus on short text mining. It is a challenge to classify the short text owing to its natural characters, such as sparseness, large-scale, immediacy, non-standardization. It is difficult for traditional methods to deal with short text classification mainly because too limited words in short text cannot represent the feature space and the relationship between words and documents. Several researches and reviews on text classification are shown in recent times. However, only a few of researches focus on short text classification. This paper discusses the characters of short text and the difficulty of short text classification. Then we introduce the existing popular works on short text classifiers and models, including short text classification using sematic analysis, semi-supervised short text classification, ensemble short text classification, and real-time classification. The evaluations of short text classification are analyzed in our paper. Finally we summarize the existing classification technology and prospect for development trend of short text classification

  4. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  5. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  6. The Challenge of Challenging Text

    Science.gov (United States)

    Shanahan, Timothy; Fisher, Douglas; Frey, Nancy

    2012-01-01

    The Common Core State Standards emphasize the value of teaching students to engage with complex text. But what exactly makes a text complex, and how can teachers help students develop their ability to learn from such texts? The authors of this article discuss five factors that determine text complexity: vocabulary, sentence structure, coherence,…

  7. Text-Attentional Convolutional Neural Network for Scene Text Detection.

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-06-01

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature globally computed from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this paper, we present a new system for scene text detection by proposing a novel text-attentional convolutional neural network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/non-text information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates the main task of text/non-text classification. In addition, a powerful low-level detector called contrast-enhancement maximally stable extremal regions (MSERs) is developed, which extends the widely used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 data set, with an F-measure of 0.82, substantially improving the state-of-the-art results.

  8. Text Classification using Artificial Intelligence

    CERN Document Server

    Kamruzzaman, S M

    2010-01-01

    Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user query, and text understanding systems, which transform text in some way such as producing summaries, answering questions or extracting data. Existing supervised learning algorithms for classifying text need sufficient documents to learn accurately. This paper presents a new algorithm for text classification using artificial intelligence technique that requires fewer documents for training. Instead of using words, word relation i.e. association rules from these words is used to derive feature set from pre-classified text documents. The concept of na\\"ive Bayes classifier is then used on derived features and finally only a single concept of genetic algorithm has been added for final classification. A syste...

  9. Text Classification using Data Mining

    CERN Document Server

    Kamruzzaman, S M; Hasan, Ahmed Ryadh

    2010-01-01

    Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user query, and text understanding systems, which transform text in some way such as producing summaries, answering questions or extracting data. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper presents a new algorithm for text classification using data mining that requires fewer documents for training. Instead of using words, word relation i.e. association rules from these words is used to derive feature set from pre-classified text documents. The concept of Naive Bayes classifier is then used on derived features and finally only a single concept of Genetic Algorithm has been added for final classification. A system based on the...

  10. Text analysis devices, articles of manufacture, and text analysis methods

    Science.gov (United States)

    Turner, Alan E; Hetzler, Elizabeth G; Nakamura, Grant C

    2013-05-28

    Text analysis devices, articles of manufacture, and text analysis methods are described according to some aspects. In one aspect, a text analysis device includes processing circuitry configured to analyze initial text to generate a measurement basis usable in analysis of subsequent text, wherein the measurement basis comprises a plurality of measurement features from the initial text, a plurality of dimension anchors from the initial text and a plurality of associations of the measurement features with the dimension anchors, and wherein the processing circuitry is configured to access a viewpoint indicative of a perspective of interest of a user with respect to the analysis of the subsequent text, and wherein the processing circuitry is configured to use the viewpoint to generate the measurement basis.

  11. Text-Attentional Convolutional Neural Network for Scene Text Detection

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-06-01

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this work, we present a new system for scene text detection by proposing a novel Text-Attentional Convolutional Neural Network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/nontext information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates main task of text/non-text classification. In addition, a powerful low-level detector called Contrast- Enhancement Maximally Stable Extremal Regions (CE-MSERs) is developed, which extends the widely-used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 dataset, with a F-measure of 0.82, improving the state-of-the-art results substantially.

  12. Text-Attentional Convolutional Neural Networks for Scene Text Detection.

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-03-28

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this work, we present a new system for scene text detection by proposing a novel Text-Attentional Convolutional Neural Network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/nontext information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates main task of text/non-text classification. In addition, a powerful low-level detector called Contrast- Enhancement Maximally Stable Extremal Regions (CE-MSERs) is developed, which extends the widely-used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 dataset, with a F-measure of 0.82, improving the state-of-the-art results substantially.

  13. Zipf's Law of Abbreviation and the Principle of Least Effort: Language users optimise a miniature lexicon for efficient communication.

    Science.gov (United States)

    Kanwal, Jasmeen; Smith, Kenny; Culbertson, Jennifer; Kirby, Simon

    2017-08-01

    The linguist George Kingsley Zipf made a now classic observation about the relationship between a word's length and its frequency; the more frequent a word is, the shorter it tends to be. He claimed that this "Law of Abbreviation" is a universal structural property of language. The Law of Abbreviation has since been documented in a wide range of human languages, and extended to animal communication systems and even computer programming languages. Zipf hypothesised that this universal design feature arises as a result of individuals optimising form-meaning mappings under competing pressures to communicate accurately but also efficiently-his famous Principle of Least Effort. In this study, we use a miniature artificial language learning paradigm to provide direct experimental evidence for this explanatory hypothesis. We show that language users optimise form-meaning mappings only when pressures for accuracy and efficiency both operate during a communicative task, supporting Zipf's conjecture that the Principle of Least Effort can explain this universal feature of word length distributions. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Coelomogenesis during the abbreviated development of the echinoid Heliocidaris erythrogramma and the developmental origin of the echinoderm pentameral body plan.

    Science.gov (United States)

    Morris, Valerie B

    2011-01-01

    The development of the coeloms is described in an echinoid with an abbreviated larval development and shows the early morphogenesis of the coeloms of the adult stage. The development is described from images obtained by laser scanning confocal microscopy. The development in Heliocidaris erythrogramma is asymmetric with a larger left coelom forming on the larval-left side and a smaller right coelom forming on the larval-right side. The right coelom forms after the development of the left coelom is well advanced. The hydrocoele forms from the anterior part of the left coelom. The five lobes of the hydrocoele from which the pentamery of the adult derives take shape on the outer, distal wall of the anterior part of the left coelom. The hydrocoele separates from the more posterior part of the left coelom, which becomes the left posterior coelom. The lobes of the hydrocoele are named, based on the site of the connexion of the stone canal to the hydrocoele. The mouth is assumed to form by penetration through only the outer, distal wall of the hydrocoele and the ectoderm. Both larval and adult polarities are evident in this larva. A comparison with coelomogenesis in the asteroid Parvulastra exigua, which also has an abbreviated development, leads to predictions of homology between the echinoderm and chordate phyla that do not require the hypothesis of a dorsoventral inversion event in chordates.

  15. Abbreviated mindfulness intervention for job satisfaction, quality of life, and compassion in primary care clinicians: a pilot study.

    Science.gov (United States)

    Fortney, Luke; Luchterhand, Charlene; Zakletskaia, Larissa; Zgierska, Aleksandra; Rakel, David

    2013-01-01

    Burnout, attrition, and low work satisfaction of primary care physicians are growing concerns and can have a negative influence on health care. Interventions for clinicians that improve work-life balance are few and poorly understood. We undertook this study as a first step in investigating whether an abbreviated mindfulness intervention could increase job satisfaction, quality of life, and compassion among primary care clinicians. A total of 30 primary care clinicians participated in an abbreviated mindfulness course. We used a single-sample, pre-post design. At 4 points in time (baseline, and 1 day, 8 weeks, and 9 months postintervention), participants completed a set of online measures assessing burnout, anxiety, stress, resilience, and compassion. We used a linear mixed-effects model analysis to assess changes in outcome measures. Participants had improvements compared with baseline at all 3 follow-up time points. At 9 months postintervention, they had significantly better scores (1) on all Maslach Burnout Inventory burnout subscales-Emotional Exhaustion (P =.009), Depersonalization (P = .005), and Personal Accomplishment (P job burnout, depression, anxiety, and stress. Modified mindfulness training may be a time-efficient tool to help support clinician health and well-being, which may have implications for patient care.

  16. Contrastive Study of Coherence in Chinese Text and English Text

    Institute of Scientific and Technical Information of China (English)

    王婷

    2013-01-01

    The paper presents the text-linguistic concepts on which the analysis of textual structure is based including text and discourse, coherence and cohesive. In addition we try to discover different manifestations of text between ET and CT, including different coherent structures.

  17. Supported eText: Assistive Technology through Text Transformations

    Science.gov (United States)

    Anderson-Inman, Lynne; Horney, Mark A.

    2007-01-01

    To gain meaningful access to the curriculum, students with reading difficulties must overcome substantial barriers imposed by the printed materials they are asked to read. Technology can assist students to overcome these challenges by enabling a shift from printed text to electronic text. By electronic text it means textual material read using a…

  18. Test of Picture-Text Amalgams in Procedural Texts.

    Science.gov (United States)

    Stone, David Edey

    Designed to assess how people read and comprehend information presented in picture-text amalgams in procedural texts, this instrument presents various combinations of text information and illustrative information on slides. Subjects are assigned to one of four conditions and directed to follow the instructions presented on the slides. Videotapes…

  19. Bacterial toxins activation of abbreviated urea cycle in porcine cerebral vascular smooth muscle cells.

    Science.gov (United States)

    Mishra, Rajesh G; Tseng, Tzu-Ling; Chen, Mei-Fang; Chen, Po-Yi; Lee, Tony J-F

    2016-12-01

    Nitric oxide (NO) overproduction via induction of inducible nitric oxide synthase (iNOS) is implicated in vasodilatory shock in sepsis, leading to septic encephalopathy and accelerating cerebral ischemic injury. An abbreviated urea-cycle (l-citrulline-l-arginine-NO cycle) has been demonstrated in cerebral perivascular nitrergic nerves and endothelial cells but not in normal cerebral vascular smooth muscle cell (CVSMC). This cycle indicates that argininosuccinate synthase (ASS) catalyzes l-citrulline (l-cit) conversion to form argininosuccinate (AS), and subsequent AS cleavage by argininosuccinate lyase (ASL) forms l-arginine (l-arg), the substrate for NO synthesis. The possibility that ASS enzyme in this cycle was induced in the CVSMC in sepsis was examined. Blood-vessel myography technique was used for measuring porcine isolated basilar arterial tone. NO in cultured CVSMC and in condition mediums were estimated by diaminofluorescein (DAF)-induced fluorescence and Griess reaction, respectively. Immunohistochemical and immunoblotting analyses were used to examine iNOS and ASS induction. l-cit and l-arg, which did not relax endothelium-denuded normal basilar arteries precontracted by U-46619, induced significant vasorelaxation with increased NO production in these arteries and the CVSMCs following 6-hour exposure to 20μg/ml lipopolysaccharide (LPS) or lipoteichoic acid (LTA). Pre-treatment with pyrrolidine dithiocarbamate (PDTC) and salicylate (SAL) (NFκB inhibitors), aminoguanidine (AG, an iNOS inhibitor), and nitro-l-arg (NLA, a non-specific NOS inhibitor) blocked NO synthesis in the CVSMC and attenuated l-cit- and l-arg-induced relaxation of LPS- and LTA-treated arteries. Furthermore, immunohistochemical and immunoblotting studies demonstrated that expression of basal iNOS and ASS in the smooth muscle cell of arterial segments denuded of endothelium and the cultured CVSMCs was significantly increased following 6-hour incubation with LPS or LTA. This increased i

  20. Cyclosporine therapy monitored with abbreviated area under curve in nephrotic syndrome.

    Science.gov (United States)

    Rinaldi, Stefano; Sesto, Antonella; Barsotti, Paola; Faraggiana, Tullio; Sera, Francesco; Rizzoni, Gianfranco

    2005-01-01

    Cyclosporin A (CsA) is an effective therapy for children with long-lasting nephrotic syndrome (NS). Long-term treatment can result in chronic CsA nephropathy (CsAN) and there is controversy concerning its incidence and severity. Trough levels are commonly used to monitor the drug concentration. We report a retrospective clinical and histological analysis of 18 children (12 males, 6 females) with steroid-dependent nephrotic syndrome (15 patients) and partially steroid-sensitive nephrotic syndrome (3 patients) treated with CsA for a long-term period (mean 4.9 years, range 2.2-6.9). Before CsA treatment all patients had normal creatinine clearance. CsA was started at a dose of 5 mg/kg per day administered orally in two divided doses and adjusted to maintain the mean CsA blood concentration between 250 and 350 ng/ml obtained from abbreviated area under the curve (AUC). A renal biopsy was performed after a mean period of 3.9 years (range 2.2-6.2) from the start of CsA treatment. Tubular, interstitial, and arteriolar lesions were evaluated in order to assess CsAN. The mean CsA dose and the mean CsA blood concentration were 4.4 mg/kg per day (range 3.6-5.8) and 276.6 ng/ml (range 162-346), respectively. No child had a worsening creatinine clearance during CsA treatment and follow-up after CsA discontinuation. If compared with the year before the start of CsA treatment, NS relapses and prednisone (PDN) dose significantly decreased during CsA treatment, 4/year versus 0.8/year (P <0.0001) and 0.9 mg/kg per day versus 0.2 mg/kg per day (P <0.0001), respectively. Histological analysis showed 15 patients with minimal change disease and 3 with focal segmental glomerulosclerosis. Clear-cut lesions diagnostic of CsAN were never found and only mild lesions were observed in 5 children (suggestive of CsAN in 2 patients and consistent with CsAN in 3 patients). Long-term CsA treatment is confirmed to be effective in preventing NS relapses and reducing PDN dose. Renal function is not a

  1. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  2. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  3. Text Signals Influence Team Artifacts

    Science.gov (United States)

    Clariana, Roy B.; Rysavy, Monica D.; Taricani, Ellen

    2015-01-01

    This exploratory quasi-experimental investigation describes the influence of text signals on team visual map artifacts. In two course sections, four-member teams were given one of two print-based text passage versions on the course-related topic "Social influence in groups" downloaded from Wikipedia; this text had two paragraphs, each…

  4. Too Dumb for Complex Texts?

    Science.gov (United States)

    Bauerlein, Mark

    2011-01-01

    High school students' lack of experience and practice with reading complex texts is a primary cause of their difficulties with college-level reading. Filling the syllabus with digital texts does little to address this deficiency. Complex texts demand three dispositions from readers: a willingness to probe works characterized by dense meanings, the…

  5. Mensuração da gravidade do trauma com as versões 1998 e 2005 da Abbreviated Injury Scale

    OpenAIRE

    Maria Carolina Barbosa Teixeira Lopes; Iveth Yamaguchi Whitaker

    2014-01-01

    Objetivo: Comparar a gravidade das lesões e do trauma mensurada pelas versões da Abbreviated Injury Scale 1998 e 2005 e verificar a mortalidade nos escores Injury Severity Score e New Injury Severity Score nas duas versões.Método: Estudo transversal e retrospectivo analisou lesões de pacientes de trauma, de três hospitais universitários do município de São Paulo, Brasil. Cada lesão foi codificada com Abbreviated Injury Scale 1998 e 2005. Os testes estatísticos aplicados foram Wilcoxon, McNema...

  6. Multilingual Text Analysis for Text-to-Speech Synthesis

    CERN Document Server

    Sproat, R

    1996-01-01

    We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, German, Russian, Mandarin and Japanese.

  7. Predicting Prosody from Text for Text-to-Speech Synthesis

    CERN Document Server

    Rao, K Sreenivasa

    2012-01-01

    Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.

  8. A Survey on Web Text Information Retrieval in Text Mining

    Directory of Open Access Journals (Sweden)

    Tapaswini Nayak

    2015-08-01

    Full Text Available In this study we have analyzed different techniques for information retrieval in text mining. The aim of the study is to identify web text information retrieval. Text mining almost alike to analytics, which is a process of deriving high quality information from text. High quality information is typically derived in the course of the devising of patterns and trends through means such as statistical pattern learning. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, creation of coarse taxonomies, sentiment analysis, document summarization and entity relation modeling. It is used to mine hidden information from not-structured or semi-structured data. This feature is necessary because a large amount of the Web information is semi-structured due to the nested structure of HTML code, is linked and is redundant. Web content categorization with a content database is the most important tool to the efficient use of search engines. A customer requesting information on a particular subject or item would otherwise have to search through hundred of results to find the most relevant information to his query. Hundreds of results through use of mining text are reduced by this step. This eliminates the aggravation and improves the navigation of information on the Web.

  9. Text comprehension practice in school

    Directory of Open Access Journals (Sweden)

    Hernández, José Emilio

    2010-01-01

    Full Text Available The starting point of the study is the existence of relations between the two dimensions of text compression: the instrumental dimension and the cognitive dimension. The first one includes the system of actions, the second one the system of knowledge. A description of identifying, describing, inferring apprising and creating actions are suggested for each type of text. Likewise, the importance of implementing text comprehension is outlined on the basis of the assumption that the text is a tool for preserving and communicating culture, that allows human beings to wide their respective cultural horizons and develop cognitive and affective process that allow them to get universal morals.

  10. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  11. TEXT DEIXIS IN NARRATIVE SEQUENCES

    Directory of Open Access Journals (Sweden)

    Josep Rivera

    2007-06-01

    Full Text Available This study looks at demonstrative descriptions, regarding them as text-deictic procedures which contribute to weave discourse reference. Text deixis is thought of as a metaphorical referential device which maps the ground of utterance onto the text itself. Demonstrative expressions with textual antecedent-triggers, considered as the most important text-deictic units, are identified in a narrative corpus consisting of J. M. Barrie’s Peter Pan and its translation into Catalan. Some linguistic and discourse variables related to DemNPs are analysed to characterise adequately text deixis. It is shown that this referential device is usually combined with abstract nouns, thus categorising and encapsulating (non-nominal complex discourse entities as nouns, while performing a referential cohesive function by means of the text deixis + general noun type of lexical cohesion.

  12. Text mining: A Brief survey

    Directory of Open Access Journals (Sweden)

    Falguni N. Patel , Neha R. Soni

    2012-12-01

    Full Text Available The unstructured texts which contain massive amount of information cannot simply be used for further processing by computers. Therefore, specific processing methods and algorithms are required in order to extract useful patterns. The process of extracting interesting information and knowledge from unstructured text completed by using Text mining. In this paper, we have discussed text mining, as a recent and interesting field with the detail of steps involved in the overall process. We have also discussed different technologies that teach computers with natural language so that they may analyze, understand, and even generate text. In addition, we briefly discuss a number of successful applications of text mining which are used currently and in future.

  13. Proposal for a revised taxonomy of the family Filoviridae: classification, names of taxa and viruses, and virus abbreviations.

    Science.gov (United States)

    Kuhn, Jens H; Becker, Stephan; Ebihara, Hideki; Geisbert, Thomas W; Johnson, Karl M; Kawaoka, Yoshihiro; Lipkin, W Ian; Negredo, Ana I; Netesov, Sergey V; Nichol, Stuart T; Palacios, Gustavo; Peters, Clarence J; Tenorio, Antonio; Volchkov, Viktor E; Jahrling, Peter B

    2010-12-01

    The taxonomy of the family Filoviridae (marburgviruses and ebolaviruses) has changed several times since the discovery of its members, resulting in a plethora of species and virus names and abbreviations. The current taxonomy has only been partially accepted by most laboratory virologists. Confusion likely arose for several reasons: species names that consist of several words or which (should) contain diacritical marks, the current orthographic identity of species and virus names, and the similar pronunciation of several virus abbreviations in the absence of guidance for the correct use of vernacular names. To rectify this problem, we suggest (1) to retain the current species names Reston ebolavirus, Sudan ebolavirus, and Zaire ebolavirus, but to replace the name Cote d'Ivoire ebolavirus [sic] with Taï Forest ebolavirus and Lake Victoria marburgvirus with Marburg marburgvirus; (2) to revert the virus names of the type marburgviruses and ebolaviruses to those used for decades in the field (Marburg virus instead of Lake Victoria marburgvirus and Ebola virus instead of Zaire ebolavirus); (3) to introduce names for the remaining viruses reminiscent of jargon used by laboratory virologists but nevertheless different from species names (Reston virus, Sudan virus, Taï Forest virus), and (4) to introduce distinct abbreviations for the individual viruses (RESTV for Reston virus, SUDV for Sudan virus, and TAFV for Taï Forest virus), while retaining that for Marburg virus (MARV) and reintroducing that used over decades for Ebola virus (EBOV). Paying tribute to developments in the field, we propose (a) to create a new ebolavirus species (Bundibugyo ebolavirus) for one member virus (Bundibugyo virus, BDBV); (b) to assign a second virus to the species Marburg marburgvirus (Ravn virus, RAVV) for better reflection of now available high-resolution phylogeny; and (c) to create a new tentative genus (Cuevavirus) with one tentative species (Lloviu cuevavirus) for the recently

  14. Proposal for a revised taxonomy of the family Filoviridae: classification, names of taxa and viruses, and virus abbreviations

    Science.gov (United States)

    Kuhn, Jens H.; Becker, Stephan; Ebihara, Hideki; Geisbert, Thomas W.; Johnson, Karl M.; Kawaoka, Yoshihiro; Lipkin, W. Ian; Negredo, Ana I.; Netesov, Sergey V.; Nichol, Stuart T.; Palacios, Gustavo; Peters, Clarence J.; Tenorio, Antonio; Volchkov, Viktor E.; Jahrling, Peter B.

    2011-01-01

    The taxonomy of the family Filoviridae (marburgviruses and ebolaviruses) has changed several times since the discovery of its members, resulting in a plethora of species and virus names and abbreviations. The current taxonomy has only been partially accepted by most laboratory virologists. Confusion likely arose for several reasons: species names that consist of several words or which (should) contain diacritical marks, the current orthographic identity of species and virus names, and the similar pronunciation of several virus abbreviations in the absence of guidance for the correct use of vernacular names. To rectify this problem, we suggest (1) to retain the current species names Reston ebolavirus, Sudan ebolavirus, and Zaire ebolavirus, but to replace the name Cote d'Ivoire ebolavirus [sic] with Taï Forest ebolavirus and Lake Victoria marburgvirus with Marburg marburgvirus; (2) to revert the virus names of the type marburgviruses and ebolaviruses to those used for decades in the field (Marburg virus instead of Lake Victoria marburgvirus and Ebola virus instead of Zaire ebolavirus); (3) to introduce names for the remaining viruses reminiscent of jargon used by laboratory virologists but nevertheless different from species names (Reston virus, Sudan virus, Taï Forest virus), and (4) to introduce distinct abbreviations for the individual viruses (RESTV for Reston virus, SUDV for Sudan virus, and TAFV for Taï Forest virus), while retaining that for Marburg virus (MARV) and reintroducing that used over decades for Ebola virus (EBOV). Paying tribute to developments in the field, we propose (a) to create a new ebolavirus species (Bundibugyo ebolavirus) for one member virus (Bundibugyo virus, BDBV); (b) to assign a second virus to the species Marburg marburgvirus (Ravn virus, RAVV) for better reflection of now available high-resolution phylogeny; and (c) to create a new tentative genus (Cuevavirus) with one tentative species (Lloviu cuevavirus) for the recently

  15. Abbreviated breast dynamic contrast-enhanced MR imaging for lesion detection and characterization: the experience of an Italian oncologic center.

    Science.gov (United States)

    Petrillo, Antonella; Fusco, Roberta; Sansone, Mario; Cerbone, Marilena; Filice, Salvatore; Porto, Annamaria; Rubulotta, Maria Rosaria; D'Aiuto, Massimiliano; Avino, Franca; Di Bonito, Maurizio; Botti, Gerardo

    2017-07-01

    To evaluate the performance of an abbreviated dynamic contrast-enhanced MR imaging (MRI) protocol for breast cancer detection; a comparison with the complete diagnostic protocol has been conducted. A retrospective analysis on 508 patients was performed. Abbreviated protocol (AP) included one pre-contrast and the first post-contrast T1-weighted series. Complete protocol (CP) consisted of four post-contrast and one pre-contrast T1-weighted series. Diagnostic performance was assessed for AP and CP. Performance comparison was made using McNemar's test for sensitivity and specificity and Moskowitz and Pepe's method as regards negative predictive value (NPV) and positive predictive value (PPV). AP has been realized in two different ways (AP1 and AP2) and they were compared by means of Cohen's κ. Both CP and AP revealed 206 of 207 cancers. There were no statistically significant differences between AP and CP diagnostic performance (P > 0.05). NPVs of CP and both versions of AP (99.57 vs. 99.56%, P = 0.39), as well as the specificity (77.08 vs. 75.42%, P = 0.18), were substantially equivalent. Relative predictive value method did not reveal the presence of a statistically significant difference between the PPV of CP and both versions of AP (74.91 vs. 73.57%, P = 0.099). Analysis for single lesion confirmed that both CP and AP had equivalent results: CP and AP revealed 280 of 281 malignancies. NPVs of CP and both AP versions, as well as the specificity (P > 0.05), were substantially equivalent. Relative predictive value method did not reveal the presence of a significant difference between the PPV of CP and both AP versions (70.89 vs. 70.18%, P = 0.25; 70.89 vs. 70.00%, P = 0.13). Abbreviated approach to breast MRI examination reduces the image acquisition and the reading time associated with MR substantially without influencing the diagnostic accuracy (high sensitivity and NPV >99.5%). AP could translate into cost-savings and could enable a higher number of

  16. Texting while driving: is speech-based text entry less risky than handheld text entry?

    Science.gov (United States)

    He, J; Chaparro, A; Nguyen, B; Burge, R J; Crandall, J; Chaparro, B; Ni, R; Cao, S

    2014-11-01

    Research indicates that using a cell phone to talk or text while maneuvering a vehicle impairs driving performance. However, few published studies directly compare the distracting effects of texting using a hands-free (i.e., speech-based interface) versus handheld cell phone, which is an important issue for legislation, automotive interface design and driving safety training. This study compared the effect of speech-based versus handheld text entries on simulated driving performance by asking participants to perform a car following task while controlling the duration of a secondary text-entry task. Results showed that both speech-based and handheld text entries impaired driving performance relative to the drive-only condition by causing more variation in speed and lane position. Handheld text entry also increased the brake response time and increased variation in headway distance. Text entry using a speech-based cell phone was less detrimental to driving performance than handheld text entry. Nevertheless, the speech-based text entry task still significantly impaired driving compared to the drive-only condition. These results suggest that speech-based text entry disrupts driving, but reduces the level of performance interference compared to text entry with a handheld device. In addition, the difference in the distraction effect caused by speech-based and handheld text entry is not simply due to the difference in task duration.

  17. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    Today, information travels fast. Texts travel, too. In a corporate context, the question is how to manage which knowledge elements should travel to a new language area or market and in which form? The decision to let knowledge elements travel or not travel highly depends on the limitation...... and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  18. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  19. Survey of Text Plagiarism Detection

    Directory of Open Access Journals (Sweden)

    Albaraa Abuobieda

    2012-06-01

    Full Text Available In this paper we are going to review and list the advantages and limitations of the significant effective techniques employed or developed in text plagiarism detection.  It was found that many of the proposed methods for plagiarism detection have a weakness and lacking for detecting some types of plagiarized text. This paper discussed several important issues in plagiarism detection such as; plagiarism detection Tasks, plagiarism detection process and some of the current plagiarism detection techniques.

  20. Typesafe Modeling in Text Mining

    CERN Document Server

    Steeg, Fabian

    2011-01-01

    Based on the concept of annotation-based agents, this report introduces tools and a formal notation for defining and running text mining experiments using a statically typed domain-specific language embedded in Scala. Using machine learning for classification as an example, the framework is used to develop and document text mining experiments, and to show how the concept of generic, typesafe annotation corresponds to a general information model that goes beyond text processing.

  1. Text Type and Translation Strategy

    Institute of Scientific and Technical Information of China (English)

    刘福娟

    2015-01-01

    Translation strategy and translation standards are undoubtedly the core problems translators are confronted with in translation. There have arisen many kinds of translation strategies in translation history, among which the text type theory is considered an important breakthrough and a significant complement of traditional translation standards. This essay attempts to demonstrate the value of text typology (informative, expressive, and operative) to translation strategy, emphasizing the importance of text types and their communicative functions.

  2. SparkText: Biomedical Text Mining on Big Data Framework.

    Science.gov (United States)

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  3. Hermeneutic reading of classic texts.

    Science.gov (United States)

    Koskinen, Camilla A-L; Lindström, Unni Å

    2013-09-01

    The purpose of this article is to broaden the understandinfg of the hermeneutic reading of classic texts. The aim is to show how the choice of a specific scientific tradition in conjunction with a methodological approach creates the foundation that clarifies the actual realization of the reading. This hermeneutic reading of classic texts is inspired by Gadamer's notion that it is the researcher's own research tradition and a clearly formulated theoretical fundamental order that shape the researcher's attitude towards texts and create the starting point that guides all reading, uncovering and interpretation. The researcher's ethical position originates in a will to openness towards what is different in the text and which constantly sets the researcher's preunderstanding and research tradition in movement. It is the researcher's attitude towards the text that allows the text to address, touch and arouse wonder. Through a flexible, lingering and repeated reading of classic texts, what is different emerges with a timeless value. The reading of classic texts is an act that may rediscover and create understanding for essential dimensions and of human beings' reality on a deeper level. The hermeneutic reading of classic texts thus brings to light constantly new possibilities of uncovering for a new envisioning and interpretation for a new understanding of the essential concepts and phenomena within caring science.

  4. Risk adapted transmission prophylaxis to prevent vertical HIV–1 transmission: Effectiveness and safety of an abbreviated regimen of postnatal oral Zidovudine

    Directory of Open Access Journals (Sweden)

    Neubert Jennifer

    2013-01-01

    Full Text Available Abstract Background Antiretroviral drugs including zidovudine (ZDV are effective in reducing HIV mother to child transmission (MTCT, however safety concern remains. The optimal duration of postnatal ZDV has not been established in clinical studies and there is a lack of consensus regarding optimal management. The objective of this study was to investigate the effectiveness and safety of a risk adapted two week course of oral postnatal ZDV as part of a combined intervention to reduce MTCT. Methods 118 mother infant pairs were treated according to the German-Austrian recommendations for HIV therapy in pregnancy and in HIV exposed newborns between 2000–2010. In the absence of factors associated with an increased HIV–1 transmission risk, children were assigned to the low risk group and treated with an abbreviated postnatal regimen with oral ZDV for 2 weeks. In the presence of risk factors, postnatal ZDV was escalated accordingly. Results Of 118 mother-infant pairs 79 were stratified to the low risk group, 27 to the high risk group and 11 to the very high risk group for HIV–1 MTCT. 4 children were lost to follow up. Overall Transmission risk in the group regardless of risk factors and completion of prophylaxis was 1.8% (95% confidence interval (CI 0.09–6.6. If transmission prophylaxis was complete, transmission risk was 0.9% (95% CI 0.01-5.7. In the low risk group receiving two week oral ZDV transmission risk was 1.4% (95% CI 0.01–8.4 Conclusion These data demonstrate the effectiveness of a short neonatal ZDV regimen in infants of women on stable ART and effective HIV–1 suppression. Further evaluation is needed in larger studies.

  5. Screening for personality disorder in incarcerated adolescent boys: preliminary validation of an adolescent version of the standardised assessment of personality – abbreviated scale (SAPAS-AV

    Directory of Open Access Journals (Sweden)

    Kongerslev Mickey

    2012-07-01

    Full Text Available Abstract Background Personality disorder (PD is associated with significant functional impairment and an elevated risk of violent and suicidal behaviour. The prevalence of PD in populations of young offenders is likely to be high. However, because the assessment of PD is time-consuming, it is not routinely assessed in this population. A brief screen for the identification of young people who might warrant further detailed assessment of PD could be particularly valuable for clinicians and researchers working in juvenile justice settings. Method We adapted a rapid screen for the identification of PD in adults (Standardised Assessment of Personality – Abbreviated Scale; SAPAS for use with adolescents and then carried out a study of the reliability and validity of the adapted instrument in a sample of 80 adolescent boys in secure institutions. Participants were administered the screen and shortly after an established diagnostic interview for DSM-IV PDs. Nine days later the screen was readministered. Results A score of 3 or more on the screening interview correctly identified the presence of DSM-IV PD in 86% of participants, yielding a sensitivity and specificity of 0.87 and 0.86 respectively. Internal consistency was modest but comparable to the original instrument. 9-days test-retest reliability for the total score was excellent. Convergent validity correlations with the total number of PD criteria were large. Conclusion This study provides preliminary evidence of the validity, reliability, and usefulness of the screen in secure institutions for adolescent male offenders. It can be used in juvenile offender institutions with limited resources, as a brief, acceptable, staff-administered routine screen to identify individuals in need of further assessment of PD or by researchers conducting epidemiological surveys.

  6. Validation of an abbreviated version of the structured interview of reported symptoms in outpatient psychiatric and community settings.

    Science.gov (United States)

    Green, Debbie; Rosenfeld, Barry; Dole, Tia; Pivovarova, Ekaterina; Zapf, Patricia A

    2008-04-01

    This study examined the effectiveness of an abbreviated version of the Structured Interview of Reported Symptoms (SIRS-A) in identifying malingered mental illness. The SIRS-A is comprised of 69 items drawn from the SIRS (R. Rogers et al. 1992, SIRS: Structured Interview of Reported Symptoms: Professional Manual. Odessa, FL: Psychological Assessment Resources, Inc.), substantially reducing the administration time. A simulation design was used with three samples; 87 psychiatric outpatients who responded honestly were compared to 29 community-dwelling adults and 24 psychiatric patients instructed to malinger psychopathology. The SIRS-A generated sensitivity comparable to or exceeding that of the SIRS normative data, but specificity was poorer; many genuinely impaired patients were misclassified as malingering. Although these findings suggest the SIRS-A may be an effective means to assess malingering in psychiatric populations, further research assessing the reasons for the elevated false positive rates is necessary.

  7. An Abbreviated Protocol for In Vitro Generation of Functional Human Embryonic Stem Cell-Derived Beta-Like Cells

    DEFF Research Database (Denmark)

    Massumi, Mohammad; Pourasgari, Farzaneh; Nalla, Amarnadh

    2016-01-01

    developed an abbreviated five-stage protocol (25-30 days) to generate human Embryonic Stem Cell-Derived Beta-like Cells (ES-DBCs). We showed that Geltrex, as an extracellular matrix, could support the generation of ES-DBCs more efficiently than that of the previously described culture systems......The ability to yield glucose-responsive pancreatic beta-cells from human pluripotent stem cells in vitro will facilitate the development of the cell replacement therapies for the treatment of Type 1 Diabetes. Here, through the sequential in vitro targeting of selected signaling pathways, we have...... positive cells, 1% insulin and glucagon positive cells and 30% insulin and NKX6.1 co-expressing cells. Functionally, ES-DBCs were responsive to high glucose in static incubation and perifusion studies, and could secrete insulin in response to successive glucose stimulations. Mitochondrial metabolic flux...

  8. Dangers of Texting While Driving

    Science.gov (United States)

    ... nhtsa.gov/risky-driving/distracted-driving . Print Out Texting While Driving Guide (pdf) File a Complaint with the FCC ... Office: Consumer and Governmental Affairs Tags: Consumers - Distracted Driving - Health and Safety - Texting Federal Communications Commission 445 12th Street SW, Washington, ...

  9. Text analysis for knowledge graphs

    NARCIS (Netherlands)

    Popping, Roel

    2007-01-01

    The concept of knowledge graphs is introduced as a method to represent the state of the art in a specific scientific discipline. Next the text analysis part in the construction of such graphs is considered. Here the 'translation' from text to graph takes place. The method that is used here is compar

  10. Text Retrieval on a Microcomputer.

    Science.gov (United States)

    Giordano, Richard; And Others

    1988-01-01

    Presents description of the Generalized Automatic Text Organization and Retrieval system (GATOR), a database system that indexes and retrieves information from machine-readable texts such as interviews and case histories. Qualitative and quantitative analyses are discussed, and integrating GATOR with standard statistical packages is described.…

  11. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases th...... datasets. Our model also outperforms A Decision Cluster Classification (ADCC) and the Decision Cluster Forest Classification (DCFC) models on the Reuters-21578 dataset....

  12. Strategies for Translating Vocative Texts

    Directory of Open Access Journals (Sweden)

    Olga COJOCARU

    2014-12-01

    Full Text Available The paper deals with the linguistic and cultural elements of vocative texts and the techniques used in translating them by giving some examples of texts that are typically vocative (i.e. advertisements and instructions for use. Semantic and communicative strategies are popular in translation studies and each of them has its own advantages and disadvantages in translating vocative texts. The advantage of semantic translation is that it takes more account of the aesthetic value of the SL text, while communicative translation attempts to render the exact contextual meaning of the original text in such a way that both content and language are readily acceptable and comprehensible to the readership. Focus is laid on the strategies used in translating vocative texts, strategies that highlight and introduce a cultural context to the target audience, in order to achieve their overall purpose, that is to sell or persuade the reader to behave in a certain way. Thus, in order to do that, a number of advertisements from the field of cosmetics industry and electronic gadgets were selected for analysis. The aim is to gather insights into vocative text translation and to create new perspectives on this field of research, now considered a process of innovation and diversion, especially in areas as important as economy and marketing.

  13. Improve Reading with Complex Texts

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2015-01-01

    The Common Core State Standards have cast a renewed light on reading instruction, presenting teachers with the new requirements to teach close reading of complex texts. Teachers and administrators should consider a number of essential features of close reading: They are short, complex texts; rich discussions based on worthy questions; revisiting…

  14. Text Genres in Information Organization

    Science.gov (United States)

    Nahotko, Marek

    2016-01-01

    Introduction: Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method: The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information…

  15. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  16. Text mining for systems biology.

    Science.gov (United States)

    Fluck, Juliane; Hofmann-Apitius, Martin

    2014-02-01

    Scientific communication in biomedicine is, by and large, still text based. Text mining technologies for the automated extraction of useful biomedical information from unstructured text that can be directly used for systems biology modelling have been substantially improved over the past few years. In this review, we underline the importance of named entity recognition and relationship extraction as fundamental approaches that are relevant to systems biology. Furthermore, we emphasize the role of publicly organized scientific benchmarking challenges that reflect the current status of text-mining technology and are important in moving the entire field forward. Given further interdisciplinary development of systems biology-orientated ontologies and training corpora, we expect a steadily increasing impact of text-mining technology on systems biology in the future.

  17. Impact of abbreviated lecture with interactive mini-cases vs traditional lecture on student performance in the large classroom.

    Science.gov (United States)

    Marshall, Leisa L; Nykamp, Diane L; Momary, Kathryn M

    2014-12-15

    To compare the impact of 2 different teaching and learning methods on student mastery of learning objectives in a pharmacotherapy module in the large classroom setting. Two teaching and learning methods were implemented and compared in a required pharmacotherapy module for 2 years. The first year, multiple interactive mini-cases with inclass individual assessment and an abbreviated lecture were used to teach osteoarthritis; a traditional lecture with 1 inclass case discussion was used to teach gout. In the second year, the same topics were used but the methods were flipped. Student performance on pre/post individual readiness assessment tests (iRATs), case questions, and subsequent examinations were compared each year by the teaching and learning method and then between years by topic for each method. Students also voluntarily completed a 20-item evaluation of the teaching and learning methods. Postpresentation iRATs were significantly higher than prepresentation iRATs for each topic each year with the interactive mini-cases; there was no significant difference in iRATs before and after traditional lecture. For osteoarthritis, postpresentation iRATs after interactive mini-cases in year 1 were significantly higher than postpresentation iRATs after traditional lecture in year 2; the difference in iRATs for gout per learning method was not significant. The difference between examination performance for osteoarthritis and gout was not significant when the teaching and learning methods were compared. On the student evaluations, 2 items were significant both years when answers were compared by teaching and learning method. Each year, students ranked their class participation higher with interactive cases than with traditional lecture, but both years they reported enjoying the traditional lecture format more. Multiple interactive mini-cases with an abbreviated lecture improved immediate mastery of learning objectives compared to a traditional lecture format, regardless of

  18. Myeloproliferative neoplasm (MPN) symptom assessment form total symptom score: Prospective international assessment of an abbreviated symptom burden scoring system among patients with MPNs

    NARCIS (Netherlands)

    R.M. Emanuel (Robyn); A.C. Dueck (Amylou); H.L. Geyer (Holly); J.J. Kiladjian; S. Slot (Stefanie); S. Zweegman (Sonja); P.A.W. te Boekhorst (Peter); S. Commandeur (Suzan); H. Schouten (Harry); F. Sackmann (Federico); A.K. Fuentes (Ana Kerguelen); D. Hernández-Maraver (Dolores); C. Pahl (Clemens); M. Griesshammer (Martin); F. Stegelmann (Frank); K. Doehner (Konstanze); T. Lehmann (Thomas); K. Bonatz (Karin); A. Reiter (Alfred); F. Boyer (Francoise); J. Etienne (Jerome); J.-C. Ianotto (Jean-Christophe); D. Ranta (Dana); L. Roy (Lydia); J.-Y. Cahn (Jean-Yves); C.N. Harrison (Claire); D. Radia (Deepti); P. Muxi (Pablo); N. Maldonado (Norman); C. Besses (Carlos); F. Cervantes (Francisco); P.L. Johansson (Peter); T. Barbui (Tiziano); G. Barosi (Giovanni); A.M. Vannucchi (Alessandro); F. Passamonti (Francesco); B. Andreasson (Bjorn); M.L. Ferarri (Maria); A. Rambaldi (Alessandro); J. Samuelsson (Jan); G. Birgegard (Gunnar); A. Tefferi (Ayalew); A.A. Mesa

    2012-01-01

    textabstractPurpose: Myeloproliferative neoplasm (MPN) symptoms are troublesome to patients, and alleviation of this burden represents a paramount treatment objective in the development of MPN-directed therapies. We aimed to assess the utility of an abbreviated symptom score for the most pertinent a

  19. 21 CFR 314.107 - Effective date of approval of a 505(b)(2) application or abbreviated new drug application under...

    Science.gov (United States)

    2010-04-01

    ... introduction into interstate commerce when approval of the application or abbreviated application for the drug... for 5 years of exclusive marketing under § 314.108(b)(2) and the patent owner or its representative or... application first commences commercial marketing of its drug product; or (ii) The date of a decision of...

  20. The Self-report Standardized Assessment of Personality-abbreviated Scale: Preliminary results of a brief screening test for personality disorders

    NARCIS (Netherlands)

    Germans, S.; Heck, G.L. van; Moran, P.; Hodiamont, P.P.G.

    2009-01-01

    Objective The internal consistency, test-retest reliability and validity of the Self-report Standardized Assessment of Personality-abbreviated Scale (SAPAS-SR) as a screening instrument for personality disorders were studied in a random sample of 195 Dutch psychiatric outpatients, using the Structu

  1. Myeloproliferative neoplasm (MPN) symptom assessment form total symptom score: Prospective international assessment of an abbreviated symptom burden scoring system among patients with MPNs

    NARCIS (Netherlands)

    R.M. Emanuel (Robyn); A.C. Dueck (Amylou); H.L. Geyer (Holly); J.J. Kiladjian; S. Slot (Stefanie); S. Zweegman (Sonja); P.A.W. te Boekhorst (Peter); S. Commandeur (Suzan); H. Schouten (Harry); F. Sackmann (Federico); A.K. Fuentes (Ana Kerguelen); D. Hernández-Maraver (Dolores); C. Pahl (Clemens); M. Griesshammer (Martin); F. Stegelmann (Frank); K. Doehner (Konstanze); T. Lehmann (Thomas); K. Bonatz (Karin); A. Reiter (Alfred); F. Boyer (Francoise); J. Etienne (Jerome); J.-C. Ianotto (Jean-Christophe); D. Ranta (Dana); L. Roy (Lydia); J.-Y. Cahn (Jean-Yves); C.N. Harrison (Claire); D. Radia (Deepti); P. Muxi (Pablo); N. Maldonado (Norman); C. Besses (Carlos); F. Cervantes (Francisco); P.L. Johansson (Peter); T. Barbui (Tiziano); G. Barosi (Giovanni); A.M. Vannucchi (Alessandro); F. Passamonti (Francesco); B. Andreasson (Bjorn); M.L. Ferarri (Maria); A. Rambaldi (Alessandro); J. Samuelsson (Jan); G. Birgegard (Gunnar); A. Tefferi (Ayalew); A.A. Mesa

    2012-01-01

    textabstractPurpose: Myeloproliferative neoplasm (MPN) symptoms are troublesome to patients, and alleviation of this burden represents a paramount treatment objective in the development of MPN-directed therapies. We aimed to assess the utility of an abbreviated symptom score for the most pertinent

  2. Text structures in medical text processing: empirical evidence and a text understanding prototype.

    Science.gov (United States)

    Hahn, U; Romacker, M

    1997-01-01

    We consider the role of textual structures in medical texts. In particular, we examine the impact the lacking recognition of text phenomena has on the validity of medical knowledge bases fed by a natural language understanding front-end. First, we review the results from an empirical study on a sample of medical texts considering, in various forms of local coherence phenomena (anaphora and textual ellipses). We then discuss the representation bias emerging in the text knowledge base that is likely to occur when these phenomena are not dealt with--mainly the emergence of referentially incoherent and invalid representations. We then turn to a medical text understanding system designed to account for local text coherence.

  3. Text Analytics to Data Warehousing

    Directory of Open Access Journals (Sweden)

    Kalli Srinivasa Nageswara Prasad

    2010-09-01

    Full Text Available Information hidden or stored in unstructured data can play a critical role in making decisions, understanding and conducting other business functions. Integrating data stored in both structured and unstructured formats can add significant value to an organization. With the extent of development happening in Text Mining and technologies to deal with unstructured and semi structured data like XML and MML(Mining Markup Language to extract and analyze data, textanalytics has evolved to handle unstructured data to helps unlock and predict business results via Business Intelligence and Data Warehousing. Text mining involves dealing with texts in documents and discovering hidden patterns, but Text Analytics enhances InformationRetrieval in form of search and enabling clustering of results and more over Text Analytics is text mining and visualization. In this paper we would discuss on handling unstructured data that are in documents so that they fit into business applications like Data Warehouses for further analysis and it helps in the framework we have used for the solution.

  4. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  5. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  6. Text Steganographic Approaches: A Comparison

    Directory of Open Access Journals (Sweden)

    Monika Agarwal

    2013-02-01

    Full Text Available This paper presents three novel approaches of text steganography. The first approach uses the theme ofmissing letter puzzle where each character of message is hidden by missing one or more letters in a wordof cover. The average Jaro score was found to be 0.95 indicating closer similarity between cover andstego file. The second approach hides a message in a wordlist where ASCII value of embedded characterdetermines length and starting letter of a word. The third approach conceals a message, withoutdegrading cover, by using start and end letter of words of the cover. For enhancing the security of secretmessage, the message is scrambled using one-time pad scheme before being concealed and cipher text isthen concealed in cover. We also present an empirical comparison of the proposed approaches with someof the popular text steganographic approaches and show that our approaches outperform the existingapproaches.

  7. Adaptive Personality Recogntion from Text

    OpenAIRE

    Celli, Fabio

    2012-01-01

    We address the issue of domain adaptation for automatic Personality Recognition from Text (PRT). The PRT task consists in the classification of the personality traits of some authors, given some pieces of text they wrote. The purpose of our work is to improve current approaches to PRT in order to extract personality information from social network sites, which is a really challenging task. We argue that current approaches, based on supervised learning, have several limitations for th...

  8. Functional Stylistics and Peripeteic Texts

    DEFF Research Database (Denmark)

    Borchmann, Simon

    2008-01-01

    Using a pragmatically based linguistic description apparatus on literary use of language is not unproblematic. Observations show that literary use of language violates the norms contained by this apparatus. With this paper I suggest how we can deal with this problem by setting up a frame for the ...... for the use of a functional linguistic description apparatus on literary texts. As an extension of this suggestion I present a model for describing a specific type of literary texts....

  9. Text messaging during simulated driving.

    Science.gov (United States)

    Drews, Frank A; Yazdani, Hina; Godfrey, Celeste N; Cooper, Joel M; Strayer, David L

    2009-10-01

    This research aims to identify the impact of text messaging on simulated driving performance. In the past decade, a number of on-road, epidemiological, and simulator-based studies reported the negative impact of talking on a cell phone on driving behavior. However, the impact of text messaging on simulated driving performance is still not fully understood. Forty participants engaged in both a single task (driving) and a dual task (driving and text messaging) in a high-fidelity driving simulator. Analysis of driving performance revealed that participants in the dual-task condition responded more slowly to the onset of braking lights and showed impairments in forward and lateral control compared with a driving-only condition. Moreover, text-messaging drivers were involved in more crashes than drivers not engaged in text messaging. Text messaging while driving has a negative impact on simulated driving performance. This negative impact appears to exceed the impact of conversing on a cell phone while driving. The results increase our understanding of driver distraction and have potential implications for public safety and device development.

  10. A comparison study on algorithms of detecting long forms for short forms in biomedical text

    Directory of Open Access Journals (Sweden)

    Wu Cathy H

    2007-11-01

    Full Text Available Abstract Motivation With more and more research dedicated to literature mining in the biomedical domain, more and more systems are available for people to choose from when building literature mining applications. In this study, we focus on one specific kind of literature mining task, i.e., detecting definitions of acronyms, abbreviations, and symbols in biomedical text. We denote acronyms, abbreviations, and symbols as short forms (SFs and their corresponding definitions as long forms (LFs. The study was designed to answer the following questions; i how well a system performs in detecting LFs from novel text, ii what the coverage is for various terminological knowledge bases in including SFs as synonyms of their LFs, and iii how to combine results from various SF knowledge bases. Method We evaluated the following three publicly available detection systems in detecting LFs for SFs: i a handcrafted pattern/rule based system by Ao and Takagi, ALICE, ii a machine learning system by Chang et al., and iii a simple alignment-based program by Schwartz and Hearst. In addition, we investigated the conceptual coverage of two terminological knowledge bases: i the UMLS (the Unified Medical Language System, and ii the BioThesaurus (a thesaurus of names for all UniProt protein records. We also implemented a web interface that provides a virtual integration of various SF knowledge bases. Results We found that detection systems agree with each other on most cases, and the existing terminological knowledge bases have a good coverage of synonymous relationship for frequently defined LFs. The web interface allows people to detect SF definitions from text and to search several SF knowledge bases. Availability The web site is http://gauss.dbb.georgetown.edu/liblab/SFThesaurus.

  11. Analysing ESP Texts, but How?

    Directory of Open Access Journals (Sweden)

    Borza Natalia

    2015-03-01

    Full Text Available English as a second language (ESL teachers instructing general English and English for specific purposes (ESP in bilingual secondary schools face various challenges when it comes to choosing the main linguistic foci of language preparatory courses enabling non-native students to study academic subjects in English. ESL teachers intending to analyse English language subject textbooks written for secondary school students with the aim of gaining information about what bilingual secondary school students need to know in terms of language to process academic textbooks cannot avoiding deal with a dilemma. It needs to be decided which way it is most appropriate to analyse the texts in question. Handbooks of English applied linguistics are not immensely helpful with regard to this problem as they tend not to give recommendation as to which major text analytical approaches are advisable to follow in a pre-college setting. The present theoretical research aims to address this lacuna. Respectively, the purpose of this pedagogically motivated theoretical paper is to investigate two major approaches of ESP text analysis, the register and the genre analysis, in order to find the more suitable one for exploring the language use of secondary school subject texts from the point of view of an English as a second language teacher. Comparing and contrasting the merits and limitations of the two contrastive approaches allows for a better understanding of the nature of the two different perspectives of text analysis. The study examines the goals, the scope of analysis, and the achievements of the register perspective and those of the genre approach alike. The paper also investigates and reviews in detail the starkly different methods of ESP text analysis applied by the two perspectives. Discovering text analysis from a theoretical and methodological angle supports a practical aspect of English teaching, namely making an informed choice when setting out to analyse

  12. USADA BHUDA KACAPI: BALINESE TRADITIONAL THERAPY (USADA LITERARY TEXT

    Directory of Open Access Journals (Sweden)

    I Ketut Jirnaya

    2015-01-01

    Full Text Available Usada Budha Kacapi (abbreviated to UBK text, which contains the basic Balinese traditional therapy, is a text which is in the form of narration. The Balinese traditional therapy (usada texts generally contain collections of names of diseases, medicinal substances, and how to cure such diseases; however, the UBK is in the form of narration, containing characters, setting, themes, and literary language. The UBK text, after being edited, is recorded in a number of palm-leaf manuscripts. The title is the same but the content varies. Budha Kecapi is the main character, which has inspired many other writers; therefore, the works produced still use the same language units as used by Budha Kacapi. Such works are Budha Kacapi Cemeng, Budha Kacapi Putih, and Budha Kacapi Sastrasanga . It is this which has inspired the researcher to explore the UBK in order to know who and what Budha Kacapi is. In order to be able to identify the message transmitted to the reader or the community, and its totality, it is necessary to know, understand, and analyze the signs it contains. Therefore, two theories are used in this study; they are the theory of intertextuality and the theory of semiotics. The results of analysis show that the writers wish to teach and guide those who desire to be professional indigenous medical practitioners ‘dukun’, namely, the ones who are highly knowledgeable of traditional therapy, ethical and not easily defeated by diseases. That, according to Budha Kacapi, can be achieved through ‘yogasastra’. The indigenous medical practitioners should improve their quality through yoga (meditation and aksara suci (holy scripts as the means. A set of learning materials related to the basic knowledge needed by the indigenous medical practitioners are systematically organized, starting from how to recruit the prospective learners, the learning method, how to diagnose (nenger, the philosophy of life and death, the philosophy of diseases, the concept

  13. Assessment of the effects and limitations of the 1998 to 2008 Abbreviated Injury Scale map using a large population-based dataset

    Directory of Open Access Journals (Sweden)

    Franklyn Melanie

    2011-01-01

    Full Text Available Abstract Background Trauma systems should consistently monitor a given trauma population over a period of time. The Abbreviated Injury Scale (AIS and derived scores such as the Injury Severity Score (ISS are commonly used to quantify injury severities in trauma registries. To reflect contemporary trauma management and treatment, the most recent version of the AIS (AIS08 contains many codes which differ in severity from their equivalents in the earlier 1998 version (AIS98. Consequently, the adoption of AIS08 may impede comparisons between data coded using different AIS versions. It may also affect the number of patients classified as major trauma. Methods The entire AIS98-coded injury dataset of a large population based trauma registry was retrieved and mapped to AIS08 using the currently available AIS98-AIS08 dictionary map. The percentage of codes which had increased or decreased in severity, or could not be mapped, was examined in conjunction with the effect of these changes to the calculated ISS. The potential for free text information accompanying AIS coding to improve the quality of AIS mapping was explored. Results A total of 128280 AIS98-coded injuries were evaluated in 32134 patients, 15471 patients of whom were classified as major trauma. Although only 4.5% of dictionary codes decreased in severity from AIS98 to AIS08, this represented almost 13% of injuries in the registry. In 4.9% of patients, no injuries could be mapped. ISS was potentially unreliable in one-third of patients, as they had at least one AIS98 code which could not be mapped. Using AIS08, the number of patients classified as major trauma decreased by between 17.3% and 30.3%. Evaluation of free text descriptions for some injuries demonstrated the potential to improve mapping between AIS versions. Conclusions Converting AIS98-coded data to AIS08 results in a significant decrease in the number of patients classified as major trauma. Many AIS98 codes are missing from the

  14. Princess Brambilla - images/text

    Directory of Open Access Journals (Sweden)

    Maria Aparecida Barbosa

    2016-01-01

    Full Text Available Read the illustrated literary text is simultaneously think pictures and words. This articulation between the written text and pictures adds potential, expands and becomes complex. Coincides with nowadays discussions on Giorgio Agamben's "contemporary" that add to what adheres to respectively time the displacement and the distance needed to understand it, shakes linear notions of historical chronology. Somehow the coincidence is related to the current interest in the concept of "Nachleben" (survival, which assumes the images of the past ransom, postulated by the art historian Aby Warburg in a research on ancient art of motion characteristics in Renaissance pictures Botticelli's. For the translation of the Princesa Brambilla – um capriccio segundo Jakob Callot, de E. T. A. Hoffmann, com 8 gravuras cunhadas a partir de moldes originais de Callot (1820 to Portuguese such discussions were fundamental, as I try to present in this article.

  15. Text Recognition from an Image

    Directory of Open Access Journals (Sweden)

    Shrinath Janvalkar

    2014-04-01

    Full Text Available To achieve high speed in data processing it is necessary to convert the analog data into digital data. Storage of hard copy of any document occupies large space and retrieving of information from that document is time consuming. Optical character recognition system is an effective way in recognition of printed character. It provides an easy way to recognize and convert the printed text on image into the editable text. It also increases the speed of data retrieval from the image. The image which contains characters can be scanned through scanner and then recognition engine of the OCR system interpret the images and convert images of printed characters into machine-readable characters [8].It improving the interface between man and machine in many applications

  16. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    Inspecting the quality of printed texts has its own importance in many industrial applications. To do so, this paper proposes a grading system which evaluates the performance of the printing task using some quality measures for each character and symbols. The purpose of these grading system is two......-folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...

  17. Identifying issue frames in text.

    Directory of Open Access Journals (Sweden)

    Eyal Sagi

    Full Text Available Framing, the effect of context on cognitive processes, is a prominent topic of research in psychology and public opinion research. Research on framing has traditionally relied on controlled experiments and manually annotated document collections. In this paper we present a method that allows for quantifying the relative strengths of competing linguistic frames based on corpus analysis. This method requires little human intervention and can therefore be efficiently applied to large bodies of text. We demonstrate its effectiveness by tracking changes in the framing of terror over time and comparing the framing of abortion by Democrats and Republicans in the U.S.

  18. A Guide Text or Many Texts? "That is the Question”

    Directory of Open Access Journals (Sweden)

    Delgado de Valencia Sonia

    2001-08-01

    Full Text Available The use of supplementary materials in the classroom has always been an essential part of the teaching and learning process. To restrict our teaching to the scope of one single textbook means to stand behind the advances of knowledge, in any area and context. Young learners appreciate any new and varied support that expands their knowledge of the world: diaries, letters, panels, free texts, magazines, short stories, poems or literary excerpts, and articles taken from Internet are materials that will allow learnersto share more and work more collaboratively. In this article we are going to deal with some of these materials, with the criteria to select, adapt, and create them that may be of interest to the learner and that may promote reading and writing processes. Since no text can entirely satisfy the needs of students and teachers, the creativity of both parties will be necessary to improve the quality of teaching through the adequate use and adaptation of supplementary materials.

  19. Reviving "Walden": Mining the Text.

    Science.gov (United States)

    Hewitt Julia

    2000-01-01

    Describes how the author and her high school English students begin their study of Thoreau's "Walden" by mining the text for quotations to inspire their own writing and discussion on the topic, "How does Thoreau speak to you or how could he speak to someone you know?" (SR)

  20. Reviving "Walden": Mining the Text.

    Science.gov (United States)

    Hewitt Julia

    2000-01-01

    Describes how the author and her high school English students begin their study of Thoreau's "Walden" by mining the text for quotations to inspire their own writing and discussion on the topic, "How does Thoreau speak to you or how could he speak to someone you know?" (SR)

  1. The Return of the Text.

    Science.gov (United States)

    Patrikis, Peter

    2002-01-01

    In celebration of the work of Claire Kramsch, this article affirms her promotion of the literary text "to enrich and enliven the classroom, making the act of reading reflective and self-reflective, and creating a common culture of interpretation and debate within each classroom." (Author/VWL)

  2. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level...

  3. Multilingual text induced spelling correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams

  4. Seductive Texts with Serious Intentions.

    Science.gov (United States)

    Nielsen, Harriet Bjerrum

    1995-01-01

    Debates whether a text claiming to have scientific value is using seduction irresponsibly at the expense of the truth, and discusses who is the subject and who is the object of such seduction. It argues that, rather than being an assault against scientific ethics, seduction is a necessary premise for a sensible conversation to take place. (GR)

  5. Comparison of Text Categorization Algorithms

    Institute of Scientific and Technical Information of China (English)

    SHI Yong-feng; ZHAO Yan-ping

    2004-01-01

    This paper summarizes several automatic text categorization algorithms in common use recently, analyzes and compares their advantages and disadvantages.It provides clues for making use of appropriate automatic classifying algorithms in different fields.Finally some evaluations and summaries of these algorithms are discussed, and directions to further research have been pointed out.

  6. Basic Chad Arabic: Comprehension Texts.

    Science.gov (United States)

    Absi, Samir Abu; Sinaud, Andre

    This text, principally designed for use in a three-volume course on Chad Arabic, complements the pre-speech and active phases of the course in that it provides the answers to comprehension exercises students are required to complete during the course. The comprehension exercises require that students listen to an instructor or tape and write…

  7. COMPENDEX/TEXT-PAC: CIS.

    Science.gov (United States)

    Standera, Oldrich

    This report evaluates the engineering information services provided by the University of Calgary since implementation of the COMPENDEX (tape service of Engineering Index, Inc.) service using the IBM TEXT-PAC system. Evaluation was made by a survey of the users of the Current Information Selection (CIS) service, the interaction between the system…

  8. Functional Stylistics and Peripeteic Texts

    DEFF Research Database (Denmark)

    Borchmann, Simon

    2008-01-01

    Using a pragmatically based linguistic description apparatus on literary use of language is not unproblematic. Observations show that literary use of language violates the norms contained by this apparatus. With this paper I suggest how we can deal with this problem by setting up a frame for the ...... for the use of a functional linguistic description apparatus on literary texts. As an extension of this suggestion I present a model for describing a specific type of literary texts.......Using a pragmatically based linguistic description apparatus on literary use of language is not unproblematic. Observations show that literary use of language violates the norms contained by this apparatus. With this paper I suggest how we can deal with this problem by setting up a frame...

  9. Text Segmentation Using Exponential Models

    CERN Document Server

    Beeferman, D; Lafferty, G D; Beeferman, Doug; Berger, Adam; Lafferty, John

    1997-01-01

    This paper introduces a new statistical approach to partitioning text automatically into coherent segments. Our approach enlists both short-range and long-range language models to help it sniff out likely sites of topic changes in text. To aid its search, the system consults a set of simple lexical hints it has learned to associate with the presence of boundaries through inspection of a large corpus of annotated data. We also propose a new probabilistically motivated error metric for use by the natural language processing and information retrieval communities, intended to supersede precision and recall for appraising segmentation algorithms. Qualitative assessment of our algorithm as well as evaluation using this new metric demonstrate the effectiveness of our approach in two very different domains, Wall Street Journal articles and the TDT Corpus, a collection of newswire articles and broadcast news transcripts.

  10. Locative inferences in medical texts.

    Science.gov (United States)

    Mayer, P S; Bailey, G H; Mayer, R J; Hillis, A; Dvoracek, J E

    1987-06-01

    Medical research relies on epidemiological studies conducted on a large set of clinical records that have been collected from physicians recording individual patient observations. These clinical records are recorded for the purpose of individual care of the patient with little consideration for their use by a biostatistician interested in studying a disease over a large population. Natural language processing of clinical records for epidemiological studies must deal with temporal, locative, and conceptual issues. This makes text understanding and data extraction of clinical records an excellent area for applied research. While much has been done in making temporal or conceptual inferences in medical texts, parallel work in locative inferences has not been done. This paper examines the locative inferences as well as the integration of temporal, locative, and conceptual issues in the clinical record understanding domain by presenting an application that utilizes two key concepts in its parsing strategy--a knowledge-based parsing strategy and a minimal lexicon.

  11. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level. Th....... By comparing the biological with the textual account of autopoietic agency, the end conclusion is that a newly derived concept of sociopoiesis might be better suited for discussing the architecture of textual systems....

  12. Survey on Text Document Clustering

    OpenAIRE

    M.Thangamani; Dr.P.Thangaraj

    2010-01-01

    Document clustering is also referred as text clustering, and its concept is merely equal to data clustering. It is hardly difficult to find the selective information from an ‘N’number of series information, so that document clustering came into picture. Basically cluster means a group of similar data, document clustering means segregating the data into different groups of similar data. Clustering can be of mathematical, statistical or numerical domain. Clustering is a fundamental data analysi...

  13. Individual Profiling Using Text Analysis

    Science.gov (United States)

    2016-04-15

    implemented using Latent Dirichlet Allocation (LDA) [3], and n–gram language models were used to extract features to train Support Vector Machine (SVM... extraversion , agreeableness, and neuroticism. 3 Methods, Assumptions and Procedures 3.1 Data and Preprocessing The text provided proved to be quite clean and...assess the affect of various features a 10-fold cross validation was performed on the training data. 3 n–gram language model Throughout early experiments

  14. Learning Context for Text Categorization

    CERN Document Server

    Haribhakta, Y V

    2011-01-01

    This paper describes our work which is based on discovering context for text document categorization. The document categorization approach is derived from a combination of a learning paradigm known as relation extraction and an technique known as context discovery. We demonstrate the effectiveness of our categorization approach using reuters 21578 dataset and synthetic real world data from sports domain. Our experimental results indicate that the learned context greatly improves the categorization performance as compared to traditional categorization approaches.

  15. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  16. The Balinese Unicode Text Processing

    Directory of Open Access Journals (Sweden)

    Imam Habibi

    2009-06-01

    Full Text Available In principal, the computer only recognizes numbers as the representation of a character. Therefore, there are many encoding systems to allocate these numbers although not all characters are covered. In Europe, every single language even needs more than one encoding system. Hence, a new encoding system known as Unicode has been established to overcome this problem. Unicode provides unique id for each different characters which does not depend on platform, program, and language. Unicode standard has been applied in a number of industries, such as Apple, HP, IBM, JustSystem, Microsoft, Oracle, SAP, Sun, Sybase, and Unisys. In addition, language standards and modern information exchanges such as XML, Java, ECMA Script (JavaScript, LDAP, CORBA 3.0, and WML make use of Unicode as an official tool for implementing ISO/IEC 10646. There are four things to do according to Balinese script: the algorithm of transliteration, searching, sorting, and word boundary analysis (spell checking. To verify the truth of algorithm, some applications are made. These applications can run on Linux/Windows OS platform using J2SDK 1.5 and J2ME WTK2 library. The input and output of the algorithm/application are character sequence that is obtained from keyboard punch and external file. This research produces a module or a library which is able to process the Balinese text based on Unicode standard. The output of this research is the ability, skill, and mastering of 1. Unicode standard (21-bit as a substitution to ASCII (7-bit and ISO8859-1 (8-bit as the former default character set in many applications. 2. The Balinese Unicode text processing algorithm. 3. An experience of working with and learning from an international team that consists of the foremost experts in the area: Michael Everson (Ireland, Peter Constable (Microsoft US, I Made Suatjana, and Ida Bagus Adi Sudewa.

  17. Text Mining the Biomedical Literature

    Science.gov (United States)

    2007-11-05

    FORECASTING AND SOCIAL CHANGE 72 (7): 798-814. Kostoff, RN; del Rio, JA; Humenik, JA; Garcia , EO; Ramirez , AM. 2001. Citation mining: Integrating text...Author, verify your references - or, the accuracy of references in Israeli medical journals. Israel Journal of Medical Sciences. 27 (2): 109-112...0.3352% DEY, L 3 0.3352% 252 DIAZ, I 3 0.3352% FELDMAN, R 3 0.3352% FREEMAN, RT 3 0.3352% FRIEDMAN, C 3 0.3352% GAO, W 3 0.3352% GARCIA

  18. New Historicism: Text and Context

    Directory of Open Access Journals (Sweden)

    Violeta M. Vesić

    2016-02-01

    Full Text Available During most of the twentieth century history was seen as a phenomenon outside of literature that guaranteed the veracity of literary interpretation. History was unique and it functioned as a basis for reading literary works. During the seventies of the twentieth century there occurred a change of attitude towards history in American literary theory, and there appeared a new theoretical approach which soon became known as New Historicism. Since its inception, New Historicism has been identified with the study of Renaissance and Romanticism, but nowadays it has been increasingly involved in other literary trends. Although there are great differences in the arguments and practices at various representatives of this school, New Historicism has clearly recognizable features and many new historicists will agree with the statement of Walter Cohen that New Historicism, when it appeared in the eighties, represented something quite new in reference to the studies of theory, criticism and history (Cohen 1987, 33. Theoretical connection with Bakhtin, Foucault and Marx is clear, as well as a kind of uneasy tie with deconstruction and the work of Paul de Man. At the center of this approach is a renewed interest in the study of literary works in the light of historical and political circumstances in which they were created. Foucault encouraged readers to begin to move literary texts and to link them with discourses and representations that are not literary, as well as to examine the sociological aspects of the texts in order to take part in the social struggles of today. The study of literary works using New Historicism is the study of politics, history, culture and circumstances in which these works were created. With regard to one of the main fact which is located in the center of the criticism, that history cannot be viewed objectively and that reality can only be understood through a cultural context that reveals the work, re-reading and interpretation of

  19. Efficient Index for Handwritten Text

    Science.gov (United States)

    Kamel, Ibrahim

    This paper deals with one of the new emerging multimedia data types, namely, handwritten cursive text. The paper presents two indexing methods for searching a collection of cursive handwriting. The first index, word-level index, treats word as pictogram and uses global features for retrieval. The word-level index is suitable for large collection of cursive text. While the second one, called stroke-level index, treats the word as a set of strokes. The stroke-level index is more accurate, but more costly than the word level index. Each word (or stroke) can be described with a set of features and, thus, can be stored as points in the feature space. The Karhunen-Loeve transform is then used to minimize the number of features used (data dimensionality) and thus the index size. Feature vectors are stored in an R-tree. We implemented both indexes and carried many simulation experiments to measure the effectiveness and the cost of the search algorithm. The proposed indexes achieve substantial saving in the search time over the sequential search. Moreover, the proposed indexes improve the matching rate up to 46% over the sequential search.

  20. Succincter Text Indexing with Wildcards

    CERN Document Server

    Thachuk, Chris

    2011-01-01

    We study the problem of indexing text with wildcard positions, motivated by the challenge of aligning sequencing data to large genomes that contain millions of single nucleotide polymorphisms (SNPs)---positions known to differ between individuals. SNPs modeled as wildcards can lead to more informed and biologically relevant alignments. We improve the space complexity of previous approaches by giving a succinct index requiring $(2 + o(1))n \\log \\sigma + O(n) + O(d \\log n) + O(k \\log k)$ bits for a text of length $n$ over an alphabet of size $\\sigma$ containing $d$ groups of $k$ wildcards. A key to the space reduction is a result we give showing how any compressed suffix array can be supplemented with auxiliary data structures occupying $O(n) + O(d \\log \\frac{n}{d})$ bits to also support efficient dictionary matching queries. The query algorithm for our wildcard index is faster than previous approaches using reasonable working space. More importantly our new algorithm greatly reduces the query working space to ...

  1. Testing the Abbreviated Food Technology Neophobia Scale and its relation to satisfaction with food-related life in university students.

    Science.gov (United States)

    Schnettler, Berta; Grunert, Klaus G; Miranda-Zapata, Edgardo; Orellana, Ligia; Sepúlveda, José; Lobos, Germán; Hueche, Clementina; Höger, Yesli

    2017-06-01

    The aims of this study were to test the relationships between food neophobia, satisfaction with food-related life and food technology neophobia, distinguishing consumer segments according to these variables and characterizing them according to willingness to purchase food produced with novel technologies. A survey was conducted with 372 university students (mean aged=20.4years, SD=2.4). The questionnaire included the Abbreviated version of the Food Technology Neophobia Scale (AFTNS), Satisfaction with Life Scale (SWLS), and a 6-item version of the Food Neophobia Scale (FNS). Using confirmatory factor analysis, it was confirmed that SWFL correlated inversely with FNS, whereas FNS correlated inversely with AFTNS. No relationship was found between SWFL and AFTNS. Two main segments were identified using cluster analysis; these segments differed according to gender and family size. Group 1 (57.8%) possessed higher AFTNS and FNS scores than Group 2 (28.5%). However, these groups did not differ in their SWFL scores. Group 1 was less willing to purchase foods produced with new technologies than Group 2. The AFTNS and the 6-item version of the FNS are suitable instruments to measure acceptance of foods produced using new technologies in South American developing countries. The AFTNS constitutes a parsimonious alternative for the international study of food technology neophobia. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. Selecting an optimal abbreviated ICF set for clinical practice among rehabilitants with subacute stroke: retrospective analysis of patient records.

    Science.gov (United States)

    Saltychev, Mikhail; Tarvonen-Schröder, Sinikka; Eskola, Merja; Laimi, Katri

    2013-06-01

    To evaluate the adequacy of abbreviated versions of International Classification of Functioning, Disability and Health (ICF) (the WHO ICF Checklist and the ICF Comprehensive Core Set for Stroke) with respect to the specific clinical needs of a stroke rehabilitation unit before their implementation at a practical level. Common descriptions of functional limitations were identified from patient records of 10 subsequent subacute stroke patients referred to an inpatient multiprofessional rehabilitation unit of a university hospital. These descriptions were then converted into ICF categories, and the list was compared with the ICF Checklist of the WHO and the ICF Comprehensive and Brief Core Sets for Stroke developed by the ICF Research Branch. From the study population (50% women), 71 different, second-level ICF categories were identified, averaging 36.4 categories/patient (SD 5.8, range 28-46). Except for one category, all of the categories identified were also found in the ICF Comprehensive Core Set for Stroke. Of the categories identified, 49 (69%) were found in the WHO ICF Checklist. All except one category included in the ICF Brief Core Set for Stroke were also in our list. The Comprehensive Core Set for Stroke was found to be a good potential starting point for the practical implementation of the ICF in a stroke rehabilitation unit.

  3. Math Anxiety Assessment with the Abbreviated Math Anxiety Scale: Applicability and Usefulness: Insights from the Polish Adaptation

    Science.gov (United States)

    Cipora, Krzysztof; Szczygieł, Monika; Willmes, Klaus; Nuerk, Hans-Christoph

    2015-01-01

    Math anxiety has an important impact on mathematical development and performance. However, although math anxiety is supposed to be a transcultural trait, assessment instruments are scarce and are validated mainly for Western cultures so far. Therefore, we aimed at examining the transcultural generality of math anxiety by a thorough investigation of the validity of math anxiety assessment in Eastern Europe. We investigated the validity and reliability of a Polish adaptation of the Abbreviated Math Anxiety Scale (AMAS), known to have very good psychometric characteristics in its original, American-English version as well as in its Italian and Iranian adaptations. We also observed high reliability, both for internal consistency and test-retest stability of the AMAS in the Polish sample. The results also show very good construct, convergent and discriminant validity: The factorial structure in Polish adult participants (n = 857) was very similar to the one previously found in other samples; AMAS scores correlated moderately in expected directions with state and trait anxiety, self-assessed math achievement and skill as well temperamental traits of emotional reactivity, briskness, endurance, and perseverance. Average scores obtained by participants as well as gender differences and correlations with external measures were also similar across cultures. Beyond the cultural comparison, we used path model analyses to show that math anxiety relates to math grades and self-competence when controlling for trait anxiety. The current study shows transcultural validity of math anxiety assessment with the AMAS. PMID:26648893

  4. Math Anxiety Assessment with the Abbreviated Math Anxiety Scale: Applicability and Usefulness: Insights from the Polish Adaptation.

    Science.gov (United States)

    Cipora, Krzysztof; Szczygieł, Monika; Willmes, Klaus; Nuerk, Hans-Christoph

    2015-01-01

    Math anxiety has an important impact on mathematical development and performance. However, although math anxiety is supposed to be a transcultural trait, assessment instruments are scarce and are validated mainly for Western cultures so far. Therefore, we aimed at examining the transcultural generality of math anxiety by a thorough investigation of the validity of math anxiety assessment in Eastern Europe. We investigated the validity and reliability of a Polish adaptation of the Abbreviated Math Anxiety Scale (AMAS), known to have very good psychometric characteristics in its original, American-English version as well as in its Italian and Iranian adaptations. We also observed high reliability, both for internal consistency and test-retest stability of the AMAS in the Polish sample. The results also show very good construct, convergent and discriminant validity: The factorial structure in Polish adult participants (n = 857) was very similar to the one previously found in other samples; AMAS scores correlated moderately in expected directions with state and trait anxiety, self-assessed math achievement and skill as well temperamental traits of emotional reactivity, briskness, endurance, and perseverance. Average scores obtained by participants as well as gender differences and correlations with external measures were also similar across cultures. Beyond the cultural comparison, we used path model analyses to show that math anxiety relates to math grades and self-competence when controlling for trait anxiety. The current study shows transcultural validity of math anxiety assessment with the AMAS.

  5. Reliability and diagnostic efficiency of the abbreviated-diagnostic interview for borderlines in an adolescent clinical population.

    Science.gov (United States)

    Guilé, Jean Marc; Greenfield, Brian; Berthiaume, Claude; Chapdelaine, Cimon; Bergeron, Lise

    2009-09-01

    Examine the reliability as well as the concurrent validity and diagnostic efficiency of the Abbreviated version of the diagnostic interview for borderlines revised (Ab-DIB) as a screening measure of borderline psychopathology in an adolescent clinical population. The Ab-DIB is a DIB-R-derived self-report covering the impulsiveness as well as the affect and cognitive components of the borderline construct. Its administration lasts 10 min. The Ab-DIB was tested on 139 suicidal youths for reliability and concurrent validity against the DIB-R and the Columbia Impairment Scale (CIS). Internal consistencies and test-retest Intra-Class-Correlations ranged from 0.80 to 0.86 and 0.77 to 0.95, respectively. ROC analysis yielded an area under the curve of 0.87 (p < 0.001). Sensitivity was 0.88 and specificity ranged from 0.82 to 0.73 depending on the age-range. Correlation of the Ab-DIB's continuous score with the CIS was 0.42 (p < 0.001). In conclusion, The Ab-DIB's brief duration and psychometric properties suggest its utility in time-limited settings.

  6. Everyday Life as a Text

    Directory of Open Access Journals (Sweden)

    Michael Lahey

    2016-02-01

    Full Text Available This article explores how audience data are utilized in the tentative partnerships created between television and social media companies. Specially, it looks at the mutually beneficial relationship formed between the social media platform Twitter and television. It calls attention to how audience data are utilized as a way for the television industry to map itself onto the everyday lives of digital media audiences. I argue that the data-intensive monitoring of everyday life offers some measure of soft control over audiences in a digital media landscape. To do this, I explore “Social TV”—the relationships created between social media technologies and television—before explaining how Twitter leverages user data into partnerships with various television companies. Finally, the article explains what is fruitful about understanding the Twitter–television relationship as a form of soft control.

  7. Segmental Rescoring in Text Recognition

    Science.gov (United States)

    2014-02-04

    ttm № tes/m, m* tmvr mowm* a Smyrna Of l δrtA£ACf02S’ A w m - y i p m AmiKSiS € f № ) C № № m .. sg6#?«rA fiθN ; Atφ h Sft№’·’Spxn mm m fim f№b t&m&mm...applying a Hidden Markov Model (HMM) recognition approach. Generating the plurality text hypotheses for the image forming includes generating a first...image. Applying segmental analysis to a segmentation determined by a first OCR engine, such as a segmentation determined by a Hidden Markov Model (HMM

  8. Linguistic dating of biblical texts

    DEFF Research Database (Denmark)

    Young, Ian; Rezetko, Robert; Ehrensvärd, Martin Gustaf

    and diglossia and textual criticism (Chapters 7, 13), and the significance of extra-biblical sources, including Amarna Canaanite, Ugaritic, Aramaic, Hebrew inscriptions of the monarchic period, Qumran and Mishnaic Hebrew, the Hebrew language of Ben Sira and Bar Kochba, and also Egyptian, Akkadian, Persian....... This is followed by an detailed synthesis of the topics introduced in the first volume, a series of detailed case studies on various linguistic issues, extensive tables of grammatical and lexical features, and a comprehensive bibliography. The authors argue that the scholarly use of language in dating biblical...... texts, and even the traditional standpoint on the chronological development of biblical Hebrew, require a thorough re-evaluation, and propose a new perspective on linguistic variety in biblical Hebrew. Early Biblical Hebrew and Late Biblical Hebrew do not represent different chronological periods...

  9. Linguistic dating of biblical texts

    DEFF Research Database (Denmark)

    Young, Ian; Rezetko, Robert; Ehrensvärd, Martin Gustaf

    Since the beginning of critical scholarship biblical texts have been dated using linguistic evidence. In recent years this has become a controversial topic, especially with the publication of Ian Young (ed.), Biblical Hebrew: Studies in Chronology and Typology (2003). However, until now there has...... in the history of biblical Hebrew, but instead represent co-existing styles of literary Hebrew throughout the biblical period....... and diglossia and textual criticism (Chapters 7, 13), and the significance of extra-biblical sources, including Amarna Canaanite, Ugaritic, Aramaic, Hebrew inscriptions of the monarchic period, Qumran and Mishnaic Hebrew, the Hebrew language of Ben Sira and Bar Kochba, and also Egyptian, Akkadian, Persian....... This is followed by an detailed synthesis of the topics introduced in the first volume, a series of detailed case studies on various linguistic issues, extensive tables of grammatical and lexical features, and a comprehensive bibliography. The authors argue that the scholarly use of language in dating biblical...

  10. Abbreviated World Health Organization Quality of Life questionnaire (WHOQOL-Bref) in north Indian patients with bronchial asthma: an evaluation using Rasch analysis

    OpenAIRE

    Aggarwal, Ashutosh N.; Agarwal, Ritesh; Gupta, Dheeraj

    2014-01-01

    Background: There is no disease-specific instrument to describe health-related quality of life (HRQoL) in Indian patients with asthma. However, an abbreviated World Health Organization Quality of Life questionnaire (WHOQOL-Bref), a generic Hindi HRQoL measure, has been developed and validated in India. Aims: To evaluate the WHOQOL-Bref in adult patients with asthma and to test possible modifications to the instrument to improve its psychometric adequacy. Methods: Sixty-seven patients with ast...

  11. Modeling the structure of the attitudes and belief scale 2 using CFA and bifactor approaches: Toward the development of an abbreviated version.

    Science.gov (United States)

    Hyland, Philip; Shevlin, Mark; Adamson, Gary; Boduszek, Daniel

    2014-01-01

    The Attitudes and Belief Scale-2 (ABS-2: DiGiuseppe, Leaf, Exner, & Robin, 1988. The development of a measure of rational/irrational thinking. Paper presented at the World Congress of Behavior Therapy, Edinburg, Scotland.) is a 72-item self-report measure of evaluative rational and irrational beliefs widely used in Rational Emotive Behavior Therapy research contexts. However, little psychometric evidence exists regarding the measure's underlying factor structure. Furthermore, given the length of the ABS-2 there is a need for an abbreviated version that can be administered when there are time demands on the researcher, such as in clinical settings. This study sought to examine a series of theoretical models hypothesized to represent the latent structure of the ABS-2 within an alternative models framework using traditional confirmatory factor analysis as well as utilizing a bifactor modeling approach. Furthermore, this study also sought to develop a psychometrically sound abbreviated version of the ABS-2. Three hundred and thirteen (N = 313) active emergency service personnel completed the ABS-2. Results indicated that for each model, the application of bifactor modeling procedures improved model fit statistics, and a novel eight-factor intercorrelated solution was identified as the best fitting model of the ABS-2. However, the observed fit indices failed to satisfy commonly accepted standards. A 24-item abbreviated version was thus constructed and an intercorrelated eight-factor solution yielded satisfactory model fit statistics. Current results support the use of a bifactor modeling approach to determining the factor structure of the ABS-2. Furthermore, results provide empirical support for the psychometric properties of the newly developed abbreviated version.

  12. Impact of an Abbreviated Cardiac Enzyme Protocol to Aid Rapid Discharge of Patients with Cocaine-associated Chest Pain in the Clinical Decision Unit

    Directory of Open Access Journals (Sweden)

    Faheem W. Guirgis

    2014-03-01

    Full Text Available Introduction: In 2007 there were 64,000 visits to the emergency department (ED for possible myocardial infarction (MI related to cocaine use. Prior studies have demonstrated that low- to intermediate-risk patients with cocaine-associated chest pain can be safely discharged after 9-12 hours of observation. The goal of this study was to determine the safety of an 8-hour protocol for ruling out MI in patients who presented with cocaine-associated chest pain. Methods: We conducted a retrospective review of patients treated with an 8-hour cocaine chest pain protocol between May 1, 2011 and November 30, 2012 who were sent to the clinical decision unit (CDU for observation. The protocol included serial cardiac biomarker testing with Troponin-T, CK-MB (including delta CK-MB, and total CK at 0, 2, 4, and 8 hours after presentation with cardiac monitoring for the observation period. Patients were followed up for adverse cardiac events or death within 30 days of discharge. Results: There were 111 admissions to the CDU for cocaine chest pain during the study period. One patient had a delta CK-MB of 1.6 ng/ml, but had negative Troponin-T at all time points. No patient had a positive Troponin-T or CK-MB at 0, 2, 4 or 8 hours, and there were no MIs or deaths within 30 days of discharge. Most patients were discharged home (103 and there were 8 inpatient admissions from the CDU. Of the admitted patients, 2 had additional stress tests that were negative, 1 had additional cardiac biomarkers that were negative, and all 8 patients were discharged home. The estimated risk of missing MI using our protocol is, with 99% confidence, less than 5.1% and with 95% confidence, less than 3.6% (99% CI, 0-5.1%; 95% CI, 0-3.6%. Conclusion: Application of an abbreviated cardiac enzyme protocol resulted in the safe and rapid discharge of patients presenting to the ED with cocaine-associated chest pain. [West J Emerg Med. 2014;15(2:180–183.

  13. 两种不同类型的缩略词语:用语缩略与造词缩略——兼论海峡两岸缩略词语的类型差异%Two kinds of Abbreviations:Abbreviations about Using and Abbreviation about Coinages——On Different Types across the Taiwan Strait

    Institute of Scientific and Technical Information of China (English)

    刁晏斌

    2011-01-01

    Abbreviations in Modern Chinese are a very complex set which can be divided as abbreviations about using and abbreviation about coinages,depending on their generated motivations and generative process.The former focuses on the convenient form of language use about the relatively fixed form of language.The latter is a kind of combination which is not associated with a relatively fixed strictly corresponded to the original style,using the abbreviated form of word-building materials.They have different focuses and correspond to different issues.The mechanism and procedure of generating and the surface meanings are also different.In the framework of dichotomy,there are significant difference between abbreviations in Taiwan Mandarin and mainland Mandarin: as to abbreviations about using,it includes less Numeral Compact Expressions,more common names and more compaction of Three-syllable words;as to abbreviation about coinages,it includes less fixed terms but more temporary words.%现代汉语缩略词语是一个非常复杂的集合,可以根据其产生动机和生成过程等的差异,分为"用语的缩略"和"造词的缩略",前者因着眼于对已有相对固定语言形式的便捷使用而生,后者则是不与某一相对固定的原式严格对应的、利用缩略性构词材料构成的组合形式。二者的着眼点不同,对应物不同,产生机制和过程不同,在表义上也有差异。在这个二分的框架下,可以看到台湾"国语"缩略词语与普通话的明显差异:就用语缩略来说,是数字略语少、合称多、三音节词的简缩多;就造词缩略来看,则是固定词少而临时词多。

  14. Audio Steganography with Embedded Text

    Science.gov (United States)

    Teck Jian, Chua; Chai Wen, Chuah; Rahman, Nurul Hidayah Binti Ab.; Hamid, Isredza Rahmi Binti A.

    2017-08-01

    Audio steganography is about hiding the secret message into the audio. It is a technique uses to secure the transmission of secret information or hide their existence. It also may provide confidentiality to secret message if the message is encrypted. To date most of the steganography software such as Mp3Stego and DeepSound use block cipher such as Advanced Encryption Standard or Data Encryption Standard to encrypt the secret message. It is a good practice for security. However, the encrypted message may become too long to embed in audio and cause distortion of cover audio if the secret message is too long. Hence, there is a need to encrypt the message with stream cipher before embedding the message into the audio. This is because stream cipher provides bit by bit encryption meanwhile block cipher provide a fixed length of bits encryption which result a longer output compare to stream cipher. Hence, an audio steganography with embedding text with Rivest Cipher 4 encryption cipher is design, develop and test in this project.

  15. A programmed text in statistics

    CERN Document Server

    Hine, J

    1975-01-01

    Exercises for Section 2 42 Physical sciences and engineering 42 43 Biological sciences 45 Social sciences Solutions to Exercises, Section 1 47 Physical sciences and engineering 47 49 Biological sciences 49 Social sciences Solutions to Exercises, Section 2 51 51 PhYSical sciences and engineering 55 Biological sciences 58 Social sciences 62 Tables 2 62 x - tests involving variances 2 63,64 x - one tailed tests 2 65 x - two tailed tests F-distribution 66-69 Preface This project started some years ago when the Nuffield Foundation kindly gave a grant for writing a pro­ grammed text to use with service courses in statistics. The work carried out by Mrs. Joan Hine and Professor G. B. Wetherill at Bath University, together with some other help from time to time by colleagues at Bath University and elsewhere. Testing was done at various colleges and universities, and some helpful comments were received, but we particularly mention King Edwards School, Bath, who provided some sixth formers as 'guinea pigs' for the fir...

  16. Abbreviated half-lives and impaired fuel utilization in carnitine palmitoyltransferase II variant fibroblasts.

    Directory of Open Access Journals (Sweden)

    Min Yao

    Full Text Available Carnitine palmitoyltransferase II (CPT II deficiency is one of the most common causes of fatty acid oxidation metabolism disorders. However, the molecular mechanism between CPT2 gene polymorphisms and metabolic stress has not been fully clarified. We previously reported that a number of patients show a thermal instable phenotype of compound hetero/homozygous variants of CPT II. To understand the mechanism of the metabolic disorder resulting from CPT II deficiency, the present study investigated CPT II variants in patient fibroblasts, [c.1102 G>A (p.V368I] (heterozygous, [c.1102 G>A (p.V368I] (homozygous, and [c.1055 T>G (p.F352C] (heterozygous + [c.1102 G>A (p.V368I] (homozygous compared with fibroblasts from healthy controls. CPT II variants exerted an effect of dominant negative on the homotetrameric proteins that showed thermal instability, reduced residual enzyme activities and a short half-life. Moreover, CPT II variant fibroblasts showed a significant decrease in fatty acid β-oxidation and adenosine triphosphate generation, combined with a reduced mitochondrial membrane potential, resulting in cellular apoptosis. Collectively, our data indicate that the CPT II deficiency induces an energy crisis of the fatty acid metabolic pathway. These findings may contribute to the elucidation of the genetic factors involved in metabolic disorder encephalopathy caused by the CPT II deficiency.

  17. Efficient Retrieval of Text for Biomedical Domain using Expectation Maximization Algorithm

    Directory of Open Access Journals (Sweden)

    Sumit Vashishtha

    2011-11-01

    Full Text Available Data mining, a branch of computer science [1], is the process of extracting patterns from large data sets by combining methods from statistics and artificial intelligence with database management. Data mining is seen as an increasingly important tool by modern business to transform data into business intelligence giving an informational advantage. Biomedical text retrieval refers to text retrieval techniques applied to biomedical resources and literature available of the biomedical and molecular biology domain. The volume of published biomedical research, and therefore the underlying biomedical knowledge base, is expanding at an increasing rate. Biomedical text retrieval is a way to aid researchers in coping with information overload. By discovering predictive relationships between different pieces of extracted data, data-mining algorithms can be used to improve the accuracy of information extraction. However, textual variation due to typos, abbreviations, and other sources can prevent the productive discovery and utilization of hard-matching rules. Recent methods of soft clustering can exploit predictive relationships in textual data. This paper presents a technique for using soft clustering data mining algorithm to increase the accuracy of biomedical text extraction. Experimental results demonstrate that this approach improves text extraction more effectively that hard keyword matching rules.

  18. Extending a Single Residue Switch for Abbreviating Catalysis in Plant ent-Kaurene Synthases

    Directory of Open Access Journals (Sweden)

    Meirong Jia

    2016-11-01

    Full Text Available Production of ent-kaurene as a precursor for important signaling molecules such as the gibberellins seems to have arisen early in plant evolution, with corresponding cyclase(s present in all land plants (i.e., embryophyta. The relevant enzymes seem to represent fusion of the class II diterpene cyclase that produces the intermediate ent-copalyl diphosphate (ent-CPP and the subsequently acting class I diterpene synthase that produces ent-kaurene, although the bifunctionality of the ancestral gene is only retained in certain early diverging plants, with gene duplication and sub-functionalization leading to distinct ent-CPP synthases (CPSs and ent-kaurene synthases (KSs generally observed. This evolutionary scenario implies that plant KSs should have conserved structural features uniquely required for production of ent-kaurene relative to related enzymes that have alternative function. Notably, substitution of threonine for a conserved isoleucine has been shown to short-circuit the complex bicyclization and rearrangement reaction catalyzed by KSs after initial cyclization, leading to predominant production of ent-pimaradiene, at least in KSs from angiosperms. Here this effect is shown to extend to KSs from earlier diverging plants (i.e., bryophytes, including a bifunctional CPS/KS. In addition, attribution of the dramatic effect of this single residue switch on product outcome to electrostatic stabilization of the ent-pimarenyl carbocation intermediate formed upon initial cyclization by the hydroxyl introduced by threonine substitution has been called into question by the observation of similar effects from substitution of alanine. Here further mutational analysis and detailed product analysis is reported that supports the importance of electrostatic stabilization by a hydroxyl or water.

  19. Orientalist discourse in media texts

    Directory of Open Access Journals (Sweden)

    Necla Mora

    2009-10-01

    Full Text Available By placing itself at the center of the world with a Eurocentric point of view, the West exploits other countries and communities through inflicting cultural change and transformation on them either from within via colonialist movements or from outside via “Orientalist” discourses in line with its imperialist objectives.The West has fictionalized the “image of the Orient” in terms of science by making use of social sciences like anthropology, history and philology and launched an intensive propaganda which covers literature, painting, cinema and other fields of art in order to actualize this fiction. Accordingly, the image of the Orient – which has been built firstly in terms of science then socially – has been engraved into the collective memory of both the Westerner and the Easterner.The internalized “Orientalist” point of view and discourse cause the Westerner to see and perceive the Easterner with the image formed in his/her memory while looking at them. The Easterner represents and expresses himself/herself from the eyes of the Westerner and with the image which the Westerner fictionalized for him/her. Hence, in order to gain acceptance from the West, the East tries to shape itself into the “Orientalist” mold which the Westerner fictionalized for it.Artists, intellectuals, writers and media professionals, who embrace and internalize the stereotypical hegemonic-driven “Orientalist” discourse of the Westerner and who rank among the elite group, reflect their internalized “Orientalist” discourse on their own actions. This condition causes the “Orientalist” clichés to be engraved in the memory of the society; causes the society to view itself with an “Orientalist” point of view and perceive itself with the clichés of the Westerner. Consequently, the second ring of the hegemony is reproduced by the symbolic elites who represent the power/authority within the country.The “Orientalist” discourse, which is

  20. Applicability of an abbreviated version of the Child-OIDP inventory among primary schoolchildren in Tanzania

    Directory of Open Access Journals (Sweden)

    Mtaya Matilda

    2007-07-01

    Full Text Available Abstract Background There is a need for studies evaluating oral health related quality of life (OHRQoL of children in developing countries. Aim to assess the psychometric properties, prevalence and perceived causes of the child version of oral impact on daily performance inventory (Child-OIDP among school children in two socio-demographically different districts of Tanzania. Socio-behavioral and clinical correlates of children's OHRQoL were also investigated. Method One thousand six hundred and one children (mean age 13 yr, 60.5% girls attending 16 (urban and rural primary schools in Kinondoni and Temeke districts completed a survey instrument in face to face interviews and participated in a full mouth clinical examination. The survey instrument was designed to measure a Kiswahili translated and culturally adapted Child-OIDP frequency score, global oral health indicators and socio-demographic factors. Results The Kiswahili version of the Child-OIDP inventory preserved the overall concept of the original English version and revealed good reliability in terms of Cronbach's alpha coefficient of 0.77 (Kinondoni: 0.62, Temeke: 0.76. Weighted Kappa scores from a test-retest were 1.0 and 0.8 in Kinondoni and Temeke, respectively. Validity was supported in that the OIDP scores varied systematically and in the expected direction with self-reported oral health measures and socio-behavioral indicators. Confirmatory factor analyses, CFA, confirmed three dimensions identified initially by Principle Component Analysis within the OIDP item pool. A total of 28.6% of the participants had at least one oral impact. The area specific rates for Kinondoni and Temeke were 18.5% and 45.5%. The most frequently reported impacts were problems eating and cleaning teeth, and the most frequently reported cause of impacts were toothache, ulcer in mouth and position of teeth. Conclusion This study showed that the Kiswahili version of the Child-OIDP was applicable for use among

  1. Abbreviation of larval development and extension of brood care as key features of the evolution of freshwater Decapoda.

    Science.gov (United States)

    Vogt, Günter

    2013-02-01

    The transition from marine to freshwater habitats is one of the major steps in the evolution of life. In the decapod crustaceans, four groups have colonized fresh water at different geological times since the Triassic, the freshwater shrimps, freshwater crayfish, freshwater crabs and freshwater anomurans. Some families have even colonized terrestrial habitats via the freshwater route or directly via the sea shore. Since none of these taxa has ever reinvaded its environment of origin the Decapoda appear particularly suitable to investigate life-history adaptations to fresh water. Evolutionary comparison of marine, freshwater and terrestrial decapods suggests that the reduction of egg number, abbreviation of larval development, extension of brood care and lecithotrophy of the first posthatching life stages are key adaptations to fresh water. Marine decapods usually have high numbers of small eggs and develop through a prolonged planktonic larval cycle, whereas the production of small numbers of large eggs, direct development and extended brood care until the juvenile stage is the rule in freshwater crayfish, primary freshwater crabs and aeglid anomurans. The amphidromous freshwater shrimp and freshwater crab species and all terrestrial decapods that invaded land via the sea shore have retained ocean-type planktonic development. Abbreviation of larval development and extension of brood care are interpreted as adaptations to the particularly strong variations of hydrodynamic parameters, physico-chemical factors and phytoplankton availability in freshwater habitats. These life-history changes increase fitness of the offspring and are obviously favoured by natural selection, explaining their multiple origins in fresh water. There is no evidence for their early evolution in the marine ancestors of the extant freshwater groups and a preadaptive role for the conquest of fresh water. The costs of the shift from relative r- to K-strategy in freshwater decapods are traded

  2. An Abbreviated Protocol for In Vitro Generation of Functional Human Embryonic Stem Cell-Derived Beta-Like Cells

    Science.gov (United States)

    Massumi, Mohammad; Pourasgari, Farzaneh; Nalla, Amarnadh; Batchuluun, Battsetseg; Nagy, Kristina; Neely, Eric; Gull, Rida; Nagy, Andras; Wheeler, Michael B.

    2016-01-01

    The ability to yield glucose-responsive pancreatic beta-cells from human pluripotent stem cells in vitro will facilitate the development of the cell replacement therapies for the treatment of Type 1 Diabetes. Here, through the sequential in vitro targeting of selected signaling pathways, we have developed an abbreviated five-stage protocol (25–30 days) to generate human Embryonic Stem Cell-Derived Beta-like Cells (ES-DBCs). We showed that Geltrex, as an extracellular matrix, could support the generation of ES-DBCs more efficiently than that of the previously described culture systems. The activation of FGF and Retinoic Acid along with the inhibition of BMP, SHH and TGF-beta led to the generation of 75% NKX6.1+/NGN3+ Endocrine Progenitors. The inhibition of Notch and tyrosine kinase receptor AXL, and the treatment with Exendin-4 and T3 in the final stage resulted in 35% mono-hormonal insulin positive cells, 1% insulin and glucagon positive cells and 30% insulin and NKX6.1 co-expressing cells. Functionally, ES-DBCs were responsive to high glucose in static incubation and perifusion studies, and could secrete insulin in response to successive glucose stimulations. Mitochondrial metabolic flux analyses using Seahorse demonstrated that the ES-DBCs could efficiently metabolize glucose and generate intracellular signals to trigger insulin secretion. In conclusion, targeting selected signaling pathways for 25–30 days was sufficient to generate ES-DBCs in vitro. The ability of ES-DBCs to secrete insulin in response to glucose renders them a promising model for the in vitro screening of drugs, small molecules or genes that may have potential to influence beta-cell function. PMID:27755557

  3. SLAM-enriched hematopoietic stem cells maintain long-term repopulating capacity after lentiviral transduction using an abbreviated protocol.

    Science.gov (United States)

    Laje, P; Zoltick, P W; Flake, A W

    2010-03-01

    Gene transfer to long-term repopulating hematopoietic stem cells (HSCs) using integrating viral vectors is an important goal in gene therapy. The SLAM (signaling lymphocyte activation molecule)-family receptors have recently been used for the isolation of highly enriched murine HSCs. This HSC enrichment protocol is relatively simple, and results in an HSC population with comparable repopulating capacity to c-kit(+)lin(-)Sca-1(+) (KSL) HSCs. The capacity to withstand genetic manipulation and, most importantly, to maintain long-term repopulating capacity of SLAM-enriched HSC populations has not been reported. In this study, SLAM-enriched HSCs were assessed for transduction efficiency and in vivo long-term repopulating capacity after lentiviral transduction using an abbreviated transduction protocol and KSL-enriched HSCs as a reference population. SLAM- and KSL-enriched HSCs were efficiently transduced by lentiviral vector using a simple protocol that involves minimal in vitro manipulation and no pre-stimulation. SLAM-HSCs are at least equal to KSL-HSCs with respect to efficiency of transduction and maintenance of long-term repopulating capacity. Although there was a reduction in repopulating capacity related to enrichment and culture manipulations relative to freshly isolated bone marrow (BM) cells, no detrimental effects were identified on long-term competitive capacity related to transduction, as transduced cells maintained stable levels of chimerism in competition with non-transduced cells and freshly isolated BM cells. These results support the SLAM-HSC enrichment protocol as a simple and efficient method for HSC enrichment for gene transfer studies.

  4. Use of an abbreviated neuroscience education approach in the treatment of chronic low back pain: a case report.

    Science.gov (United States)

    Louw, Adriaan; Puentedura, Emilio Louie; Mintken, Paul

    2012-01-01

    Chronic low back pain (CLBP) remains prevalent in society, and conservative treatment strategies appear to have little effect. It is proposed that patients with CLBP may have altered cognition and increased fear, which impacts their ability to move, perform exercise, and partake in activities of daily living. Neuroscience education (NE) aims to change a patient's cognition regarding their pain state, which may result in decreased fear, ultimately resulting in confrontation of pain barriers and a resumption of normal activities. A 64-year-old female with history of CLBP was the patient for this case report. A physical examination, the Numeric Pain Rating Scale (NPRS), Oswestry Disability Index (ODI), Fear-Avoidance Beliefs Questionnaire (FABQ), and Zung Depression Scale were assessed during her initial physical therapy visit, immediately after her first physical therapy session, and at 7-month follow-up. Treatment consisted of an abbreviated NE approach, exercises (range of motion, stretches, and cardiovascular), and aquatic therapy. She attended twice a week for 4 weeks, or 8 visits total. Pre-NE, the patient reported NPRS = 9/10; ODI = 54%; FABQ-W = 25/42,; FABQ-PA = 20/24, and Zung = 58. Immediately following the 75-minute evaluation and NE session, the patient reported improvement in all four outcome measures, most notably a reduction in the FABQ-W score to 2/42 and the FABQ-PA to 1/24. At a 7-month follow-up, all outcome measures continued to be improved. NE aimed at decreasing fear associated with movement may be a valuable adjunct to movement-based therapy, such as exercise, for patients with CLBP.

  5. On the recovery of an ancient text: Principles of editing, The diaries of Lady Anne Barnard

    Directory of Open Access Journals (Sweden)

    M. Lenta

    1997-05-01

    Full Text Available The unrevised and handwritten Cape diaries of Lady Anne Barnard for the years 1799 and 1800 have recently been transcribed and are now in the process of being edited. Since they are very long, and would be expensive to publish in their entirely, the question has arisen for their editors what principles of selection and emphasis should be followed in the editorial process. The diaries are private documents, intended to be read by no one but the author herself, and they are frequently non-standard in punctuation, spelling and even at times in syntax. The editors therefore face other issues, concerning their right to correct or standardise the text, which as it stands, is an illustration of the practice of a highly intelligent and experienced woman with almost no formal education - a woman who in many respects is representative of her time and class. The different kinds of interest present within the text - Cape and European history, the history of women, of slaves and of colonialism, as well as of the indigenous peoples of the Cape hinterland, may well represent alternative focuses between which the editors, in an abbreviated text, must choose, since the final decision concerning publication is likely to be an economic one. Finally the editors’ recommendations are likely to be based on the degree of interest possessed by the text in its component parts - are all its subjects equally interesting to the envisaged reader, the amateur of history of the present day?

  6. Measurement of prompt and nonprompt [Formula: see text] production in [Formula: see text] and [Formula: see text] collisions at [Formula: see text].

    Science.gov (United States)

    Sirunyan, A M; Tumasyan, A; Adam, W; Asilar, E; Bergauer, T; Brandstetter, J; Brondolin, E; Dragicevic, M; Erö, J; Flechl, M; Friedl, M; Frühwirth, R; Ghete, V M; Hartl, C; Hörmann, N; Hrubec, J; Jeitler, M; König, A; Krätschmer, I; Liko, D; Matsushita, T; Mikulec, I; Rabady, D; Rad, N; Rahbaran, B; Rohringer, H; Schieck, J; Strauss, J; Waltenberger, W; Wulz, C-E; Dvornikov, O; Makarenko, V; Mossolov, V; Suarez Gonzalez, J; Zykunov, V; Shumeiko, N; Alderweireldt, S; De Wolf, E A; Janssen, X; Lauwers, J; Van De Klundert, M; Van Haevermaet, H; Van Mechelen, P; Van Remortel, N; Van Spilbeeck, A; Abu Zeid, S; Blekman, F; D'Hondt, J; Daci, N; De Bruyn, I; Deroover, K; Lowette, S; Moortgat, S; Moreels, L; Olbrechts, A; Python, Q; Skovpen, K; Tavernier, S; Van Doninck, W; Van Mulders, P; Van Parijs, I; Brun, H; Clerbaux, B; De Lentdecker, G; Delannoy, H; Fasanella, G; Favart, L; Goldouzian, R; Grebenyuk, A; Karapostoli, G; Lenzi, T; Léonard, A; Luetic, J; Maerschalk, T; Marinov, A; Randle-Conde, A; Seva, T; Vander Velde, C; Vanlaer, P; Vannerom, D; Yonamine, R; Zenoni, F; Zhang, F; Cimmino, A; Cornelis, T; Dobur, D; Fagot, A; Gul, M; Khvastunov, I; Poyraz, D; Salva, S; Schöfbeck, R; Tytgat, M; Van Driessche, W; Yazgan, E; Zaganidis, N; Bakhshiansohi, H; Beluffi, C; Bondu, O; Brochet, S; Bruno, G; Caudron, A; De Visscher, S; Delaere, C; Delcourt, M; Francois, B; Giammanco, A; Jafari, A; Komm, M; Krintiras, G; Lemaitre, V; Magitteri, A; Mertens, A; Musich, M; Piotrzkowski, K; Quertenmont, L; Selvaggi, M; Vidal Marono, M; Wertz, S; Beliy, N; Aldá Júnior, W L; Alves, F L; Alves, G A; Brito, L; Hensel, C; Moraes, A; Pol, M E; Rebello Teles, P; Belchior Batista Das Chagas, E; Carvalho, W; Chinellato, J; Custódio, A; Da Costa, E M; Da Silveira, G G; De Jesus Damiao, D; De Oliveira Martins, C; Fonseca De Souza, S; Huertas Guativa, L M; Malbouisson, H; Matos Figueiredo, D; Mora Herrera, C; Mundim, L; Nogima, H; Prado Da Silva, W L; Santoro, A; Sznajder, A; Tonelli Manganote, E J; Torres Da Silva De Araujo, F; Vilela Pereira, A; Ahuja, S; Bernardes, C A; Dogra, S; Fernandez Perez Tomei, T R; Gregores, E M; Mercadante, P G; Moon, C S; Novaes, S F; Padula, Sandra S; Romero Abad, D; Ruiz Vargas, J C; Aleksandrov, A; Hadjiiska, R; Iaydjiev, P; Rodozov, M; Stoykova, S; Sultanov, G; Vutova, M; Dimitrov, A; Glushkov, I; Litov, L; Pavlov, B; Petkov, P; Fang, W; Ahmad, M; Bian, J G; Chen, G M; Chen, H S; Chen, M; Chen, Y; Cheng, T; Jiang, C H; Leggat, D; Liu, Z; Romeo, F; Ruan, M; Shaheen, S M; Spiezia, A; Tao, J; Wang, C; Wang, Z; Zhang, H; Zhao, J; Ban, Y; Chen, G; Li, Q; Liu, S; Mao, Y; Qian, S J; Wang, D; Xu, Z; Avila, C; Cabrera, A; Chaparro Sierra, L F; Florez, C; Gomez, J P; González Hernández, C F; Ruiz Alvarez, J D; Sanabria, J C; Godinovic, N; Lelas, D; Puljak, I; Ribeiro Cipriano, P M; Sculac, T; Antunovic, Z; Kovac, M; Brigljevic, V; Ferencek, D; Kadija, K; Mesic, B; Susa, T; Attikis, A; Mavromanolakis, G; Mousa, J; Nicolaou, C; Ptochos, F; Razis, P A; Rykaczewski, H; Tsiakkouri, D; Finger, M; Finger, M; Carrera Jarrin, E; Assran, Y; Elkafrawy, T; Mahrous, A; Kadastik, M; Perrini, L; Raidal, M; Tiko, A; Veelken, C; Eerola, P; Pekkanen, J; Voutilainen, M; Härkönen, J; Järvinen, T; Karimäki, V; Kinnunen, R; Lampén, T; Lassila-Perini, K; Lehti, S; Lindén, T; Luukka, P; Tuominiemi, J; Tuovinen, E; Wendland, L; Talvitie, J; Tuuva, T; Besancon, M; Couderc, F; Dejardin, M; Denegri, D; Fabbro, B; Faure, J L; Favaro, C; Ferri, F; Ganjour, S; Ghosh, S; Givernaud, A; Gras, P; Hamel de Monchenault, G; Jarry, P; Kucher, I; Locci, E; Machet, M; Malcles, J; Rander, J; Rosowsky, A; Titov, M; Abdulsalam, A; Antropov, I; Arleo, F; Baffioni, S; Beaudette, F; Busson, P; Cadamuro, L; Chapon, E; Charlot, C; Davignon, O; Granier de Cassagnac, R; Jo, M; Lisniak, S; Miné, P; Nguyen, M; Ochando, C; Ortona, G; Paganini, P; Pigard, P; Regnard, S; Salerno, R; Sirois, Y; Strebler, T; Yilmaz, Y; Zabi, A; Zghiche, A; Agram, J-L; Andrea, J; Aubin, A; Bloch, D; Brom, J-M; Buttignol, M; Chabert, E C; Chanon, N; Collard, C; Conte, E; Coubez, X; Fontaine, J-C; Gelé, D; Goerlach, U; Le Bihan, A-C; Van Hove, P; Gadrat, S; Beauceron, S; Bernet, C; Boudoul, G; Carrillo Montoya, C A; Chierici, R; Contardo, D; Courbon, B; Depasse, P; El Mamouni, H; Fay, J; Gascon, S; Gouzevitch, M; Grenier, G; Ille, B; Lagarde, F; Laktineh, I B; Lethuillier, M; Mirabito, L; Pequegnot, A L; Perries, S; Popov, A; Sabes, D; Sordini, V; Vander Donckt, M; Verdier, P; Viret, S; Khvedelidze, A; Tsamalaidze, Z; Autermann, C; Beranek, S; Feld, L; Kiesel, M K; Klein, K; Lipinski, M; Preuten, M; Schomakers, C; Schulz, J; Verlage, T; Albert, A; Brodski, M; Dietz-Laursonn, E; Duchardt, D; Endres, M; Erdmann, M; Erdweg, S; Esch, T; Fischer, R; Güth, A; Hamer, M; Hebbeker, T; Heidemann, C; Hoepfner, K; Knutzen, S

    2017-01-01

    This paper reports the measurement of [Formula: see text] meson production in proton-proton ([Formula: see text]) and proton-lead ([Formula: see text]) collisions at a center-of-mass energy per nucleon pair of [Formula: see text] by the CMS experiment at the LHC. The data samples used in the analysis correspond to integrated luminosities of 28[Formula: see text] and 35[Formula: see text] for [Formula: see text] and [Formula: see text] collisions, respectively. Prompt and nonprompt [Formula: see text] mesons, the latter produced in the decay of [Formula: see text] hadrons, are measured in their dimuon decay channels. Differential cross sections are measured in the transverse momentum range of [Formula: see text], and center-of-mass rapidity ranges of [Formula: see text] ([Formula: see text]) and [Formula: see text] ([Formula: see text]). The nuclear modification factor, [Formula: see text], is measured as a function of both [Formula: see text] and [Formula: see text]. Small modifications to the [Formula: see text] cross sections are observed in [Formula: see text] relative to [Formula: see text] collisions. The ratio of [Formula: see text] production cross sections in [Formula: see text]-going and Pb-going directions, [Formula: see text], studied as functions of [Formula: see text] and [Formula: see text], shows a significant decrease for increasing transverse energy deposited at large pseudorapidities. These results, which cover a wide kinematic range, provide new insight on the role of cold nuclear matter effects on prompt and nonprompt [Formula: see text] production.

  7. The socio-demographics of texting

    DEFF Research Database (Denmark)

    Ling, Richard; Bertel, Troels Fibæk; Sundsøy, Pål

    2012-01-01

    Who texts, and with whom do they text? This article examines the use of texting using metered traffic data from a large dataset (nearly 400 million anonymous text messages). We ask 1) How much do different age groups use mobile phone based texting (SMS)? 2) How wide is the circle of texting partn...

  8. DHARMAYATRA IN THE DWIJENDRA TATTWA TEXT ANALYSIS OF RECEPTION

    Directory of Open Access Journals (Sweden)

    Ida Bagus Rai Putra

    2012-11-01

    Full Text Available The object of the study is Dwijendra Text (hereinafter abbreviated to DT. It containsinteresting narrations and is importantly related to the dharmayatra, the holy religious journeymade by Dang Hyang Nirartha, the charismatic figure, in Bali, Lombok and Sumbawa. Beforethe analysis of reception was conducted, the corpus text of the DT texts completely andstructurally telling the religious journey made by Dang Hyang Nirartha was successfullydetermined. The analysis in this study was made to answer the following questions: what is thenarrative structure of the DT text; what are the enlightenment image entities of the dharmayatraof the DT text; how do people appreciate the dharmayatra of the DT text? The answers to thenarrative structure of the DT text; the image entities and the appreciation provided by people arethe main objectives of this study.The theories adopted in this study are the theory of reception introduced by Jauss, thetheory of semiotics introduced by Pierce and the theory of mythology introduced by Barthes. Asa qualitative study, the data needed were collected by the methods of observation, note taking,documentation and interview supported with a sound recorder and pictures. The results of theanalysis are informally presented, meaning that they are verbally described in the form of wordswhich are systematically composed based on the problems formulated in this study.The analysis of the narrative structure of the DT text contains narrative units which are inthe forms of theme, characters and plots. They all unite to form stories which are mythological,legendary, symbolic, hagiographic and suggestive in nature. Based on the analysis ofenlightenment image entities, it can be concluded that there are three basic entities leading to thecreation of the DT text. They are first enlightenment; second protection of Hinduism; and thirdconstruction of temple institutions. Based on the reception analysis, it can be concluded thatpeople, through

  9. What's so Simple about Simplified Texts? A Computational and Psycholinguistic Investigation of Text Comprehension and Text Processing

    Science.gov (United States)

    Crossley, Scott A.; Yang, Hae Sung; McNamara, Danielle S.

    2014-01-01

    This study uses a moving windows self-paced reading task to assess both text comprehension and processing time of authentic texts and these same texts simplified to beginning and intermediate levels. Forty-eight second language learners each read 9 texts (3 different authentic, beginning, and intermediate level texts). Repeated measures ANOVAs…

  10. MARRIAGE RITUAL TEXT OF BALINESE TRADITIONAL COMMUNITY: AN ANALYSIS OF FUNCTIONAL SYSTEMIC LINGUISTICS

    Directory of Open Access Journals (Sweden)

    Putu Sutama

    2012-11-01

    Full Text Available The Marriage Ritual Text of Balinese Traditional Community (Teks Ritual‘Pewiwahan’ Masyarakat Adat Bali, hereon abbreviated to TRPMAB in this dissertationis analyzed in the perspective of linguistic studies using the functional systemic linguistictheory. TRPMAB is a dialogic text containing a discussion, and is terminologicallytermed as a conversational text. It refers to the use of Balinese language (Bahasa Bali,hereon abbreviated to BB in a marriage ritual. There are two inseparable systems in it;they are BB system and social system, which are widely termed as cultural system.The method employed in this study is field method, meaning that the researcherwent directly to the field or to the location where TRPMAP took place. The researcherdirectly took part as both the active and passive participant. In this way, the researchcould observe TRPMAB directly.The population of the study includes all TPRB in Bali. Considering that thepopulation is too wide, then samples were taken to represent all the population. Thesamples total 10 which were obtained from the biggest marriage processions in Bali. Outof the 10 samples, 6 units of text were selected as the corpus of the study. The selectionwas made based on particular criteria including quality. It is this corpus which wasanalyzed to support and examine the hypothesis related to the text analyzed.The analysis of TRPMAB includes: the structure of the texts, the mood, thetransitivity, the theme-rheme and the logical relationship between the clause and theideology. The findings are as follows:(1 TRPMAB is a text which has a number of structural dimensions such as (acultural structure, (b macro structure, that is, the structure related to the situationalcontext made up of field, tenor and mode, (c micro structure, (d structure of meaning,that is, the structure related to the sequence of meanings between the participants withinthe dialogue, and (e the texture, that is, the intact successive relationship

  11. The principles of designing of algorithm for speech synthesis from texts written in Albanian language

    Directory of Open Access Journals (Sweden)

    Agni Dika

    2012-05-01

    Full Text Available The speech synthesis is artificial generation of human speech from written texts. For this purpose, adequate algorithms are designed, which then through relevant programs make it possible to synthesize texts to speech. The process of converting text into speech is also known as Text-To-Speech (TTS system [5]. In this paper are given basic principles to be used when designing a system to synthesize speech in Albanian language from written texts. Currently there are solutions that enable natural speech generation for various world languages. However, unfortunately these are not universal solutions to be used for other languages too, because the volume generated for other languages is incomprehensible and unnatural. For this reason, for every language one should seek solutions that address the specifics of it, always with the aim of generating voice to suit the nature of language. Generating systems that are currently used mainly rely on the use of the concatenation method [6], during which acoustic segments of text files are joined, which are previously digitized and stored as such in a database. For Albanian language, we consider that on the textual part of the database, as basic segments to be used are: the most frequent words, two-letters and letters [4]. However, in a particular part of the database are included various abbreviations, i.e. textual equivalents and their acoustics files, to be used also during the generation of appropriate speech. Whereas, with the aim of synthesizing the various numerical values written in the decimal system, in database were added values, respectively their corresponding sound files, whereby speech is generated for different numbers. The first part of the paper is a brief presentation of the Albanian language [1], respectively of the alphabet used in writing the language and its most frequent words.

  12. Mortality Risk in Pediatric Motor Vehicle Crash Occupants: Accounting for Developmental Stage and Challenging Abbreviated Injury Scale Metrics.

    Science.gov (United States)

    Doud, Andrea N; Weaver, Ashley A; Talton, Jennifer W; Barnard, Ryan T; Schoell, Samantha L; Petty, John K; Stitzel, Joel D

    2015-01-01

    Survival risk ratios (SRRs) and their probabilistic counterpart, mortality risk ratios (MRRs), have been shown to be at odds with Abbreviated Injury Scale (AIS) severity scores for particular injuries in adults. SRRs have been validated for pediatrics but have not been studied within the context of pediatric age stratifications. We hypothesized that children with similar motor vehicle crash (MVC) injuries may have different mortality risks (MR) based upon developmental stage and that these MRs may not correlate with AIS severity. The NASS-CDS 2000-2011 was used to define the top 95% most common AIS 2+ injuries among MVC occupants in 4 age groups: 0-4, 5-9, 10-14, and 15-18 years. Next, the National Trauma Databank 2002-2011 was used to calculate the MR (proportion of those dying with an injury to those sustaining the injury) and the co-injury-adjusted MR (MRMAIS) for each injury within 6 age groups: 0-4, 5-9, 10-14, 15-18, 0-18, and 19+ years. MR differences were evaluated between age groups aggregately, between age groups based upon anatomic injury patterns and between age groups on an individual injury level using nonparametric Wilcoxon tests and chi-square or Fisher's exact tests as appropriate. Correlation between AIS and MR within each age group was also evaluated. MR and MRMAIS distributions of the most common AIS 2+ injuries were right skewed. Aggregate MR of these most common injuries varied between the age groups, with 5- to 9-year-old and 10- to 14-year-old children having the lowest MRs and 0- to 4-year-old and 15- to 18-year-old children and adults having the highest MRs (all P injuries imparted the greatest mortality risk in all age groups with median MRMAIS ranging from 0 to 6% and 0 to 4.5%, respectively. Injuries to particular body regions also varied with respect to MR based upon age. For example, thoracic injuries in adults had significantly higher MRMAIS than such injuries among 5- to 9-year-olds and 10- to 14-year-olds (P =.04; P injuries

  13. SIAM 2007 Text Mining Competition dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — Subject Area: Text Mining Description: This is the dataset used for the SIAM 2007 Text Mining competition. This competition focused on developing text mining...

  14. Text Categorization with Latent Dirichlet Allocation

    Directory of Open Access Journals (Sweden)

    ZLACKÝ Daniel

    2014-05-01

    Full Text Available This paper focuses on the text categorization of Slovak text corpora using latent Dirichlet allocation. Our goal is to build text subcorpora that contain similar text documents. We want to use these better organized text subcorpora to build more robust language models that can be used in the area of speech recognition systems. Our previous research in the area of text categorization showed that we can achieve better results with categorized text corpora. In this paper we used latent Dirichlet allocation for text categorization. We divided initial text corpus into 2, 5, 10, 20 or 100 subcorpora with various iterations and save steps. Language models were built on these subcorpora and adapted with linear interpolation to judicial domain. The experiment results showed that text categorization using latent Dirichlet allocation can improve the system for automatic speech recognition by creating the language models from organized text corpora.

  15. Text Anomalies Detection Using Histograms of Words

    Directory of Open Access Journals (Sweden)

    Abdulwahed Faraj Almarimi

    2016-01-01

    Full Text Available Authors of written texts mainly can be characterized by some collection of attributes obtained from texts. Texts of the same author are very similar from the style point of view. We can consider that attributes of a full text are very similar to attributes of parts in the same text. In the same thoughts can be compared different parts of the same text. In the paper, we describe an algorithm based on histograms of a mapped text to interval. In the mapping, it is kipped the word order as in the text. Histograms are analyzed from a cluster point of view. If a cluster dispersion is not large, the text is probably written by the same author. If the cluster dispersion is large, the text will be split in two or more parts and the same analysis will be done for the text parts.  The experiments were done on English and Arabic texts. For combined English texts our algorithm covers that texts were not written by one author. We have got the similar results for combined Arabic texts. Our algorithm can be used to basic text analysis if the text was written by one author.       

  16. TEXT CLASSIFICATION TOWARD A SCIENTIFIC FORUM

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Text mining, also known as discovering knowledge from the text, which has emerged as a possible solution for the current information explosion, refers to the process of extracting non-trivial and useful patterns from unstructured text. Among the general tasks of text mining such as text clustering,summarization, etc, text classification is a subtask of intelligent information processing, which employs unsupervised learning to construct a classifier from training text by which to predict the class of unlabeled text. Because of its simplicity and objectivity in performance evaluation, text classification was usually used as a standard tool to determine the advantage or weakness of a text processing method, such as text representation, text feature selection, etc. In this paper, text classification is carried out to classify the Web documents collected from XSSC Website (http://www. xssc.ac.cn). The performance of support vector machine (SVM) and back propagation neural network (BPNN) is compared on this task. Specifically, binary text classification and multi-class text classification were conducted on the XSSC documents. Moreover, the classification results of both methods are combined to improve the accuracy of classification. An experiment is conducted to show that BPNN can compete with SVM in binary text classification; but for multi-class text classification, SVM performs much better. Furthermore, the classification is improved in both binary and multi-class with the combined method.

  17. Noticeable Focuses in Reading a Text

    Institute of Scientific and Technical Information of China (English)

    李明

    2007-01-01

    This paper discusses the relationship between commanding those basic information contained in a text and the final p urpose of comprehending in a text-reading process. By using the main topic and the central meaning that all texts have as two main examples, the author mainly illustrates what a reader should pay attention to in reading a text.

  18. What makes a written text written

    Institute of Scientific and Technical Information of China (English)

    赵亦倩

    2008-01-01

    Text can be used for both written and spoken language, and different features of spoken and written texts provide us the possibility to have a general idea of the division of two main categories--spoken English and written English. In this article, an attempt will be given to a sample text in order to discuss the general features of written texts.

  19. Theoretical simulation of CO2 capture by an \\text{A}{{\\text{l}}_{11}}\\text{Mg}_{3}^{-} cluster

    Science.gov (United States)

    Jiang, Yuanyuan; Xie, Xuefang; Hamid, Ilyar; Chen, Chu; Duan, Haiming

    2017-04-01

    In order to have an impact on carbon emissions, new stable materials for carbon capture should be able to adsorb CO2 from a mixture of other gases efficiently. Based on density functional theory calculations, we showed that the \\text{A}{{\\text{l}}11}\\text{Mg}3- cluster has an excellent capture capacity of CO2 and high CO2 selectivity under ambient conditions. \\text{A}{{\\text{l}}11}\\text{Mg}3- has an O2-resist property because this cluster is similar to \\text{Al}13- which contains 40 electrons with a larger energy gap. The \\text{A}{{\\text{l}}11}\\text{Mg}3- cluster prefers to adsorb CO2 compared with CH4, H2 and N2, and the CO2 molecule can be chemically adsorbed on the cluster by overcoming a lower barrier, which originates from the introduction of the Mg atom. When seven CO2 molecules are chemically adsorbed on the cluster, the capture capacity of CO2 can reach up to 18.99 mol kg-1 this means that the \\text{A}{{\\text{l}}11}\\text{Mg}3- cluster can be viewed as a potential candidate material for CO2 capture.

  20. Text To Speech System for Telugu Language

    OpenAIRE

    Siva kumar, M; E. Prakash Babu

    2014-01-01

    Telugu is one of the oldest languages in India. This paper describes the development of Telugu Text-to-Speech System (TTS).In Telugu TTS the input is Telugu text in Unicode. The voices are sampled from real recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform. Speech synthesis is a process of building machinery that can generate human-like speech from any text input to imitate human speakers. Text proc...

  1. A Proposed Arabic Handwritten Text Normalization Method

    Directory of Open Access Journals (Sweden)

    Tarik Abu-Ain

    2014-11-01

    Full Text Available Text normalization is an important technique in document image analysis and recognition. It consists of many preprocessing stages, which include slope correction, text padding, skew correction, and straight the writing line. In this side, text normalization has an important role in many procedures such as text segmentation, feature extraction and characters recognition. In the present article, a new method for text baseline detection, straightening, and slant correction for Arabic handwritten texts is proposed. The method comprises a set of sequential steps: first components segmentation is done followed by components text thinning; then, the direction features of the skeletons are extracted, and the candidate baseline regions are determined. After that, selection of the correct baseline region is done, and finally, the baselines of all components are aligned with the writing line.  The experiments are conducted on IFN/ENIT benchmark Arabic dataset. The results show that the proposed method has a promising and encouraging performance.

  2. Adaptive Text Entry for Mobile Devices

    DEFF Research Database (Denmark)

    Proschowsky, Morten Smidt

    for mobile devices and a framework for adaptive context-aware language models. Based on analysis of current text entry methods, the requirements to the new text entry methods are established. Transparent User guided Prediction (TUP) is a text entry method for devices with one dimensional touch input. It can......The reduced size of many mobile devices makes it difficult to enter text with them. The text entry methods are often slow or complicated to use. This affects the performance and user experience of all applications and services on the device. This work introduces new easy-to-use text entry methods...... to improve the models of human motor behaviour. TUP-Key is a variant of TUP, designed for 12 key phone keyboards. It is introduced in the thesis but has not been implemented or evaluated. Both text entry methods support adaptive context-aware language models. YourText is a framework for adaptive context...

  3. Comparative Discourse Analysis of Parallel Texts

    CERN Document Server

    Van der Eijk, P

    1994-01-01

    A quantitative representation of discourse structure can be computed by measuring lexical cohesion relations among adjacent blocks of text. These representations have been proposed to deal with sub-topic text segmentation. In a parallel corpus, similar representations can be derived for versions of a text in various languages. These can be used for parallel segmentation and as an alternative measure of text-translation similarity.

  4. Multimodal Diversity of Postmodernist Fiction Text

    Directory of Open Access Journals (Sweden)

    U. I. Tykha

    2016-12-01

    Full Text Available The article is devoted to the analysis of structural and functional manifestations of multimodal diversity in postmodernist fiction texts. Multimodality is defined as the coexistence of more than one semiotic mode within a certain context. Multimodal texts feature a diversity of semiotic modes in the communication and development of their narrative. Such experimental texts subvert conventional patterns by introducing various semiotic resources – verbal or non-verbal.

  5. Evaluation Methods of The Text Entities

    Science.gov (United States)

    Popa, Marius

    2006-01-01

    The paper highlights some evaluation methods to assess the quality characteristics of the text entities. The main concepts used in building and evaluation processes of the text entities are presented. Also, some aggregated metrics for orthogonality measurements are presented. The evaluation process for automatic evaluation of the text entities is…

  6. Interdisciplinary Approach to Understanding Literary Texts

    Science.gov (United States)

    Dossanova, Altynay Zh.; Ismakova, Bibissara S.; Tapanova, Saule E.; Ayupova, Gulbagira K.; Gotting, Valentina V.; Kaltayeva, Gulnar K.

    2016-01-01

    The primary purpose is the implementation of the interdisciplinary approach to understanding and the construction of integrative models of understanding literary texts. The interdisciplinary methodological paradigm of studying text understanding, based on the principles of various sciences facilitating the identification of the text understanding…

  7. Text-Picture Relations in Cooking Instructions

    NARCIS (Netherlands)

    van der Sluis, Ielka; Leito, Shadira; Redeker, Gisela; Bunt, Harry

    2016-01-01

    Like many other instructions, recipes on packages with ready-to-use ingredients for a dish combine a series of pictures with short text paragraphs. The information presentation in such multimodal instructions can be compact (either text or picture) and/or cohesive (text and picture). In an explorato

  8. Applying statistical methods to text steganography

    CERN Document Server

    Nechta, Ivan

    2011-01-01

    This paper presents a survey of text steganography methods used for hid- ing secret information inside some covertext. Widely known hiding techniques (such as translation based steganography, text generating and syntactic embed- ding) and detection are considered. It is shown that statistical analysis has an important role in text steganalysis.

  9. Mathematical Texts as Narrative: Rethinking Curriculum

    Science.gov (United States)

    Dietiker, Leslie

    2013-01-01

    This paper proposes a framework for reading mathematics texts as narratives. Building from a narrative framework of Meike Bal, a reader's experience with the mathematical content as it unfolds in the text (the "mathematical story") is distinguished from his or her logical reconstruction of the content beyond the text (the…

  10. Object reading: text recognition for object recognition

    NARCIS (Netherlands)

    Karaoglu, S.; van Gemert, J.C.; Gevers, T.

    2012-01-01

    We propose to use text recognition to aid in visual object class recognition. To this end we first propose a new algorithm for text detection in natural images. The proposed text detection is based on saliency cues and a context fusion step. The algorithm does not need any parameter tuning and can d

  11. Refutation Texts for Effective Climate Change Education

    Science.gov (United States)

    Nussbaum, E. Michael; Cordova, Jacqueline R.; Rehmat, Abeera P.

    2017-01-01

    Refutation texts, which are texts that rebut scientific misconceptions and explain the normative concept, can be effective devices for addressing misconceptions and affecting conceptual change. However, few, if any, refutation texts specifically related to climate change have been validated for effectiveness. In this project, we developed and…

  12. About Reformulation in Full-Text IRS.

    Science.gov (United States)

    Debili, Fathi; And Others

    1989-01-01

    Analyzes different kinds of reformulations used in information retrieval systems where full text databases are accessed through natural language queries. Tests of these reformulations on large full text databases managed by the Syntactic and Probabilistic Indexing and Retrieval of Information in Texts (SPIRIT) system are described, and an expert…

  13. The Ecological Approach to Text Visualization.

    Science.gov (United States)

    Wise, James A.

    1999-01-01

    Presents both theoretical and technical bases on which to build a "science of text visualization." The Spatial Paradigm for Information Retrieval and Exploration (SPIRE) text-visualization system, which images information from free-text documents as natural terrains, serves as an example of the "ecological approach" in its visual metaphor, its…

  14. Teacher Modeling Using Complex Informational Texts

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2015-01-01

    Modeling in complex texts requires that teachers analyze the text for factors of qualitative complexity and then design lessons that introduce students to that complexity. In addition, teachers can model the disciplinary nature of content area texts as well as word solving and comprehension strategies. Included is a planning guide for think aloud.

  15. Creating and Using Culturally Sustaining Informational Texts

    Science.gov (United States)

    Watanabe Kganetso, Lynne M.

    2017-01-01

    Current standards and assessments emphasize the importance of a variety of genres in students' literacy diets, which has placed increased attention on informational texts. Unfortunately, young students' current exposure to and experiences with informational texts are often limited by the texts' availability, quality, and relevance to children's…

  16. Text Complexity: Primary Teachers' Views

    Science.gov (United States)

    Fitzgerald, Jill; Hiebert, Elfrieda H.; Bowen, Kimberly; Relyea-Kim, E. Jackie; Kung, Melody; Elmore, Jeff

    2015-01-01

    The research question was, "What text characteristics do primary teachers think are most important for early grades text complexity?" Teachers from across the United States accomplished a two-part task. First, to stimulate teachers' thinking about important text characteristics, primary teachers completed an online paired-text…

  17. Role of Terms in Popular Science Text

    Directory of Open Access Journals (Sweden)

    Zhabbarova F. U.

    2013-01-01

    Full Text Available The article examines and determines the specifics of terminological vocabulary used in a popular science text. It differentiates the notions of cohesion and coherence. The article reveals the main terminological means realizing cohesion in the text of a popular science article.

  18. A Non-Inferiority Trial of an Evidence-Based Secondary HIV Prevention Behavioral Intervention Compared to an Adapted, Abbreviated Version: Rationale and Intervention Description

    Science.gov (United States)

    Shrestha, Roman; Krishnan, Archana; Altice, Frederick L.; Copenhaver, Michael

    2015-01-01

    Background Real-world clinical settings like addiction treatment programs are ill-equipped to deploy and sustain the existing-resource-demanding evidence-based interventions (EBIs) that target HIV-infected people who use drugs (PWUDs), and this has left a critical void in current HIV prevention efforts. In response to this unmet need, we have conducted formative research in addiction treatment settings that has resulted in Holistic Health for HIV (3H+) – an empirically adapted, substantially abbreviated version of Holistic Health Recovery Program (HHRP+), a CDC-recommended EBI targeting HIV-infected PWUDs. Methods Using a non-inferiority randomized controlled trial design, we will determine whether the abbreviated 3H+ intervention is comparable (i.e., within a 10% margin) and cost-effective relative to the original HHRP+ intervention in terms of reducing HIV risk behaviors and improving antiretroviral therapy (ART) adherence among HIV-infected PWUDs in addiction treatment who report drug- or sex-related HIV risk behaviors. Conclusions This article provides a description of the development and adaptation of the 3H+ intervention, the innovative non-inferiority comparative experimental design for testing the 3H+ to the HHRP+. Furthermore, it provides empirical evidence from a formal cost-effectiveness analysis justifying the cost-effectiveness of the 3H+ intervention when compared to the HHRP+ intervention. If confirmed to be comparable and more cost-effective, as hypothesized, the 3H+ intervention has the potential to be readily and immediately integrated within common clinical settings where large numbers of HIV-infected PWUDs receive clinical services. PMID:26253181

  19. Acceptance and commitment therapy for chronic pain: evidence of mediation and clinically significant change following an abbreviated interdisciplinary program of rehabilitation.

    Science.gov (United States)

    Vowles, Kevin E; Witkiewitz, Katie; Sowden, Gail; Ashworth, Julie

    2014-01-01

    There is an emerging body of evidence regarding interdisciplinary acceptance and commitment therapy in the rehabilitative treatment of chronic pain. This study evaluated the reliability and clinical significance of change following an open trial that was briefer than that examined in previous work. In addition, the possible mediating effect of psychological flexibility, which is theorized to underlie the acceptance and commitment therapy model, was examined. Participants included 117 completers of an interdisciplinary program of rehabilitation for chronic pain. Assessment took place at treatment onset and conclusion, and at a 3-month follow-up when 78 patients (66.7%) provided data. At the 3-month follow-up, 46.2% of patients achieved clinically significant change, and 58.9% achieved reliable change, in at least 1 key measure of functioning (depression, pain anxiety, and disability). Changes in measures of psychological flexibility significantly mediated changes in disability, depression, pain-related anxiety, number of medical visits, and the number of classes of prescribed analgesics. These results add to the growing body of evidence supporting interdisciplinary acceptance and commitment therapy for chronic pain, particularly with regard to the clinical significance of an abbreviated course of treatment. Further, improvements appear to be mediated by changes in the processes specified within the theoretical model. Outcomes of an abbreviated interdisciplinary treatment for chronic pain based on a particular theoretical model are presented. Analyses indicated that improvements at follow-up mediated change in the theorized treatment process. Clinically significant change was indicated in just under half of participants. These data may be helpful to clinicians and researchers interested in intervention approaches and mechanisms of change. Copyright © 2014 American Pain Society. All rights reserved.

  20. Detecting malingering in traumatic brain injury and chronic pain with an abbreviated version of the Meyers Index for the MMPI-2.

    Science.gov (United States)

    Aguerrevere, Luis E; Greve, Kevin W; Bianchini, Kevin J; Meyers, John E

    2008-01-01

    Meyers, Millis, and Volkert [Meyers, J. E., Millis, S. R., & Volkert, K. (2002). A validity index for the MMPI-2. Archives of Clinical Neuropsychology, 17, 157-169] developed a method to detect malingering in chronic pain patients using seven scales from the Minnesota Multiphasic Inventory-2 (MMPI-2). This method may be impractical because two of the scales (Obvious minus Subtle and Dissimulation-revised) are not reported by the commercially available Pearson computerized scoring system. The current study recalculated the Meyers Index using the five Pearson-provided scales in the chronic pain data sets of Meyers et al. [Meyers, J. E., Millis, S. R., & Volkert, K. (2002). A validity index for the MMPI-2. Archives of Clinical Neuropsychology, 17, 157-169] and Bianchini, Etherton, Greve, Heinly, and Meyers [Bianchini, K. J., Etherton, J. L., Greve, K. W., Heinly, M. T., & Meyers, J. E. (in press). Classification accuracy of MMPI-2 validity scales in the detection of pain-related malingering: A known-groups approach. Assessment], and the traumatic brain injury data of Greve, Bianchini, Love, Brennan, and Heinly [Greve, K. W., Bianchini, K. J., Love, J. M., Brennan, A., & Heinly, M. T. (2006). Sensitivity and specificity of MMPI-2 validity scales and indicators to malingered neurocognitive dysfunction in traumatic brain injury. The Clinical Neuropsychologist, 20, 491-512]. Classification accuracy of the abbreviated Meyers Index was comparable to the original. These findings demonstrate that the abbreviated Meyers Index can be used as a substitute of the original Meyers Index without decrements in classification accuracy.

  1. Figure text extraction in biomedical literature.

    Directory of Open Access Journals (Sweden)

    Daehyun Kim

    Full Text Available BACKGROUND: Figures are ubiquitous in biomedical full-text articles, and they represent important biomedical knowledge. However, the sheer volume of biomedical publications has made it necessary to develop computational approaches for accessing figures. Therefore, we are developing the Biomedical Figure Search engine (http://figuresearch.askHERMES.org to allow bioscientists to access figures efficiently. Since text frequently appears in figures, automatically extracting such text may assist the task of mining information from figures. Little research, however, has been conducted exploring text extraction from biomedical figures. METHODOLOGY: We first evaluated an off-the-shelf Optical Character Recognition (OCR tool on its ability to extract text from figures appearing in biomedical full-text articles. We then developed a Figure Text Extraction Tool (FigTExT to improve the performance of the OCR tool for figure text extraction through the use of three innovative components: image preprocessing, character recognition, and text correction. We first developed image preprocessing to enhance image quality and to improve text localization. Then we adapted the off-the-shelf OCR tool on the improved text localization for character recognition. Finally, we developed and evaluated a novel text correction framework by taking advantage of figure-specific lexicons. RESULTS/CONCLUSIONS: The evaluation on 382 figures (9,643 figure texts in total randomly selected from PubMed Central full-text articles shows that FigTExT performed with 84% precision, 98% recall, and 90% F1-score for text localization and with 62.5% precision, 51.0% recall and 56.2% F1-score for figure text extraction. When limiting figure texts to those judged by domain experts to be important content, FigTExT performed with 87.3% precision, 68.8% recall, and 77% F1-score. FigTExT significantly improved the performance of the off-the-shelf OCR tool we used, which on its own performed with 36

  2. The abbreviated impactor measurement (AIM) concept: part 1--Influence of particle bounce and re-entrainment-evaluation with a "dry" pressurized metered dose inhaler (pMDI)-based formulation.

    Science.gov (United States)

    Mitchell, J P; Nagel, M W; Avvakoumova, V; MacKay, H; Ali, R

    2009-01-01

    The abbreviated impactor measurement concept is a potential improvement to the labor-intensive full-resolution cascade impactor methodology for inhaler aerosol aerodynamic particle size distribution (APSD) measurement by virtue of being simpler and therefore quicker to execute. At the same time, improved measurement precision should be possible by eliminating stages upon which little or no drug mass is collected. Although several designs of abbreviated impactor systems have been developed in recent years, experimental work is lacking to validate the technique with aerosols produced by currently available inhalers. In part 1 of this two-part article that focuses on aerosols produced by pressurized metered dose inhalers (pMDIs), the evaluation of two abbreviated impactor systems (Copley fast screening Andersen impactor and Trudell fast screening Andersen impactor), based on the full-resolution eight-stage Andersen nonviable cascade impactor (ACI) operating principle, is reported with a formulation producing dry particles. The purpose was to investigate the potential for non-ideal collection behavior associated with particle bounce in relation to internal losses to surfaces from which particles containing active pharmaceutical ingredient are not normally recovered. Both abbreviated impactors were found to be substantially equivalent to the full-resolution ACI in terms of extra-fine and fine particle and coarse mass fractions used as metrics to characterize the APSD of these pMDI-produced aerosols when sampled at 28.3 L/min, provided that precautions are taken to coat collection plates to minimize bounce and entrainment.

  3. An Embedded Application for Degraded Text Recognition

    Directory of Open Access Journals (Sweden)

    Thillou Céline

    2005-01-01

    Full Text Available This paper describes a mobile device which tries to give the blind or visually impaired access to text information. Three key technologies are required for this system: text detection, optical character recognition, and speech synthesis. Blind users and the mobile environment imply two strong constraints. First, pictures will be taken without control on camera settings and a priori information on text (font or size and background. The second issue is to link several techniques together with an optimal compromise between computational constraints and recognition efficiency. We will present the overall description of the system from text detection to OCR error correction.

  4. Text line Segmentation of Curved Document Images

    Directory of Open Access Journals (Sweden)

    Anusree.M

    2014-05-01

    Full Text Available Document image analysis has been widely used in historical and heritage studies, education and digital library. Document image analytical techniques are mainly used for improving the human readability and the OCR quality of the document. During the digitization, camera captured images contain warped document due perspective and geometric distortions. The main difficulty is text line detection in the document. Many algorithms had been proposed to address the problem of printed document text line detection, but they failed to extract text lines in curved document. This paper describes a segmentation technique that detects the curled text line in camera captured document images.

  5. Unified layout analysis and text localization framework

    Science.gov (United States)

    Vasilopoulos, Nikos; Kavallieratou, Ergina

    2017-01-01

    A technique appropriate for extracting textual information from documents with complex layouts, such as newspapers and journals, is presented. It is a combination of a foreground analysis and a text localization method. The first one is used to segment the page in text and nontext blocks, whereas the second one is used to detect text that may be embedded inside images, charts, diagrams, tables, etc. Detailed experiments on two public databases showed that mixing layout analysis and text localization techniques can lead to improved page segmentation and text extraction results.

  6. Text-speak processing impairs tactile location.

    Science.gov (United States)

    Head, James; Helton, William; Russell, Paul; Neumann, Ewald

    2012-09-01

    Dual task experiments have highlighted that driving while having a conversation on a cell phone can have negative impacts on driving (Strayer & Drews, 2007). It has also been noted that this negative impact is greater when reading a text-message (Lee, 2007). Commonly used in text-messaging are shortening devices collectively known as text-speak (e.g.,Ys I wll ttyl 2nite, Yes I will talk to you later tonight). To the authors' knowledge, there has been no investigation into the potential negative impacts of reading text-speak on concurrent performance on other tasks. Forty participants read a correctly spelled story and a story presented in text-speak while concurrently monitoring for a vibration around their waist. Slower reaction times and fewer correct vibration detections occurred while reading text-speak than while reading a correctly spelled story. The results suggest that reading text-speak imposes greater cognitive load than reading correctly spelled text. These findings suggest that the negative impact of text messaging on driving may be compounded by the messages being in text-speak, instead of orthographically correct text.

  7. The translated text as re-textualisation

    Directory of Open Access Journals (Sweden)

    Walter Carlos Costa

    2003-01-01

    Full Text Available All texts seem to be, in one way or another, dependent upon other texts, but a translated text is dependent upon one particular text in a very peculiar way. When writing a normal text the writer is in principle free to organise a set of words, clauses and paragraphs, according to his or her intentions and abilities. Yet we all know that this liberty is more apparent than real, since our memory of previous texts, as well as the cultural norms we have internalised, restrict, as a rule, many of our textual movements. The translator, however, works under different conditions. The text he or she writes will be based on a message that already exists in a textual form in another language. The original text constrains the new text in a number of ways. The most inmediate one is that in order to be recognised as a translation, the translator’s text must have a great degree of similarity with its original counterpart. In translation studies this similarity is currently labelled equivalence.

  8. Text segmentation in degraded historical document images

    Directory of Open Access Journals (Sweden)

    A.S. Kavitha

    2016-07-01

    Full Text Available Text segmentation from degraded Historical Indus script images helps Optical Character Recognizer (OCR to achieve good recognition rates for Hindus scripts; however, it is challenging due to complex background in such images. In this paper, we present a new method for segmenting text and non-text in Indus documents based on the fact that text components are less cursive compared to non-text ones. To achieve this, we propose a new combination of Sobel and Laplacian for enhancing degraded low contrast pixels. Then the proposed method generates skeletons for text components in enhanced images to reduce computational burdens, which in turn helps in studying component structures efficiently. We propose to study the cursiveness of components based on branch information to remove false text components. The proposed method introduces the nearest neighbor criterion for grouping components in the same line, which results in clusters. Furthermore, the proposed method classifies these clusters into text and non-text cluster based on characteristics of text components. We evaluate the proposed method on a large dataset containing varieties of images. The results are compared with the existing methods to show that the proposed method is effective in terms of recall and precision.

  9. The nuclear modification of charged particles in Pb-Pb at $\\sqrt{\\text{s}_\\text{NN}} = \\text{5.02}\\,\\text{TeV}$ measured with ALICE

    CERN Document Server

    INSPIRE-00537113

    2016-01-01

    The study of inclusive charged-particle production in heavy-ion collisions provides insights into the density of the medium and the energy-loss mechanisms. The observed suppression of high-$\\textit{p}_\\text{T}$ yield is generally attributed to energy loss of partons as they propagate through a deconfined state of quarks and gluons - Quark-Gluon Plasma (QGP) - predicted by QCD. Such measurements allow the characterization of the QGP by comparison with models. In these proceedings, results on high-$\\textit{p}_\\text{T}$ particle production measured by ALICE in Pb-Pb collisions at $ \\sqrt{\\text{s}_\\text{NN}}\\, = 5.02\\ \\rm{TeV}$ as well as well in pp at $\\sqrt{\\text{s}}\\,=5.02\\ \\rm{TeV}$ are presented for the first time. The nuclear modification factors ($\\text{R}_\\text{AA}$) in Pb-Pb collisions are presented and compared with model calculations.

  10. CRIE: An automated analyzer for Chinese texts.

    Science.gov (United States)

    Sung, Yao-Ting; Chang, Tao-Hsing; Lin, Wei-Chun; Hsieh, Kuan-Sheng; Chang, Kuo-En

    2016-12-01

    Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

  11. The nuclear modification of charged particles in Pb-Pb at $\\sqrt{\\text{s}_\\text{NN}} = \\text{5.02}\\,\\text{TeV}$ measured with ALICE

    CERN Document Server

    Gronefeld, Julius

    2016-09-21

    The study of inclusive charged-particle production in heavy-ion collisions provides insights into the density of the medium and the energy-loss mechanisms. The observed suppression of high-$\\textit{p}_\\text{T}$ yield is generally attributed to energy loss of partons as they propagate through a deconfined state of quarks and gluons - Quark-Gluon Plasma (QGP) - predicted by QCD. Such measurements allow the characterization of the QGP by comparison with models. In these proceedings, results on high-$\\textit{p}_\\text{T}$ particle production measured by ALICE in Pb-Pb collisions at $ \\sqrt{\\text{s}_\\text{NN}}\\, = 5.02\\ \\rm{TeV}$ as well as well in pp at $\\sqrt{\\text{s}}\\,=5.02\\ \\rm{TeV}$ are presented for the first time. The nuclear modification factors ($\\text{R}_\\text{AA}$) in Pb-Pb collisions are presented and compared with model calculations.

  12. Text mining with R a tidy approach

    CERN Document Server

    Silge, Julia

    2017-01-01

    Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you'll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You'll learn how tidytext and other tidy tools in R can make text analysis easier and more effective. The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You'll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media. Learn how to apply the tidy text format to NLP Use sentiment analysis to mine the emotional content of text Identify a document's most important terms with frequency measurements E...

  13. An Exploration of the Views of Teachers Concerning the Effects of Texting on Children’s Literacy Development

    Directory of Open Access Journals (Sweden)

    David Wray

    2015-06-01

    Full Text Available Texting, or text messaging, refers to the use of mobile phones to type and send brief, electronic messages over a telephone network. Because such messages are limited to 160 characters and are typed on a small phone keypad, texters tend to employ a great many abbreviations in conveying their messages. This has led to widespread spelling adaptations, for example, “BRB” (be right back, “LOL” (laughing out loud, and “CUL8ER” (see you later. The research in this paper aimed to examine the views and opinions held by teachers about the impact of texting on children’s literacy development. Twenty-seven primary teachers were interviewed in depth and a number of key themes emerged. These teachers did express some negative view about the impact of texting, and of technology use generally, upon their students’ literacy, although many also mentioned some positive effects. A majority did feel concerned about the effects of textisms, but these feelings were tempered by a range of other factors. None of them blamed the use of textisms exclusively for declining levels of student literacy, suggesting also that the impact of student “street slang” was a significant influence as was the fact that many of their students spoke English as an additional language. These outcomes suggest that the media portrayal of this issue has been over-simplistic at best.

  14. Text To Speech System for Telugu Language

    Directory of Open Access Journals (Sweden)

    M. Siva Kumar

    2014-03-01

    Full Text Available Telugu is one of the oldest languages in India. This paper describes the development of Telugu Text-to-Speech System (TTS.In Telugu TTS the input is Telugu text in Unicode. The voices are sampled from real recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform. Speech synthesis is a process of building machinery that can generate human-like speech from any text input to imitate human speakers. Text processing and speech generation are two main components of a text to speech system. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of these rules and their derivation depends upon the nature of the language. The quality of a speech synthesizer is judged by its closeness to the natural human voice and understandability. In this paper we described an approach to build a Telugu TTS system using concatenative synthesis method with syllable as a basic unit of concatenation.

  15. NEW TECHNIQUES USED IN AUTOMATED TEXT ANALYSIS

    Directory of Open Access Journals (Sweden)

    M. I strate

    2010-12-01

    Full Text Available Automated analysis of natural language texts is one of the most important knowledge discovery tasks for any organization. According to Gartner Group, almost 90% of knowledge available at an organization today is dispersed throughout piles of documents buried within unstructured text. Analyzing huge volumes of textual information is often involved in making informed and correct business decisions. Traditional analysis methods based on statistics fail to help processing unstructured texts and the society is in search of new technologies for text analysis. There exist a variety of approaches to the analysis of natural language texts, but most of them do not provide results that could be successfully applied in practice. This article concentrates on recent ideas and practical implementations in this area.

  16. Biomechanical patterns of text-message distraction.

    Science.gov (United States)

    Le, Peter; Hwang, Jaejin; Grawe, Sarah; Li, Jing; Snyder, Alison; Lee, Christina; Marras, William S

    2015-01-01

    The objective of this study was to identify biomechanical measures that can distinguish texting distraction in a laboratory-simulated driving environment. The goal would be to use this information to provide an intervention for risky driving behaviour. Sixteen subjects participated in this study. Three independent variables were tested: task (texting, visual targeting, weighted and non-weighted movements), task direction (front and side) and task distance (close and far). Dependent variables consisted of biomechanical moments, head displacement and the length of time to complete each task. Results revealed that the time to complete each task was higher for texting compared to other tasks. Peak moments during texting were only distinguishable from visual targeting. Peak head displacement and cumulative biomechanical exposure measures indicated that texting can be distinguished from other tasks. Therefore, it may be useful to take into account both temporal and biomechanical measures when considering warning systems to detect texting distraction.

  17. The Research of Chinese Text Proofreading Algorithm

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Generally, text proofreading consists of two procedures, finding the wrongly used words and then presenting the correct forms. At present, most of the Chinese text proofreading focuses on finding the wrongly used words, but pays less attention to correcting these errors. In this paper, the Chinese text features are interpreted first and then a Chinese text proofreading method and its algorithm are introduced. In this algorithm, text features, including text statistical feature and language structure feature, are properly used. Here, correcting errors goes on at the same time with finding errors. Experimental results show that this method has a performance of detecting 75% of wrongly used Chinese words and correcting about 60% of them with the first candidates.

  18. Algorithm for Generating Train Calendar Texts

    Directory of Open Access Journals (Sweden)

    Karel Greiner

    2013-04-01

    Full Text Available The article describes a possibility of generating train calendar text for the needs of compiling the annual timetable in the conditions of the Czech Republic. Based on the analysis of the types of texts of calendars that appear in various print outputs, a heuristic algorithm was designed to generate a text from a set of calendar days. The algorithm is a part of an application that also provides a tool to define the text of the calendar by using a mask of sub-periods and calendars to be displayed in them. The algorithm was tested on real data of the timetable. In most cases, the algorithm shows the same or better results than the previously used tools. In several cases, however, a better result can be obtained by the user. The described algorithm to generate the text of the calendar is a part of a program that is used for compiling the timetable for trains in the Czech Republic.

  19. Colored-sketch of Text Information

    Directory of Open Access Journals (Sweden)

    Beomjin Kim

    2002-01-01

    Full Text Available This paper presents an information visualization method, which transforms text into abstracted visual representations. The proposed color-coding algorithm converts text into a sequence of colored icons that inform users about the distributional patterns of given queries, as well as the structural overview of a document simultaneously. By presenting the compact, but instructive visual abstraction of texts concurrently, users can compare multiple documents intuitively while alleviating the need to reference the underlying text. The system provides interactive navigation tools to support users' decision-making processes - including multi-level viewing, a tree hierarchy recording previous search activities, and suggestive words for refinement of the search scope. An experimental study evaluating this visual approach for delivering search results has been conducted on text corpora in comparison with a traditional information retrieval system. By informing search results to clientele in a perceptive form, the users' performance in obtaining desired information has been improved, while maintaining the accuracy.

  20. Legal English and Adapted Legal Texts

    Directory of Open Access Journals (Sweden)

    Alvyda Liuolienė

    2012-06-01

    Full Text Available The article aims at analysing the significance of authentic legal English text and adapted legal texts in ESP classes. The authors point out the advantages and disadvantages of legal texts and analyse the possibilities of their efficient application in the teaching process. At the initial stage of teaching English legalese, materials prepared specially for teaching purposes in textbooks seem to be more appropriate as they are adapted for a particular level for law students whereas in more advanced levels, authentic texts in a legal English classroom can more considerably contribute to the learning experience. The usage of both legal authentic materials and adapted legal texts have tangible impact on mastering legal English.

  1. Mining free-text medical records.

    OpenAIRE

    Heinze, D. T.; Morsch, M. L.; Holbrook, J.

    2001-01-01

    Text mining projects can be characterized along four parameters: 1) the demands of the market in terms of target domain and specificity and depth of queries; 2) the volume and quality of text in the target domain; 3) the text mining process requirements; and 4) the quality assurance process that validates the extracted data. In this paper, we provide lessons learned and results from a large-scale commercial project using Natural Language Processing (NLP) for mining the transcriptions of dicta...

  2. Beyond Text Theory: Understanding Literary Response

    OpenAIRE

    Miall, David S.; Kuiken, Don

    1994-01-01

    Approaches to text comprehension that focus on propositional, inferential, and elaborative processes have often been considered capable of extension in principle to literary texts, such as stories or poems. However, we argue that literary response is influenced by stylistic features that result in defamiliarization; that defamiliarization invokes feeling which calls on personal perspectives and meanings; and that these aspects of literary response are not addressed by current text theories. T...

  3. Translation Strategies of Non-literary Texts

    Institute of Scientific and Technical Information of China (English)

    杨静

    2015-01-01

    Translator's subjectivity is closely related to the choice of the style of the translated texts and translation strategies.This paper presents an analytical study of translation strategies of non-literary texts.It introduces different non-literary texts,and then generalizes some factors influencing the selection of translation strategies.Take these Influencing factors into account,Translators should adopt different translation strategies

  4. Colored-sketch of Text Information

    OpenAIRE

    Beomjin Kim; Philip Johnson; Adam S. Huarng

    2002-01-01

    This paper presents an information visualization method, which transforms text into abstracted visual representations. The proposed color-coding algorithm converts text into a sequence of colored icons that inform users about the distributional patterns of given queries, as well as the structural overview of a document simultaneously. By presenting the compact, but instructive visual abstraction of texts concurrently, users can compare multiple documents intuitively while alleviating the need...

  5. Multilingual Text Detection with Nonlinear Neural Network

    Directory of Open Access Journals (Sweden)

    Lin Li

    2015-01-01

    Full Text Available Multilingual text detection in natural scenes is still a challenging task in computer vision. In this paper, we apply an unsupervised learning algorithm to learn language-independent stroke feature and combine unsupervised stroke feature learning and automatically multilayer feature extraction to improve the representational power of text feature. We also develop a novel nonlinear network based on traditional Convolutional Neural Network that is able to detect multilingual text regions in the images. The proposed method is evaluated on standard benchmarks and multilingual dataset and demonstrates improvement over the previous work.

  6. Financial Statement Fraud Detection using Text Mining

    Directory of Open Access Journals (Sweden)

    Rajan Gupta

    2013-01-01

    Full Text Available Data mining techniques have been used enormously by the researchers’ community in detecting financial statement fraud. Most of the research in this direction has used the numbers (quantitative information i.e. financial ratios present in the financial statements for detecting fraud. There is very little or no research on the analysis of text such as auditor’s comments or notes present in published reports. In this study we propose a text mining approach for detecting financial statement fraud by analyzing the hidden clues in the qualitative information (text present in financial statements.

  7. An approach for NL text interpretation

    Directory of Open Access Journals (Sweden)

    Anatol Popescu

    2007-11-01

    Full Text Available For modeling the interpretation process of NL sentences we use the mechanisms implying semantic networks that assure syntactic - semantic text interpretation (SSI, including an understanding axiomatic model, interpretation model and denotation model to represent the result of SSI. These models estimate the correctness and the consistency of texts too. Also it implements an information extraction from texts in NL. Our approach based, mainly, upon semantic networks grammars has an extraordinary interpretation potential implying a system of completely new concepts and processing methods.

  8. Handwritten text line segmentation by spectral clustering

    Science.gov (United States)

    Han, Xuecheng; Yao, Hui; Zhong, Guoqiang

    2017-02-01

    Since handwritten text lines are generally skewed and not obviously separated, text line segmentation of handwritten document images is still a challenging problem. In this paper, we propose a novel text line segmentation algorithm based on the spectral clustering. Given a handwritten document image, we convert it to a binary image first, and then compute the adjacent matrix of the pixel points. We apply spectral clustering on this similarity metric and use the orthogonal kmeans clustering algorithm to group the text lines. Experiments on Chinese handwritten documents database (HIT-MW) demonstrate the effectiveness of the proposed method.

  9. Reading of Foreign Language Technical Texts

    Directory of Open Access Journals (Sweden)

    Metka Brkan

    1997-01-01

    Full Text Available An efficient foreign language reader is one who has approached the reading flexibility of a native speaker as he reads different texts presented in his environment: newspaper articles, magazins, personal letters, business correspondence, official documents, academic textbooks and scientific and technical texts. Flexibility in reading means increased speed as well as enhanced comprehension: an efficient re­ ader should read fast with needed comprehension. A poor reader is one who reacts everything slowly without getting much meaning from reading. The article focuses on techniques for developing foreign language reading skills of university students to cape with the reading of English technical texts.

  10. Absolute frequency measurement of the {{}^{1}}{{\\text{S}}_{0}} – {{}^{3}}{{\\text{P}}_{0}} transition of 171Yb

    Science.gov (United States)

    Pizzocaro, Marco; Thoumany, Pierre; Rauf, Benjamin; Bregolin, Filippo; Milani, Gianmaria; Clivati, Cecilia; Costanzo, Giovanni A.; Levi, Filippo; Calonico, Davide

    2017-02-01

    We report the absolute frequency measurement of the unperturbed transition {{}1}{{\\text{S}}0} – {{}3}{{\\text{P}}0} at 578 nm in 171Yb realized in an optical lattice frequency standard relative to a cryogenic caesium fountain. The measurement result is 518 295 836 590 863.59(31) Hz with a relative standard uncertainty of 5.9× {{10}-16} . This value is in agreement with the ytterbium frequency recommended as a secondary representation of the second in the International System of Units.

  11. [Text comprehension, cognitive resources and aging].

    Science.gov (United States)

    Chesneau, Sophie; Jbabdi, Saad; Champagne-Lavau, Maud; Giroux, Francine; Ska, Bernadette

    2007-03-01

    Aging brings cognitive changes. Language is not immune to these changes. The use of compensation strategies may permit older adults to achieve a performance level identical to the one obtained by younger adults. This research aims to study text comprehension in aging and the reading strategies used for by older and younger adults. Kintsch's cognitive model (1988) allows the identification of different levels of representation within text treatment (linguistic form, macrostructure, microstructure and situation model) and predicts the underlying cognitive components. Eye-tracking analyses during reading permit inference about the moments of reading treatment and detection of reading strategies. Sixty highly educated participants were assessed. They were divided in two age groups (20-40 and 60-80 years old). Participants were asked to read and understand three texts constructed to highlight the features of text comprehension within each one of the different levels of text representation. The amount of detail and the necessity of updating the situation model varied for each text. Eye movements were registered by an eye-tracker (Cambridge research) during the reading process. Specific complementary tasks were administered to evaluate working memory, long-term memory, and executive functions. Variances analyses showed significantly lower performance by older adults regarding: 1) recall of the microstructure of the two texts with a high degree of detail, 2) macrostructure of the text with fewer details, and 3) performance on all tasks that evaluated cognitive components. Aging influenced treatment of levels of text representation depending on text characteristics. However, cluster analysis of the text comprehension and eye-tracker data revealed a group of older adults whose performance in reading comprehension was identical to the performance of younger adults, with the same reading profile. This result seems to show that use of compensation strategies by older adults at

  12. Searches for ttH and tH with $\\text{H}\\rightarrow\\text{b}\\bar{\\text{b}}$

    CERN Document Server

    Schroeder, Matthias

    2016-01-01

    The associated production of a Higgs boson with a top quark-antiquark pair (ttH production) or with a single top quark (tH production) allows a direct measurement of the top-Higgs-Yukawa coupling with minimal model dependence. In this article, recent results of searches for ttH and tH production in the $\\text{H}\\rightarrow\\text{b}\\bar{\\text{b}}$ channel performed by the ATLAS and CMS experiments are reviewed. The analyses use pp collision data collected at a centre-of-mass energy of $13\\,$TeV corresponding to an integrated luminosity of up to 13.2$\\,\\text{fb}^{-1}$.

  13. Learning with Text in the Primary Grades.

    Science.gov (United States)

    Guillaume, Andrea M.

    1998-01-01

    Provides a rationale for learning-with-text experiences for primary-grade children; lists 10 general approaches to foster primary-grade content area reading; and offers a sample lesson incorporating these approaches that promotes comprehension of text and content matter. Suggests that trade books, textbooks, realistic fiction, and other print…

  14. Classifying Written Texts Through Rhythmic Features

    NARCIS (Netherlands)

    Balint, Mihaela; Dascalu, Mihai; Trausan-Matu, Stefan

    2016-01-01

    Rhythm analysis of written texts focuses on literary analysis and it mainly considers poetry. In this paper we investigate the relevance of rhythmic features for categorizing texts in prosaic form pertaining to different genres. Our contribution is threefold. First, we define a set of rhythmic featu

  15. Texts in multiple versions: histories of editions

    NARCIS (Netherlands)

    Giuliani, L.; Brinkman, H.; Lernout, G.; Mathijsen, M.

    2006-01-01

    Texts in multiple versions constitute the core problem of textual scholarship. For texts from antiquity and the medieval period, the many versions may be the result of manuscript transmission, requiring editors and readers to discriminate between levels of authority in variant readings produced

  16. Modeling text with generalizable Gaussian mixtures

    DEFF Research Database (Denmark)

    Hansen, Lars Kai; Sigurdsson, Sigurdur; Kolenda, Thomas

    2000-01-01

    We apply and discuss generalizable Gaussian mixture (GGM) models for text mining. The model automatically adapts model complexity for a given text representation. We show that the generalizability of these models depends on the dimensionality of the representation and the sample size. We discuss...

  17. Touchstone Texts: Fertile Ground for Creativity

    Science.gov (United States)

    Sturgell, Irma

    2008-01-01

    When state and local standards drive instruction, teachers often worry about compromising their creativity for a prescriptive curriculum with predictable outcomes. It is possible for creative teaching to flourish by using touchstone texts, or mentor texts, that engage both teacher and student in exploratory yet purposeful learning. (Contains 1…

  18. A text in Romani from 1622

    DEFF Research Database (Denmark)

    Bakker, Peter

    2015-01-01

    this is a reprint of a 2012 article: A new old text in Romani: Lord's Prayer, 1622. International Journal of Romani Language and Culture 2 (2011): 193-212.......this is a reprint of a 2012 article: A new old text in Romani: Lord's Prayer, 1622. International Journal of Romani Language and Culture 2 (2011): 193-212....

  19. Text comprehension strategy instruction with poor readers

    NARCIS (Netherlands)

    Van den Bos, K.P.; Aarnoudse, C.C.; Brand-Gruwel, S.

    1998-01-01

    The goal of this study was to investigate the effects of teaching text comprehension strategies to children with decoding and reading comprehension problems and with a poor or normal listening ability. Two experiments are reported. Four text comprehension strategies, viz., question generation,

  20. Texts, Transmissions, Receptions. Modern Approaches to Narratives

    NARCIS (Netherlands)

    Lardinois, A.P.M.H.; Levie, S.A.; Hoeken, H.; Lüthy, C.H.

    2015-01-01

    The papers collected in this volume study the function and meaning of narrative texts from a variety of perspectives. The word 'text' is used here in the broadest sense of the term: it denotes literary books, but also oral tales, speeches, newspaper articles and comics. One of the purposes of this v

  1. An Intelligent System For Arabic Text Categorization

    NARCIS (Netherlands)

    Syiam, M.M.; Tolba, Mohamed F.; Fayed, Z.T.; Abdel-Wahab, Mohamed S.; Ghoniemy, Said A.; Habib, Mena Badieh

    Text Categorization (classification) is the process of classifying documents into a predefined set of categories based on their content. In this paper, an intelligent Arabic text categorization system is presented. Machine learning algorithms are used in this system. Many algorithms for stemming and

  2. Learning with Text in the Primary Grades.

    Science.gov (United States)

    Guillaume, Andrea M.

    1998-01-01

    Provides a rationale for learning-with-text experiences for primary-grade children; lists 10 general approaches to foster primary-grade content area reading; and offers a sample lesson incorporating these approaches that promotes comprehension of text and content matter. Suggests that trade books, textbooks, realistic fiction, and other print…

  3. Choices of texts for literary education

    DEFF Research Database (Denmark)

    Skyggebjerg, Anna Karlskov

    . The teaching of literature has a double bind. On the one hand, there is a subject (Danish) and a curriculum with a certain type of texts with cultural and even national connotations, and the limits of the choice of texts and curriculum are decided by the state. On the other hand, there are some concrete...

  4. On the Techniques of Journalistic Text Translation

    Institute of Scientific and Technical Information of China (English)

    林燕

    2015-01-01

    With the development of economy globalization,the translation of journalistic text has become increasingly important to cultural exchanges or economy communication among different countries. This paper briefly introduces the characteristics of news text and provides some feasible techniques for translation from English to Chinese or Chinese to English based on the case study.

  5. The Managed Text: Prose and Qualms.

    Science.gov (United States)

    Kadushin, Charles

    1979-01-01

    Managed texts are written and designed by a team of writers and researchers under the direction and control of a publishing house. How these books got started, what needs they meet, their advantages and disadvantages, and the consequences they are having on college text publishing are addressed. (JMD)

  6. Ontology Assisted Formal Specification Extraction from Text

    Directory of Open Access Journals (Sweden)

    Andreea Mihis

    2010-12-01

    Full Text Available In the field of knowledge processing, the ontologies are the most important mean. They make possible for the computer to understand better the natural language and to make judgments. In this paper, a method which use ontologies in the semi-automatic extraction of formal specifications from a natural language text is proposed.

  7. Texts in multiple versions: histories of editions

    NARCIS (Netherlands)

    L. Giuliani; H. Brinkman; G. Lernout; M. Mathijsen

    2006-01-01

    Texts in multiple versions constitute the core problem of textual scholarship. For texts from antiquity and the medieval period, the many versions may be the result of manuscript transmission, requiring editors and readers to discriminate between levels of authority in variant readings produced alon

  8. Opening Mathematics Texts: Resisting the Seduction

    Science.gov (United States)

    Wagner, David

    2012-01-01

    This analysis of the writing in a grade 7 mathematics textbook distinguishes between closed texts and open texts, which acknowledge multiple possibilities. I use tools that have recently been applied in mathematics contexts, focussing on grammatical features that include personal pronouns, modality, and types of imperatives, as well as on…

  9. Text comprehension strategy instruction with poor readers

    NARCIS (Netherlands)

    Van den Bos, K.P.; Aarnoudse, C.C.; Brand-Gruwel, S.

    1998-01-01

    The goal of this study was to investigate the effects of teaching text comprehension strategies to children with decoding and reading comprehension problems and with a poor or normal listening ability. Two experiments are reported. Four text comprehension strategies, viz., question generation, summa

  10. Extracting Conceptual Feature Structures from Text

    DEFF Research Database (Denmark)

    Andreasen, Troels; Bulskov, Henrik; Lassen, Tine;

    2011-01-01

    This paper describes an approach to indexing texts by their conceptual content using ontologies along with lexico-syntactic information and semantic role assignment provided by lexical resources. The conceptual content of meaningful chunks of text is transformed into conceptual feature structures...

  11. Texts, Troubled Teens, and Troubling Times

    Science.gov (United States)

    Tatum, Alfred W., Ed.

    2009-01-01

    Seeking ways to effectively mediate texts with troubled teens in troubling times is worth the investment. Text is a powerful tool for shaping positive life trajectories, especially for those teens being affected by vulnerable-producing conditions that interrupt positive human development. These conditions, coupled with poor literacy skills…

  12. Texts, Transmissions, Receptions. Modern Approaches to Narratives

    NARCIS (Netherlands)

    Lardinois, A.P.M.H.; Levie, S.A.; Hoeken, H.; Lüthy, C.H.

    2015-01-01

    The papers collected in this volume study the function and meaning of narrative texts from a variety of perspectives. The word 'text' is used here in the broadest sense of the term: it denotes literary books, but also oral tales, speeches, newspaper articles and comics. One of the purposes of this

  13. The Patchwork Text in Teaching Greek Tragedy.

    Science.gov (United States)

    Parker, Jan

    2003-01-01

    Describes the rewards and challenges of using the Patchwork Text to teach Greek Tragedy to Cambridge University English final-year students. The article uses close reading of the students' texts, analysis and reflection to discuss both the products and the process of Patchwork writing. (Author/AEF)

  14. The Limited Benefits of Rereading Educational Texts

    Science.gov (United States)

    Callender, Aimee A.; McDaniel, Mark A.

    2009-01-01

    Though rereading is a study method commonly used by students, theoretical disagreement exists regarding whether rereading a text significantly enhances the representation and retention of the text's contents. In four experiments, we evaluated the effectiveness of rereading relative to a single reading in a context paralleling that faced by…

  15. Text mining and visualization using VOSviewer

    CERN Document Server

    van Eck, Nees Jan

    2011-01-01

    VOSviewer is a computer program for creating, visualizing, and exploring bibliometric maps of science. In this report, the new text mining functionality of VOSviewer is presented. A number of examples are given of applications in which VOSviewer is used for analyzing large amounts of text data.

  16. Text Writing at an Undergraduate College.

    Science.gov (United States)

    Myers, David G.

    Strategies for writing a text are offered by a college professor on the basis of his own experience of writing a text on social psychology. Suggestions are given on creating an efficient office environment, researching the topic, and drafting the manuscript. One way to improve efficiency is to compress teaching into a few days, leaving the…

  17. Undergraduates' Text Messaging Language and Literacy Skills

    Science.gov (United States)

    Grace, Abbie; Kemp, Nenagh; Martin, Frances Heritage; Parrila, Rauno

    2014-01-01

    Research investigating whether people's literacy skill is being affected by the use of text messaging language has produced largely positive results for children, but mixed results for adults. We asked 150 undergraduate university students in Western Canada and 86 in South Eastern Australia to supply naturalistic text messages and to complete…

  18. Student Performance in an Electronic Text Environment.

    Science.gov (United States)

    Friedman, Edward A.; And Others

    1989-01-01

    Describes a project conducted at Stevens Institute of Technology to develop and test the applicability of full-text electronic databases and full-text retrieval technology for use in undergraduate humanities education. The creation of a machine-readable database on Galileo is described, student reactions are discussed, and further work is…

  19. Text mining for the biocuration workflow.

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  20. Flexible frontiers for text division into rows

    Directory of Open Access Journals (Sweden)

    Dan L. Lacrămă

    2009-01-01

    Full Text Available This paper presents an original solution for flexible hand-written text division into rows. Unlike the standard procedure, the proposed method avoids the isolated characters extensions amputation and reduces the recognition error rate in the final stage.

  1. Rapid and effective synthesis of $\\text{}^{40}\\text{Ca}-\\text{}^{27}\\text{Al}$ ion pair towards quantum logic optical clock

    CERN Document Server

    Shang, Junjuan; Cao, Jian; Wang, Shaomao; Shu, Hualin; Huang, Xueren

    2016-01-01

    High precision atomic clocks have been applied not only to very important technological problems such as synchronization and global navigation systems, but to the fundament precision measurement physics. Single $\\text{}^{27}\\text{Al}^+$ is one of the most attractions of selection system due to its very low blackbody radiation effect which dominates frequency shifts in other optical clock systems. Up to now, the $\\text{}^{27}\\text{Al}^+$ still could not be laser-cooled directly by reason that the absence of 167nm laser. Sympathetic cooling is a viable method to solve this problem. In this work, we used a single laser cooled $\\text{}^{40}\\text{Ca}^+$ to sympathetically cool one $\\text{}^{27}\\text{Al}^+$ in linear Paul trap. Comparing to laser ablation method we got a much lower velocity atoms sprayed from a home-made atom oven, which would make loading aluminum ion more efficient and the sympathetic cooling much easier. By the method of precisely measuring the secular frequency of the ion pair, finally we prove...

  2. Using Genetic Algorithms for Texts Classification Problems

    Directory of Open Access Journals (Sweden)

    A. A. Shumeyko

    2009-01-01

    Full Text Available The avalanche quantity of the information developed by mankind has led to concept of automation of knowledge extraction – Data Mining ([1]. This direction is connected with a wide spectrum of problems - from recognition of the fuzzy set to creation of search machines. Important component of Data Mining is processing of the text information. Such problems lean on concept of classification and clustering ([2]. Classification consists in definition of an accessory of some element (text to one of in advance created classes. Clustering means splitting a set of elements (texts on clusters which quantity are defined by localization of elements of the given set in vicinities of these some natural centers of these clusters. Realization of a problem of classification initially should lean on the given postulates, basic of which – the aprioristic information on primary set of texts and a measure of affinity of elements and classes.

  3. Integrating Text Plans for Conciseness and Coherence

    CERN Document Server

    Harvey, T; Harvey, Terrence; Carberry, Sandra

    1998-01-01

    Our experience with a critiquing system shows that when the system detects problems with the user's performance, multiple critiques are often produced. Analysis of a corpus of actual critiques revealed that even though each individual critique is concise and coherent, the set of critiques as a whole may exhibit several problems that detract from conciseness and coherence, and consequently assimilation. Thus a text planner was needed that could integrate the text plans for individual communicative goals to produce an overall text plan representing a concise, coherent message. This paper presents our general rule-based system for accomplishing this task. The system takes as input a \\emph{set} of individual text plans represented as RST-style trees, and produces a smaller set of more complex trees representing integrated messages that still achieve the multiple communicative goals of the individual text plans. Domain-independent rules are used to capture strategies across domains, while the facility for addition...

  4. Rhetorical structure theory and text analysis

    Science.gov (United States)

    Mann, William C.; Matthiessen, Christian M. I. M.; Thompson, Sandra A.

    1989-11-01

    Recent research on text generation has shown that there is a need for stronger linguistic theories that tell in detail how texts communicate. The prevailing theories are very difficult to compare, and it is also very difficult to see how they might be combined into stronger theories. To make comparison and combination a bit more approachable, we have created a book which is designed to encourage comparison. A dozen different authors or teams, all experienced in discourse research, are given exactly the same text to analyze. The text is an appeal for money by a lobbying organization in Washington, DC. It informs, stimulates and manipulates the reader in a fascinating way. The joint analysis is far more insightful than any one team's analysis alone. This paper is our contribution to the book. Rhetorical Structure Theory (RST), the focus of this paper, is a way to account for the functional potential of text, its capacity to achieve the purposes of speakers and produce effects in hearers. It also shows a way to distinguish coherent texts from incoherent ones, and identifies consequences of text structure.

  5. The network of concepts in written texts

    CERN Document Server

    Caldeira, S M G; Andrade, R F S; Neme, A; Miranda, J G V; Caldeira, Silvia M. G.; Lobao, Thierry C. Petit; Neme, Alexis

    2005-01-01

    Complex network theory is used to investigate the structure of meaningful concepts in written texts of individual authors. Networks have been constructed after a two phase filtering, where words with less meaning contents are eliminated, and all remaining words are set to their canonical form, without any number, gender or time flexion. Each sentence in the text is added to the network as a clique. A large number of written texts have been scrutinized, and its found that texts have small-world as well as scale-free structures. The growth process of these networks has also been investigated, and a universal evolution of network quantifiers have been found among the set of texts written by distinct authors. Further analyzes, based on shufling procedures taken either on the texts or on the constructed networks, provide hints on the role played by the word frequency and sentence length distributions to the network structure. Since the meaningful words are related to concepts in the author's mind, results for text...

  6. Chapter 16: text mining for translational bioinformatics.

    Directory of Open Access Journals (Sweden)

    K Bretonnel Cohen

    2013-04-01

    Full Text Available Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  7. Integrating image data into biomedical text categorization.

    Science.gov (United States)

    Shatkay, Hagit; Chen, Nawei; Blostein, Dorothea

    2006-07-15

    Categorization of biomedical articles is a central task for supporting various curation efforts. It can also form the basis for effective biomedical text mining. Automatic text classification in the biomedical domain is thus an active research area. Contests organized by the KDD Cup (2002) and the TREC Genomics track (since 2003) defined several annotation tasks that involved document classification, and provided training and test data sets. So far, these efforts focused on analyzing only the text content of documents. However, as was noted in the KDD'02 text mining contest-where figure-captions proved to be an invaluable feature for identifying documents of interest-images often provide curators with critical information. We examine the possibility of using information derived directly from image data, and of integrating it with text-based classification, for biomedical document categorization. We present a method for obtaining features from images and for using them-both alone and in combination with text-to perform the triage task introduced in the TREC Genomics track 2004. The task was to determine which documents are relevant to a given annotation task performed by the Mouse Genome Database curators. We show preliminary results, demonstrating that the method has a strong potential to enhance and complement traditional text-based categorization methods.

  8. How Popular Culture Texts Inform and Shape Students' Discussions of Social Studies Texts

    Science.gov (United States)

    Hall, Leigh A.

    2012-01-01

    In this article, I examine how 6th-grade students used pop culture texts to inform their understandings about social studies texts and shape their discussions of it. Discussions showed that students used pop culture texts in three ways when talking about social studies texts. First, students applied comprehension strategies to pop culture texts to…

  9. Engaging Texts: Effects of Concreteness on Comprehensibility, Interest, and Recall in Four Text Types.

    Science.gov (United States)

    Sadoski, Mark; Goetz, Ernest T.; Rodriguez, Maximo

    2000-01-01

    Investigates concreteness as a text feature that engaged undergraduate readers' comprehension, interest, and learning in four text types: persuasion, exposition, literary stories, and narratives. Results show that concrete texts were recalled better than abstract texts, although the magnitude of the advantage varied across text types. Concreteness…

  10. Text-Based Recall and Extra-Textual Generations Resulting from Simplified and Authentic Texts

    Science.gov (United States)

    Crossley, Scott A.; McNamara, Danielle S.

    2016-01-01

    This study uses a moving windows self-paced reading task to assess text comprehension of beginning and intermediate-level simplified texts and authentic texts by L2 learners engaged in a text-retelling task. Linear mixed effects (LME) models revealed statistically significant main effects for reading proficiency and text level on the number of…

  11. How Popular Culture Texts Inform and Shape Students' Discussions of Social Studies Texts

    Science.gov (United States)

    Hall, Leigh A.

    2012-01-01

    In this article, I examine how 6th-grade students used pop culture texts to inform their understandings about social studies texts and shape their discussions of it. Discussions showed that students used pop culture texts in three ways when talking about social studies texts. First, students applied comprehension strategies to pop culture texts to…

  12. Engaging Texts: Effects of Concreteness on Comprehensibility, Interest, and Recall in Four Text Types.

    Science.gov (United States)

    Sadoski, Mark; Goetz, Ernest T.; Rodriguez, Maximo

    2000-01-01

    Investigates concreteness as a text feature that engaged undergraduate readers' comprehension, interest, and learning in four text types: persuasion, exposition, literary stories, and narratives. Results show that concrete texts were recalled better than abstract texts, although the magnitude of the advantage varied across text types. Concreteness…

  13. NOTICING AND TEXT-BASED CHAT

    Directory of Open Access Journals (Sweden)

    Chun Lai

    2006-09-01

    Full Text Available This study examined the capacity of text-based online chat to promote learners’ noticing of their problematic language productions and of the interactional feedback from their interlocutors. In this study, twelve ESL learners formed six mixed-proficiency dyads. The same dyads worked on two spot-the-difference tasks, one via online chat and the other through face-to-face conversation. Stimulated recall sessions were held subsequently to identify instances of noticing. It was found that text-based online chat promotes noticing more than face-to-face conversations, especially in terms of learners’ noticing of their own linguistic mistakes.

  14. Extracting and Sharing Knowledge from Medical Texts

    Institute of Scientific and Technical Information of China (English)

    曹存根

    2002-01-01

    In recent years, we have been developing a new framework for acquiring medical knowledge from Encyclopedic texts. This framework consists of three major parts. The first part is an extended high-level conceptual language (called HLCL 1.1) for use by knowledge engineers to formalize knowledge texts in an encyclopedia. The other part is an HLCL 1.1compiler for parsing and analyzing the formalized texts into knowledge models. The third part is a set of domain-specific ontologies for sharing knowledge.

  15. A New Text Location Approach Based Wavelet

    Institute of Scientific and Technical Information of China (English)

    Weihua Li; Zhen Fang; Shuozhong Wang

    2002-01-01

    With the advancement of content-based retrieval technology, the importance of semantics for text information contained in images attracts many researchers. An algorithm which will automatically locate the textual regions in the input image will facilitate the retrieving task, and the optical character recognizer can then be applied to only those regions of the image which contain text. In this paper a new text location method based wavelet is described, which can be used to locate textual regions from complex image and video frame. Experimental results show that the textual regions in image can be located effectively and quickly.

  16. A New Text Location Approach Based Wavelet

    Institute of Scientific and Technical Information of China (English)

    Weihua Li; Zhen Fang; Shuozhong Wang

    2002-01-01

    With the advancement of content-based retrieval technology, the importance of semantics for text information contained in images attracts many researchers. An algorithm which will automatically locate the textual regions in the input image will facilitate the retrieving task, and the optical character recognizer can then be applied to only those regions of the image which contain text. In this paper a new text location method is described, which can be used to locate textual regions from complex image and video frame. Experimental results show that the textual regions in image can be located effectively and quickly.

  17. Monolingual Accounting Dictionaries for EFL Text Production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2009-01-01

    that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading....... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...

  18. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading....... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...

  19. Executive Decision:Text or Talk?

    Institute of Scientific and Technical Information of China (English)

    Rahma; Karam

    2011-01-01

    Nielsen Media, a global market researchcompany, reported in March that spending onvoice calls has gone down significantly over thelast five years, while customers’ text spendingis increasing. It’s anticipated that textingwill eclipse voice calls totally in three years.

  20. Understanding How Headings Influence Text Processing

    Directory of Open Access Journals (Sweden)

    Julie Lemarié

    2012-07-01

    Full Text Available Titles and headings are commonly used signaling devices in expository texts. Researchers in cognitive and educational psychology have demonstrated several important effects of headings and titles on text processing: headings improve memory for text organization; headings influence text comprehension by activating readers’ prior knowledge; and titles can bias text comprehension by their emphasis on a particular text topic. However, the lack of precise linguistic analyses of titles/headings has limited both the scope of empirical research and the precision of conclusions. We present a theory of signaling devices that provides a detailed analysis of variation in titles and headings and generates predictions concerning their effects. We discuss the implications of our analyses for research on titles and headings and summarize recent research findings that illustrate the validity of a central component of our analyses. Finally, we propose some future research directions integrating insights from linguistics for the study of how headings and titles affect text processing.Les titres et intertitres sont des dispositifs de signalisation fréquemment utilisés dans les textes expositifs. De nombreuses recherches réalisées en psychologie cognitive et psychologie des apprentissages ont mis en évidence leurs effets sur le traitement du texte par le lecteur : les intertitres améliorent la représentation mnésique de l’organisation du texte et influencent la compréhension du texte par un mécanisme d’activation des connaissances antérieures du lecteur. Les titres généraux, lorsqu’ils mettent en avant un des thèmes du texte, biaisent la compréhension du texte. Cependant, l’absence d’analyse linguistique approfondie des titres et intertitres a limité la portée de ces travaux et a mené à des conclusions méritant d’être affinées. Nous présentons une théorie générale de la signalisation des textes qui propose un cadre d

  1. The Relationship between Paraphrasing and Text Analysis

    Directory of Open Access Journals (Sweden)

    María Luisa Cepeda Islas

    2013-04-01

    Full Text Available Given the importance of paraphrasing in the process of comprehension for college students, this study assessed the level of implementation of text analysis and paraphrases the response of a sample of senior students of the career psychology. We selected a group of freshmen to the Psychology course, which was asked to answer a questionnaire and carry out the summary of an empirical article. The results showed that participants have a low level of text analysis, at the same time had low levels of paraphrasing. It was seen that the predominant textual copy. They envision some possibilities for the structure of a training workshop not only paraphrasing but on the analysis of text.

  2. Cohesion and Metaphor Aspects in Andabhuana Text

    Directory of Open Access Journals (Sweden)

    Ida Bagus Mahardika

    2015-02-01

    Full Text Available Cohesion and metaphor are the unique and interesting parts of language aspects in Andhabhuan text to research. They are quite dominant aspects in the story in developing its literature aesthetic. This research is based on the arts technical and analytical method. The result of the research on those two aspects shows that traditional aesthetic style in arts, as described in Andabhuana verses emphasize on the reference, meaning, selection and variation of words. The language parts used are aimed at bringing the text ideology to humanity perspective, especially the ?iwatattwa values as parts of Hindu teaching. Hence the cohesion and metaphor in Andabhuana text  are  semiotic description to transform to Balinese Hindus as most of them follow ?iwatattwa belief.

  3. The Educational Objectives of International Relations Texts

    Science.gov (United States)

    Pearson, Frederic S.

    1974-01-01

    Certain educational objectives are proposed for undergraduate and graduate international studies education, investigating the interrelationship of these objectives and evaluating the appropriateness and adequacy of current texts for such objectives. (Author/KM)

  4. Cohesive Function of Lexical Repetition in Text

    Institute of Scientific and Technical Information of China (English)

    张莉; 卢沛沛

    2013-01-01

    Lexical repetition is the most direct form of lexical cohesion,which is the central device for making texts hang together. Although repetition is the most direct way to emphasize,it performs the cohesive effect more apparently.

  5. AUTOMATIC TEXT SUMMARIZATION BASED ON TEXTUAL COHESION

    Institute of Scientific and Technical Information of China (English)

    Chen Yanmin; Liu Bingquan; Wang Xiaolong

    2007-01-01

    This paper presents two different algorithms that derive the cohesion structure in the form of lexical chains from two kinds of language resources HowNet and TongYiCiCiLin.The research that connects the cohesion structure of a text to the derivation of its summary is displayed.A novel model of automatic text summarization is devised,based on the data provided by lexicai chains from original texts.Moreover,the construction rules of lexical chains are modified according to characteristics of the knowledge database in order to be more suitable for Chinese suIninarization.Evaluation results show that high quality indicative summaries are produced from Chinese texts.

  6. Voice to Text Language Translation (VTLT) Project

    Data.gov (United States)

    National Aeronautics and Space Administration — A feasibility analysis of adding a second modality to pilot/Air Traffic Control (ATC) communications. The real time availability of text in Air Traffic Control...

  7. Figures of thought mathematics and mathematical texts

    CERN Document Server

    Reed, David

    2003-01-01

    Examines the ways in which mathematical works can be read as texts, examines their textual strategiesand demonstrates that such readings provide a rich source of philosophical debate regarding mathematics.

  8. QuitNowTXT Text Messaging Library

    Data.gov (United States)

    U.S. Department of Health & Human Services — Overview: The QuitNowTXT text messaging program is designed as a resource that can be adapted to specific contexts including those outside the United States and in...

  9. Strategies to Increase Accuracy in Text Classification

    NARCIS (Netherlands)

    D. Blommesteijn (Dennis)

    2014-01-01

    htmlabstractText classification via supervised learning involves various steps from processing raw data, features extraction to training and validating classifiers. Within these steps implementation decisions are critical to the resulting classifier accuracy. This paper contains a report of the

  10. Text-Filled Stacked Area Graphs

    DEFF Research Database (Denmark)

    Kraus, Martin

    2011-01-01

    Text can add a significant amount of detail and value to an information visualization. In particular, it can integrate more of the data that a visualization is based on, and it can also integrate information that is personally relevant to readers of a visualization. This may influence readers...... to consider a visualization a detailed enrichment of their personal experience instead of an abstract representation of anonymous numbers. However, the integration of textual detail into a visualization is often very challenging. This work discusses one particular approach to this problem, namely text......-filled stacked area graphs; i.e., graphs that feature stacked areas that are filled with small-typed text. Since these graphs allow for computing the text layout automatically, it is possible to include large amounts of textual detail with very little effort. We discuss the most important challenges and some...

  11. Discovery of Recurring Anomalies in Text Reports

    Data.gov (United States)

    National Aeronautics and Space Administration — This paper describes the results of a significant research and development effort conducted at NASA Ames Research Center to develop new text mining algorithms to...

  12. Being Brave: Writing Environmental Education Research Texts.

    Science.gov (United States)

    Lotz-Sisitka, Heila; Burt, Jane

    2002-01-01

    Explores some of the headwork that goes into textwork in environmental education research. Reflects upon some of the institutional and epistemological issues associated with writing social science research texts. (Contains 26 references.) (Author/YDS)

  13. Strategies to Increase Accuracy in Text Classification

    NARCIS (Netherlands)

    Blommesteijn, D.

    2014-01-01

    Text classification via supervised learning involves various steps from processing raw data, features extraction to training and validating classifiers. Within these steps implementation decisions are critical to the resulting classifier accuracy. This paper contains a report of the study performed

  14. [Text Comprehensibility of Hospital Report Cards].

    Science.gov (United States)

    Sander, U; Kolb, B; Christoph, C; Emmert, M

    2016-12-01

    Objectives: Recently, the number of hospital report cards that compare quality of hospitals and present information from German quality reports has greatly increased. Objectives of this study were to a) identify suitable methods for measuring the readability and comprehensibility of hospital report cards, b) to obtain reliable information on the comprehensibility of texts for laymen, c) to give recommendations for improvements and d) to recommend public health actions. Methods: The readability and comprehensibility of the texts were tested with a) a computer-aided evaluation of formal text characteristics (readability indices Flesch (German formula) and 1. Wiener Sachtextformel formula), b) an expert-based heuristic analysis of readability and comprehensibility of texts (counting technical terms and analysis of text simplicity as well as brevity and conciseness using the Hamburg intelligibility model) and c) a survey of subjects about the comprehensibility of individual technical terms, the assessment of the comprehensibility of the presentations and the subjects' decisions in favour of one of the 5 presented clinics due to the better quality of data. In addition, the correlation between the results of the text analysis with the results from the survey of subjects was tested. Results: The assessment of texts with the computer-aided evaluations showed poor comprehensibility values. The assessment of text simplicity using the Hamburg intelligibility model showed poor comprehensibility values (-0.3). On average, 6.8% of the words used were technical terms. A review of 10 technical terms revealed that in all cases only a minority of respondents (from 4.4% to 39.1%) exactly knew what was meant by each of them. Most subjects (62.4%) also believed that unclear terms worsened their understanding of the information offered. The correlation analysis showed that presentations with a lower frequency of technical terms and better values for the text simplicity were better

  15. Text and Voice: Complements, Substitutes or Both?

    OpenAIRE

    Andersson, Kjetil; Foros, Øystein; Steen, Frode

    2006-01-01

    Text messaging has become an important revenue component for European and Asian mobile operators. We develop a simple model of demand for mobile services incorporating the existence of call externalities and network effects. We show that when incoming messages and calls stimulate outgoing communications, services that are perceived as substitutes, such as mobile text and voice, may evolve into complements in terms of the price effect when the network size becomes large. We esti...

  16. Cohesion in Computer Text Generation: Lexical Substitution.

    Science.gov (United States)

    1983-05-01

    substitutions. Paul is able to generate a cohesive text which exhibits the binding of sentences through presupposition dependencies, the marking of old...lexical substitutions, Paul is able to generate a cohesive text - • which exhibits the binding of sentences through presupposition dependencies, the...problem in using these cohesive devices is that it is necessary to guarantee that they are understandable. That is, since these items refer anaphorically

  17. Dress and Identity in Old Babylonian Texts

    OpenAIRE

    Tanaka, Terri-lynn Wai Ping Hong

    2013-01-01

    The present study argues that using dress theory is a productive means of reading cuneiform texts from ancient Mesopotamia. Although anthropological studies on dress have flourished in recent years, and despite the economic and social importance of dress in ancient Mesopotamia, previous research has focused on either archaeological remains or pictorial representations of dress; however, anthropological theories on dress have not yet been applied to ancient Mesopotamian cuneiform texts writte...

  18. Comparison between Two Text Digital Watermarking Algorithms

    Institute of Scientific and Technical Information of China (English)

    TANG Sheng; XUE Xu-ce

    2011-01-01

    In this paper,two text digital watermarking methods are compared in the context of their robustness performances.A nonlinear watermarking algorithm embeds the watermark into the reordered DCT coefficients of a text image,and utilizes a nonlinear detector to detect the watermark in some attacks.Compared with the classical watermarking algorithm,experimental results show that this nonlinear watennarking nlgorithm has some potential merits.

  19. Automatic Text Summarization: Past, Present and Future

    OpenAIRE

    Saggion, Horacio; Poibeau, Thierry

    2012-01-01

    International audience; Automatic text summarization, the computer-based production of condensed versions of documents, is an important technology for the information society. Without summaries it would be practically impossible for human beings to get access to the ever growing mass of information available online. Although research in text summarization is over fifty years old, some efforts are still needed given the insufficient quality of automatic summaries and the number of interesting ...

  20. Reading Comprehension Assessment : From Text Perspectives

    OpenAIRE

    小林, 美代子; コバヤシ, ミヨコ; MIYOKO, KOBAYASHI

    2004-01-01

    This paper investigates the nature of reading comprehension questions. Very few studies have so far examined comprehension questions in relation to text features. Kintsch and Yarbrough (1982) and Shohamy and Inbar (1991) are among the few studies, and their results suggest that there is an interaction between text features and the focus of questions. The present study builds on these findings and examines how Meyer's (1975, 1985) model of content structure analysis can help identify what exac...

  1. Position index preserving compression of text data

    OpenAIRE

    Akhtar, Nasim; Rashid, Mamunur; Islam, Shafiqul; Kashem, Mohammod Abul; Kolybanov, Cyrll Y.

    2011-01-01

    Data compression offers an attractive approach to reducing communication cost by using available bandwidth effectively. It also secures data during transmission for its encoded form. In this paper an index based position oriented lossless text compression called PIPC ( Position Index Preserving Compression) is developed. In PIPC the position of the input word is denoted by ASCII code. The basic philosopy of the secure compression is to preprocess the text and transform it into some intermedia...

  2. Figure-associated text summarization and evaluation.

    Science.gov (United States)

    Polepalli Ramesh, Balaji; Sethi, Ricky J; Yu, Hong

    2015-01-01

    Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).

  3. A Survey of Unstructured Text Summarization Techniques

    Directory of Open Access Journals (Sweden)

    Sherif Elfayoumy

    2014-05-01

    Full Text Available Due to the explosive amounts of text data being created and organizations increased desire to leverage their data corpora, especially with the availability of Big Data platforms, there is not usually enough time to read and understand each document and make decisions based on document contents. Hence, there is a great demand for summarizing text documents to provide a representative substitute for the original documents. By improving summarizing techniques, precision of document retrieval through search queries against summarized documents is expected to improve in comparison to querying against the full spectrum of original documents. Several generic text summarization algorithms have been developed, each with its own advantages and disadvantages. For example, some algorithms are particularly good for summarizing short documents but not for long ones. Others perform well in identifying and summarizing single-topic documents but their precision degrades sharply with multi-topic documents. In this article we present a survey of the literature in text summarization. We also surveyed some of the most common evaluation methods for the quality of automated text summarization techniques. Last, we identified some of the challenging problems that are still open, in particular the need for a universal approach that yields good results for mixed types of documents.

  4. Text mining resources for the life sciences.

    Science.gov (United States)

    Przybyła, Piotr; Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable-those that have the crucial ability to share information, enabling smooth integration and reusability. © The Author(s) 2016. Published by Oxford University Press.

  5. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  6. A method of text watermarking using presuppositions

    Science.gov (United States)

    Vybornova, O.; Macq, B.

    2007-02-01

    We propose a method for watermarking texts of arbitrary length using natural-language semantic structures. For the key of our approach we use the linguistic semantic phenomenon of presuppositions. Presupposition is the implicit information considered as well-known or which readers of the text are supposed to treat as well-known; this information is a semantic component of certain linguistic expressions (lexical items and syntactical constructions called presupposition triggers). The same sentence can be used with or without presupposition, or with a different presupposition trigger, provided that all the relations between subjects, objects and other discourse referents are preserved - such transformations will not change the meaning of the sentence. We define the distinct rules for presupposition identification for each trigger and regular transformation rules for using/non-using the presupposition in a given sentence (one bit per sentence in this case). Isolated sentences can carry the proposed watermarks. However, the longer is the text, the more efficient is the watermark. The proposed approach is resilient to main types of random transformations, like passivization, topicalization, extraposition, preposing, etc. The web of resolved presupposed information in the text will hold the watermark of the text (e.g. integrity watermark, or prove of ownership), introducing "secret ordering" into the text structure to make it resilient to "data loss" attacks and "data altering" attacks.

  7. Inspiration and the Texts of the Bible

    Directory of Open Access Journals (Sweden)

    Dirk Buchner

    1997-01-01

    Full Text Available This article seeks to explore what the inspired text of the Old Testament was as it existed for the New Testament authors, particularly for the author of the book of Hebrews. A quick look at the facts makes. it clear that there was, at the time, more than one 'inspired' text, among these were the Septuagint and the Masoretic Text 'to name but two'. The latter eventually gained ascendancy which is why it forms the basis of our translated Old Testament today. Yet we have to ask: what do we make of that other text that was the inspired Bible to the early Church, especially to the writer of the book of Hebrews, who ignored the Masoretic text? This article will take a brief look at some suggestions for a doctrine of inspiration that keeps up with the facts of Scripture. Allied to this, the article is something of a bibliographical study of recent developments in textual research following the discovery of the Dead Sea scrolls.

  8. Text Entry by Gazing and Smiling

    Directory of Open Access Journals (Sweden)

    Outi Tuisku

    2013-01-01

    Full Text Available Face Interface is a wearable prototype that combines the use of voluntary gaze direction and facial activations, for pointing and selecting objects on a computer screen, respectively. The aim was to investigate the functionality of the prototype for entering text. First, three on-screen keyboard layout designs were developed and tested (n=10 to find a layout that would be more suitable for text entry with the prototype than traditional QWERTY layout. The task was to enter one word ten times with each of the layouts by pointing letters with gaze and select them by smiling. Subjective ratings showed that a layout with large keys on the edge and small keys near the center of the keyboard was rated as the most enjoyable, clearest, and most functional. Second, using this layout, the aim of the second experiment (n=12 was to compare entering text with Face Interface to entering text with mouse. The results showed that text entry rate for Face Interface was 20 characters per minute (cpm and 27 cpm for the mouse. For Face Interface, keystrokes per character (KSPC value was 1.1 and minimum string distance (MSD error rate was 0.12. These values compare especially well with other similar techniques.

  9. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  10. Practical vision based degraded text recognition system

    Science.gov (United States)

    Mohammad, Khader; Agaian, Sos; Saleh, Hani

    2011-02-01

    Rapid growth and progress in the medical, industrial, security and technology fields means more and more consideration for the use of camera based optical character recognition (OCR) Applying OCR to scanned documents is quite mature, and there are many commercial and research products available on this topic. These products achieve acceptable recognition accuracy and reasonable processing times especially with trained software, and constrained text characteristics. Even though the application space for OCR is huge, it is quite challenging to design a single system that is capable of performing automatic OCR for text embedded in an image irrespective of the application. Challenges for OCR systems include; images are taken under natural real world conditions, Surface curvature, text orientation, font, size, lighting conditions, and noise. These and many other conditions make it extremely difficult to achieve reasonable character recognition. Performance for conventional OCR systems drops dramatically as the degradation level of the text image quality increases. In this paper, a new recognition method is proposed to recognize solid or dotted line degraded characters. The degraded text string is localized and segmented using a new algorithm. The new method was implemented and tested using a development framework system that is capable of performing OCR on camera captured images. The framework allows parameter tuning of the image-processing algorithm based on a training set of camera-captured text images. Novel methods were used for enhancement, text localization and the segmentation algorithm which enables building a custom system that is capable of performing automatic OCR which can be used for different applications. The developed framework system includes: new image enhancement, filtering, and segmentation techniques which enabled higher recognition accuracies, faster processing time, and lower energy consumption, compared with the best state of the art published

  11. Native Language Processing using Exegy Text Miner

    Energy Technology Data Exchange (ETDEWEB)

    Compton, J

    2007-10-18

    Lawrence Livermore National Laboratory's New Architectures Testbed recently evaluated Exegy's Text Miner appliance to assess its applicability to high-performance, automated native language analysis. The evaluation was performed with support from the Computing Applications and Research Department in close collaboration with Global Security programs, and institutional activities in native language analysis. The Exegy Text Miner is a special-purpose device for detecting and flagging user-supplied patterns of characters, whether in streaming text or in collections of documents at very high rates. Patterns may consist of simple lists of words or complex expressions with sub-patterns linked by logical operators. These searches are accomplished through a combination of specialized hardware (i.e., one or more field-programmable gates arrays in addition to general-purpose processors) and proprietary software that exploits these individual components in an optimal manner (through parallelism and pipelining). For this application the Text Miner has performed accurately and reproducibly at high speeds approaching those documented by Exegy in its technical specifications. The Exegy Text Miner is primarily intended for the single-byte ASCII characters used in English, but at a technical level its capabilities are language-neutral and can be applied to multi-byte character sets such as those found in Arabic and Chinese. The system is used for searching databases or tracking streaming text with respect to one or more lexicons. In a real operational environment it is likely that data would need to be processed separately for each lexicon or search technique. However, the searches would be so fast that multiple passes should not be considered as a limitation a priori. Indeed, it is conceivable that large databases could be searched as often as necessary if new queries were deemed worthwhile. This project is concerned with evaluating the Exegy Text Miner installed in the

  12. Benchmarking infrastructure for mutation text mining

    Science.gov (United States)

    2014-01-01

    Background Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. Results We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. Conclusion We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption. PMID:24568600

  13. n-Gram-Based Text Compression

    Directory of Open Access Journals (Sweden)

    Vu H. Nguyen

    2016-01-01

    Full Text Available We propose an efficient method for compressing Vietnamese text using n-gram dictionaries. It has a significant compression ratio in comparison with those of state-of-the-art methods on the same dataset. Given a text, first, the proposed method splits it into n-grams and then encodes them based on n-gram dictionaries. In the encoding phase, we use a sliding window with a size that ranges from bigram to five grams to obtain the best encoding stream. Each n-gram is encoded by two to four bytes accordingly based on its corresponding n-gram dictionary. We collected 2.5 GB text corpus from some Vietnamese news agencies to build n-gram dictionaries from unigram to five grams and achieve dictionaries with a size of 12 GB in total. In order to evaluate our method, we collected a testing set of 10 different text files with different sizes. The experimental results indicate that our method achieves compression ratio around 90% and outperforms state-of-the-art methods.

  14. Introduction, Critical Text logy and Textual Criticism

    Directory of Open Access Journals (Sweden)

    فرزاد قائمی

    2013-06-01

    Full Text Available Asadi’s Shahnameh is a great epic consisting of twenty-four thousand distiches and is attributed to Asadi or another poet of the same nickname. This work was created in the same line of development as Ferdowsi’s Shahnameh. The main theme is the old campaign of Soleymān to Iran to confront with Rostam and Keykhosrow and to repeat the pattern of Rostam’s battles with his children in a state of anonymity. The text structure is episodic with numerous central characters. The narratives are for the most part derived from oral literature. Textual evidence demonstrates that the poet is Shiite. The narrative content, chronogram as well as the literary and linguistic style of one of the manuscripts reveal that the text was written in the ninth century (probably 809 A.H.. The article first introduces the text and the origin of its narratives in oral literature; it then proceeds with the study of the narrative structure of the epic using three available manuscripts dating back to the thirteenth and fourteenth centuries (A.H.. Textology and Textual Criticism have been employed as the research methodology. The literary and linguistic features of the text have also been examined at three levels: lexical, syntactic and rhetorical.

  15. ERRORS AND DIFFICULTIES IN TRANSLATING LEGAL TEXTS

    Directory of Open Access Journals (Sweden)

    Camelia, CHIRILA

    2014-11-01

    Full Text Available Nowadays the accurate translation of legal texts has become highly important as the mistranslation of a passage in a contract, for example, could lead to lawsuits and loss of money. Consequently, the translation of legal texts to other languages faces many difficulties and only professional translators specialised in legal translation should deal with the translation of legal documents and scholarly writings. The purpose of this paper is to analyze translation from three perspectives: translation quality, errors and difficulties encountered in translating legal texts and consequences of such errors in professional translation. First of all, the paper points out the importance of performing a good and correct translation, which is one of the most important elements to be considered when discussing translation. Furthermore, the paper presents an overview of the errors and difficulties in translating texts and of the consequences of errors in professional translation, with applications to the field of law. The paper is also an approach to the differences between languages (English and Romanian that can hinder comprehension for those who have embarked upon the difficult task of translation. The research method that I have used to achieve the objectives of the paper was the content analysis of various Romanian and foreign authors' works.

  16. Handwriting segmentation of unconstrained Oriya text

    Indian Academy of Sciences (India)

    N Tripathy; U Pal

    2006-12-01

    Segmentation of handwritten text into lines, words and characters is one of the important steps in the handwritten text recognition process. In this paper we propose a water reservoir concept-based scheme for segmentation of unconstrained Oriya handwritten text into individual characters. Here, at first, the text image is segmented into lines, and the lines are then segmented into individual words. For line segmentation, the document is divided into vertical stripes. Analysing the heights of the water reservoirs obtained from different components of the document, the width of a stripe is calculated. Stripe-wise horizontal histograms are then computed and the relationship of the peak–valley points of the histograms is used for line segmentation. Based on vertical projection profiles and structural features of Oriya characters, text lines are segmented into words. For character segmentation, at first, the isolated and connected (touching) characters in a word are detected. Using structural, topological and water reservoir concept-based features, characters of the word that touch are then segmented. From experiments we have observed that the proposed “touching character” segmentation module has 96·7% accuracy for two-character touching strings.

  17. La dimension diachronique des textes beckettiens

    Directory of Open Access Journals (Sweden)

    Carla Taban

    2007-07-01

    Full Text Available La présente discussion se propose de montrer que les aspects diachroniques du français et de l’anglais – entendues restrictivement comme évolutions sémantiques des lexèmes des deux idiomes et non pas comme évolutions syntaxiques ou phonétiques de ceux-ci – opèrent dans les textes de Beckett en tant que modalités po(ïétiques de différenciation de sens. Autrement dit, la manière dont les unités lexicales sont inscrites dans leurs environnements intra-textuel (d’un texte donné et intra-inter-textuel (d’une paire bilingue de textes correspondants permet, voire requiert de les actualiser simultanément avec plusieurs significations, dont certaines sont originaires ou historiques. La dimension diachronique dans les deux langues offre ainsi à Beckett un outil d’accroissement du potentiel signifiant de ses textes.

  18. A NOVEL MULTIDICTIONARY BASED TEXT COMPRESSION

    Directory of Open Access Journals (Sweden)

    Y. Venkataramani

    2012-01-01

    Full Text Available The amount of digital contents grows at a faster speed as a result does the demand for communicate them. On the other hand, the amount of storage and bandwidth increases at a slower rate. Thus powerful and efficient compression methods are required. The repetition of words and phrases cause the reordered text much more compressible than the original text. On the whole system is fast and achieves close to the best result on the test files. In this study a novel fast dictionary based text compression technique MBRH (Multidictionary with burrows wheeler transforms, Run length coding and Huffman coding is proposed for the purpose of obtaining improved performance on various document sizes. MBRH algorithm comprises of two stages, the first stage is concerned with the conversion of input text into dictionary based compression .The second stage deals mainly with reduction of the redundancy in multidictionary based compression by using BWT, RLE and Huffman coding. Bib test files of input size of 111, 261 bytes achieves compression ratio of 0.192, bit rate of 1.538 and high speed using MBRH algorithm. The algorithm has attained a good compression ratio, reduction of bit rate and the increase in execution speed.

  19. OMG! Texting in Class = U Fail :( Empirical Evidence That Text Messaging During Class Disrupts Comprehension

    Science.gov (United States)

    Gingerich, Amanda C.; Lineweaver, Tara T.

    2014-01-01

    In two experiments, we examined the effects of text messaging during lecture on comprehension of lecture material. Students (in Experiment 1) and randomly assigned participants (in Experiment 2) in a text message condition texted a prescribed conversation while listening to a brief lecture. Students and participants in the no-text condition…

  20. OMG! Texting in Class = U Fail :( Empirical Evidence That Text Messaging During Class Disrupts Comprehension

    Science.gov (United States)

    Gingerich, Amanda C.; Lineweaver, Tara T.

    2014-01-01

    In two experiments, we examined the effects of text messaging during lecture on comprehension of lecture material. Students (in Experiment 1) and randomly assigned participants (in Experiment 2) in a text message condition texted a prescribed conversation while listening to a brief lecture. Students and participants in the no-text condition…

  1. Putting Text Complexity in Context: Refocusing on Comprehension of Complex Text

    Science.gov (United States)

    Valencia, Sheila W.; Wixson, Karen K.; Pearson, P. David

    2014-01-01

    The Common Core State Standards for English Language Arts have prompted enormous attention to issues of text complexity. The purpose of this article is to put text complexity in perspective by moving from a primary focus on the text itself to a focus on the comprehension of complex text. We argue that a focus on comprehension is at the heart of…

  2. Exploring the Effect of Background Knowledge and Text Cohesion on Learning from Texts in Computer Science

    Science.gov (United States)

    Gasparinatou, Alexandra; Grigoriadou, Maria

    2013-01-01

    In this study, we examine the effect of background knowledge and local cohesion on learning from texts. The study is based on construction-integration model. Participants were 176 undergraduate students who read a Computer Science text. Half of the participants read a text of maximum local cohesion and the other a text of minimum local cohesion.…

  3. METACOGNITIVE READING STRATEGIES USED IN CHEMISTRY TEXTS

    Directory of Open Access Journals (Sweden)

    Wilmer Orlando López

    2008-07-01

    Full Text Available This research aimed at analyzing the metacognitive strategies of a group of junior-high school students when reading a chemistry text. The methodology used was of the type of descriptive and of the field. The information was gathered through a 13-item questionnaire subject to valuation and approval by a group of specialists in reading, evaluation, and chemistry teaching. The sample was composed by 27 ninth-grade students in a public institution in Mérida downtown, Venezuela. It is concluded that no conscious or reflective reading is shown, i.e. metacognitive reading strategies were not applied by the students, and this could have allowed them to get a whole comprehension of the text, which is vital for a significant learning.

  4. Word-Sized Graphics for Scientific Texts.

    Science.gov (United States)

    Beck, Fabian; Weiskopf, Daniel

    2017-02-24

    Generating visualizations at the size of a word creates dense information representations often called sparklines. The integration of word-sized graphics into text could avoid additional cognitive load caused by splitting the readers' attention between figures and text. In scientific publications, these graphics make statements easier to understand and verify because additional quantitative information is available where needed. In this work, we perform a literature review to find out how researchers have already applied such word-sized representations. Illustrating the versatility of the approach, we leverage these representations for reporting empirical and bibliographic data in three application examples. For interactive Web-based publications, we explore levels of interactivity and discuss interaction patterns to link visualization and text. We finally call the visualization community to be a pioneer in exploring new visualization-enriched and interactive publication formats.

  5. Tenosynovitis caused by texting: an emerging disease.

    Science.gov (United States)

    Ashurst, John V; Turco, Domenic A; Lieb, Brian E

    2010-05-01

    De Quervain tenosynovitis is characterized by pain that overlies the radial aspect of the wrist and that is aggravated by ulnar deviation of the hand. The most common cause of de Quervain tenosynovitis is overuse of the thumb musculature. The authors report a case of bilateral de Quervain tenosynovitis observed in a woman aged 48 years at a rural outpatient primary care office. The condition was induced by the patient's excessive use of the text messaging feature on her cellular telephone. Treatment, including naproxen, cock-up wrist splints, and limitation of texting, resulted in complete recovery of the patient. The authors urge physicians to be aware of the potential association between a patient's tenosynovitis symptoms and excessive texting.

  6. Text messaging is a useful reminder tool.

    Science.gov (United States)

    Balzer, Ben W R; Kelly, Patrick J; Hazell, Philip; Paxton, Karen; Hawke, Catherine; Steinbeck, Katharine S

    2014-07-01

    Longitudinal studies of adolescents must be 'adolescent-friendly', to collect data and to encourage maintenance in the study cohort. Text messaging may offer a feasible means to do both. Adolescents in the Adolescent Rural Cohort, Hormones and Health, Education, Environments and Relationships (ARCHER) study (n=342) are sent automated text messages every 3 months, prompting biological specimen collection. A total of 99.2% of participants (or their parents) owned a mobile phone, of which 89.1% of participants responded to text messages and 97.3% of intended urine samples were collected. The average time to provide a urine sample after prompting correlated with time to reply to Short Message Service (SMS). This study shows SMS can be used effectively in longitudinal research involving adolescents and is feasible and useful as a reminder tool for regular biological specimen collection.

  7. Text mining patents for biomedical knowledge.

    Science.gov (United States)

    Rodriguez-Esteban, Raul; Bundschus, Markus

    2016-06-01

    Biomedical text mining of scientific knowledge bases, such as Medline, has received much attention in recent years. Given that text mining is able to automatically extract biomedical facts that revolve around entities such as genes, proteins, and drugs, from unstructured text sources, it is seen as a major enabler to foster biomedical research and drug discovery. In contrast to the biomedical literature, research into the mining of biomedical patents has not reached the same level of maturity. Here, we review existing work and highlight the associated technical challenges that emerge from automatically extracting facts from patents. We conclude by outlining potential future directions in this domain that could help drive biomedical research and drug discovery. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Mining free-text medical records.

    Science.gov (United States)

    Heinze, D T; Morsch, M L; Holbrook, J

    2001-01-01

    Text mining projects can be characterized along four parameters: 1) the demands of the market in terms of target domain and specificity and depth of queries; 2) the volume and quality of text in the target domain; 3) the text mining process requirements; and 4) the quality assurance process that validates the extracted data. In this paper, we provide lessons learned and results from a large-scale commercial project using Natural Language Processing (NLP) for mining the transcriptions of dictated clinical records in a variety of medical specialties. We conclude that the current state-of-the-art in NLP is suitable for mining information of moderate content depth across a diverse collection of medical settings and specialties.

  9. Text Classification Using Sentential Frequent Itemsets

    Institute of Scientific and Technical Information of China (English)

    Shi-Zhu Liu; He-Ping Hu

    2007-01-01

    Text classification techniques mostly rely on single term analysis of the document data set, while more concepts,especially the specific ones, are usually conveyed by set of terms. To achieve more accurate text classifier, more informative feature including frequent co-occurring words in the same sentence and their weights are particularly important in such scenarios. In this paper, we propose a novel approach using sentential frequent itemset, a concept comes from association rule mining, for text classification, which views a sentence rather than a document as a transaction, and uses a variable precision rough set based method to evaluate each sentential frequent itemset's contribution to the classification. Experiments over the Reuters and newsgroup corpus are carried out, which validate the practicability of the proposed system.

  10. Tagging and Morphological Disambiguation of Turkish Text

    CERN Document Server

    Oflazer, K; Oflazer, Kemal; Kuruoz, Ilker

    1994-01-01

    Automatic text tagging is an important component in higher level analysis of text corpora, and its output can be used in many natural language processing applications. In languages like Turkish or Finnish, with agglutinative morphology, morphological disambiguation is a very crucial process in tagging, as the structures of many lexical forms are morphologically ambiguous. This paper describes a POS tagger for Turkish text based on a full-scale two-level specification of Turkish morphology that is based on a lexicon of about 24,000 root words. This is augmented with a multi-word and idiomatic construct recognizer, and most importantly morphological disambiguator based on local neighborhood constraints, heuristics and limited amount of statistical information. The tagger also has functionality for statistics compilation and fine tuning of the morphological analyzer, such as logging erroneous morphological parses, commonly used roots, etc. Preliminary results indicate that the tagger can tag about 98-99\\% of the...

  11. Urdu Text Classification using Majority Voting

    Directory of Open Access Journals (Sweden)

    Muhammad Usman

    2016-08-01

    Full Text Available Text classification is a tool to assign the predefined categories to the text documents using supervised machine learning algorithms. It has various practical applications like spam detection, sentiment detection, and detection of a natural language. Based on the idea we applied five well-known classification techniques on Urdu language corpus and assigned a class to the documents using majority voting. The corpus contains 21769 news documents of seven categories (Business, Entertainment, Culture, Health, Sports, and Weird. The algorithms were not able to work directly on the data, so we applied the preprocessing techniques like tokenization, stop words removal and a rule-based stemmer. After preprocessing 93400 features are extracted from the data to apply machine learning algorithms. Furthermore, we achieved up to 94% precision and recall using majority voting.

  12. Preprocessing and Morphological Analysis in Text Mining

    Directory of Open Access Journals (Sweden)

    Krishna Kumar Mohbey Sachin Tiwari

    2011-12-01

    Full Text Available This paper is based on the preprocessing activities which is performed by the software or language translators before applying mining algorithms on the huge data. Text mining is an important area of Data mining and it plays a vital role for extracting useful information from the huge database or data ware house. But before applying the text mining or information extraction process, preprocessing is must because the given data or dataset have the noisy, incomplete, inconsistent, dirty and unformatted data. In this paper we try to collect the necessary requirements for preprocessing. When we complete the preprocess task then we can easily extract the knowledgful information using mining strategy. This paper also provides the information about the analysis of data like tokenization, stemming and semantic analysis like phrase recognition and parsing. This paper also collect the procedures for preprocessing data i.e. it describe that how the stemming, tokenization or parsing are applied.

  13. WYLBUR reference manual. [For interactive text editing

    Energy Technology Data Exchange (ETDEWEB)

    Krupp, R.F.; Messina, P.C.; Peavler, J.M.; Schustack, S.; Starai, T.

    1977-04-01

    WYLBUR is a system for manipulating various kinds of text, such as computer programs, manuscripts, letters, forms, articles, or reports. Its on-line interactive text-editing capabilities allow the user to create, change, and correct text, and to search and display it. WYLBUR also has facilities for job submission and retrieval from remote terminals that make it possible for a user to inquire about the status of any job in the system, cancel jobs that are executing or awaiting execution, reroute output, raise job priority, or get information on the backlog of batch jobs. WYLBUR also has excellent recovery capabilities and a fast response time. This manual describes the WYLBUR version currently used at ANL. It is intended primarily as a reference manual; thus, examples of WYLBUR commands are kept to a minimum. (RWR)

  14. Présentation des textes

    OpenAIRE

    Freitag, Michel

    2015-01-01

    Les textes choisis n’ont pas pour but la reconstitution ou le survol d’une carrière, mais la mise en valeur des étapes saillantes d’une double éclosion, celle d’Elizabeth Cady Stanton comme féministe et avec elle celle du mouvement de défense des droits des femmes aux États-Unis. Un tel objectif implique donc des limites temporelles en amont et en aval de l’événement fondateur que fut la Convention de Seneca Falls en 1848, origine du texte non moins fondateur de la Déclaration de sentiments r...

  15. Text Classification: A Sequential Reading Approach

    CERN Document Server

    Dulac-Arnold, Gabriel; Gallinari, Patrick

    2011-01-01

    We propose to model the text classification process as a sequential decision process. In this process, an agent learns to classify documents into topics while reading the document sentences sequentially and learns to stop as soon as enough information was read for deciding. The proposed algorithm is based on a modelisation of Text Classification as a Markov Decision Process and learns by using Reinforcement Learning. Experiments on four different classical mono-label corpora show that the proposed approach performs comparably to classical SVM approaches for large training sets, and better for small training sets. In addition, the model automatically adapts its reading process to the quantity of training information provided.

  16. Lidový text a grafika

    OpenAIRE

    Lukš, Jiří

    2015-01-01

    The dissertation "The Folk Text and Graphic Art" studies a song as a topic for graphic and book production. Within the praktical part of the dissertation the author works up a graphic design of a original song-book, which represent his former music band's texts. He surveys the clash of today's fashionable music trends with folk traditions in his region and asks a question about the character of the contemporary folk song. The author's song-book is one of answers. On the base of this effort he...

  17. Spatial Text Visualization Using Automatic Typographic Maps.

    Science.gov (United States)

    Afzal, S; Maciejewski, R; Jang, Yun; Elmqvist, N; Ebert, D S

    2012-12-01

    We present a method for automatically building typographic maps that merge text and spatial data into a visual representation where text alone forms the graphical features. We further show how to use this approach to visualize spatial data such as traffic density, crime rate, or demographic data. The technique accepts a vector representation of a geographic map and spatializes the textual labels in the space onto polylines and polygons based on user-defined visual attributes and constraints. Our sample implementation runs as a Web service, spatializing shape files from the OpenStreetMap project into typographic maps for any region.

  18. There is a Text in 'The Balloon'

    DEFF Research Database (Denmark)

    Elias, Camelia

    2009-01-01

    From the Introduction: Camelia Elias' "There is a Text in 'The Balloon': Donald Barthelme's Allegorical Flights" provides its reader with a much-need and useful distinction between fantasy and the fantastic: "whereas fantasy in critical discourse can be aligned with allegory, in which a supernatu......From the Introduction: Camelia Elias' "There is a Text in 'The Balloon': Donald Barthelme's Allegorical Flights" provides its reader with a much-need and useful distinction between fantasy and the fantastic: "whereas fantasy in critical discourse can be aligned with allegory, in which...

  19. A Sequential Algorithm for Training Text Classifiers

    CERN Document Server

    Lewis, D D; Lewis, David D.; Gale, William A.

    1994-01-01

    The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was developed and tested on a newswire text categorization task. This method, which we call uncertainty sampling, reduced by as much as 500-fold the amount of training data that would have to be manually classified to achieve a given level of effectiveness.

  20. Multilingual Topic Models for Unaligned Text

    CERN Document Server

    Boyd-Graber, Jordan

    2012-01-01

    We develop the multilingual topic model for unaligned text (MuTo), a probabilistic model of text that is designed to analyze corpora composed of documents in two languages. From these documents, MuTo uses stochastic EM to simultaneously discover both a matching between the languages and multilingual latent topics. We demonstrate that MuTo is able to find shared topics on real-world multilingual corpora, successfully pairing related documents across languages. MuTo provides a new framework for creating multilingual topic models without needing carefully curated parallel corpora and allows applications built using the topic model formalism to be applied to a much wider class of corpora.