WorldWideScience

Sample records for include full text

  1. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    release of intracellular myocyte components. Clinical sequelae to rhabdomyolysis include hypovolemia, hyperkalemia, metabolic acidosis and acute renal failure which is the most serious complication. Renal failure is caused by renal vasoconstriction, myoglobin and heme protein toxicity. Usual explanations of the cause of.

  2. Managing nuclear knowledge: IAEA activities and international coordination. Including resource material full text CD-ROM

    International Nuclear Information System (INIS)

    2005-06-01

    The present CD-ROM summarizes some activities carried out by the Departments of Nuclear Energy and Nuclear Safety and Security in the area of nuclear knowledge management in the period 2003-2005. It comprises, as open resource, most of the relevant documents in full text, including policy level documents, reports, presentation material by Member States and meeting summaries. The collection starts with a reprint of the report to the IAEA General Conference 2004 on Nuclear Knowledge [GOV/2004/56-GC(48)/12] summarizing the developments in nuclear knowledge management since the 47th session of the General Conference in 2003 and covers Managing Nuclear Knowledge including safety issues and Information and Strengthening Education and Training for Capacity Building. It contains an excerpt on Nuclear Knowledge from the General Conference Resolution [GC(48)/RES/13] on Strengthening the Agency's Activities Related to Nuclear Science, Technology and Applications. On the CD-ROM itself, all documents can easily be accessed by clicking on their titles on the subject pages (also printed at the end of this Working Material). Part 1 of the CD-ROM covers the activities in the period 2003-2005 and part 2 presents a resource material full text CD-ROM on Managing Nuclear Knowledge issued in October 2003

  3. download full text

    African Journals Online (AJOL)

    In English, this class includes the particles how, too, so, and as (Ibid). (3) Mary is ...... Doctoral thesis (unpublished), University of Dar es Salaam. Goodness, D. .... manga. fat. corpulent. 36. mbindipindi. green. 37. mwalo. naughty. absurd. 38.

  4. FTP: Full-Text Publishing?

    Science.gov (United States)

    Jul, Erik

    1992-01-01

    Describes the use of file transfer protocol (FTP) on the INTERNET computer network and considers its use as an electronic publishing system. The differing electronic formats of text files are discussed; the preparation and access of documents are described; and problems are addressed, including a lack of consistency. (LRW)

  5. The Medline/full-text research project.

    Science.gov (United States)

    McKinin, E J; Sievert, M; Johnson, E D; Mitchell, J A

    1991-05-01

    This project was designed to test the relative efficacy of index terms and full-text for the retrieval of documents in those MEDLINE journals for which full-text searching was also available. The full-text files used were MEDIS from Mead Data Central and CCML from BRS Information Technologies. One hundred clinical medical topics were searched in these two files as well as the MEDLINE file to accumulate the necessary data. It was found that full-text identified significantly more relevant articles than did the indexed file, MEDLINE. The full-text searches, however, lacked the precision of searches done in the indexed file. Most relevant items missed in the full-text files, but identified in MEDLINE, were missed because the searcher failed to account for some aspect of natural language, used a logical or positional operator that was too restrictive, or included a concept which was implied, but not expressed in the natural language. Very few of the unique relevant full-text citations would have been retrieved by title or abstract alone. Finally, as of July, 1990 the more current issue of a journal was just as likely to appear in MEDLINE as in one of the full-text files.

  6. Academic Journal Embargoes and Full Text Databases.

    Science.gov (United States)

    Brooks, Sam

    2003-01-01

    Documents the reasons for embargoes of academic journals in full text databases (i.e., publisher-imposed delays on the availability of full text content) and provides insight regarding common misconceptions. Tables present data on selected journals covering a cross-section of subjects and publishers and comparing two full text business databases.…

  7. The Weaknesses of Full-Text Searching

    Science.gov (United States)

    Beall, Jeffrey

    2008-01-01

    This paper provides a theoretical critique of the deficiencies of full-text searching in academic library databases. Because full-text searching relies on matching words in a search query with words in online resources, it is an inefficient method of finding information in a database. This matching fails to retrieve synonyms, and it also retrieves…

  8. Where Full-Text Is Viable.

    Science.gov (United States)

    Cotton, P. L.

    1987-01-01

    Defines two types of online databases: source, referring to those intended to be complete in themselves, whether full-text or abstracts; and bibliographic, meaning those that are not complete. Predictions are made about the future growth rate of these two types of databases, as well as full-text versus abstract databases. (EM)

  9. Systematic characterizations of text similarity in full text biomedical publications.

    Science.gov (United States)

    Sun, Zhaohui; Errami, Mounir; Long, Tara; Renard, Chris; Choradia, Nishant; Garner, Harold

    2010-09-15

    Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text articles are becoming increasingly available, yet the similarities among them have not been systematically studied. Here, we quantitatively investigated the full text similarity of biomedical publications in PubMed Central. 72,011 full text articles from PubMed Central (PMC) were parsed to generate three different datasets: full texts, sections, and paragraphs. Text similarity comparisons were performed on these datasets using the text similarity algorithm eTBLAST. We measured the frequency of similar text pairs and compared it among different datasets. We found that high abstract similarity can be used to predict high full text similarity with a specificity of 20.1% (95% CI [17.3%, 23.1%]) and sensitivity of 99.999%. Abstract similarity and full text similarity have a moderate correlation (Pearson correlation coefficient: -0.423) when the similarity ratio is above 0.4. Among pairs of articles in PMC, method sections are found to be the most repetitive (frequency of similar pairs, methods: 0.029, introduction: 0.0076, results: 0.0043). In contrast, among a set of manually verified duplicate articles, results are the most repetitive sections (frequency of similar pairs, results: 0.94, methods: 0.89, introduction: 0.82). Repetition of introduction and methods sections is more likely to be committed by the same authors (odds of a highly similar pair having at least one shared author, introduction: 2.31, methods: 1.83, results: 1.03). There is also significantly more similarity in pairs of review articles than in pairs containing one review and one nonreview paper (frequency of similar pairs: 0.0167 and 0.0023, respectively). While quantifying abstract similarity is an effective approach for finding duplicate citations, a comprehensive full text analysis is necessary to uncover all potential duplicate citations in the scientific literature and is helpful when

  10. Selecting Full-Text Undergraduate Periodicals Databases.

    Science.gov (United States)

    Still, Julie M.; Kassabian, Vibiana

    1999-01-01

    Examines how libraries and librarians can compare full-text general periodical indices, using ProQuest Direct, Periodical Abstracts (via Ovid), and EBSCOhost as examples. Explores breadth and depth of coverage; manipulation of results (email/download/print); ease of use (searching); and indexing quirks. (AEF)

  11. Multilingual access to full text databases

    International Nuclear Information System (INIS)

    Fluhr, C.; Radwan, K.

    1990-05-01

    Many full text databases are available in only one language, or more, they may contain documents in different languages. Even if the user is able to understand the language of the documents in the database, it could be easier for him to express his need in his own language. For the case of databases containing documents in different languages, it is more simple to formulate the query in one language only and to retrieve documents in different languages. This paper present the developments and the first experiments of multilingual search, applied to french-english pair, for text data in nuclear field, based on the system SPIRIT. After reminding the general problems of full text databases search by queries formulated in natural language, we present the methods used to reformulate the queries and show how they can be expanded for multilingual search. The first results on data in nuclear field are presented (AFCEN norms and INIS abstracts). 4 refs

  12. Database citation in full text biomedical articles.

    Science.gov (United States)

    Kafkas, Şenay; Kim, Jee-Hyub; McEntyre, Johanna R

    2013-01-01

    Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services.

  13. SSRF-PDM and its full-text retrieval improvement

    International Nuclear Information System (INIS)

    Tong Xingfan; Deng Huiyu; Li Zhiming

    2011-01-01

    Project and data management is essential for Shanghai Synchrotron Radiation Facility (SSRF) which is a huge scientific platform for science research and technology development in China. With Product Data Management (PDM) system, SSRF improves its information service greatly. In this paper, we introduce the network structure, configuration modules and client terminals of the PDM system and the improvement in full-text retrieval subsystem, including its algorithms and details of implement in order to optimize the retrieval system.(authors)

  14. 48 CFR 2852.102-270 - Incorporation in full text.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Incorporation in full text... 2852.102-270 Incorporation in full text. JAR provisions or clauses shall be incorporated in solicitations and contracts in full text. ...

  15. 48 CFR 1952.102-2 - Incorporation in full text.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Incorporation in full text... Clauses 1952.102-2 Incorporation in full text. All IAAR provisions and clauses shall be incorporated in solicitations and/or contracts in full text. ...

  16. Is searching full text more effective than searching abstracts?

    Directory of Open Access Journals (Sweden)

    Lin Jimmy

    2009-02-01

    Full Text Available Abstract Background With the growing availability of full-text articles online, scientists and other consumers of the life sciences literature now have the ability to go beyond searching bibliographic records (title, abstract, metadata to directly access full-text content. Motivated by this emerging trend, I posed the following question: is searching full text more effective than searching abstracts? This question is answered by comparing text retrieval algorithms on MEDLINE® abstracts, full-text articles, and spans (paragraphs within full-text articles using data from the TREC 2007 genomics track evaluation. Two retrieval models are examined: bm25 and the ranking algorithm implemented in the open-source Lucene search engine. Results Experiments show that treating an entire article as an indexing unit does not consistently yield higher effectiveness compared to abstract-only search. However, retrieval based on spans, or paragraphs-sized segments of full-text articles, consistently outperforms abstract-only search. Results suggest that highest overall effectiveness may be achieved by combining evidence from spans and full articles. Conclusion Users searching full text are more likely to find relevant articles than searching only abstracts. This finding affirms the value of full text collections for text retrieval and provides a starting point for future work in exploring algorithms that take advantage of rapidly-growing digital archives. Experimental results also highlight the need to develop distributed text retrieval algorithms, since full-text articles are significantly longer than abstracts and may require the computational resources of multiple machines in a cluster. The MapReduce programming model provides a convenient framework for organizing such computations.

  17. UKPMC: a full text article resource for the life sciences.

    Science.gov (United States)

    McEntyre, Johanna R; Ananiadou, Sophia; Andrews, Stephen; Black, William J; Boulderstone, Richard; Buttery, Paula; Chaplin, David; Chevuru, Sandeepreddy; Cobley, Norman; Coleman, Lee-Ann; Davey, Paul; Gupta, Bharti; Haji-Gholam, Lesley; Hawkins, Craig; Horne, Alan; Hubbard, Simon J; Kim, Jee-Hyub; Lewin, Ian; Lyte, Vic; MacIntyre, Ross; Mansoor, Sami; Mason, Linda; McNaught, John; Newbold, Elizabeth; Nobata, Chikashi; Ong, Ernest; Pillai, Sharmila; Rebholz-Schuhmann, Dietrich; Rosie, Heather; Rowbotham, Rob; Rupp, C J; Stoehr, Peter; Vaughan, Philip

    2011-01-01

    UK PubMed Central (UKPMC) is a full-text article database that extends the functionality of the original PubMed Central (PMC) repository. The UKPMC project was launched as the first 'mirror' site to PMC, which in analogy to the International Nucleotide Sequence Database Collaboration, aims to provide international preservation of the open and free-access biomedical literature. UKPMC (http://ukpmc.ac.uk) has undergone considerable development since its inception in 2007 and now includes both a UKPMC and PubMed search, as well as access to other records such as Agricola, Patents and recent biomedical theses. UKPMC also differs from PubMed/PMC in that the full text and abstract information can be searched in an integrated manner from one input box. Furthermore, UKPMC contains 'Cited By' information as an alternative way to navigate the literature and has incorporated text-mining approaches to semantically enrich content and integrate it with related database resources. Finally, UKPMC also offers added-value services (UKPMC+) that enable grantees to deposit manuscripts, link papers to grants, publish online portfolios and view citation information on their papers. Here we describe UKPMC and clarify the relationship between PMC and UKPMC, providing historical context and future directions, 10 years on from when PMC was first launched.

  18. Is searching full text more effective than searching abstracts?

    Science.gov (United States)

    Lin, Jimmy

    2009-02-03

    With the growing availability of full-text articles online, scientists and other consumers of the life sciences literature now have the ability to go beyond searching bibliographic records (title, abstract, metadata) to directly access full-text content. Motivated by this emerging trend, I posed the following question: is searching full text more effective than searching abstracts? This question is answered by comparing text retrieval algorithms on MEDLINE abstracts, full-text articles, and spans (paragraphs) within full-text articles using data from the TREC 2007 genomics track evaluation. Two retrieval models are examined: bm25 and the ranking algorithm implemented in the open-source Lucene search engine. Experiments show that treating an entire article as an indexing unit does not consistently yield higher effectiveness compared to abstract-only search. However, retrieval based on spans, or paragraphs-sized segments of full-text articles, consistently outperforms abstract-only search. Results suggest that highest overall effectiveness may be achieved by combining evidence from spans and full articles. Users searching full text are more likely to find relevant articles than searching only abstracts. This finding affirms the value of full text collections for text retrieval and provides a starting point for future work in exploring algorithms that take advantage of rapidly-growing digital archives. Experimental results also highlight the need to develop distributed text retrieval algorithms, since full-text articles are significantly longer than abstracts and may require the computational resources of multiple machines in a cluster. The MapReduce programming model provides a convenient framework for organizing such computations.

  19. Full Text Psychology Journals Available from Popular Library Databases

    Science.gov (United States)

    Joswick, Kathleen E.

    2006-01-01

    The author identified 433 core journals in psychology and investigated their full text availability in popular databases. While 62 percent of the studied journals were available in at least one database, access from individual databases ranged from 1.4 percent to 38.1 percent of the titles. The full text of influential psychology journals is not…

  20. "Free full text articles": where to search for them?

    Science.gov (United States)

    Singh, Ashish; Singh, Manish; Singh, Ajai Kumar; Singh, Deepti; Singh, Pratibha; Sharma, Abhishek

    2011-07-01

    References form the backbone of any medical literature. Presently, because of high inflation, it is very difficult for any library/organization/college to purchase all journals. The condition is even worse for an individual person, such as private practitioners. The solution lies in the free availability of full-text articles. Here, the authors share their experiences about the accessibility of free full-text articles.

  1. Searching for Bill and Jane: Electronic Full-Text Literature.

    Science.gov (United States)

    Still, Julie; Kassabian, Vibiana

    1998-01-01

    Examines electronic full-text literature available on the World Wide Web and on CD-ROM. Discusses authors and genres, electronic texts, and fees. Highlights Shakespeare, Jane Austen, and nature writing. Provides a bibliography of Web guides, specialized Shakespeare pages, and pages dealing with the Shakespeare authorship debate and secondary…

  2. Mining biological networks from full-text articles.

    Science.gov (United States)

    Czarnecki, Jan; Shepherd, Adrian J

    2014-01-01

    The study of biological networks is playing an increasingly important role in the life sciences. Many different kinds of biological system can be modelled as networks; perhaps the most important examples are protein-protein interaction (PPI) networks, metabolic pathways, gene regulatory networks, and signalling networks. Although much useful information is easily accessible in publicly databases, a lot of extra relevant data lies scattered in numerous published papers. Hence there is a pressing need for automated text-mining methods capable of extracting such information from full-text articles. Here we present practical guidelines for constructing a text-mining pipeline from existing code and software components capable of extracting PPI networks from full-text articles. This approach can be adapted to tackle other types of biological network.

  3. Subject Retrieval from Full-Text Databases in the Humanities

    Science.gov (United States)

    East, John W.

    2007-01-01

    This paper examines the problems involved in subject retrieval from full-text databases of secondary materials in the humanities. Ten such databases were studied and their search functionality evaluated, focusing on factors such as Boolean operators, document surrogates, limiting by subject area, proximity operators, phrase searching, wildcards,…

  4. Full text and figure display improves bioscience literature search.

    Directory of Open Access Journals (Sweden)

    Anna Divoli

    Full Text Available When reading bioscience journal articles, many researchers focus attention on the figures and their captions. This observation led to the development of the BioText literature search engine, a freely available Web-based application that allows biologists to search over the contents of Open Access Journals, and see figures from the articles displayed directly in the search results. This article presents a qualitative assessment of this system in the form of a usability study with 20 biologist participants using and commenting on the system. 19 out of 20 participants expressed a desire to use a bioscience literature search engine that displays articles' figures alongside the full text search results. 15 out of 20 participants said they would use a caption search and figure display interface either frequently or sometimes, while 4 said rarely and 1 said undecided. 10 out of 20 participants said they would use a tool for searching the text of tables and their captions either frequently or sometimes, while 7 said they would use it rarely if at all, 2 said they would never use it, and 1 was undecided. This study found evidence, supporting results of an earlier study, that bioscience literature search systems such as PubMed should show figures from articles alongside search results. It also found evidence that full text and captions should be searched along with the article title, metadata, and abstract. Finally, for a subset of users and information needs, allowing for explicit search within captions for figures and tables is a useful function, but it is not entirely clear how to cleanly integrate this within a more general literature search interface. Such a facility supports Open Access publishing efforts, as it requires access to full text of documents and the lifting of restrictions in order to show figures in the search interface.

  5. Full text and figure display improves bioscience literature search.

    Science.gov (United States)

    Divoli, Anna; Wooldridge, Michael A; Hearst, Marti A

    2010-04-14

    When reading bioscience journal articles, many researchers focus attention on the figures and their captions. This observation led to the development of the BioText literature search engine, a freely available Web-based application that allows biologists to search over the contents of Open Access Journals, and see figures from the articles displayed directly in the search results. This article presents a qualitative assessment of this system in the form of a usability study with 20 biologist participants using and commenting on the system. 19 out of 20 participants expressed a desire to use a bioscience literature search engine that displays articles' figures alongside the full text search results. 15 out of 20 participants said they would use a caption search and figure display interface either frequently or sometimes, while 4 said rarely and 1 said undecided. 10 out of 20 participants said they would use a tool for searching the text of tables and their captions either frequently or sometimes, while 7 said they would use it rarely if at all, 2 said they would never use it, and 1 was undecided. This study found evidence, supporting results of an earlier study, that bioscience literature search systems such as PubMed should show figures from articles alongside search results. It also found evidence that full text and captions should be searched along with the article title, metadata, and abstract. Finally, for a subset of users and information needs, allowing for explicit search within captions for figures and tables is a useful function, but it is not entirely clear how to cleanly integrate this within a more general literature search interface. Such a facility supports Open Access publishing efforts, as it requires access to full text of documents and the lifting of restrictions in order to show figures in the search interface.

  6. Layout-aware text extraction from full-text PDF of scientific articles

    Directory of Open Access Journals (Sweden)

    Ramakrishnan Cartic

    2012-05-01

    Full Text Available Abstract Background The Portable Document Format (PDF is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the ‘Layout-Aware PDF Text Extraction’ (LA-PDFText system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Results Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1 Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2 Classifying text blocks into rhetorical categories using a rule-based method and (3 Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF

  7. Layout-aware text extraction from full-text PDF of scientific articles.

    Science.gov (United States)

    Ramakrishnan, Cartic; Patnia, Abhishek; Hovy, Eduard; Burns, Gully Apc

    2012-05-28

    The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the 'Layout-Aware PDF Text Extraction' (LA-PDFText) system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1) Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2) Classifying text blocks into rhetorical categories using a rule-based method and (3) Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF. Finally, we discuss preliminary error analysis for

  8. Full text clustering and relationship network analysis of biomedical publications.

    Directory of Open Access Journals (Sweden)

    Renchu Guan

    Full Text Available Rapid developments in the biomedical sciences have increased the demand for automatic clustering of biomedical publications. In contrast to current approaches to text clustering, which focus exclusively on the contents of abstracts, a novel method is proposed for clustering and analysis of complete biomedical article texts. To reduce dimensionality, Cosine Coefficient is used on a sub-space of only two vectors, instead of computing the Euclidean distance within the space of all vectors. Then a strategy and algorithm is introduced for Semi-supervised Affinity Propagation (SSAP to improve analysis efficiency, using biomedical journal names as an evaluation background. Experimental results show that by avoiding high-dimensional sparse matrix computations, SSAP outperforms conventional k-means methods and improves upon the standard Affinity Propagation algorithm. In constructing a directed relationship network and distribution matrix for the clustering results, it can be noted that overlaps in scope and interests among BioMed publications can be easily identified, providing a valuable analytical tool for editors, authors and readers.

  9. Solar Energy Education. Renewable energy: a background text. [Includes glossary

    Energy Technology Data Exchange (ETDEWEB)

    1985-01-01

    Some of the most common forms of renewable energy are presented in this textbook for students. The topics include solar energy, wind power hydroelectric power, biomass ocean thermal energy, and tidal and geothermal energy. The main emphasis of the text is on the sun and the solar energy that it yields. Discussions on the sun's composition and the relationship between the earth, sun and atmosphere are provided. Insolation, active and passive solar systems, and solar collectors are the subtopics included under solar energy. (BCS)

  10. Multilingual access to full text databases; Acces multilingue aux bases de donnees en texte integral

    Energy Technology Data Exchange (ETDEWEB)

    Fluhr, C; Radwan, K [Institut National des Sciences et Techniques Nucleaires (INSTN), Centre d` Etudes de Saclay, 91 - Gif-sur-Yvette (France)

    1990-05-01

    Many full text databases are available in only one language, or more, they may contain documents in different languages. Even if the user is able to understand the language of the documents in the database, it could be easier for him to express his need in his own language. For the case of databases containing documents in different languages, it is more simple to formulate the query in one language only and to retrieve documents in different languages. This paper present the developments and the first experiments of multilingual search, applied to french-english pair, for text data in nuclear field, based on the system SPIRIT. After reminding the general problems of full text databases search by queries formulated in natural language, we present the methods used to reformulate the queries and show how they can be expanded for multilingual search. The first results on data in nuclear field are presented (AFCEN norms and INIS abstracts). 4 refs.

  11. Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text.

    Science.gov (United States)

    Garten, Yael; Altman, Russ B

    2009-02-05

    Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded rapidly, but is dispersed in many journals. It is challenging, therefore, to identify important associations between drugs and molecular entities--particularly genes and gene variants, and thus these critical connections are often lost. Text mining techniques can allow us to convert the free-style text to a computable, searchable format in which pharmacogenomic concepts (such as genes, drugs, polymorphisms, and diseases) are identified, and important links between these concepts are recorded. Availability of full text articles as input into text mining engines is key, as literature abstracts often do not contain sufficient information to identify these pharmacogenomic associations. Thus, building on a tool called Textpresso, we have created the Pharmspresso tool to assist in identifying important pharmacogenomic facts in full text articles. Pharmspresso parses text to find references to human genes, polymorphisms, drugs and diseases and their relationships. It presents these as a series of marked-up text fragments, in which key concepts are visually highlighted. To evaluate Pharmspresso, we used a gold standard of 45 human-curated articles. Pharmspresso identified 78%, 61%, and 74% of target gene, polymorphism, and drug concepts, respectively. Pharmspresso is a text analysis tool that extracts pharmacogenomic concepts from the literature automatically and thus captures our current understanding of gene-drug interactions in a computable form. We have made Pharmspresso available at http://pharmspresso.stanford.edu.

  12. Classification of protein-protein interaction full-text documents using text and citation network features.

    Science.gov (United States)

    Kolchinsky, Artemy; Abi-Haidar, Alaa; Kaur, Jasleen; Hamed, Ahmed Abdeen; Rocha, Luis M

    2010-01-01

    We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.

  13. Full text clustering and relationship network analysis of biomedical publications.

    Science.gov (United States)

    Guan, Renchu; Yang, Chen; Marchese, Maurizio; Liang, Yanchun; Shi, Xiaohu

    2014-01-01

    Rapid developments in the biomedical sciences have increased the demand for automatic clustering of biomedical publications. In contrast to current approaches to text clustering, which focus exclusively on the contents of abstracts, a novel method is proposed for clustering and analysis of complete biomedical article texts. To reduce dimensionality, Cosine Coefficient is used on a sub-space of only two vectors, instead of computing the Euclidean distance within the space of all vectors. Then a strategy and algorithm is introduced for Semi-supervised Affinity Propagation (SSAP) to improve analysis efficiency, using biomedical journal names as an evaluation background. Experimental results show that by avoiding high-dimensional sparse matrix computations, SSAP outperforms conventional k-means methods and improves upon the standard Affinity Propagation algorithm. In constructing a directed relationship network and distribution matrix for the clustering results, it can be noted that overlaps in scope and interests among BioMed publications can be easily identified, providing a valuable analytical tool for editors, authors and readers.

  14. Extractive text summarization system to aid data extraction from full text in systematic review development.

    Science.gov (United States)

    Bui, Duy Duc An; Del Fiol, Guilherme; Hurdle, John F; Jonnalagadda, Siddhartha

    2016-12-01

    Extracting data from publication reports is a standard process in systematic review (SR) development. However, the data extraction process still relies too much on manual effort which is slow, costly, and subject to human error. In this study, we developed a text summarization system aimed at enhancing productivity and reducing errors in the traditional data extraction process. We developed a computer system that used machine learning and natural language processing approaches to automatically generate summaries of full-text scientific publications. The summaries at the sentence and fragment levels were evaluated in finding common clinical SR data elements such as sample size, group size, and PICO values. We compared the computer-generated summaries with human written summaries (title and abstract) in terms of the presence of necessary information for the data extraction as presented in the Cochrane review's study characteristics tables. At the sentence level, the computer-generated summaries covered more information than humans do for systematic reviews (recall 91.2% vs. 83.8%, p<0.001). They also had a better density of relevant sentences (precision 59% vs. 39%, p<0.001). At the fragment level, the ensemble approach combining rule-based, concept mapping, and dictionary-based methods performed better than individual methods alone, achieving an 84.7% F-measure. Computer-generated summaries are potential alternative information sources for data extraction in systematic review development. Machine learning and natural language processing are promising approaches to the development of such an extractive summarization system. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts.

    Science.gov (United States)

    Westergaard, David; Stærfeldt, Hans-Henrik; Tønsberg, Christian; Jensen, Lars Juhl; Brunak, Søren

    2018-02-01

    Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only.

  16. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

    Science.gov (United States)

    Westergaard, David; Stærfeldt, Hans-Henrik

    2018-01-01

    Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823–2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein–protein, disease–gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only. PMID:29447159

  17. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Case presentation. A seventy five year old Libyan man was seen in the urology department of Tripoli Medical Centre, Tripoli, Libya with six month history of left loin pain. The patient noted a mass in the left loin two days before he was assessed in the hospital. Also he started to vomit. There was no history of haematuria.

  18. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    impact of using a robotic dispensing machine in community pharmacies was gathered using a structured questionnaire and analysed in ... dispensing time was also shorter and staff satisfaction increased. ... reference customers who were using a ROWA robotic .... Costs situation Purchase price Stock value Personnel costs.

  19. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    –60 years, in Al-Jala Women Hospital in. Tripoli, Libya. Haemoglobin concentration was measured using an automated haematology analyzer. ... i.e. by relatives and friends of the patient needing blood. A .... More attention should be given.

  20. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    2009-07-20

    Jul 20, 2009 ... Table 1: Influenza pandemics of the 20th and 21st century. Name of ... could be responsible for the rapid human -to- human transmission [21]. Using evolutionary analysis to estimate the timescale of the origins, Smith and his research team from The. University of ... The biology of influenza A viruses is very.

  1. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    2009-01-11

    Jan 11, 2009 ... battery, retail for approximately £30GBP although bulk buying ..... care to store them carefully. Electrode costs .... nerve stimulation does not relieve in labour pain: updated ... (Online : Update Software), 2003(3): p. CD003222.

  2. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Abstract; The detection of single base mismatches in DNA is important for diagnostics, treatment of ... nucleic acid detectors, and show how such exciplexes can register the presence of .... Titration experiments were carried out using a stock.

  3. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    from Turkey, Mexico, Brazil, USA, and Spain determined some criteria in favor ... female gender, and higher level of education. [9-14]. ... teachers and/or workers in that facility. Then a random ..... Psychosocial profile in favor of organ donation.

  4. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    1Department of Surgery, Obafemi Awolowo, 2Department of Community Health,. Obafemi ... the use of mesh, either open or laparoscopic [15,21], but this ... recurrence. METHODS AND PATIENTS .... TAH-BSO* = Total abdominal hysterectomy and bilateral salpingoophorectomy. Recurrent I.H. # = Recurrent inguinal hernia.

  5. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    fishermen were enrolled at three Marine stations in Basra, Iraq. Demographic data, types .... that are used to sting and kill their prey or for defense. ... cardiotoxic, and dermatonecrotic toxins [1,6]. Figure 3: 4 ... May last a few weeks. The hands ...

  6. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    2009-05-03

    May 3, 2009 ... The patient made an uneventful recovery. The final histopathology report was consistent with metastatic renal carcinoma. The patient was referred to the oncologist but unfortunately defaulted further treatment. . She is currently well and disease free 24 months after metastatectomy. Electronic PDF security ...

  7. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    healing process when wound tensile strength is very low or absent (days 0-30). It is during this time, when .... and two (4.5%) were supraumbilical. Table 1: Age distribution and the outcome of surgery in the 44 women with incisional hernia. Variable. Frequency. Percentage. Age. 60. 9. 15. 10. 6. 4.

  8. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Original Article. Effects of Gender and Seasonal Variation on the Prevalence of. Bacterial Septicaemia Among Young Children in Benin City,. Nigeria. Omoregie R1,2, Egbe CA2, Ogefere HO1,3, Igbarumah I2, Omijie RE2. 1School of Medical Laboratory Sciences, 2Department of Medical Microbiology, University of Benin.

  9. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Figure 2 axial CT section with contrast media showing extension of lesion. Figure 3 Photomicrograph revealing many dilated cavernous lymphatic channels filled with eosinophilic coagulum. (Haematoxylin and Eosin section Orginal magnification 40 X). Discussion. Cystic hygroma, known as cystic lymphangioma is a.

  10. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Abstract; The expression of EGFR and p53 has not been adequately studied as a prognostic tool in urinary bladder tumors. We analyzed 74 bladder cancer samples from Egypt for EGFR and p53 expression using immunohistochemistry. The tumors .... have some potential value in differential diagnosis of problem cases, but ...

  11. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    receptor; TNF-α: Tumor Necrosis Factor–alpha; TGF-β1: transforming growth factor-β1. INTRODUCTION .... hormones that mediate inflammatory and immune responses in a ..... score, lactate, and base deficit), as well as treatment with agents ...

  12. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

    DEFF Research Database (Denmark)

    Westergaard, David; Stærfeldt, Hans Henrik; Tønsberg, Christian

    2018-01-01

    Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15...... subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full...... million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein...

  13. A text-mining system for extracting metabolic reactions from full-text articles.

    Science.gov (United States)

    Czarnecki, Jan; Nobeli, Irene; Smith, Adrian M; Shepherd, Adrian J

    2012-07-23

    Increasingly biological text mining research is focusing on the extraction of complex relationships relevant to the construction and curation of biological networks and pathways. However, one important category of pathway - metabolic pathways - has been largely neglected.Here we present a relatively simple method for extracting metabolic reaction information from free text that scores different permutations of assigned entities (enzymes and metabolites) within a given sentence based on the presence and location of stemmed keywords. This method extends an approach that has proved effective in the context of the extraction of protein-protein interactions. When evaluated on a set of manually-curated metabolic pathways using standard performance criteria, our method performs surprisingly well. Precision and recall rates are comparable to those previously achieved for the well-known protein-protein interaction extraction task. We conclude that automated metabolic pathway construction is more tractable than has often been assumed, and that (as in the case of protein-protein interaction extraction) relatively simple text-mining approaches can prove surprisingly effective. It is hoped that these results will provide an impetus to further research and act as a useful benchmark for judging the performance of more sophisticated methods that are yet to be developed.

  14. Investigating and Annotating the Role of Citation in Biomedical Full-Text Articles.

    Science.gov (United States)

    Yu, Hong; Agarwal, Shashank; Frid, Nadya

    2009-11-01

    Citations are ubiquitous in scientific articles and play important roles for representing the semantic content of a full-text biomedical article. In this work, we manually examined full-text biomedical articles to analyze the semantic content of citations in full-text biomedical articles. After developing a citation relation schema and annotation guideline, our pilot annotation results show an overall agreement of 0.71, and here we report on the research challenges and the lessons we've learned while trying to overcome them. Our work is a first step toward automatic citation classification in full-text biomedical articles, which may contribute to many text mining tasks, including information retrieval, extraction, summarization, and question answering.

  15. Challenges for automatically extracting molecular interactions from full-text articles.

    Science.gov (United States)

    McIntosh, Tara; Curran, James R

    2009-09-24

    The increasing availability of full-text biomedical articles will allow more biomedical knowledge to be extracted automatically with greater reliability. However, most Information Retrieval (IR) and Extraction (IE) tools currently process only abstracts. The lack of corpora has limited the development of tools that are capable of exploiting the knowledge in full-text articles. As a result, there has been little investigation into the advantages of full-text document structure, and the challenges developers will face in processing full-text articles. We manually annotated passages from full-text articles that describe interactions summarised in a Molecular Interaction Map (MIM). Our corpus tracks the process of identifying facts to form the MIM summaries and captures any factual dependencies that must be resolved to extract the fact completely. For example, a fact in the results section may require a synonym defined in the introduction. The passages are also annotated with negated and coreference expressions that must be resolved.We describe the guidelines for identifying relevant passages and possible dependencies. The corpus includes 2162 sentences from 78 full-text articles. Our corpus analysis demonstrates the necessity of full-text processing; identifies the article sections where interactions are most commonly stated; and quantifies the proportion of interaction statements requiring coherent dependencies. Further, it allows us to report on the relative importance of identifying synonyms and resolving negated expressions. We also experiment with an oracle sentence retrieval system using the corpus as a gold-standard evaluation set. We introduce the MIM corpus, a unique resource that maps interaction facts in a MIM to annotated passages within full-text articles. It is an invaluable case study providing guidance to developers of biomedical IR and IE systems, and can be used as a gold-standard evaluation set for full-text IR tasks.

  16. Evaluating Open-Source Full-Text Search Engines for Matching ICD-10 Codes.

    Science.gov (United States)

    Jurcău, Daniel-Alexandru; Stoicu-Tivadar, Vasile

    2016-01-01

    This research presents the results of evaluating multiple free, open-source engines on matching ICD-10 diagnostic codes via full-text searches. The study investigates what it takes to get an accurate match when searching for a specific diagnostic code. For each code the evaluation starts by extracting the words that make up its text and continues with building full-text search queries from the combinations of these words. The queries are then run against all the ICD-10 codes until a match indicates the code in question as a match with the highest relative score. This method identifies the minimum number of words that must be provided in order for the search engines choose the desired entry. The engines analyzed include a popular Java-based full-text search engine, a lightweight engine written in JavaScript which can even execute on the user's browser, and two popular open-source relational database management systems.

  17. Design of an On-Line Query Language for Full Text Patent Search.

    Science.gov (United States)

    Glantz, Richard S.

    The design of an English-like query language and an interactive computer environment for searching the full text of the U.S. patent collection are discussed. Special attention is paid to achieving a transparent user interface, to providing extremely broad search capabilities (including nested substitution classes, Kleene star events, and domain…

  18. The Flip Sides of Full-Text: Superindex and the Harvard Business Review/Online.

    Science.gov (United States)

    Dadlez, Eva M.

    1984-01-01

    This article illustrates similarities between two different types of full-text databases--Superindex, Harvard Business Review/Online--and uses them as arena to demonstrate search and display applications of full-text. The selection of logical operators, full-text search strategies, and keywords and Bibliographic Retrieval Service's Occurrence…

  19. Searching Harvard Business Review Online. . . Lessons in Searching a Full Text Database.

    Science.gov (United States)

    Tenopir, Carol

    1985-01-01

    This article examines the Harvard Business Review Online (HBRO) database (bibliographic description fields, abstracts, extracted information, full text, subject descriptors) and reports on 31 sample HBRO searches conducted in Bibliographic Retrieval Services to test differences between searching full text and searching bibliographic record. Sample…

  20. Full-Text Linking: Affiliated versus Nonaffiliated Access in a Free Database.

    Science.gov (United States)

    Grogg, Jill E.; Andreadis, Debra K.; Kirk, Rachel A.

    2002-01-01

    Presents a comparison of access to full-text articles from a free bibliographic database (PubSCIENCE) for affiliated and unaffiliated users. Found that affiliated users had access to more full-text articles than unaffiliated users had, and that both types of users could increase their level of access through additional searching and greater…

  1. tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles.

    Science.gov (United States)

    Cejuela, Juan Miguel; McQuilton, Peter; Ponting, Laura; Marygold, Steven J; Stefancsik, Raymund; Millburn, Gillian H; Rost, Burkhard

    2014-01-01

    The breadth and depth of biomedical literature are increasing year upon year. To keep abreast of these increases, FlyBase, a database for Drosophila genomic and genetic information, is constantly exploring new ways to mine the published literature to increase the efficiency and accuracy of manual curation and to automate some aspects, such as triaging and entity extraction. Toward this end, we present the 'tagtog' system, a web-based annotation framework that can be used to mark up biological entities (such as genes) and concepts (such as Gene Ontology terms) in full-text articles. tagtog leverages manual user annotation in combination with automatic machine-learned annotation to provide accurate identification of gene symbols and gene names. As part of the BioCreative IV Interactive Annotation Task, FlyBase has used tagtog to identify and extract mentions of Drosophila melanogaster gene symbols and names in full-text biomedical articles from the PLOS stable of journals. We show here the results of three experiments with different sized corpora and assess gene recognition performance and curation speed. We conclude that tagtog-named entity recognition improves with a larger corpus and that tagtog-assisted curation is quicker than manual curation. DATABASE URL: www.tagtog.net, www.flybase.org.

  2. Full-text publication of abstracts presented at European Orthodontic Society congresses.

    Science.gov (United States)

    Livas, Christos; Pandis, Nikolaos; Ren, Yijin

    2014-10-01

    Empirical evidence has indicated that only a subsample of studies conducted reach full-text publication and this phenomenon has become known as publication bias. A form of publication bias is the selectively delayed full publication of conference abstracts. The objective of this article was to examine the publication status of oral abstracts and poster-presentation abstracts, included in the scientific program of the 82nd and 83rd European Orthodontic Society (EOS) congresses, held in 2006 and 2007, and to identify factors associated with full-length publication. A systematic search of PubMed and Google Scholar databases was performed in April 2013 using author names and keywords from the abstract title to locate abstract and full-article publications. Information regarding mode of presentation, type of affiliation, geographical origin, statistical results, and publication details were collected and analyzed using univariable and multivariable logistic regression. Approximately 51 per cent of the EOS 2006 and 55 per cent of the EOS 2007 abstracts appeared in print more than 5 years post congress. A mean period of 1.32 years elapsed between conference and publication date. Mode of presentation (oral or poster), use of statistical analysis, and research subject area were significant predictors for publication success. Inherent discrepancies of abstract reporting, mainly related to presentation of preliminary results and incomplete description of methods, may be considered in analogous studies. On average 52.2 per cent of the abstracts presented at the two EOS conferences reached full publication. Abstracts presented orally, including statistical analysis, were more likely to get published. © The Author 2013. Published by Oxford University Press on behalf of the European Orthodontic Society. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  3. Efficient extraction of protein-protein interactions from full-text articles.

    Science.gov (United States)

    Hakenberg, Jörg; Leaman, Robert; Vo, Nguyen Ha; Jonnalagadda, Siddhartha; Sullivan, Ryan; Miller, Christopher; Tari, Luis; Baral, Chitta; Gonzalez, Graciela

    2010-01-01

    Proteins and their interactions govern virtually all cellular processes, such as regulation, signaling, metabolism, and structure. Most experimental findings pertaining to such interactions are discussed in research papers, which, in turn, get curated by protein interaction databases. Authors, editors, and publishers benefit from efforts to alleviate the tasks of searching for relevant papers, evidence for physical interactions, and proper identifiers for each protein involved. The BioCreative II.5 community challenge addressed these tasks in a competition-style assessment to evaluate and compare different methodologies, to make aware of the increasing accuracy of automated methods, and to guide future implementations. In this paper, we present our approaches for protein-named entity recognition, including normalization, and for extraction of protein-protein interactions from full text. Our overall goal is to identify efficient individual components, and we compare various compositions to handle a single full-text article in between 10 seconds and 2 minutes. We propose strategies to transfer document-level annotations to the sentence-level, which allows for the creation of a more fine-grained training corpus; we use this corpus to automatically derive around 5,000 patterns. We rank sentences by relevance to the task of finding novel interactions with physical evidence, using a sentence classifier built from this training corpus. Heuristics for paraphrasing sentences help to further remove unnecessary information that might interfere with patterns, such as additional adjectives, clauses, or bracketed expressions. In BioCreative II.5, we achieved an f-score of 22 percent for finding protein interactions, and 43 percent for mapping proteins to UniProt IDs; disregarding species, f-scores are 30 percent and 55 percent, respectively. On average, our best-performing setup required around 2 minutes per full text. All data and pattern sets as well as Java classes that

  4. Large-scale extraction of gene interactions from full-text literature using DeepDive.

    Science.gov (United States)

    Mallory, Emily K; Zhang, Ce; Ré, Christopher; Altman, Russ B

    2016-01-01

    A complete repository of gene-gene interactions is key for understanding cellular processes, human disease and drug response. These gene-gene interactions include both protein-protein interactions and transcription factor interactions. The majority of known interactions are found in the biomedical literature. Interaction databases, such as BioGRID and ChEA, annotate these gene-gene interactions; however, curation becomes difficult as the literature grows exponentially. DeepDive is a trained system for extracting information from a variety of sources, including text. In this work, we used DeepDive to extract both protein-protein and transcription factor interactions from over 100,000 full-text PLOS articles. We built an extractor for gene-gene interactions that identified candidate gene-gene relations within an input sentence. For each candidate relation, DeepDive computed a probability that the relation was a correct interaction. We evaluated this system against the Database of Interacting Proteins and against randomly curated extractions. Our system achieved 76% precision and 49% recall in extracting direct and indirect interactions involving gene symbols co-occurring in a sentence. For randomly curated extractions, the system achieved between 62% and 83% precision based on direct or indirect interactions, as well as sentence-level and document-level precision. Overall, our system extracted 3356 unique gene pairs using 724 features from over 100,000 full-text articles. Application source code is publicly available at https://github.com/edoughty/deepdive_genegene_app russ.altman@stanford.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  5. Full-text publication of abstract-presented work in sport and exercise psychology.

    Science.gov (United States)

    Shue, Sarah; Warden, Stuart

    2018-01-01

    Meetings promote information sharing, but do not enable full dissemination of details. A systematic search was conducted for abstracts presented at the 2010 and 2011 Association of Applied Sport Psychology Annual Conferences to determine the full-text dissemination rate of work presented in abstract form and investigate factors influencing this rate. Systematic searches were sequentially conducted to determine whether the abstract-presented work had been published in full-text format in the 5 years following presentation. If a potential full-text publication was identified, information from the conference abstract (eg, results, number of participants in the sample(s), measurement tools used and so on) was compared with the full text to ensure the two entities represented the same body of work. Abstract factors of interest were assessed using logistic regression. Ninety-four out of 423 presented abstracts (22.2%) were published in full text. Odds of full-text publication increased if the abstract was from an international institution, presented in certain conference sections or presented as a lecture. Those attending professional conferences should be cautious when translating data presented at conferences into their applied work because of the low rate of peer-reviewed and full-text publication of the information.

  6. BC4GO: a full-text corpus for the BioCreative IV GO task.

    Science.gov (United States)

    Van Auken, Kimberly; Schaeffer, Mary L; McQuilton, Peter; Laulederkind, Stanley J F; Li, Donghui; Wang, Shur-Jen; Hayman, G Thomas; Tweedie, Susan; Arighi, Cecilia N; Done, James; Müller, Hans-Michael; Sternberg, Paul W; Mao, Yuqing; Wei, Chih-Hsuan; Lu, Zhiyong

    2014-01-01

    Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain ∼ 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community. Database URL: http://www.biocreative.org/resources/corpora/bc-iv-go-task-corpus/. Published by Oxford University Press 2014. This work is written by US

  7. MeSH: a window into full text for document summarization.

    Science.gov (United States)

    Bhattacharya, Sanmitra; Ha-Thuc, Viet; Srinivasan, Padmini

    2011-07-01

    Previous research in the biomedical text-mining domain has historically been limited to titles, abstracts and metadata available in MEDLINE records. Recent research initiatives such as TREC Genomics and BioCreAtIvE strongly point to the merits of moving beyond abstracts and into the realm of full texts. Full texts are, however, more expensive to process not only in terms of resources needed but also in terms of accuracy. Since full texts contain embellishments that elaborate, contextualize, contrast, supplement, etc., there is greater risk for false positives. Motivated by this, we explore an approach that offers a compromise between the extremes of abstracts and full texts. Specifically, we create reduced versions of full text documents that contain only important portions. In the long-term, our goal is to explore the use of such summaries for functions such as document retrieval and information extraction. Here, we focus on designing summarization strategies. In particular, we explore the use of MeSH terms, manually assigned to documents by trained annotators, as clues to select important text segments from the full text documents. Our experiments confirm the ability of our approach to pick the important text portions. Using the ROUGE measures for evaluation, we were able to achieve maximum ROUGE-1, ROUGE-2 and ROUGE-SU4 F-scores of 0.4150, 0.1435 and 0.1782, respectively, for our MeSH term-based method versus the maximum baseline scores of 0.3815, 0.1353 and 0.1428, respectively. Using a MeSH profile-based strategy, we were able to achieve maximum ROUGE F-scores of 0.4320, 0.1497 and 0.1887, respectively. Human evaluation of the baselines and our proposed strategies further corroborates the ability of our method to select important sentences from the full texts. sanmitra-bhattacharya@uiowa.edu; padmini-srinivasan@uiowa.edu.

  8. Early Career Researchers Demand Full-text and Rely on Google to Find Scholarly Sources

    Directory of Open Access Journals (Sweden)

    Richard Hayman

    2017-12-01

    Full Text Available A Review of: Nicholas, D., Boukacem-Zeghmouri, C., Rodríguez-Bravo, B., Xu, J., Watkinson, A., Abrizah, A., Herman, E., & Świgoń, M. (2017. Where and how early career researchers find scholarly information. Learned Publishing, 30(1, 19-29. http://dx.doi.org/10.1002/leap.1087 Abstract Objective – To examine the attitudes and information behaviours of early career researchers (ECRs when locating scholarly information. Design – Qualitative longitudinal study. Setting – Research participants from the United Kingdom, United States of America, China, France, Malaysia, Poland, and Spain. Subjects – A total 116 participants from various disciplines, aged 35 and younger, who were holding or had previously held a research position, but not in a tenured position. All participants held a doctorate or were in the process of earning one. Methods – Using structured interviews of 60-90 minutes, researchers asked 60 questions of each participant via face-to-face, Skype, or telephone interviews. The interview format and questions were formed via focus groups. Main Results – As part of a longitudinal project, results reported are limited to the first year of the study, and focused on three primary questions identified by the authors: where do ECRs find scholarly information, whether they use their smartphones to locate and read scholarly information, and what social media do they use to find scholarly information. Researchers describe how ECRs themselves interpreted the phrase scholarly information to primarily mean journal articles, while the researchers themselves had a much expanded definition to include professional and “scholarly contacts, ideas, and data” (p. 22. This research shows that Google and Google Scholar are widely used by ECRs for locating scholarly information regardless of discipline, language, or geography. Their analysis by country points to currency and the combined breadth-and-depth search experience that Google provides as

  9. Building a protein name dictionary from full text: a machine learning term extraction approach

    Directory of Open Access Journals (Sweden)

    Campagne Fabien

    2005-04-01

    Full Text Available Abstract Background The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature. Results We present an approach to recognize named entities in full text. The approach collects high frequency terms in an article, and uses support vector machines (SVM to identify biological entity names. It is also computationally efficient and robust to noise commonly found in full text material. We use the method to create a protein name dictionary from a set of 80,528 full text articles. Only 8.3% of the names in this dictionary match SwissProt description lines. We assess the quality of the dictionary by studying its protein name recognition performance in full text. Conclusion This dictionary term lookup method compares favourably to other published methods, supporting the significance of our direct extraction approach. The method is strong in recognizing name variants not found in SwissProt.

  10. Full-text publication of abstracts presented at European Orthodontic Society congresses

    NARCIS (Netherlands)

    Livas, Christos; Pandis, Nikolaos; Ren, Yijin

    2014-01-01

    INTRODUCTION: Empirical evidence has indicated that only a subsample of studies conducted reach full-text publication and this phenomenon has become known as publication bias. A form of publication bias is the selectively delayed full publication of conference abstracts. The objective of this

  11. SERVICES OF FULL-TEXT SEARCHING IN A DISTRIBUTED INFORMATION ENVIRONMENT (PROJECT HUMANITARIANA

    Directory of Open Access Journals (Sweden)

    S. K. Lyapin

    2015-01-01

    Full Text Available Problem statement. We justify the possibility of full-text search services application in both universal and specialized (in terms of resource base digital libraries for the extraction and analysis of the context knowledge in the humanities. The architecture and services of virtual information and resource center for extracting knowledge from the humanitarian texts generated by «Humanitariana» project are described. The functional integration of the resources and services for a full-text search in a distributed decentralized environment, organized in the Internet / Intranet architecture under the control of the client (user browser accessing a variety of independent servers. An algorithm for a distributed full-text query implementation is described. Methods. Method of combining requency-ranked and paragraph-oriented full-text queries is used: the first are used for the preliminary analysis of the subject area or a combination product (explication of "vertical" context, or macro context, the second - for the explication of "horizontal" context, or micro context within copyright paragraph. The results of the frequency-ranked queries are used to compile paragraph-oriented queries. Results. The results of textual research are shown on the topics "The question of fact in Russian philosophy", "The question of loneliness in Russian philosophy and culture". About 50 pieces of context knowledge on the total resource base of about 2,500 full-text resources have been explicated and briefly described to their further expert investigating. Practical significance. The proposed technology (advanced full-text searching services in a distributed information environment can be used for the information support of humanitarian studies and education in the humanities, for functional integration of resources and services of various organizations, for carrying out interdisciplinary research.

  12. Extracting Characteristics of the Study Subjects from Full-Text Articles.

    Science.gov (United States)

    Demner-Fushman, Dina; Mork, James G

    Characteristics of the subjects of biomedical research are important in determining if a publication describing the research is relevant to a search. To facilitate finding relevant publications, MEDLINE citations provide Medical Subject Headings that describe the subjects' characteristics, such as their species, gender, and age. We seek to improve the recommendation of these headings by the Medical Text Indexer (MTI) that supports manual indexing of MEDLINE. To that end, we explore the potential of the full text of the publications. Using simple recall-oriented rule-based methods we determined that adding sentences extracted from the methods sections and captions to the abstracts prior to MTI processing significantly improved recall and F1 score with only a slight drop in precision. Improvements were also achieved in directly assigning several headings extracted from the full text. These results indicate the need for further development of automated methods capable of leveraging the full text for indexing.

  13. The structural and content aspects of abstracts versus bodies of full text journal articles are different.

    Science.gov (United States)

    Cohen, K Bretonnel; Johnson, Helen L; Verspoor, Karin; Roeder, Christophe; Hunter, Lawrence E

    2010-09-29

    An increase in work on the full text of journal articles and the growth of PubMedCentral have the opportunity to create a major paradigm shift in how biomedical text mining is done. However, until now there has been no comprehensive characterization of how the bodies of full text journal articles differ from the abstracts that until now have been the subject of most biomedical text mining research. We examined the structural and linguistic aspects of abstracts and bodies of full text articles, the performance of text mining tools on both, and the distribution of a variety of semantic classes of named entities between them. We found marked structural differences, with longer sentences in the article bodies and much heavier use of parenthesized material in the bodies than in the abstracts. We found content differences with respect to linguistic features. Three out of four of the linguistic features that we examined were statistically significantly differently distributed between the two genres. We also found content differences with respect to the distribution of semantic features. There were significantly different densities per thousand words for three out of four semantic classes, and clear differences in the extent to which they appeared in the two genres. With respect to the performance of text mining tools, we found that a mutation finder performed equally well in both genres, but that a wide variety of gene mention systems performed much worse on article bodies than they did on abstracts. POS tagging was also more accurate in abstracts than in article bodies. Aspects of structure and content differ markedly between article abstracts and article bodies. A number of these differences may pose problems as the text mining field moves more into the area of processing full-text articles. However, these differences also present a number of opportunities for the extraction of data types, particularly that found in parenthesized text, that is present in article bodies

  14. The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

    Directory of Open Access Journals (Sweden)

    Ulrich Schäfer

    2013-02-01

    Full Text Available We describe a novel approach to precise searching in the full content of digital libraries. The Searchbench (for search workbench is based on sentence-wise syntactic and semantic natural language processing (NLP of both born-digital and scanned publications in PDF format. The term born-digital means natively digital, i.e. prepared electronically using typesetting systems such as LaTeX, OpenOffice, and the like. In the Searchbench, queries can be formulated as (possibly underspecified statements, consisting of simple subject-predicate-object constructs such as ‘algorithm improves word alignment’. This reduces the number of false hits in large document collections when the search words happen to appear close to each other, but are not semantically related. The method also abstracts from passive voice and predicate synonyms. Moreover, negated statements can be excluded from the search results, and negated antonym predicates again count as synonyms (e.g. not include = exclude.In the Searchbench, a sentence-semantic search can be combined with search filters for classical full-text, bibliographic metadata and automatically computed domain terms. Auto-suggest fields facilitate text input. Queries can be bookmarked or emailed. Furthermore, a novel citation browser in the Searchbench allows graphical navigation in citation networks. These have been extracted automatically from metadata and paper texts. The citation browser displays short phrases from citation sentences at the edges in the citation graph and thus allows students and researchers to quickly browse publications and immerse into a new research field. By clicking on a citation edge, the original citation sentence is shown in context, and optionally also in the original PDF layout.To showcase the usefulness of our research, we have a applied it to a collection of currently approx. 25,000 open access research papers in the field of computational linguistics and language technology, the ACL

  15. Facilitating Full-text Access to Biomedical Literature Using Open Access Resources.

    Science.gov (United States)

    Kang, Hongyu; Hou, Zhen; Li, Jiao

    2015-01-01

    Open access (OA) resources and local libraries often have their own literature databases, especially in the field of biomedicine. We have developed a method of linking a local library to a biomedical OA resource facilitating researchers' full-text article access. The method uses a model based on vector space to measure similarities between two articles in local library and OA resources. The method achieved an F-score of 99.61%. This method of article linkage and mapping between local library and OA resources is available for use. Through this work, we have improved the full-text access of the biomedical OA resources.

  16. Full text publication rates of studies presented at an international emergency medicine scientific meeting.

    Science.gov (United States)

    Chan, Jannet W M; Graham, Colin A

    2011-09-01

    The publication rate of full text papers following an abstract presentation at a medical conference is variable, and few studies have examined the situation with respect to international emergency medicine conferences. This retrospective study aimed to identify the publication rate of abstracts presented at the 2006 International Conference on Emergency Medicine (ICEM) held in Halifax, Canada. The full text publication rate was 33.2%, similar to previous emergency medicine meetings. English language barriers may play a role in the low publication rate seen.

  17. Full-text publication of abstracts presented at meetings of a Latin American scientific society.

    Science.gov (United States)

    Dicembrino, Manuela; Anderson, Mariana; Vely, Ana Gabriela; Ossorio, María Fabiana; Ferrero, Fernando

    2014-12-01

    To estimate the proportion of abstracts presented at meetings of the Latin American Society for Pediatric Research that are fully-published, to describe the reasons for not publishing papers, and to assess the impact of funding on the publication rate. Abstracts presented at meetings held between 2005 and 2009 were included. Authors were contacted and invited to take a survey on the publication of their work or the reasons not to do it. Information was collected on 232 (71.4%) of the 325 abstracts presented. Of these, 58.6% were fully-published (136/232). Funded studies (40.0%) had more chances of publication (OR: 2.2; 95% CI: 1.2-3.9). "Lack of time" was the most common reason for failure to publish (35/96). 58.6% of abstracts presented at meetings of the Latin American Society for Pediatric Research, were published as full-text articles; lack of time was the most common reason for failure to publish. Funded research had more chances of being published.

  18. Full text publication rates of papers presented at the British Foot and Ankle Society.

    Science.gov (United States)

    Marsland, D; Mumith, A; Taylor, H P

    2017-07-13

    Techniques in foot and ankle surgery have expanded rapidly in recent years, often presented at national society meetings. It is important that research is published to guide evidence based practice. Many abstracts however do not go on to full text publication. A database was created of all abstracts presented at BOFAS meetings from 2009 to 2013. Computerised searches were performed using PubMed and Google search engines. In total 341 papers were presented, with an overall publication rate of 31.7%. Of 251 clinical papers, 200 were case series (79.6%). Factors associated with publication success included basic science studies, papers related to arthroscopic surgery and research performed outside the UK. A relatively low conversion rate from presentation to publication could be as a result of papers failing to pass the scrutiny of peer review, or that the work is never formally submitted for publication. The information from this study could be used to prioritise future research and promote higher quality research. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.

  19. Improving e-book access via a library-developed full-text search tool.

    Science.gov (United States)

    Foust, Jill E; Bergen, Phillip; Maxeiner, Gretchen L; Pawlowski, Peter N

    2007-01-01

    This paper reports on the development of a tool for searching the contents of licensed full-text electronic book (e-book) collections. The Health Sciences Library System (HSLS) provides services to the University of Pittsburgh's medical programs and large academic health system. The HSLS has developed an innovative tool for federated searching of its e-book collections. Built using the XML-based Vivísimo development environment, the tool enables a user to perform a full-text search of over 2,500 titles from the library's seven most highly used e-book collections. From a single "Google-style" query, results are returned as an integrated set of links pointing directly to relevant sections of the full text. Results are also grouped into categories that enable more precise retrieval without reformulation of the search. A heuristic evaluation demonstrated the usability of the tool and a web server log analysis indicated an acceptable level of usage. Based on its success, there are plans to increase the number of online book collections searched. This library's first foray into federated searching has produced an effective tool for searching across large collections of full-text e-books and has provided a good foundation for the development of other library-based federated searching products.

  20. Improving e-book access via a library-developed full-text search tool*

    Science.gov (United States)

    Foust, Jill E.; Bergen, Phillip; Maxeiner, Gretchen L.; Pawlowski, Peter N.

    2007-01-01

    Purpose: This paper reports on the development of a tool for searching the contents of licensed full-text electronic book (e-book) collections. Setting: The Health Sciences Library System (HSLS) provides services to the University of Pittsburgh's medical programs and large academic health system. Brief Description: The HSLS has developed an innovative tool for federated searching of its e-book collections. Built using the XML-based Vivísimo development environment, the tool enables a user to perform a full-text search of over 2,500 titles from the library's seven most highly used e-book collections. From a single “Google-style” query, results are returned as an integrated set of links pointing directly to relevant sections of the full text. Results are also grouped into categories that enable more precise retrieval without reformulation of the search. Results/Evaluation: A heuristic evaluation demonstrated the usability of the tool and a web server log analysis indicated an acceptable level of usage. Based on its success, there are plans to increase the number of online book collections searched. Conclusion: This library's first foray into federated searching has produced an effective tool for searching across large collections of full-text e-books and has provided a good foundation for the development of other library-based federated searching products. PMID:17252065

  1. 10. National Nuclear Science and Technologies Congress Proceedings Full Texts Volume 1

    International Nuclear Information System (INIS)

    2009-01-01

    X. National Nuclear Science and Technologies Congress was held on 6-9 October 2009 in Mugla, Turkey in the course of collaborative organization undertaken by Turkish Atomic Energy Authority, Mugla University and Sitki Kocman Foundation. This first volume of Proceedings Book contains 75 submitted presentations and 36 of them are full texts on applications of nuclear techniques.

  2. Endnote Referencing Software: Importing references from an Ebsco database, attaching full text, organising your Endnote library

    OpenAIRE

    Turner, Susan

    2017-01-01

    This video demonstrates importing bibliographic references from EBSCO Discovery Service, the same method can be used for all EBSCO databases. \\ud The video also demonstrates how to attach full text files to the references and how to organise your references within the endnote library using groups.

  3. Preparing College Students To Search Full-Text Databases: Is Instruction Necessary?

    Science.gov (United States)

    Riley, Cheryl; Wales, Barbara

    Full-text databases allow Central Missouri State University's clients to access some of the serials that libraries have had to cancel due to escalating subscription costs; EbscoHost, the subject of this study, is one such database. The database is available free to all Missouri residents. A survey was designed consisting of 21 questions intended…

  4. Full Text or Abstract? : Examining Topic Coherence Scores Using Latent Dirichlet Allocation

    NARCIS (Netherlands)

    Syed, S.; Spruit, M.

    2017-01-01

    This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientific publications when utilizing the topic model latent Dirichlet allocation (LDA) on abstract and full-text data. The coherence of a topic, used as a proxy for topic quality, is based on the

  5. Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion.

    Science.gov (United States)

    Agarwal, Shashank; Yu, Hong

    2009-12-01

    Biomedical texts can be typically represented by four rhetorical categories: Introduction, Methods, Results and Discussion (IMRAD). Classifying sentences into these categories can benefit many other text-mining tasks. Although many studies have applied different approaches for automatically classifying sentences in MEDLINE abstracts into the IMRAD categories, few have explored the classification of sentences that appear in full-text biomedical articles. We first evaluated whether sentences in full-text biomedical articles could be reliably annotated into the IMRAD format and then explored different approaches for automatically classifying these sentences into the IMRAD categories. Our results show an overall annotation agreement of 82.14% with a Kappa score of 0.756. The best classification system is a multinomial naïve Bayes classifier trained on manually annotated data that achieved 91.95% accuracy and an average F-score of 91.55%, which is significantly higher than baseline systems. A web version of this system is available online at-http://wood.ims.uwm.edu/full_text_classifier/.

  6. Full-text automated detection of surgical site infections secondary to neurosurgery in Rennes, France.

    Science.gov (United States)

    Campillo-Gimenez, Boris; Garcelon, Nicolas; Jarno, Pascal; Chapplain, Jean Marc; Cuggia, Marc

    2013-01-01

    The surveillance of Surgical Site Infections (SSI) contributes to the management of risk in French hospitals. Manual identification of infections is costly, time-consuming and limits the promotion of preventive procedures by the dedicated teams. The introduction of alternative methods using automated detection strategies is promising to improve this surveillance. The present study describes an automated detection strategy for SSI in neurosurgery, based on textual analysis of medical reports stored in a clinical data warehouse. The method consists firstly, of enrichment and concept extraction from full-text reports using NOMINDEX, and secondly, text similarity measurement using a vector space model. The text detection was compared to the conventional strategy based on self-declaration and to the automated detection using the diagnosis-related group database. The text-mining approach showed the best detection accuracy, with recall and precision equal to 92% and 40% respectively, and confirmed the interest of reusing full-text medical reports to perform automated detection of SSI.

  7. Beyond genes, proteins, and abstracts: Identifying scientific claims from full-text biomedical articles.

    Science.gov (United States)

    Blake, Catherine

    2010-04-01

    Massive increases in electronically available text have spurred a variety of natural language processing methods to automatically identify relationships from text; however, existing annotated collections comprise only bioinformatics (gene-protein) or clinical informatics (treatment-disease) relationships. This paper introduces the Claim Framework that reflects how authors across biomedical spectrum communicate findings in empirical studies. The Framework captures different levels of evidence by differentiating between explicit and implicit claims, and by capturing under-specified claims such as correlations, comparisons, and observations. The results from 29 full-text articles show that authors report fewer than 7.84% of scientific claims in an abstract, thus revealing the urgent need for text mining systems to consider the full-text of an article rather than just the abstract. The results also show that authors typically report explicit claims (77.12%) rather than an observations (9.23%), correlations (5.39%), comparisons (5.11%) or implicit claims (2.7%). Informed by the initial manual annotations, we introduce an automated approach that uses syntax and semantics to identify explicit claims automatically and measure the degree to which each feature contributes to the overall precision and recall. Results show that a combination of semantics and syntax is required to achieve the best system performance. 2009 Elsevier Inc. All rights reserved.

  8. Scholarly Electronic Full-Text Publications via the Internet: Issues and Impacts

    Science.gov (United States)

    Kosmin, Linda J.

    1999-01-01

    On-line access to complete texts of scholarly journal articles, conference papers, and books is facilitated by rapidly developing World-wide Web Internet access and capabilities. Meanwhile, print publications continue to be produced and read in spite of the proliferation of many networked electronic publications. The purpose of this presentation is to highlight fundamental issues impacting stakeholder groups, as the trend continues towards migration from paper to affordable ubiquitous networked full-text publications. Librarians, publishers, authors and end-users have various viewpoints, interests, and concerns. There are many issues challenging all stakeholder groups. For instance, all share concerns about administering copyright compliance and enforcing fair use. Uncontrollable electronic downstreaming could result in infringed copyright, while limiting a publisher's entitled revenue stream. Moreover, metered fee-based access may hamper scholarly information research. And, self-authoring on the Internet without peer filtering could lead to information clutter. Many related issues challenge librarians in particular. Among these are rising journal subscription prices, regardless if offered in print or electronic. Some electronic offerings are independent of print, others supplement or duplicate print; several publishers presently require subscribing to print in order to access electronic. Furthermore, numbers of publications are n'ow being marketed via the Internet directly to end-users, which can be viewed as encouraging users to bypass the traditional library. A key issue challenging publishers today is the rapidly expanding electronic user base that is demanding delivery of added-value full-text to desktop computers. Also of growing concern appears to be the decline in print sales to libraries, thereby reducing traditional revenue stream potential. Nowadays, publishers are more hesitant about investing in the production of publications geared toward small niche

  9. Full-text publication of abstract-presented work in physical therapy: do therapists publish what they preach?

    Science.gov (United States)

    Smith, Heather D; Bogenschutz, Elizabeth D; Bayliss, Amy J; Altenburger, Peter A; Warden, Stuart J

    2011-02-01

    Professional meetings, such as the American Physical Therapy Association's (APTA's) Combined Sections Meeting (CSM), provide forums for sharing information relevant to physical therapy. An indicator of whether therapists fully disseminate their work is the number of full-text peer-reviewed publications that result. The purposes of this study were: (1) to determine the full-text publication rate of work presented in abstract form at CSM and (2) to investigate factors influencing this rate. A systematic search was undertaken to locate full-text publications of work presented in abstract form within the Orthopaedic and Sports Physical Therapy sections at CSM between 2000 and 2004. Eligible publications were published within 5 years following abstract presentation. The influences of APTA section, year of abstract presentation, institution of origin, study design, sample size, study significance, reporting of a funding source, and presentation type on full-text publication rate were assessed. Characteristics of full-text publications were explored. Work presented in 1 out of 4 abstracts (25.4%) progressed to full-text publication. Odds of full-text publication increased if the abstract originated from a doctorate-granting or "other" institution, reported findings of an experimental study, reported a statistically significant finding, included a larger sample size, disclosed a funding source, or was presented as a platform presentation. More than one third (37.8%) of full-text publications were published in the Journal of Orthopaedic and Sports Physical Therapy or Physical Therapy, and 4 out of 10 full-text publications (39.2%) contained at least one major change from information presented in abstract form. The full-text publication rate for information presented in abstract form within the Orthopaedic and Sports Physical Therapy sections at CSM is low relative to comparative disciplines. Caution should be exercised when translating information presented at CSM into

  10. A Full-Text-Based Search Engine for Finding Highly Matched Documents Across Multiple Categories

    Science.gov (United States)

    Nguyen, Hung D.; Steele, Gynelle C.

    2016-01-01

    This report demonstrates the full-text-based search engine that works on any Web-based mobile application. The engine has the capability to search databases across multiple categories based on a user's queries and identify the most relevant or similar. The search results presented here were found using an Android (Google Co.) mobile device; however, it is also compatible with other mobile phones.

  11. Understanding disciplinary vocabularies using a full-text enabled domain-independent term extraction approach.

    Science.gov (United States)

    Yan, Erjia; Williams, Jake; Chen, Zheng

    2017-01-01

    Publication metadata help deliver rich analyses of scholarly communication. However, research concepts and ideas are more effectively expressed through unstructured fields such as full texts. Thus, the goals of this paper are to employ a full-text enabled method to extract terms relevant to disciplinary vocabularies, and through them, to understand the relationships between disciplines. This paper uses an efficient, domain-independent term extraction method to extract disciplinary vocabularies from a large multidisciplinary corpus of PLoS ONE publications. It finds a power-law pattern in the frequency distributions of terms present in each discipline, indicating a semantic richness potentially sufficient for further study and advanced analysis. The salient relationships amongst these vocabularies become apparent in application of a principal component analysis. For example, Mathematics and Computer and Information Sciences were found to have similar vocabulary use patterns along with Engineering and Physics; while Chemistry and the Social Sciences were found to exhibit contrasting vocabulary use patterns along with the Earth Sciences and Chemistry. These results have implications to studies of scholarly communication as scholars attempt to identify the epistemological cultures of disciplines, and as a full text-based methodology could lead to machine learning applications in the automated classification of scholarly work according to disciplinary vocabularies.

  12. Reported estimates of diagnostic accuracy in ophthalmology conference abstracts were not associated with full-text publication.

    Science.gov (United States)

    Korevaar, Daniël A; Cohen, Jérémie F; Spijker, René; Saldanha, Ian J; Dickersin, Kay; Virgili, Gianni; Hooft, Lotty; Bossuyt, Patrick M M

    2016-11-01

    To assess whether conference abstracts that report higher estimates of diagnostic accuracy are more likely to reach full-text publication in a peer-reviewed journal. We identified abstracts describing diagnostic accuracy studies, presented between 2007 and 2010 at the Association for Research in Vision and Ophthalmology (ARVO) Annual Meeting. We extracted reported estimates of sensitivity, specificity, area under the receiver operating characteristic curve (AUC), and diagnostic odds ratio (DOR). Between May and July 2015, we searched MEDLINE and EMBASE to identify corresponding full-text publications; if needed, we contacted abstract authors. Cox regression was performed to estimate associations with full-text publication, where sensitivity, specificity, and AUC were logit transformed, and DOR was log transformed. A full-text publication was found for 226/399 (57%) included abstracts. There was no association between reported estimates of sensitivity and full-text publication (hazard ratio [HR] 1.09 [95% confidence interval {CI} 0.98, 1.22]). The same applied to specificity (HR 1.00 [95% CI 0.88, 1.14]), AUC (HR 0.91 [95% CI 0.75, 1.09]), and DOR (HR 1.01 [95% CI 0.94, 1.09]). Almost half of the ARVO conference abstracts describing diagnostic accuracy studies did not reach full-text publication. Studies in abstracts that mentioned higher accuracy estimates were not more likely to be reported in a full-text publication. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Database citation in supplementary data linked to Europe PubMed Central full text biomedical articles.

    Science.gov (United States)

    Kafkas, Şenay; Kim, Jee-Hyub; Pi, Xingjun; McEntyre, Johanna R

    2015-01-01

    In this study, we present an analysis of data citation practices in full text research articles and their corresponding supplementary data files, made available in the Open Access set of articles from Europe PubMed Central. Our aim is to investigate whether supplementary data files should be considered as a source of information for integrating the literature with biomolecular databases. Using text-mining methods to identify and extract a variety of core biological database accession numbers, we found that the supplemental data files contain many more database citations than the body of the article, and that those citations often take the form of a relatively small number of articles citing large collections of accession numbers in text-based files. Moreover, citation of value-added databases derived from submission databases (such as Pfam, UniProt or Ensembl) is common, demonstrating the reuse of these resources as datasets in themselves. All the database accession numbers extracted from the supplementary data are publicly accessible from http://dx.doi.org/10.5281/zenodo.11771. Our study suggests that supplementary data should be considered when linking articles with data, in curation pipelines, and in information retrieval tasks in order to make full use of the entire research article. These observations highlight the need to improve the management of supplemental data in general, in order to make this information more discoverable and useful.

  14. Getting more out of biomedical documents with GATE's full lifecycle open source text analytics.

    Directory of Open Access Journals (Sweden)

    Hamish Cunningham

    Full Text Available This software article describes the GATE family of open source text analysis tools and processes. GATE is one of the most widely used systems of its type with yearly download rates of tens of thousands and many active users in both academic and industrial contexts. In this paper we report three examples of GATE-based systems operating in the life sciences and in medicine. First, in genome-wide association studies which have contributed to discovery of a head and neck cancer mutation association. Second, medical records analysis which has significantly increased the statistical power of treatment/outcome models in the UK's largest psychiatric patient cohort. Third, richer constructs in drug-related searching. We also explore the ways in which the GATE family supports the various stages of the lifecycle present in our examples. We conclude that the deployment of text mining for document abstraction or rich search and navigation is best thought of as a process, and that with the right computational tools and data collection strategies this process can be made defined and repeatable. The GATE research programme is now 20 years old and has grown from its roots as a specialist development tool for text processing to become a rather comprehensive ecosystem, bringing together software developers, language engineers and research staff from diverse fields. GATE now has a strong claim to cover a uniquely wide range of the lifecycle of text analysis systems. It forms a focal point for the integration and reuse of advances that have been made by many people (the majority outside of the authors' own group who work in text processing for biomedicine and other areas. GATE is available online under GNU open source licences and runs on all major operating systems. Support is available from an active user and developer community and also on a commercial basis.

  15. Full-text publication of abstracts in emergency medicine in Denmark

    DEFF Research Database (Denmark)

    Ravn, Anne Katrine; Petersen, Dan Brun; Folkestad, Lars

    2014-01-01

    INTRODUCTION: Abstracts presented at medical conferences or scientific meetings should ideally be published as full-text articles in peer-reviewed journals after initial presentation and feedback regardless of the findings. The aim of this survey was to determine the publication rate of papers...... similar publication rates. However, other more established specialties have higher publication levels. Knowledge of reasons for non-publication could lead to efforts to promote publication like funding; the possibility of discussion between authors and editors at conferences; "publication mentors"; and...

  16. The Establishment of the Chinese Full-text Electronic Periodical Database and Service System

    Directory of Open Access Journals (Sweden)

    Huei-Chu Chang

    2003-12-01

    Full Text Available A database covers important journals to critical mass, with powerful search interface, and easy for remote access is the most reasonable electronic resource for users. This article try to start from the project of digitizing bio-medical journals in Taiwan area to the CEPS, discuss the related issues about the selection of journals, the digitized of back issues, the copyright transfer from authors to database producers, the feedback to authors for payment from revenue. It also talks about the flow of journal publishing, marketing, function and the proposed cost-effectiveness in CEPS.[Article content in Chinese

  17. Using distant supervised learning to identify protein subcellular localizations from full-text scientific articles.

    Science.gov (United States)

    Zheng, Wu; Blake, Catherine

    2015-10-01

    Databases of curated biomedical knowledge, such as the protein-locations reflected in the UniProtKB database, provide an accurate and useful resource to researchers and decision makers. Our goal is to augment the manual efforts currently used to curate knowledge bases with automated approaches that leverage the increased availability of full-text scientific articles. This paper describes experiments that use distant supervised learning to identify protein subcellular localizations, which are important to understand protein function and to identify candidate drug targets. Experiments consider Swiss-Prot, the manually annotated subset of the UniProtKB protein knowledge base, and 43,000 full-text articles from the Journal of Biological Chemistry that contain just under 11.5 million sentences. The system achieves 0.81 precision and 0.49 recall at sentence level and an accuracy of 57% on held-out instances in a test set. Moreover, the approach identifies 8210 instances that are not in the UniProtKB knowledge base. Manual inspection of the 50 most likely relations showed that 41 (82%) were valid. These results have immediate benefit to researchers interested in protein function, and suggest that distant supervision should be explored to complement other manual data curation efforts. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. Full-text publication of abstracts in emergency medicine in Denmark.

    Science.gov (United States)

    Ravn, Anne Katrine; Petersen, Dan Brun; Folkestad, Lars; Hallas, Peter; Brabrand, Mikkel

    2014-05-24

    Abstracts presented at medical conferences or scientific meetings should ideally be published as full-text articles in peer-reviewed journals after initial presentation and feedback regardless of the findings. The aim of this survey was to determine the publication rate of papers presented at the Danish Emergency Medicine Conferences in 2009, 2010 and 2011. Abstracts presented at the conferences were identified and authors contacted to obtain publication information. A further search was conducted using relevant databases. Publication rates for the 2009 and 2010 were approximately 30% (25-31.6%). The publication rate for the 2011 conference was 14.5% within 18 months with an additional 9% under review prior to publication. When comparing full-text publication rates from DEMC to previous international studies in EM Danish EM research community has similar publication rates. However, other more established specialties have higher publication levels. Knowledge of reasons for non-publication could lead to efforts to promote publication like funding; the possibility of discussion between authors and editors at conferences; "publication mentors"; and/or research courses provided by the Danish Society of Emergency Medicine.

  19. Accuracy of Intraoral Digital Impressions for Whole Upper Jaws, Including Full Dentitions and Palatal Soft Tissues.

    Directory of Open Access Journals (Sweden)

    Ning Gan

    Full Text Available Intraoral digital impressions have been stated to meet the clinical requirements for some teeth-supported restorations, though fewer evidences were proposed for larger scanning range. The aim of this study was to compare the accuracy (trueness and precision of intraoral digital impressions for whole upper jaws, including the full dentitions and palatal soft tissues, as well as to determine the effect of different palatal vault height or arch width on accuracy of intraoral digital impressions. Thirty-two volunteers were divided into three groups according to the palatal vault height or arch width. Each volunteer received three scans with TRIOS intraoral scanner and one conventional impression of whole upper jaw. Three-dimensional (3D images digitized from conventional gypsum casts by a laboratory scanner were chose as the reference models. All datasets were imported to a specific software program for 3D analysis by "best fit alignment" and "3D compare" process. Color-coded deviation maps showed qualitative visualization of the deviations. For the digital impressions for palatal soft tissues, trueness was (130.54±33.95μm and precision was (55.26±11.21μm. For the digital impressions for upper full dentitions, trueness was (80.01±17.78μm and precision was (59.52±11.29μm. Larger deviations were found between intraoral digital impressions and conventional impressions in the areas of palatal soft tissues than that in the areas of full dentitions (p0.05, but arch width was found to have a significant effect on precision of intraoral digital impressions for full dentitions (p = 0.016. A linear correlation was found between arch width and precision of digital impressions for whole upper jaws (r = 0.326, p = 0.034 for palatal soft tissues and r = 0.485, p = 0.002 for full dentitions. It was feasible to use the intraoral scanner to obtain digital impressions for whole upper jaws. Wider dental arch contributed to lower precision of an intraoral

  20. Full Text Searching and Customization in the NASA ADS Abstract Service

    Science.gov (United States)

    Eichhorn, G.; Accomazzi, A.; Grant, C. S.; Kurtz, M. J.; Henneken, E. A.; Thompson, D. M.; Murray, S. S.

    2004-01-01

    The NASA-ADS Abstract Service provides a sophisticated search capability for the literature in Astronomy, Planetary Sciences, Physics/Geophysics, and Space Instrumentation. The ADS is funded by NASA and access to the ADS services is free to anybody worldwide without restrictions. It allows the user to search the literature by author, title, and abstract text. The ADS database contains over 3.6 million references, with 965,000 in the Astronomy/Planetary Sciences database, and 1.6 million in the Physics/Geophysics database. 2/3 of the records have full abstracts, the rest are table of contents entries (titles and author lists only). The coverage for the Astronomy literature is better than 95% from 1975. Before that we cover all major journals and many smaller ones. Most of the journal literature is covered back to volume 1. We now get abstracts on a regular basis from most journals. Over the last year we have entered basically all conference proceedings tables of contents that are available at the Harvard Smithsonian Center for Astrophysics library. This has greatly increased the coverage of conference proceedings in the ADS. The ADS also covers the ArXiv Preprints. We download these preprints every night and index all the preprints. They can be searched either together with the other abstracts or separately. There are currently about 260,000 preprints in that database. In January 2004 we have introduced two new services, full text searching and a personal notification service called "myADS". As all other ADS services, these are free to use for anybody.

  1. E2FM: an encrypted and compressed full-text index for collections of genomic sequences.

    Science.gov (United States)

    Montecuollo, Ferdinando; Schmid, Giovannni; Tagliaferri, Roberto

    2017-09-15

    Next Generation Sequencing (NGS) platforms and, more generally, high-throughput technologies are giving rise to an exponential growth in the size of nucleotide sequence databases. Moreover, many emerging applications of nucleotide datasets-as those related to personalized medicine-require the compliance with regulations about the storage and processing of sensitive data. We have designed and carefully engineered E 2 FM -index, a new full-text index in minute space which was optimized for compressing and encrypting nucleotide sequence collections in FASTA format and for performing fast pattern-search queries. E 2 FM -index allows to build self-indexes which occupy till to 1/20 of the storage required by the input FASTA file, thus permitting to save about 95% of storage when indexing collections of highly similar sequences; moreover, it can exactly search the built indexes for patterns in times ranging from few milliseconds to a few hundreds milliseconds, depending on pattern length. Source code is available at https://github.com/montecuollo/E2FM . ferdinando.montecuollo@unicampania.it. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  2. Getting more out of biomedical documents with GATE's full lifecycle open source text analytics.

    Science.gov (United States)

    Cunningham, Hamish; Tablan, Valentin; Roberts, Angus; Bontcheva, Kalina

    2013-01-01

    This software article describes the GATE family of open source text analysis tools and processes. GATE is one of the most widely used systems of its type with yearly download rates of tens of thousands and many active users in both academic and industrial contexts. In this paper we report three examples of GATE-based systems operating in the life sciences and in medicine. First, in genome-wide association studies which have contributed to discovery of a head and neck cancer mutation association. Second, medical records analysis which has significantly increased the statistical power of treatment/outcome models in the UK's largest psychiatric patient cohort. Third, richer constructs in drug-related searching. We also explore the ways in which the GATE family supports the various stages of the lifecycle present in our examples. We conclude that the deployment of text mining for document abstraction or rich search and navigation is best thought of as a process, and that with the right computational tools and data collection strategies this process can be made defined and repeatable. The GATE research programme is now 20 years old and has grown from its roots as a specialist development tool for text processing to become a rather comprehensive ecosystem, bringing together software developers, language engineers and research staff from diverse fields. GATE now has a strong claim to cover a uniquely wide range of the lifecycle of text analysis systems. It forms a focal point for the integration and reuse of advances that have been made by many people (the majority outside of the authors' own group) who work in text processing for biomedicine and other areas. GATE is available online under GNU open source licences and runs on all major operating systems. Support is available from an active user and developer community and also on a commercial basis.

  3. [Full-text publication of abstracts presented at the 33th Argentinean pediatric meeting and non publication related factors].

    Science.gov (United States)

    Canosa, Daniela; Ferrero, Fernando; Melamud, Ariel; Otero, Paula D; Merech, Raúl S; Ceriani Cernadas, José M

    2011-02-01

    There is no information about non publication of research presented at scientific meetings in Argentina. We analyzed the full-text publication rate of abstracts presented at the 33° Argentinean Pediatric Congress (APC), time to achieve publication, and factors associated with publication or non-publication. Survey-based cross-sectional study, including authors of abstracts presented at the 33° APC. The survey included age, gender, specialty and sub-specialty, professional area and reason of publication or non-publication. We randomly selected 140/894 presented abstracts. Only 16 abstracts (11.4%) were subsequently published in full, requiring 27±15 months. There were no association between full-text publication and author's characteristics. "Oral presentations" were more likely to be subsequently published (p= 0.018). In non published abstracts, 95% were not submitted by the author, more frequently because of "lack of time" (35.9%). Only 11.4% of abstracts were subsequently published in full. Oral presentation was associated with a higher publication rate. Most frequent cause for non-publication was non submission due to lack of time.

  4. The consistency between scientific papers presented at the Orthopaedic Trauma Association and their subsequent full-text publication.

    Science.gov (United States)

    Preston, Charles F; Bhandari, Mohit; Fulkerson, Eric; Ginat, Danial; Egol, Kenneth A; Koval, Kenneth J

    2006-02-01

    To determine the consistency of conclusions/statements made in podium presentations at the annual meeting of the Orthopaedic Trauma Association (OTA) with those in subsequent full-text publications. Also, to evaluate the nature and consistency of study design, methods, sample sizes, results and assign a corresponding level of evidence. Abstracts of the scientific programs of the OTA from 1994 to 1997 (N = 254) were queried by using the PubMed database to identify those studies resulting in a peer-reviewed, full-text publication. Of the 169 articles retrieved, 137 studies were the basis of our study after the exclusion criteria were applied: non-English language, basic science studies, anatomic dissection studies, and articles published in non-peer-reviewed journals. Information was abstracted onto a data form: first from the abstract published in the final meeting program, and then from the published journal article. Information was recorded regarding study issues, including the study design, primary objective, sample size, and statistical methods. We provided descriptive statistics about the frequency of consistent results between abstracts and full-text publications. The results were recorded as percentages and a 95% confidence interval was applied to each value. Study results were recorded for the abstract and full-text publication comparing results and the overall conclusion. A level of scientific-based evidence was assigned to each full-text publication. The final conclusion of the study remained the same 93.4% of the time. The method of study was an observational case series 52% of the time and a statement regarding the rate of patient follow-up was reported 42% of the time. Of the studies published, 18.2% consisted of a sample size smaller than the previously presented abstract. When the published papers had their level of evidence graded, 11% were level I, 16% level II, 17% level III, and 56% level IV. Authors conclusions were consistent with those in full-text

  5. Analysis of Nature of Science Included in Recent Popular Writing Using Text Mining Techniques

    Science.gov (United States)

    Jiang, Feng; McComas, William F.

    2014-01-01

    This study examined the inclusion of nature of science (NOS) in popular science writing to determine whether it could serve supplementary resource for teaching NOS and to evaluate the accuracy of text mining and classification as a viable research tool in science education research. Four groups of documents published from 2001 to 2010 were…

  6. Full-wave calculation of fast-wave current drive in tokamaks including kparallel upshifts

    International Nuclear Information System (INIS)

    Jaeger, E.F.; Batchelor, D.B.

    1991-01-01

    Numerical calculations of fast-wave current drive (FWCD) efficiency have generally been of two types: ray tracing or global wave calculations. Ray tracing shows that the projection of the wave number (k parallel) along the magnetic field can vary greatly over a ray trajectory, particularly when the launch point is above or below the equatorial plane. As the wave penetrates toward the center of the plasma, k parallel increases, causing a decrease in the parallel phase speed and a corresponding decrease in the current drive efficiency, γ. But the assumptions of geometrical optics, namely short wavelength and strong single-pass absorption, are not greatly applicable in FWCD scenarios. Eigenmode structure, which is ignored in ray tracing, can play an important role in determining electric field strength and Landau damping rates. In such cases, a full-wave or global solution for the wave fields is desirable. In full-wave calculations such as ORION k parallel appear as a differential operator (rvec B·∇) in the argument of the plasma dispersion function. Since this leads to a differential system of infinite order, such codes of necessity assume k parallel ∼ k var-phi = const, where k var-phi is the toroidal wave number. Thus, it is not possible to correctly include effects of the poloidal magnetic field on k parallel. The problem can be alleviated by expressing the electric field as a superposition of poloidal modes, in which case k parallel is purely algebraic. This paper describes a new full-wave calculation, Poloidal Ion Cyclotron Expansion Solution, which uses poloidal and toroidal mode expansions to solve the wave equation in general flux coordinates. The calculation includes a full solution for E parallel and uses a reduced-order form of the plasma conductivity tensor to eliminate numerical problems associated with resolution of the very short wavelength ion Bernstein wave

  7. Publication trends of shared decision making in 15 high impact medical journals: a full-text review with bibliometric analysis.

    Science.gov (United States)

    Blanc, Xavier; Collet, Tinh-Hai; Auer, Reto; Fischer, Roland; Locatelli, Isabella; Iriarte, Pablo; Krause, Jan; Légaré, France; Cornuz, Jacques

    2014-08-09

    Shared Decision Making (SDM) is increasingly advocated as a model for medical decision making. However, there is still low use of SDM in clinical practice. High impact factor journals might represent an efficient way for its dissemination. We aimed to identify and characterize publication trends of SDM in 15 high impact medical journals. We selected the 15 general and internal medicine journals with the highest impact factor publishing original articles, letters and editorials. We retrieved publications from 1996 to 2011 through the full-text search function on each journal website and abstracted bibliometric data. We included publications of any type containing the phrase "shared decision making" or five other variants in their abstract or full text. These were referred to as SDM publications. A polynomial Poisson regression model with logarithmic link function was used to assess the evolution across the period of the number of SDM publications according to publication characteristics. We identified 1285 SDM publications out of 229,179 publications in 15 journals from 1996 to 2011. The absolute number of SDM publications by journal ranged from 2 to 273 over 16 years. SDM publications increased both in absolute and relative numbers per year, from 46 (0.32% relative to all publications from the 15 journals) in 1996 to 165 (1.17%) in 2011. This growth was exponential (P Full-text search retrieved ten times more SDM publications than a similar PubMed search (1285 vs. 119 respectively). This review in full-text showed that SDM publications increased exponentially in major medical journals from 1996 to 2011. This growth might reflect an increased dissemination of the SDM concept to the medical community.

  8. Conversion rates of abstracts presented at the Canadian Rheumatology Association Annual Meetings into full-text journal articles.

    Science.gov (United States)

    Yacyshyn, Elaine A; Soong, Laura C

    2017-06-01

    Dissemination of research studies is important for research ideas to be transformed from initial abstracts to full publications. Analyses of the scientific impact and publication record of the Canadian Rheumatology Association (CRA) Annual meeting have not been previously described. This study determines the publication rate of abstracts presented at the CRA Annual Meetings 2005-2013 to full-text journal articles and the factors associated with publication. Program records of previous CRA meetings from 2005 to 2013 were obtained. Abstracts were searched for corresponding full-text publication in Google Scholar and PubMed using a search algorithm. Abstracts and subsequent published articles were evaluated for type of abstract, time to publication, study type, publishing journal, and journal impact factor. A total of 1401 abstracts were included in the study, 567 of which were converted to full publications. The average time to publication was 19.7 months, with 89% of abstracts published within 3 years of being presented. Eighty-three percent of abstracts were clinical in nature, and 58% of published studies were observational in design. Articles were published in a wide range of journals, with the top publisher being the Journal of Rheumatology (31%). Average time to publication was 19.7 months. Eighty-six percent of articles had a Journal Impact Factor > 2. Overall, 40.5% of abstracts presented at the CRA Annual Meetings 2005-2013 were published. Further research is needed to determine barriers and reasons for abstracts not being published as full-text articles.

  9. Dynamic programming re-ranking for PPI interactor and pair extraction in full-text articles

    Science.gov (United States)

    2011-01-01

    Background Experimentally verified protein-protein interactions (PPIs) cannot be easily retrieved by researchers unless they are stored in PPI databases. The curation of such databases can be facilitated by employing text-mining systems to identify genes which play the interactor role in PPIs and to map these genes to unique database identifiers (interactor normalization task or INT) and then to return a list of interaction pairs for each article (interaction pair task or IPT). These two tasks are evaluated in terms of the area under curve of the interpolated precision/recall (AUC iP/R) score because the order of identifiers in the output list is important for ease of curation. Results Our INT system developed for the BioCreAtIvE II.5 INT challenge achieved a promising AUC iP/R of 43.5% by using a support vector machine (SVM)-based ranking procedure. Using our new re-ranking algorithm, we have been able to improve system performance (AUC iP/R) by 1.84%. Our experimental results also show that with the re-ranked INT results, our unsupervised IPT system can achieve a competitive AUC iP/R of 23.86%, which outperforms the best BC II.5 INT system by 1.64%. Compared to using only SVM ranked INT results, using re-ranked INT results boosts AUC iP/R by 7.84%. Statistical significance t-test results show that our INT/IPT system with re-ranking outperforms that without re-ranking by a statistically significant difference. Conclusions In this paper, we present a new re-ranking algorithm that considers co-occurrence among identifiers in an article to improve INT and IPT ranking results. Combining the re-ranked INT results with an unsupervised approach to find associations among interactors, the proposed method can boost the IPT performance. We also implement score computation using dynamic programming, which is faster and more efficient than traditional approaches. PMID:21342534

  10. Prospects and limitations of full-text index structures in genome analysis

    Science.gov (United States)

    Vyverman, Michaël; De Baets, Bernard; Fack, Veerle; Dawyndt, Peter

    2012-01-01

    The combination of incessant advances in sequencing technology producing large amounts of data and innovative bioinformatics approaches, designed to cope with this data flood, has led to new interesting results in the life sciences. Given the magnitude of sequence data to be processed, many bioinformatics tools rely on efficient solutions to a variety of complex string problems. These solutions include fast heuristic algorithms and advanced data structures, generally referred to as index structures. Although the importance of index structures is generally known to the bioinformatics community, the design and potency of these data structures, as well as their properties and limitations, are less understood. Moreover, the last decade has seen a boom in the number of variant index structures featuring complex and diverse memory-time trade-offs. This article brings a comprehensive state-of-the-art overview of the most popular index structures and their recently developed variants. Their features, interrelationships, the trade-offs they impose, but also their practical limitations, are explained and compared. PMID:22584621

  11. Students' Learning Experiences from Didactic Teaching Sessions Including Patient Case Examples as Either Text or Video

    DEFF Research Database (Denmark)

    Pedersen, Kamilla; Moeller, Martin Holdgaard; Paltved, Charlotte

    2017-01-01

    OBJECTIVES: The aim of this study was to explore medical students' learning experiences from the didactic teaching formats using either text-based patient cases or video-based patient cases with similar content. The authors explored how the two different patient case formats influenced students......' perceptions of psychiatric patients and students' reflections on meeting and communicating with psychiatric patients. METHODS: The authors conducted group interviews with 30 medical students who volunteered to participate in interviews and applied inductive thematic content analysis to the transcribed...

  12. Accuracy of Intraoral Digital Impressions for Whole Upper Jaws, Including Full Dentitions and Palatal Soft Tissues.

    Science.gov (United States)

    Gan, Ning; Xiong, Yaoyang; Jiao, Ting

    2016-01-01

    Intraoral digital impressions have been stated to meet the clinical requirements for some teeth-supported restorations, though fewer evidences were proposed for larger scanning range. The aim of this study was to compare the accuracy (trueness and precision) of intraoral digital impressions for whole upper jaws, including the full dentitions and palatal soft tissues, as well as to determine the effect of different palatal vault height or arch width on accuracy of intraoral digital impressions. Thirty-two volunteers were divided into three groups according to the palatal vault height or arch width. Each volunteer received three scans with TRIOS intraoral scanner and one conventional impression of whole upper jaw. Three-dimensional (3D) images digitized from conventional gypsum casts by a laboratory scanner were chose as the reference models. All datasets were imported to a specific software program for 3D analysis by "best fit alignment" and "3D compare" process. Color-coded deviation maps showed qualitative visualization of the deviations. For the digital impressions for palatal soft tissues, trueness was (130.54±33.95)μm and precision was (55.26±11.21)μm. For the digital impressions for upper full dentitions, trueness was (80.01±17.78)μm and precision was (59.52±11.29)μm. Larger deviations were found between intraoral digital impressions and conventional impressions in the areas of palatal soft tissues than that in the areas of full dentitions (pimpressions for palatal soft tissues was slightly better than that for full dentitions (p = 0.049). There was no significant effect of palatal vault height on accuracy of digital impressions for palatal soft tissues (p>0.05), but arch width was found to have a significant effect on precision of intraoral digital impressions for full dentitions (p = 0.016). A linear correlation was found between arch width and precision of digital impressions for whole upper jaws (r = 0.326, p = 0.034 for palatal soft tissues and r

  13. Accuracy of Intraoral Digital Impressions for Whole Upper Jaws, Including Full Dentitions and Palatal Soft Tissues

    Science.gov (United States)

    Gan, Ning; Xiong, Yaoyang; Jiao, Ting

    2016-01-01

    Intraoral digital impressions have been stated to meet the clinical requirements for some teeth-supported restorations, though fewer evidences were proposed for larger scanning range. The aim of this study was to compare the accuracy (trueness and precision) of intraoral digital impressions for whole upper jaws, including the full dentitions and palatal soft tissues, as well as to determine the effect of different palatal vault height or arch width on accuracy of intraoral digital impressions. Thirty-two volunteers were divided into three groups according to the palatal vault height or arch width. Each volunteer received three scans with TRIOS intraoral scanner and one conventional impression of whole upper jaw. Three-dimensional (3D) images digitized from conventional gypsum casts by a laboratory scanner were chose as the reference models. All datasets were imported to a specific software program for 3D analysis by "best fit alignment" and "3D compare" process. Color-coded deviation maps showed qualitative visualization of the deviations. For the digital impressions for palatal soft tissues, trueness was (130.54±33.95)μm and precision was (55.26±11.21)μm. For the digital impressions for upper full dentitions, trueness was (80.01±17.78)μm and precision was (59.52±11.29)μm. Larger deviations were found between intraoral digital impressions and conventional impressions in the areas of palatal soft tissues than that in the areas of full dentitions (pimpressions for palatal soft tissues was slightly better than that for full dentitions (p = 0.049). There was no significant effect of palatal vault height on accuracy of digital impressions for palatal soft tissues (p>0.05), but arch width was found to have a significant effect on precision of intraoral digital impressions for full dentitions (p = 0.016). A linear correlation was found between arch width and precision of digital impressions for whole upper jaws (r = 0.326, p = 0.034 for palatal soft tissues and r

  14. Commercial Database Design vs. Library Terminology Comprehension: Why Do Students Print Abstracts Instead of Full-Text Articles?

    Science.gov (United States)

    Imler, Bonnie; Eichelberger, Michelle

    2014-01-01

    When asked to print the full text of an article, many undergraduate college students print the abstract instead of the full text. This study seeks to determine the underlying cause(s) of this confusion. In this quantitative study, participants (n = 40) performed five usability tasks to assess ease of use and usefulness of five commercial library…

  15. download full text

    African Journals Online (AJOL)

    Hence, the main objective of the research was to carry out scientific studies on its ... The animals were sacrificed on day 30 after the NIB scoring and blood sample ... effect on locomotion and rearing activities when compared with the control.

  16. download full text

    African Journals Online (AJOL)

    The overshadowing of education policies in foreign language education at primary .... Cummins states that a threshold level of linguistics competence must be ..... language education planning is designed to accommodate these interests.

  17. download full text

    African Journals Online (AJOL)

    Keywords: Technology, French as a foreign language, Learners, Instruction ... This translates to an increase of 3.7 percent or 1.4 million new mobile subscriptions ... technology (ICT) in foreign language learning and the availability as well as capacities ..... In spite of the many benefits of creating an authentic French learning ...

  18. download full text

    African Journals Online (AJOL)

    Dale E. Zand (1997) argues that People once stood in awe of electricity, until ... in today's information-driven organizations: knowledge, trust, and power. ..... people's culture and resistance to anti-corruption efforts constitute the firmly fixed load.

  19. download full text

    African Journals Online (AJOL)

    Epidemiological study has shown that 2.5 million deaths occurred every year as a result of vaccine-preventable diseases, mainly in Africa and Asia among children less than 5 years old (GIVS, 2005). Immunization is the process of conferring increased resistance to an infectious disease by a means other than experiencing ...

  20. download full text

    African Journals Online (AJOL)

    UNIVERSITY OF BENIN

    By paying strict attention to the manipulation of action and dialogue, the short story ... through the workings of the human mind as he reacts to various predicaments. .... In “A Caring Man,” in A Forest of Flowers, Ken Saro-Wiwa illustrates the theme of .... until his small dirty pillow is thrown out of the window of the moving train.

  1. download full text

    African Journals Online (AJOL)

    ... country and mass migration of the farming communities to IDP camps in major cities ..... "Global Warming Impact: Flood Events, Wet-Dry Conditions and Changing ... Global Environmental Change, Vol. 16, pp. 268-281.Web. Adger, W. N. (1999). "Social Vulnerability to Climate Change and Extremes in Coastal Vietnam.

  2. download full text

    African Journals Online (AJOL)

    Pablo Rubio Gijon

    Hishongwa belongs to a generation of writers who created a new style of expression in .... authority, can turn this authority into something even more autocratic. ... leadership of that (liberation) struggle” (Haarhoff 224), Hishongwa's Marrying ...

  3. download full text

    African Journals Online (AJOL)

    Adopting a surveillance system for antibacterial use has therefore become a more realistic ..... Financial support was obtained from the African Poverty Related Infection ... classification and Defined Daily Dose system methodology in Canada.

  4. download full text

    African Journals Online (AJOL)

    Oita Etyang

    The concept democracy has been part of man's political life for ages. ... Taking the queue form Bratton and Mattes, we add that prospects of a stable democracy are ..... of the resulting instability that emanate from entrenched ethnic cleavages.

  5. download full text

    African Journals Online (AJOL)

    TAOFEEK YUSUF

    The data used were obtained through questionnaires administered to ... Keywords: academic performance, engineering education, undergraduate students, and .... and commitment to studies irrespective of any form of learning task Yusuf et al.

  6. download full text

    African Journals Online (AJOL)

    Njeri

    He took his children to St Marys, I could not afford to do so. ... place in universities should have been an important learning space for students. ... just us we are fascinated by Manchester football clubs and western movies as well as music.

  7. download full text

    African Journals Online (AJOL)

    Language and Meaning: A Syntactic Study of Wale Okediran's Strange Encounters ... own communication role, making assertions, asking questions, giving orders, ... I will go straight to the police with all the things you stole from the hospital.

  8. Full-text

    African Journals Online (AJOL)

    ADOWIE PERE

    consideration the needs of the current generation without risking ability of future generations to attain their needs. Evaluation of .... If an element or a number such as x and a collection such as A ... defined as definitive and accurate. This also ...

  9. download full text

    African Journals Online (AJOL)

    paula fiona mwikali

    The Portrayal of Masculinity in Dholuo Ohangla Music ... The Luo culture is built on patriarchy and the socialization of the children ..... A leader must be strong because those he/ she leads look up to him/her for direction, assistance and development. ... Being a loyal lieutenant of Orange Democratic Movement, Anyanga ...

  10. The Effect of Different Modes of English Captioning on EFL learners’ General Listening Comprehension: Full text Vs. Keyword Captions

    Directory of Open Access Journals (Sweden)

    Sorayya Behroozizad

    2015-08-01

    Full Text Available This study investigated the effect of different modes of English captioning on EFL learners’ general listening comprehension. To this end, forty five intermediate-level learners were selected based on their scores on a standardized English proficiency test (PET to carry out the study. Then, the selected participants were randomly assigned into two experimental groups (full-captions and keyword-captions and one control group (no-captions. Research instrumentation included a pre-test and a post-test following an experimental design. Participants took a pre-test and a post-test containing 50 multiple-choice questions (25question for pre-test and 25 question for post-test selected from a standard listening test PET, and also 15 treatment sessions. The findings showed significant differences among full-captions, keyword-captions, and no-captions in terms of their effect on learners’ general listening comprehension. This study provided some pedagogical implications for teaching listening through using different modes of captions. Keywords: Caption, full caption, keyword caption, listening comprehension

  11. Retrieval of publications addressing shared decision making: an evaluation of full-text searches on medical journal websites.

    Science.gov (United States)

    Blanc, Xavier; Collet, Tinh-Hai; Auer, Reto; Iriarte, Pablo; Krause, Jan; Légaré, France; Cornuz, Jacques; Clair, Carole

    2015-04-07

    Full-text searches of articles increase the recall, defined by the proportion of relevant publications that are retrieved. However, this method is rarely used in medical research due to resource constraints. For the purpose of a systematic review of publications addressing shared decision making, a full-text search method was required to retrieve publications where shared decision making does not appear in the title or abstract. The objective of our study was to assess the efficiency and reliability of full-text searches in major medical journals for identifying shared decision making publications. A full-text search was performed on the websites of 15 high-impact journals in general internal medicine to look up publications of any type from 1996-2011 containing the phrase "shared decision making". The search method was compared with a PubMed search of titles and abstracts only. The full-text search was further validated by requesting all publications from the same time period from the individual journal publishers and searching through the collected dataset. The full-text search for "shared decision making" on journal websites identified 1286 publications in 15 journals compared to 119 through the PubMed search. The search within the publisher-provided publications of 6 journals identified 613 publications compared to 646 with the full-text search on the respective journal websites. The concordance rate was 94.3% between both full-text searches. Full-text searching on medical journal websites is an efficient and reliable way to identify relevant articles in the field of shared decision making for review or other purposes. It may be more widely used in biomedical research in other fields in the future, with the collaboration of publishers and journals toward open-access data.

  12. The BioC-BioGRID corpus: full text articles annotated for curation of protein–protein and genetic interactions

    Science.gov (United States)

    Kim, Sun; Chatr-aryamontri, Andrew; Chang, Christie S.; Oughtred, Rose; Rust, Jennifer; Wilbur, W. John; Comeau, Donald C.; Dolinski, Kara; Tyers, Mike

    2017-01-01

    A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein–protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future

  13. The BioC-BioGRID corpus: full text articles annotated for curation of protein-protein and genetic interactions.

    Science.gov (United States)

    Islamaj Dogan, Rezarta; Kim, Sun; Chatr-Aryamontri, Andrew; Chang, Christie S; Oughtred, Rose; Rust, Jennifer; Wilbur, W John; Comeau, Donald C; Dolinski, Kara; Tyers, Mike

    2017-01-01

    A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein-protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future

  14. A full-text english database of testimonies of those exposed to radiation near the Semipalatinsk nuclear test site, Kazakhstan

    OpenAIRE

    Matsuo, Masatsugu; Kawano, Noriyuki; Hirabayashi, Kyoko; Tooka, Yasuyuki; Apsalikov, Kazbek Negamatovich; Hoshi, Masaharu

    2004-01-01

    The present paper is a sequel to the initial report (Kawano et al 2003a) of the project for a full-text Japanese database of the testimonies of those exposed to radiation near the nuclear test site of Semipalatinsk, Kazakhstan. 139 testimonies were gathered in four villages near Semipalatinsk in 2002. We translated them into English from Russian and Kazakh, and created a full-text database by using a Latin script text retrieval program, TERESA. The present paper attempts at essentially the sa...

  15. Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

    Directory of Open Access Journals (Sweden)

    Çağdaş Çapkın

    2016-12-01

    Full Text Available Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR, full-text (FIR and hybrid (HIR content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR and information retrieval performance improved.

  16. Full Space Vectors Modulation for Nine-Switch Converters Including CF & DF Modes

    DEFF Research Database (Denmark)

    Dehghan Dehnavi, Seyed Mohammad; Mohamadian, Mustafa; Andersen, Michael A. E.

    2010-01-01

    converter. As a space vector modulation for DF mode has already been proposed by authors. This paper proposes a full space vector modulation (SVM) for both CF and DF modes. Also practical methods are presented for SVM proposed. In addition a special SVM is proposed that offers minimum total harmonic...... distortion (THD) in DF mode. The performance of the proposed SVM is verified by simulation results....

  17. Empirical investigations into full-text protein interaction Article Categorization Task (ACT) in the BioCreative II.5 Challenge.

    Science.gov (United States)

    Lan, Man; Su, Jian

    2010-01-01

    The selection of protein interaction documents is one important application for biology research and has a direct impact on the quality of downstream BioNLP applications, i.e., information extraction and retrieval, summarization, QA, etc. The BioCreative II.5 Challenge Article Categorization task (ACT) involves doing a binary text classification to determine whether a given structured full-text article contains protein interaction information. This may be the first attempt at classification of full-text protein interaction documents in wide community. In this paper, we compare and evaluate the effectiveness of different section types in full-text articles for text classification. Moreover, in practice, the less number of true-positive samples results in unstable performance and unreliable classifier trained on it. Previous research on learning with skewed class distributions has altered the class distribution using oversampling and downsampling. We also investigate the skewed protein interaction classification and analyze the effect of various issues related to the choice of external sources, oversampling training sets, classifiers, etc. We report on the various factors above to show that 1) a full-text biomedical article contains a wealth of scientific information important to users that may not be completely represented by abstracts and/or keywords, which improves the accuracy performance of classification and 2) reinforcing true-positive samples significantly increases the accuracy and stability performance of classification.

  18. Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

    Science.gov (United States)

    Cohen, K Bretonnel; Lanfranchi, Arrick; Choi, Miji Joo-Young; Bada, Michael; Baumgartner, William A; Panteleyeva, Natalya; Verspoor, Karin; Palmer, Martha; Hunter, Lawrence E

    2017-08-17

    Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature. In order to better understand the nature of the phenomenon of coreference in biomedical publications and to increase performance on the task, we annotated the Colorado Richly Annotated Full Text (CRAFT) corpus with coreference relations. The corpus was manually annotated with coreference relations, including identity and appositives for all coreferring base noun phrases. The OntoNotes annotation guidelines, with minor adaptations, were used. Interannotator agreement ranges from 0.480 (entity-based CEAF) to 0.858 (Class-B3), depending on the metric that is used to assess it. The resulting corpus adds nearly 30,000 annotations to the previous release of the CRAFT corpus. Differences from related projects include a much broader definition of markables, connection to extensive annotation of several domain-relevant semantic classes, and connection to complete syntactic annotation. Tool performance was benchmarked on the data. A publicly available out-of-the-box, general-domain coreference resolution system achieved an F-measure of 0.14 (B3), while a simple domain-adapted rule-based system achieved an F-measure of 0.42. An ensemble of the two reached F of 0.46. Following the IDENTITY chains in the data would add 106,263 additional named entities in the full 97-paper corpus, for an increase of 76% percent in the semantic classes of the eight ontologies that have been annotated in earlier versions of the CRAFT corpus. The project produced a large data set for further investigation of coreference and coreference resolution in the scientific literature. The work raised issues in the phenomenon of reference in this domain and genre, and the paper proposes that many mentions that would be considered generic in the general domain are not

  19. Precise predictions of higgs boson decays including the full one-loop corrections in supersymmetry

    International Nuclear Information System (INIS)

    Frisch, W.

    2011-01-01

    The Standard Model of elementary particle physics is a highly successful theory, describing the electromagnetic, strong and weak interaction of matter particles up to energy scales to a few hundred giga electronvolt. Despite its great success in explaining experimental results correctly, there is hardly no doubt that the SM is an effective theory, which means that the theory loses its predictability at higher energies. Therefore, the Standard Model has to be extended in a proper way to describe physics at higher energies. A most promising concept for the extension of the SM is those of Supersymmetry, where for each particle of the SM one or more superpartner particles are introduced. The simplest and most attractive extension of the SM is called Minimal Supersymmetric Standard Model (MSSM). Minimal refers to the additional field content, which is kept as low as possible. In fact the MSSM consists of the fields of the SM and their corresponding supersymmetric partner fields, as well as one additional Higgs doublet. The presence of this additional Higgs doublet leads to the existence of five physical Higgs bosons in the MSSM. The search for supersymmetric particles and Higgs bosons is one of the primary goals of the Large Hadron Collider (LHC) at the CERN laboratory, producing collisions at sufficiently high energies to detect these particles. For the discovery of these new particles, precise pre- dictions of the corresponding decay widths and branching rations are utmost mandatory. To contribute with the precision of the LHC and the future ILC, Feynman amplitudes should be calculated at least to one-loop order. Since these calculations lead to so called UV- and IR- divergences, it is essential to perform a renormalization procedure, where the divergences are subtracted by a proper definition of counterterms. The goal of this work was to develop a program package, which calculates all MSSM two- body Higgs decay widths and corresponding branching ratios at full one

  20. 4He binding energy calculation including full tensor-force effects

    Science.gov (United States)

    Fonseca, A. C.

    1989-09-01

    The four-body equations of Alt, Grassberger, and Sandhas are solved in the version where the (2)+(2) subamplitudes are treated exactly by convolution, using one-term separable Yamaguchy nucleon-nucleon potentials in the 1S0 and 3S1-3D1 channels. The resulting jp=1/2+ and (3/2+ three-body subamplitudes are represented in a separable form using the energy-dependent pole expansion. Converged bound-state results are calculated for the first time using the full interaction, and are compared with those obtained from a simplified treatment of the tensor force. The Tjon line that correlates three-nucleon and four-nucleon binding energies is shown using different nucleon-nucleon potentials. In all calculations the Coulomb force has been neglected.

  1. Estimation of full moment tensors, including uncertainties, for earthquakes, volcanic events, and nuclear explosions

    Science.gov (United States)

    Alvizuri, Celso R.

    We present a catalog of full seismic moment tensors for 63 events from Uturuncu volcano in Bolivia. The events were recorded during 2011-2012 in the PLUTONS seismic array of 24 broadband stations. Most events had magnitudes between 0.5 and 2.0 and did not generate discernible surface waves; the largest event was Mw 2.8. For each event we computed the misfit between observed and synthetic waveforms, and we used first-motion polarity measurements to reduce the number of possible solutions. Each moment tensor solution was obtained using a grid search over the six-dimensional space of moment tensors. For each event we show the misfit function in eigenvalue space, represented by a lune. We identify three subsets of the catalog: (1) 6 isotropic events, (2) 5 tensional crack events, and (3) a swarm of 14 events southeast of the volcanic center that appear to be double couples. The occurrence of positively isotropic events is consistent with other published results from volcanic and geothermal regions. Several of these previous results, as well as our results, cannot be interpreted within the context of either an oblique opening crack or a crack-plus-double-couple model. Proper characterization of uncertainties for full moment tensors is critical for distinguishing among physical models of source processes. A seismic moment tensor is a 3x3 symmetric matrix that provides a compact representation of a seismic source. We develop an algorithm to estimate moment tensors and their uncertainties from observed seismic data. For a given event, the algorithm performs a grid search over the six-dimensional space of moment tensors by generating synthetic waveforms for each moment tensor and then evaluating a misfit function between the observed and synthetic waveforms. 'The' moment tensor M0 for the event is then the moment tensor with minimum misfit. To describe the uncertainty associated with M0, we first convert the misfit function to a probability function. The uncertainty, or

  2. On the Creation of Hypertext Links in Full-Text Documents: Measurement of Inter-Linker Consistency.

    Science.gov (United States)

    Ellis, David; And Others

    1994-01-01

    Describes a study in which several different sets of hypertext links are inserted by different people in full-text documents. The degree of similarity between the sets is measured using coefficients and topological indices. As in comparable studies of inter-indexer consistency, the sets of links used by different people showed little similarity.…

  3. Comparing data accuracy between structured abstracts and full-text journal articles: implications in their use for informing clinical decisions.

    Science.gov (United States)

    Fontelo, Paul; Gavino, Alex; Sarmiento, Raymond Francis

    2013-12-01

    The abstract is the most frequently read section of a research article. The use of 'Consensus Abstracts', a clinician-oriented web application formatted for mobile devices to search MEDLINE/PubMed, for informing clinical decisions was proposed recently; however, inaccuracies between abstracts and the full-text article have been shown. Efforts have been made to improve quality. We compared data in 60 recent-structured abstracts and full-text articles from six highly read medical journals. Data inaccuracies were identified and then classified as either clinically significant or not significant. Data inaccuracies were observed in 53.33% of articles ranging from 3.33% to 45% based on the IMRAD format sections. The Results section showed the highest discrepancies (45%) although these were deemed to be mostly not significant clinically except in one. The two most common discrepancies were mismatched numbers or percentages (11.67%) and numerical data or calculations found in structured abstracts but not mentioned in the full text (40%). There was no significant relationship between journals and the presence of discrepancies (Fisher's exact p value =0.3405). Although we found a high percentage of inaccuracy between structured abstracts and full-text articles, these were not significant clinically. The inaccuracies do not seem to affect the conclusion and interpretation overall. Structured abstracts appear to be informative and may be useful to practitioners as a resource for guiding clinical decisions.

  4. Desktop Access to Full-Text NACA and NASA Reports: Systems Developed by NASA Langley Technical Library

    Science.gov (United States)

    Ambur, Manjula Y.; Adams, David L.; Trinidad, P. Paul

    1997-01-01

    NASA Langley Technical Library has been involved in developing systems for full-text information delivery of NACA/NASA technical reports since 1991. This paper will describe the two prototypes it has developed and the present production system configuration. The prototype systems are a NACA CD-ROM of thirty-three classic paper NACA reports and a network-based Full-text Electronic Reports Documents System (FEDS) constructed from both paper and electronic formats of NACA and NASA reports. The production system is the DigiDoc System (DIGItal Documents) presently being developed based on the experiences gained from the two prototypes. DigiDoc configuration integrates the on-line catalog database World Wide Web interface and PDF technology to provide a powerful and flexible search and retrieval system. It describes in detail significant achievements and lessons learned in terms of data conversion, storage technologies, full-text searching and retrieval, and image databases. The conclusions from the experiences of digitization and full- text access and future plans for DigiDoc system implementation are discussed.

  5. Probing dark energy with cluster counts and cosmic shear power spectra: including the full covariance

    International Nuclear Information System (INIS)

    Takada, Masahiro; Bridle, Sarah

    2007-01-01

    Several dark energy experiments are available from a single large-area imaging survey and may be combined to improve cosmological parameter constraints and/or test inherent systematics. Two promising experiments are cosmic shear power spectra and counts of galaxy clusters. However, the two experiments probe the same cosmic mass density field in large-scale structure, therefore the combination may be less powerful than first thought. We investigate the cross-covariance between the cosmic shear power spectra and the cluster counts based on the halo model approach, where the cross-covariance arises from the three-point correlations of the underlying mass density field. Fully taking into account the cross-covariance, as well as non-Gaussian errors on the lensing power spectrum covariance, we find a significant cross-correlation between the lensing power spectrum signals at multipoles l∼10 3 and the cluster counts containing halos with masses M∼>10 14 M o-dot . Including the cross-covariance for the combined measurement degrades and in some cases improves the total signal-to-noise (S/N) ratios up to ∼±20% relative to when the two are independent. For cosmological parameter determination, the cross-covariance has a smaller effect as a result of working in a multi-dimensional parameter space, implying that the two observables can be considered independent to a good approximation. We also discuss the fact that cluster count experiments using lensing-selected mass peaks could be more complementary to cosmic shear tomography than mass-selected cluster counts of the corresponding mass threshold. Using lensing selected clusters with a realistic usable detection threshold ((S/N) cluster ∼6 for a ground-based survey), the uncertainty on each dark energy parameter may be roughly halved by the combined experiments, relative to using the power spectra alone

  6. Grammatical replacements in translation of German advertising texts of utomotive subject including participial constructions with attributive meaning

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2013-12-01

    Full Text Available Informative capacity of participial construction of source and target languages contributes to a more complex and multi aspect image of an expensive car. Dangling participles and attributive clauses placed after the determined word are being used in translation of extended adjectives with participles I and II. These grammatical transformations connected with reconstruction of semantic structure remain logically rational argumentation of an advertising text of the source language.

  7. Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles.

    Science.gov (United States)

    Xu, Rong; Wang, QuanQiu

    2015-02-01

    Anticancer drug-associated side effect knowledge often exists in multiple heterogeneous and complementary data sources. A comprehensive anticancer drug-side effect (drug-SE) relationship knowledge base is important for computation-based drug target discovery, drug toxicity predication and drug repositioning. In this study, we present a two-step approach by combining table classification and relationship extraction to extract drug-SE pairs from a large number of high-profile oncological full-text articles. The data consists of 31,255 tables downloaded from the Journal of Oncology (JCO). We first trained a statistical classifier to classify tables into SE-related and -unrelated categories. We then extracted drug-SE pairs from SE-related tables. We compared drug side effect knowledge extracted from JCO tables to that derived from FDA drug labels. Finally, we systematically analyzed relationships between anti-cancer drug-associated side effects and drug-associated gene targets, metabolism genes, and disease indications. The statistical table classifier is effective in classifying tables into SE-related and -unrelated (precision: 0.711; recall: 0.941; F1: 0.810). We extracted a total of 26,918 drug-SE pairs from SE-related tables with a precision of 0.605, a recall of 0.460, and a F1 of 0.520. Drug-SE pairs extracted from JCO tables is largely complementary to those derived from FDA drug labels; as many as 84.7% of the pairs extracted from JCO tables have not been included a side effect database constructed from FDA drug labels. Side effects associated with anticancer drugs positively correlate with drug target genes, drug metabolism genes, and disease indications. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. [Exploration and construction of the full-text database of acupuncture literature in the Republic of China].

    Science.gov (United States)

    Fei, Lin; Zhao, Jing; Leng, Jiahao; Zhang, Shujian

    2017-10-12

    The ALIPORC full-text database is targeted at a specific full-text database of acupuncture literature in the Republic of China. Starting in 2015, till now, the database has been getting completed, focusing on books relevant with acupuncture, articles and advertising documents, accomplished or published in the Republic of China. The construction of this database aims to achieve the source sharing of acupuncture medical literature in the Republic of China through the retrieval approaches to diversity and accurate content presentation, contributes to the exchange of scholars, reduces the paper damage caused by paging and simplify the retrieval of the rare literature. The writers have made the explanation of the database in light of sources, characteristics and current situation of construction; and have discussed on improving the efficiency and integrity of the database and deepening the development of acupuncture literature in the Republic of China.

  9. How accessibility influences citation counts: The case of citations to the full text articles available from ResearchGate

    Directory of Open Access Journals (Sweden)

    Mohammad Sababi

    2017-08-01

    Full Text Available It is generally believed that the number of citations to an article can positively be correlated to its free online availability. In the present study, we investigated the possible impact of academic social networks on the number of citations. We chose the social web service “ResearchGate” as a case. This website acts both as a social network to connect researchers, and at the same time, as an open access repository to publish post-print version of the accepted manuscripts and final versions of open access articles. We collected the data of 1823 articles published by the authors from four different universities. By analyzing these data, we showed that although different levels of full text availability are observed for the four universities, there is always a significant positive correlation between full text availability and the citation count. Moreover, we showed that both post-print version and publisher’s version (i.e., final published version of the archived manuscripts receive more citations than non-OA articles, and the difference in the citation counts of post-print manuscripts and publisher’s version articles is nonsignificant.

  10. Full-Length Sequence of Mouse Acupuncture-Induced 1-L (Aig1l Gene Including Its Transcriptional Start Site

    Directory of Open Access Journals (Sweden)

    Mika Ohta

    2011-01-01

    Full Text Available We have been investigating the molecular efficacy of electroacupuncture (EA, which is one type of acupuncture therapy. In our previous molecular biological study of acupuncture, we found an EA-induced gene, named acupuncture-induced 1-L (Aig1l, in mouse skeletal muscle. The aims of this study consisted of identification of the full-length cDNA sequence of Aig1l including the transcriptional start site, determination of the tissue distribution of Aig1l and analysis of the effect of EA on Aig1l gene expression. We determined the complete cDNA sequence including the transcriptional start site via cDNA cloning with the cap site hunting method. We then analyzed the tissue distribution of Aig1l by means of northern blot analysis and real-time quantitative polymerase chain reaction. We used the semiquantitative reverse transcriptase-polymerase chain reaction to examine the effect of EA on Aig1l gene expression. Our results showed that the complete cDNA sequence of Aig1l was 6073 bp long, and the putative protein consisted of 962 amino acids. All seven tissues that we analyzed expressed the Aig1l gene. In skeletal muscle, EA induced expression of the Aig1l gene, with high expression observed after 3 hours of EA. Our findings thus suggest that the Aig1l gene may play a key role in the molecular mechanisms of EA efficacy.

  11. Identifying Scientific Project-generated Data Citation from Full-text Articles: An Investigation of TCGA Data Citation

    Directory of Open Access Journals (Sweden)

    Jiao Li

    2016-06-01

    Full Text Available Purpose: In the open science era, it is typical to share project-generated scientific data by depositing it in an open and accessible database. Moreover, scientific publications are preserved in a digital library archive. It is challenging to identify the data usage that is mentioned in literature and associate it with its source. Here, we investigated the data usage of a government-funded cancer genomics project, The Cancer Genome Atlas (TCGA, via a full-text literature analysis. Design/methodology/approach: We focused on identifying articles using the TCGA dataset and constructing linkages between the articles and the specific TCGA dataset. First, we collected 5,372 TCGA-related articles from PubMed Central (PMC. Second, we constructed a benchmark set with 25 full-text articles that truly used the TCGA data in their studies, and we summarized the key features of the benchmark set. Third, the key features were applied to the remaining PMC full-text articles that were collected from PMC. Findings: The amount of publications that use TCGA data has increased significantly since 2011, although the TCGA project was launched in 2005. Additionally, we found that the critical areas of focus in the studies that use the TCGA data were glioblastoma multiforme, lung cancer, and breast cancer; meanwhile, data from the RNA-sequencing (RNA-seq platform is the most preferable for use. Research limitations: The current workflow to identify articles that truly used TCGA data is labor-intensive. An automatic method is expected to improve the performance. Practical implications: This study will help cancer genomics researchers determine the latest advancements in cancer molecular therapy, and it will promote data sharing and data-intensive scientific discovery. Originality/value: Few studies have been conducted to investigate data usage by government-funded projects/programs since their launch. In this preliminary study, we extracted articles that use TCGA data

  12. FULIR Full-text Institutional Repository of the Ruđer Bošković Institute

    Directory of Open Access Journals (Sweden)

    Macan, B.

    2014-11-01

    Full Text Available Institutional repositories are online platforms for gathering, archiving, dissemination and long-term preservation of an institution’s scientific output. Archiving full-text papers into the institutional and other open access repositories (OAR is one way of achieving open access to scientific informa tion, the so-called “green” OA. In 2006 the idea of implementing an institutional repository at the Ruđer Bošković Institute Library was initiated and one year later, the project had started, parallel with the subproject of digitizing RBI’s documentary materials. 4 Various open source solutions were tested, among which EPrints software was chosen as the most suitable to RBI’s needs. The first documents were archived in March 2011, and on October 18, 2012, the official version of the repository was published under the name Full-text Institutional Repository of the Ruđer Bošković Institute – FULIR. The goal of FULIR is to gather, archive, disseminate and preserve, for the long term, the whole scientific production and documentary materials of the Institute in digital form, as well as provide open access to archived materials where possible. It is possible to archive all kinds of materials in FULIR, such as articles published in journals, conference proceedings, books, book chapters, dissertations, but also various kinds of unpublished materials such as posters and presentations from conferences or lectures, reports, audio and video records and datasets. FULIR is based on the principle that the scientists themselves archive their materials, but in this initial phase, the librarians are also assisting the scientists in archiving new items. The repository allows different access rights to archived materials and a depositor may define to whom (all users/only registered users (RBI staff/only repository administrators, and when (immediately or after a cer tain embargo period full-texts of archived materials will be available. Repository

  13. Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles.

    Science.gov (United States)

    Xu, Rong; Wang, QuanQiu

    2015-06-01

    Targeted anticancer drugs such as imatinib, trastuzumab and erlotinib dramatically improved treatment outcomes in cancer patients, however, these innovative agents are often associated with unexpected side effects. The pathophysiological mechanisms underlying these side effects are not well understood. The availability of a comprehensive knowledge base of side effects associated with targeted anticancer drugs has the potential to illuminate complex pathways underlying toxicities induced by these innovative drugs. While side effect association knowledge for targeted drugs exists in multiple heterogeneous data sources, published full-text oncological articles represent an important source of pivotal, investigational, and even failed trials in a variety of patient populations. In this study, we present an automatic process to extract targeted anticancer drug-associated side effects (drug-SE pairs) from a large number of high profile full-text oncological articles. We downloaded 13,855 full-text articles from the Journal of Oncology (JCO) published between 1983 and 2013. We developed text classification, relationship extraction, signaling filtering, and signal prioritization algorithms to extract drug-SE pairs from downloaded articles. We extracted a total of 26,264 drug-SE pairs with an average precision of 0.405, a recall of 0.899, and an F1 score of 0.465. We show that side effect knowledge from JCO articles is largely complementary to that from the US Food and Drug Administration (FDA) drug labels. Through integrative correlation analysis, we show that targeted drug-associated side effects positively correlate with their gene targets and disease indications. In conclusion, this unique database that we built from a large number of high-profile oncological articles could facilitate the development of computational models to understand toxic effects associated with targeted anticancer drugs. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Factors Affecting Subsequent Full-text Publication of Papers Presented at the Annual Conference of the Indian Academy of Pediatrics.

    Science.gov (United States)

    Khalil, Sumaira; Mishra, Devendra; Mishra, Ruchi; Gupta, Shalu

    2017-02-15

    To study the factors associated with the subsequent (over next 9 years) full-text publication of papers presented at the 44th National Conference of Indian Academy of Pediatrics (PEDICON), 2007. All papers presented at PEDICON 2007 were searched for subsequent full-text publication over the next 9 years in English-language journals by an internet-based search. The published papers were compared with the conference-abstracts. 74 (16%) of the 450 abstracts presented were subsequently published; 61 (82.4%) in Medline-indexed journals. Majority (50, 67.6%) of the papers was published within the first 36 mo in journals with mean (SD) impact factor of 2.62 (1.63). The factors significantly associated with subsequent publication were papers presented as award papers (Pfull-papers, 55% had a change in title; authors were changed in 65%, and participants' numbers were dissimilar in 8.6%. There is a need to identify the factors responsible for this low rate of subsequent publication, and interventions to improve it both at institutional and researchers' level.

  15. Acoustic measurements of a full-scale rotor with four tip shapes. Volume 1: Text, appendices A and B

    Science.gov (United States)

    Mosher, M.

    1984-01-01

    A full-scale helicopter with four different blade-tip geometries was tested in the 40- by 80-foot wind tunnel at Ames Research Center. Performance, loads, and noise were measured. The four tip shapes tested were rectangular, tapered, swept, and swept-tapered. Noise measurements from that test are presented in the form of tables and plots. The noise data include measurements of the sound pressure level in dB, dBA, and tone-corrected PNdB, for all of the conditions tested. Detailed measurements, 1/3-octave spectra and time-histories for some selected data are included as well as plots of dBA as function of test condition. Some performance measurements are given to aid interpretation of the noise data.

  16. Students' Learning Experiences from Didactic Teaching Sessions Including Patient Case Examples as Either Text or Video: A Qualitative Study.

    Science.gov (United States)

    Pedersen, Kamilla; Moeller, Martin Holdgaard; Paltved, Charlotte; Mors, Ole; Ringsted, Charlotte; Morcke, Anne Mette

    2017-10-06

    The aim of this study was to explore medical students' learning experiences from the didactic teaching formats using either text-based patient cases or video-based patient cases with similar content. The authors explored how the two different patient case formats influenced students' perceptions of psychiatric patients and students' reflections on meeting and communicating with psychiatric patients. The authors conducted group interviews with 30 medical students who volunteered to participate in interviews and applied inductive thematic content analysis to the transcribed interviews. Students taught with text-based patient cases emphasized excitement and drama towards the personal clinical narratives presented by the teachers during the course, but never referred to the patient cases. Authority and boundary setting were regarded as important in managing patients. Students taught with video-based patient cases, in contrast, often referred to the patient cases when highlighting new insights, including the importance of patient perspectives when communicating with patients. The format of patient cases included in teaching may have a substantial impact on students' patient-centeredness. Video-based patient cases are probably more effective than text-based patient cases in fostering patient-centered perspectives in medical students. Teachers sharing stories from their own clinical experiences stimulates both engagement and excitement, but may also provoke unintended stigma and influence an authoritative approach in medical students towards managing patients in clinical psychiatry.

  17. BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph.

    Science.gov (United States)

    Peng, Yifan; Arighi, Cecilia; Wu, Cathy H; Vijay-Shanker, K

    2016-01-01

    There has been a large growth in the number of biomedical publications that report experimental results. Many of these results concern detection of protein-protein interactions (PPI). In BioCreative V, we participated in the BioC task and developed a PPI system to detect text passages with PPIs in the full-text articles. By adopting the BioC format, the output of the system can be seamlessly added to the biocuration pipeline with little effort required for the system integration. A distinctive feature of our PPI system is that it utilizes extended dependency graph, an intermediate level of representation that attempts to abstract away syntactic variations in text. As a result, we are able to use only a limited set of rules to extract PPI pairs in the sentences, and additional rules to detect additional passages for PPI pairs. For evaluation, we used the 95 articles that were provided for the BioC annotation task. We retrieved the unique PPIs from the BioGRID database for these articles and show that our system achieves a recall of 83.5%. In order to evaluate the detection of passages with PPIs, we further annotated Abstract and Results sections of 20 documents from the dataset and show that an f-value of 80.5% was obtained. To evaluate the generalizability of the system, we also conducted experiments on AIMed, a well-known PPI corpus. We achieved an f-value of 76.1% for sentence detection and an f-value of 64.7% for unique PPI detection.Database URL: http://proteininformationresource.org/iprolink/corpora. © The Author(s) 2016. Published by Oxford University Press.

  18. The film’s the thing: film translation and its effect on a silent, edited and full text Hamlet The film’s the thing: film translation and its effect on a silent, edited and full text Hamlet

    Directory of Open Access Journals (Sweden)

    Janete R. Costa

    2008-04-01

    Full Text Available Translation is, at its best, a difficult path to tred, especially in a global, multicultural society. A word that defines an object may be in need of careful consideration and modification, not only to convey its individual meaning, but also to place it in the concept or intent when linked with others words forming a thought. The process is particularly complex when pairing a word with an image as is done in film. In the 1960’s, the American television classic, Star Trek, added new words as well as additional meaning to old words in the English lexicon. The definition of these words was clearly given in visual images that can still be recalled today. A typical exchange of dialogue may read: Captain, according to my tricorder, there is no intelligent life on this planet. Beam him up, Scotty. Energise. Translation is, at its best, a difficult path to tred, especially in a global, multicultural society. A word that defines an object may be in need of careful consideration and modification, not only to convey its individual meaning, but also to place it in the concept or intent when linked with others words forming a thought. The process is particularly complex when pairing a word with an image as is done in film. In the 1960’s, the American television classic, Star Trek, added new words as well as additional meaning to old words in the English lexicon. The definition of these words was clearly given in visual images that can still be recalled today. A typical exchange of dialogue may read: Captain, according to my tricorder, there is no intelligent life on this planet. Beam him up, Scotty. Energise.

  19. Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system.

    Science.gov (United States)

    Tudor, Catalina O; Ross, Karen E; Li, Gang; Vijay-Shanker, K; Wu, Cathy H; Arighi, Cecilia N

    2015-01-01

    Protein phosphorylation is a reversible post-translational modification where a protein kinase adds a phosphate group to a protein, potentially regulating its function, localization and/or activity. Phosphorylation can affect protein-protein interactions (PPIs), abolishing interaction with previous binding partners or enabling new interactions. Extracting phosphorylation information coupled with PPI information from the scientific literature will facilitate the creation of phosphorylation interaction networks of kinases, substrates and interacting partners, toward knowledge discovery of functional outcomes of protein phosphorylation. Increasingly, PPI databases are interested in capturing the phosphorylation state of interacting partners. We have previously developed the eFIP (Extracting Functional Impact of Phosphorylation) text mining system, which identifies phosphorylated proteins and phosphorylation-dependent PPIs. In this work, we present several enhancements for the eFIP system: (i) text mining for full-length articles from the PubMed Central open-access collection; (ii) the integration of the RLIMS-P 2.0 system for the extraction of phosphorylation events with kinase, substrate and site information; (iii) the extension of the PPI module with new trigger words/phrases describing interactions and (iv) the addition of the iSimp tool for sentence simplification to aid in the matching of syntactic patterns. We enhance the website functionality to: (i) support searches based on protein roles (kinases, substrates, interacting partners) or using keywords; (ii) link protein entities to their corresponding UniProt identifiers if mapped and (iii) support visual exploration of phosphorylation interaction networks using Cytoscape. The evaluation of eFIP on full-length articles achieved 92.4% precision, 76.5% recall and 83.7% F-measure on 100 article sections. To demonstrate eFIP for knowledge extraction and discovery, we constructed phosphorylation-dependent interaction

  20. An exploratory analysis of PubMed's free full-text limit on citation retrieval for clinical questions.

    Science.gov (United States)

    Krieger, Mary M; Richter, Randy R; Austin, Tricia M

    2008-10-01

    The research sought to determine (1) how use of the PubMed free full-text (FFT) limit affects citation retrieval and (2) how use of the FFT limit impacts the types of articles and levels of evidence retrieved. Four clinical questions based on a research agenda for physical therapy were searched in PubMed both with and without the use of the FFT limit. Retrieved citations were examined for relevancy to each question. Abstracts of relevant citations were reviewed to determine the types of articles and levels of evidence. Descriptive analysis was used to compare the total number of citations, number of relevant citations, types of articles, and levels of evidence both with and without the use of the FFT limit. Across all 4 questions, the FFT limit reduced the number of citations to 11.1% of the total number of citations retrieved without the FFT limit. Additionally, high-quality evidence such as systematic reviews and randomized controlled trials were missed when the FFT limit was used. Health sciences librarians play a key role in educating users about the potential impact the FFT limit has on the number of citations, types of articles, and levels of evidence retrieved.

  1. Publication rates of full-text journal articles converted from abstracts presented during the 22(nd) Turkish National Urology Congress.

    Science.gov (United States)

    Kocaaslan, Ramazan; Kayalı, Yunus; Tok, Adem; Tepeler, Abdulkadir

    2016-03-01

    To analyze the publication rates of full-text journal articles converted from the abstracts presented in the 22(nd) Turkish National Urology Congress in 2012. A total of 576 abstracts accepted for presentation at the 22(nd) Turkish National Urology Association Meeting were identified from the published abstract book. The abstracts were categorized into subsections such as endourology and pediatric urology. The subsequent publication rate for the studies was evaluated by scanning PubMed Medline. Abstracts published before the proceedings were excluded from the study. The abstracts were categorized as being presented orally (n=155), by poster (n=421), or by video (n=78). Of the 28 (18.3%) of 155 oral and 34 (8.15%) of 421 poster presentations, were subsequently published in several journals until March 2015. The publication rates of the abstracts based on urology subsections were as follows: neurology (25%), andrology (18.6%), endourology (17.2%), urolithiasis (15.3%), general urology (12.5%), infectious diseases (7.14%), pediatric urology (6.25%), uro-gynecology (6.06%), reconstructive urology (5.8%), and urooncology (3.8%). The average time to publication was 11.77 (0-33) months. This is the first study assessing the publication rates of abstracts presented at a Turkish National Urology Congress. It reveals that more qualified randomized studies need to be done to improve the rate of publication.

  2. Saddle Slow Manifolds and Canard Orbits in [Formula: see text] and Application to the Full Hodgkin-Huxley Model.

    Science.gov (United States)

    Hasan, Cris R; Krauskopf, Bernd; Osinga, Hinke M

    2018-04-19

    Many physiological phenomena have the property that some variables evolve much faster than others. For example, neuron models typically involve observable differences in time scales. The Hodgkin-Huxley model is well known for explaining the ionic mechanism that generates the action potential in the squid giant axon. Rubin and Wechselberger (Biol. Cybern. 97:5-32, 2007) nondimensionalized this model and obtained a singularly perturbed system with two fast, two slow variables, and an explicit time-scale ratio ε. The dynamics of this system are complex and feature periodic orbits with a series of action potentials separated by small-amplitude oscillations (SAOs); also referred to as mixed-mode oscillations (MMOs). The slow dynamics of this system are organized by two-dimensional locally invariant manifolds called slow manifolds which can be either attracting or of saddle type.In this paper, we introduce a general approach for computing two-dimensional saddle slow manifolds and their stable and unstable fast manifolds. We also develop a technique for detecting and continuing associated canard orbits, which arise from the interaction between attracting and saddle slow manifolds, and provide a mechanism for the organization of SAOs in [Formula: see text]. We first test our approach with an extended four-dimensional normal form of a folded node. Our results demonstrate that our computations give reliable approximations of slow manifolds and canard orbits of this model. Our computational approach is then utilized to investigate the role of saddle slow manifolds and associated canard orbits of the full Hodgkin-Huxley model in organizing MMOs and determining the firing rates of action potentials. For ε sufficiently large, canard orbits are arranged in pairs of twin canard orbits with the same number of SAOs. We illustrate how twin canard orbits partition the attracting slow manifold into a number of ribbons that play the role of sectors of rotations. The upshot is that we

  3. ScienceCentral: open access full-text archive of scientific journals based on Journal Article Tag Suite regardless of their languages.

    Science.gov (United States)

    Huh, Sun

    2013-01-01

    ScienceCentral, a free or open access, full-text archive of scientific journal literature at the Korean Federation of Science and Technology Societies, was under test in September 2013. Since it is a Journal Article Tag Suite-based full text database, extensible markup language files of all languages can be presented, according to Unicode Transformation Format 8-bit encoding. It is comparable to PubMed Central: however, there are two distinct differences. First, its scope comprises all science fields; second, it accepts all language journals. Launching ScienceCentral is the first step for free access or open access academic scientific journals of all languages to leap to the world, including scientific journals from Croatia.

  4. Open-Source Tools for Enhancing Full-Text Searching of OPACs: Use of Koha, Greenstone and Fedora

    Science.gov (United States)

    Anuradha, K. T.; Sivakaminathan, R.; Kumar, P. Arun

    2011-01-01

    Purpose: There are many library automation packages available as open-source software, comprising two modules: staff-client module and online public access catalogue (OPAC). Although the OPAC of these library automation packages provides advanced features of searching and retrieval of bibliographic records, none of them facilitate full-text…

  5. Students’ Learning Experiences from Didactic Teaching Sessions Including Patient Case Examples as Either Text or Video: A Qualitative Study

    DEFF Research Database (Denmark)

    Pedersen, Kamilla; Holdgaard, Martin Møller; Paltved, Charlotte

    2017-01-01

    ' perceptions of psychiatric patients and students' reflections on meeting and communicating with psychiatric patients. METHODS: The authors conducted group interviews with 30 medical students who volunteered to participate in interviews and applied inductive thematic content analysis to the transcribed....... Students taught with video-based patient cases, in contrast, often referred to the patient cases when highlighting new insights, including the importance of patient perspectives when communicating with patients. CONCLUSION: The format of patient cases included in teaching may have a substantial impact...... unintended stigma and influence an authoritative approach in medical students towards managing patients in clinical psychiatry....

  6. A full-angle Monte-Carlo scattering technique including cumulative and single-event Rutherford scattering in plasmas

    Science.gov (United States)

    Higginson, Drew P.

    2017-11-01

    We describe and justify a full-angle scattering (FAS) method to faithfully reproduce the accumulated differential angular Rutherford scattering probability distribution function (pdf) of particles in a plasma. The FAS method splits the scattering events into two regions. At small angles it is described by cumulative scattering events resulting, via the central limit theorem, in a Gaussian-like pdf; at larger angles it is described by single-event scatters and retains a pdf that follows the form of the Rutherford differential cross-section. The FAS method is verified using discrete Monte-Carlo scattering simulations run at small timesteps to include each individual scattering event. We identify the FAS regime of interest as where the ratio of temporal/spatial scale-of-interest to slowing-down time/length is from 10-3 to 0.3-0.7; the upper limit corresponds to Coulomb logarithm of 20-2, respectively. Two test problems, high-velocity interpenetrating plasma flows and keV-temperature ion equilibration, are used to highlight systems where including FAS is important to capture relevant physics.

  7. Lithospheric layering in the North American craton revealed by including Short Period Constraints in Full Waveform Tomography

    Science.gov (United States)

    Roy, C.; Calo, M.; Bodin, T.; Romanowicz, B. A.

    2017-12-01

    Recent receiver function studies of the North American craton suggest the presence of significant layering within the cratonic lithosphere, with significant lateral variations in the depth of the velocity discontinuities. These structural boundaries have been confirmed recently using a transdimensional Markov Chain Monte Carlo approach (TMCMC), inverting surface wave dispersion data and converted phases simultaneously (Calò et al., 2016; Roy and Romanowicz 2017). The lateral resolution of upper mantle structure can be improved with a high density of broadband seismic stations, or with a sparse network using full waveform inversion based on numerical wavefield computation methods such as the Spectral Element Method (SEM). However, inverting for discontinuities with strong topography such as MLDS's or LAB, presents challenges in an inversion framework, both computationally, due to the short periods required, and from the point of view of stability of the inversion. To overcome these limitations, and to improve resolution of layering in the upper mantle, we are developing a methodology that combines full waveform inversion tomography and information provided by short period seismic observables. We have extended the 30 1D radially anisotropic shear velocity profiles of Calò et al. 2016 to several other stations, for which we used a recent shear velocity model (Clouzet et al., 2017) as constraint in the modeling. These 1D profiles, including both isotropic and anisotropic discontinuities in the upper mantle (above 300 km depth) are then used to build a 3D starting model for the full waveform tomographic inversion. This model is built after 1) homogenization of the layered 1D models and 2) interpolation between the 1D smooth profiles and the model of Clouzet et al. 2017, resulting in a smooth 3D starting model. Waveforms used in the inversion are filtered at periods longer than 30s. We use the SEM code "RegSEM" for forward computations and a quasi-Newton inversion

  8. The Dayton Agenda: Full Text

    Science.gov (United States)

    Journal of Research on Christian Education, 2009

    2009-01-01

    In November 1997, 140 researchers, administrators, and others interested in the support of nonpublic schools gathered at the University of Dayton to develop a research agenda for American private education. What developed over the several hours of intense sessions was an agenda that has given direction to researchers well into the 21st century.…

  9. An Observational Study of Abstracts Presented at the American College of Veterinary Surgeon Annual Meetings (2001-2008) and Their Subsequent Full-Text Publication.

    Science.gov (United States)

    Meyers, Katherine E; Lindem, Margaret J; Giuffrida, Michelle A

    2016-07-01

    To determine the frequency of abstracts presented at American College of Veterinary Surgeons (ACVS) meetings from 2001 to 2008 that were published as complete articles, to identify abstract characteristics associated with final full-text publication, and to examine consistency of information between abstracts and final full-text publications. Observational bibliographic study. Abstracts were retrieved from published proceedings. Published articles were retrieved from bibliographic databases. Features of abstract and article authorship, design, and content were recorded. Regression analysis identified abstract features associated with article publication, and evaluated consistency between abstracts and final publications. Seven hundred eighty-two of 1078 (73%) abstracts were published as complete articles. Median time to publication was 1 year; 90% were published within 3 years. Abstracts originating from academic institutions were published more often than abstracts from practice or industry sites (odds ratio 2.61, 95% confidence interval 1.68-4.05). Compared to their conference abstracts, 49% of articles contained major inconsistences including changes in study design, interventions, outcomes, sample size, and results. For each year elapsed between presentation and publication, the odds of major inconsistency increased 2.4 times (odds ratio 2.36, 95% confidence interval 1.57-3.55) for retrospective studies and 1.4 times (odds ratio 1.35, 95% confidence interval 1.17-1.56) for other study designs. Changes in study title and authorship were frequent, particularly in publications that contained major inconsistencies. ACVS abstracts were promptly and reliably published, but final full-text publications often differed substantially from the original abstracts. © Copyright 2016 by The American College of Veterinary Surgeons.

  10. The contribution of CT to perinatal intracranial hemorrhage including that accompanying apparently uncomplicated delivery at full term

    International Nuclear Information System (INIS)

    Baleriaux, D.; Ticket, L.; Dony, D.; Jeanmart, L.

    1980-01-01

    The clinical and CT evolution of intracranial bleeding in six neonates (four full-term and two premature) has been followed. Clinical symptoms, risk factors, evolution, sequellae and prognostic factors of intracranial hemorrhage in the newborn are reviewed. The routine use of CT for detection and survey of perinatal hemorrhage and its sequellae is recommended. (orig.)

  11. A comparison of the accuracy of clinical decisions based on full-text articles and on journal abstracts alone: a study among residents in a tertiary care hospital.

    Science.gov (United States)

    Marcelo, Alvin; Gavino, Alex; Isip-Tan, Iris Thiele; Apostol-Nicodemus, Leilanie; Mesa-Gaerlan, Faith Joan; Firaza, Paul Nimrod; Faustorilla, John Francis; Callaghan, Fiona M; Fontelo, Paul

    2013-04-01

    Many clinicians depend solely on journal abstracts to guide clinical decisions. This study aims to determine if there are differences in the accuracy of responses to simulated cases between resident physicians provided with an abstract only and those with full-text articles. It also attempts to describe their information-seeking behaviour. Seventy-seven resident physicians from four specialty departments of a tertiary care hospital completed a paper-based questionnaire with clinical simulation cases, then randomly assigned to two intervention groups-access to abstracts-only and access to both abstracts and full-text. While having access to medical literature, they completed an online version of the same questionnaire. The average improvement across departments was not significantly different between the abstracts-only group and the full-text group (p=0.44), but when accounting for an interaction between intervention and department, the effect was significant (p=0.049) with improvement greater with full-text in the surgery department. Overall, the accuracy of responses was greater after the provision of either abstracts-only or full-text (pfull-text articles were more accurate than those guided by abstracts alone, but the results seem to be driven by a significant difference in one department.

  12. Progress in the Full-Text Publication Rate of Orthopaedic and Sport Physical Therapy Abstracts Presented at the American Physical Therapy Association's Combined Sections Meeting.

    Science.gov (United States)

    Warden, Stuart J; Fletcher, Jacquelyn M; Barker, Rick G; Guildenbecher, Elizabeth A; Gorkis, Colleen E; Thompson, William R

    2017-10-07

    Study Design Descriptive study. Background Professional meetings, such as the American Physical Therapy Association's (APTA's) Combined Sections Meeting (CSM), provide forums for sharing information. However, it was reported that only one-quarter of orthopaedic and sports physical therapy abstracts presented at the CSM between 2000 and 2004 went on to full-text publication. This low conversion rate raises a number of concerns regarding the full dissemination of work within the profession. Objectives The purpose of this study was to determine the full-text publication rate of work presented in abstract form at subsequent CSMs and investigate factors influencing the rate. Methods A systematic search was undertaken to locate full-text publications of orthopaedic and sports physical therapy abstracts presented at CSMs between 2005 and 2011. Eligible publications were published within 5 years following abstract presentation. The influences of year of abstract presentation, APTA section, presentation type, institution of origin, study design, and study significance were assessed. Results Over one-third (38.6%) of presented abstracts progressed to full-text publication. Odds of full-text publication increased if the abstract was presented as a platform presentation, originated from a doctorate-granting institution, reported findings of an experimental study, or reported a statistically significant finding. Conclusion The full-text publication rate for orthopaedic and sports physical therapy abstracts presented at recent CSMs has increased by over 50% compared to that reported for the preceding period. The rate is now in the range of that reported in comparable clinical disciplines, demonstrating important progress in the full dissemination of work within the profession. J Orthop Sports Phys Ther, Epub 7 Oct 2017. doi:10.2519/jospt.2018.7581.

  13. The effect of coverings, including plastic bags and wraps, on mortality and morbidity in preterm and full-term neonates.

    Science.gov (United States)

    Oatley, H K; Blencowe, H; Lawn, J E

    2016-05-01

    Neonatal hypothermia is an important risk factor for mortality and morbidity, and is common even in temperate climates. We conducted a systematic review to determine whether plastic coverings, used immediately following delivery, were effective in reducing the incidence of mortality, hypothermia and morbidity. A total of 26 studies (2271 preterm and 1003 term neonates) were included. Meta-analyses were conducted as appropriate. Plastic wraps were associated with a reduction in hypothermia in preterm (⩽29 weeks; risk ratio (RR)=0.57; 95% confidence interval (CI) 0.46 to 0.71) and term neonates (RR=0.76; 95% CI 0.60 to 0.96). No significant reduction in neonatal mortality or morbidity was found; however, the studies were underpowered for these outcomes. For neonates, especially preterm, plastic wraps combined with other environmental heat sources are effective in reducing hypothermia during stabilization and transfer within hospital. Further research is needed to quantify the effects on mortality or morbidity, and investigate the use of plastic coverings outside hospital settings or without additional heat sources.

  14. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

    Science.gov (United States)

    Krallinger, Martin; Vazquez, Miguel; Leitner, Florian; Salgado, David; Chatr-Aryamontri, Andrew; Winter, Andrew; Perfetto, Livia; Briganti, Leonardo; Licata, Luana; Iannuccelli, Marta; Castagnoli, Luisa; Cesareni, Gianni; Tyers, Mike; Schneider, Gerold; Rinaldi, Fabio; Leaman, Robert; Gonzalez, Graciela; Matos, Sergio; Kim, Sun; Wilbur, W John; Rocha, Luis; Shatkay, Hagit; Tendulkar, Ashish V; Agarwal, Shashank; Liu, Feifan; Wang, Xinglong; Rak, Rafal; Noto, Keith; Elkan, Charles; Lu, Zhiyong; Dogan, Rezarta Islamaj; Fontaine, Jean-Fred; Andrade-Navarro, Miguel A; Valencia, Alfonso

    2011-10-03

    Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were evaluated by comparing

  15. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text

    Science.gov (United States)

    2011-01-01

    Background Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. Results A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were

  16. Full text publication rates of research abstracts presented at the European Society of Endodontology (ESE) Congresses in the last 20 years.

    Science.gov (United States)

    Tzanetakis, G N; Tzimpoulas, N; Floratos, S; Agrafioti, A; Kontakiotis, E G; Shemesh, H

    2017-06-26

    To evaluate the full-text publication rates of scientific research abstracts presented at the European Society of Endodontology (ESE) Congresses held between 1993 and 2013 (a total of 11 occasions) and to determine factors associated with the manuscripts. An electronic database search was conducted from January 2015 to December 2016 to identify full text English written publications of the research abstracts presented at the last 11 ESE Biennial Congresses from 1993 to 2013. For each occasion, research abstract information were retrieved from the International Endodontic Journal (IEJ) through the official website of the ESE and the following parameters for each abstract presentation were recorded: Year of presentation, first author's affiliation, geographic origin, and type of study. Following full-text article identification, additional information was recorded such as: Year and journal of publication, elapsed time until full publication and number of authors per presentation and publication. A total of 1165 research abstracts were presented, of which 401 (34.4%) were finally published as full-length articles. Overall 235 articles (58.6%) were published either in the International Endodontic Journal (IEJ, 35.7%) or Journal of Endodontics (JOE, 22.9%). The mean time between abstract presentation and full-text publication was 18.95 months. Munich (2001) had the highest publication rate (44%) whereas Lisbon (2013) had the highest number of published articles (77). Turkey was the country with the highest number of published abstracts (56). However, the Netherlands was the country with the highest number of publications related to the number of presentations (21/26) (80.7%). Differences in authorship between presentation and full publication were found in 179 (44.6%) articles. A substantial number of research abstracts presented at ESE congresses were not published in peer reviewed journals. Authors prefer to publish their research papers in international journals with

  17. A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools.

    Science.gov (United States)

    Verspoor, Karin; Cohen, Kevin Bretonnel; Lanfranchi, Arrick; Warner, Colin; Johnson, Helen L; Roeder, Christophe; Choi, Jinho D; Funk, Christopher; Malenkiy, Yuriy; Eckert, Miriam; Xue, Nianwen; Baumgartner, William A; Bada, Michael; Palmer, Martha; Hunter, Lawrence E

    2012-08-17

    We introduce the linguistic annotation of a corpus of 97 full-text biomedical publications, known as the Colorado Richly Annotated Full Text (CRAFT) corpus. We further assess the performance of existing tools for performing sentence splitting, tokenization, syntactic parsing, and named entity recognition on this corpus. Many biomedical natural language processing systems demonstrated large differences between their previously published results and their performance on the CRAFT corpus when tested with the publicly available models or rule sets. Trainable systems differed widely with respect to their ability to build high-performing models based on this data. The finding that some systems were able to train high-performing models based on this corpus is additional evidence, beyond high inter-annotator agreement, that the quality of the CRAFT corpus is high. The overall poor performance of various systems indicates that considerable work needs to be done to enable natural language processing systems to work well when the input is full-text journal articles. The CRAFT corpus provides a valuable resource to the biomedical natural language processing community for evaluation and training of new models for biomedical full text publications.

  18. Fate of abstracts presented at a National Turkish Orthopedics and Traumatology Congress: publication rates and consistency of abstracts compared with their subsequent full-text publications.

    Science.gov (United States)

    Yalçınkaya, Merter; Bagatur, Erdem

    2013-01-01

    The aim of this study was to evaluate the publication rates of full-text articles after presentation of abstracts at a Turkish National Orthopaedics and Traumatology Congress, determine the time lag from the congress date to publication of full-text articles and assess the consistency between abstracts and the subsequent publications. All abstracts from the scientific program of the 20th Turkish National Orthopaedics and Traumatology Congress (2007) were identified and computerized PubMed searches were conducted to determine whether an abstract had been followed by publication of a full-text article and key features were compared to evaluate their consistency. The time lag to publication and the impact factors of the journals where the articles were published were noted. Of the 770 abstracts (264 oral, 506 poster presentations), 227 (29.5%) were followed by a full-text and 116 (44%) of the 264 oral and 111 (22%) of the 506 poster presentations were published. The mean time to publication was 14.9±16.075 (range: 33 to 55) months. Thirty-three (14.5%) were published prior to the presentation at the congress. The likelihood of publication decreased after the third year (26 of 227, 11.5%). A total of 182 (80.2%) articles showed inconsistencies with the abstract; 74 (32.6%) minor, 14 (6.2%) major, and 94 (41.4%) minor and major inconsistencies. The mean impact factor of the journals was 1.152±0.858. The vast majority of abstracts presented at this congress were not followed by publication of a full-text article. Additionally, frequent inconsistencies between the final published article and the original abstract indicated the inadequacy of quality of reporting in abstracts.

  19. Multi-stage gene normalization for full-text articles with context-based species filtering for dynamic dictionary entry selection.

    Science.gov (United States)

    Tsai, Richard Tzong-Han; Lai, Po-Ting

    2011-10-03

    Gene normalization (GN) is the task of identifying the unique database IDs of genes and proteins in literature. The best-known public competition of GN systems is the GN task of the BioCreative challenge, which has been held four times since 2003. The last two BioCreatives, II.5 & III, had two significant differences from earlier tasks: firstly, they provided full-length articles in addition to abstracts; and secondly, they included multiple species without providing species ID information. Full papers introduce more complex targets for GN processing, while the inclusion of multiple species vastly increases the potential size of dictionaries needed for GN. BioCreative III GN uses Threshold Average Precision at a median of k errors per query (TAP-k), a new measure closely related to the well-known average precision, but also reflecting the reliability of the score provided by each GN system. To use full-paper text, we employed a multi-stage GN algorithm and a ranking method which exploit information in different sections and parts of a paper. To handle the inclusion of multiple unknown species, we developed two context-based dynamic strategies to select dictionary entries related to the species that appear in the paper-section-wide and article-wide context. Our originally submitted BioCreative III system uses a static dictionary containing only the most common species entries. It already exceeds the BioCreative III average team performance by at least 24% in every evaluation. However, using our proposed dynamic dictionary strategies, we were able to further improve TAP-5, TAP-10, and TAP-20 by 16.47%, 13.57% and 6.01%, respectively in the Gold 50 test set. Our best dynamic strategy outperforms the best BioCreative III systems in TAP-10 on the Silver 50 test set and in TAP-5 on the Silver 507 set. Our experimental results demonstrate the superiority of our proposed dynamic dictionary selection strategies over our original static strategy and most BioCreative III

  20. Discrepancies between Abstracts Presented at International Association for Dental Research Annual Sessions from 2004 to 2005 and Full-Text Publication.

    Science.gov (United States)

    Prasad, Soni; Lee, Damian J; Yuan, Judy Chia-Chun; Barao, Valentim A R; Shyamsunder, Nodesh; Sukotjo, Cortino

    2012-01-01

    Purpose. The purpose of this study was to evaluate the discrepancies between abstracts presented at the IADR meeting (2004-2005) and their full-text publication. Material and Methods. Abstracts from the Prosthodontic Section of IADR meeting were obtained. The following information was collected: abstract title, number of authors, study design, statistical analysis, outcome, and funding source. PubMed was used to identify the full-text publication of the abstracts. The discrepancies between the abstract and the full-text publication were examined, categorized as major and minor discrepancies, and quantified. The data were collected and analyzed using descriptive analysis. Frequency and percentage of major and minor discrepancies were calculated. Results. A total of 109 (95.6%) articles showed changes from their abstracts. Seventy-four (65.0%) and 105 (92.0%) publications had at least one major and one minor discrepancies, respectively. Minor discrepancies were more prevalent (92.0%) than major discrepancies (65.0%). The most common minor discrepancy was observed in the title (80.7%), and most common major discrepancies were seen in results (48.2%). Conclusion. Minor discrepancies were more prevalent than major discrepancies. The data presented in this study may be useful to establish a more comprehensive structured abstract requirement for future meetings.

  1. Analysis of full-text publication and publishing predictors of abstracts presented at an Italian public health meeting (2005-2007).

    Science.gov (United States)

    Castaldi, S; Giacometti, M; Toigo, W; Bert, F; Siliquini, R

    2015-09-29

    In Public Health, a thorough review of abstract quality evaluations and the publication history of studies presented at scientific meetings has never been conducted. To analyse the long-term outcome of quality abstracts submitted to conferences of Italian Society of Hygiene and Public Health (SItI) from 2005 to 2007, we conducted a second analysis of previously published material aiming to estimate full-text publication rate of high quality abstract presented at Italian public health meetings, and to identify predictors of full-text publication. The search was undertaken through scientific databases and search engines and through the web sites of the major Italian journals of Public Health. For each publication confirmed as a full text paper, the journal name, impact factor, year of publication, gender of the first author, type of study design, characteristics of the results and sample size were collected. The overall publication rate of the abstracts presented is 23.5%; most of the papers were published in Public Health journals (average impact factor: 3.007). Non universitary affiliation had resulted in a lower probability of publication, while some of the Conference topics had predisposed the studies to an increased likelihood of publication as well as poster form presentation. The method presented in this study provides a good framework for the evaluation of the scientific evidence. The findings achieved should be taken into consideration by the Scientific Societies during the contributions selection phase, with the aim of achieving a continuous improvement of work quality. In the future, it would be interesting to survey the abstract authors to identify reasons for unpublished data.

  2. Spelling is just a click away – a user-centered brain-computer interface including auto-calibration and predictive text entry

    Directory of Open Access Journals (Sweden)

    Tobias eKaufmann

    2012-05-01

    Full Text Available Brain Computer Interfaces (BCI based on event-related potentials (ERP allow for selection of characters from a visually presented character-matrix and thus provide a communication channel for users with neurodegenerative disease. Although they have been topic of research for more than 20 years and were multiply proven to be a reliable communication method, BCIs are almost exclusively used in experimental settings, handled by qualified experts. This study investigates if ERP-BCIs can be handled independently by laymen without expert interference, which is inevitable for establishing BCIs in end-user’s daily life situations. Furthermore we compared the classic character-by-character text entry against a predictive text entry (PTE that directly incorporates predictive text into the character matrix. N=19 BCI novices handled a user-centred ERP-BCI application on their own without expert interference. The software individually adjusted classifier weights and control parameters in the background, invisible to the user (auto-calibration. All participants were able to operate the software on their own and to twice correctly spell a sentence with the auto-calibrated classifier (once with PTE, once without. Our PTE increased spelling speed and importantly did not reduce accuracy. In sum, this study demonstrates feasibility of auto-calibrating ERP-BCI use, independently by laymen and the strong benefit of integrating predictive text directly into the character-matrix.

  3. http://www.isarder.org/isardercom/2013vol5issue1/vol5_issue1_article06full_text.PDF

    Directory of Open Access Journals (Sweden)

    Suat TEKER

    2013-03-01

    Full Text Available This study has pointed out that a new version of Build-Operate-Transfer (BOT financing model generated out from classical BOT model can be used for highway financing. The classical BOT, one of the most popular PPP models has been oftenly employed by various countries for financing of large scale public projects. Over the last 20 year period a number of infrastructure projects in Turkey such as natural gas plants, airports and hydro electric power plants were constructed by using BOT model. In this study, the new version of BOT model is implemented on the projected Ankara-İzmir Highway Project. This highway project can be constructed at a lower project cost by using the suggested BOT model compared to the classical BOT model. Therefore, the lower project cost results in a lower toll rate.

  4. Conversion rates of abstracts presented at the Urological Society of Australia and New Zealand (USANZ) Annual Scientific Meeting into full-text journal articles.

    Science.gov (United States)

    Yoon, Peter D; Chalasani, Venu; Woo, Henry H

    2012-08-01

    What's known on the subject? and What does the study add? It is well known that the transition of a presented abstract in a scientific meeting to a journal article improves the quality of the meeting and prevents an abstract being incorporated into meta-analyses or practice guidelines without proper appraisal. This is the first analysis of USANZ Annual Scientific Meeting abstracts' conversion to full publication. With relatively low publication rates compared to other international meetings, this review identifies the need for mechanisms to encourage USANZ researchers to convert their abstracts into published articles. The numbers and characteristics of the abstracts presented at the Annual Scientific Meetings (ASM) of the Urological Society of Australia and New Zealand (USANZ) that are converted to peer-reviewed publications have not previously been analysed and published. We undertook a review of all abstracts presented at the USANZ ASM from 2005 to 2009. A PubMed search was performed between 15 June and 15 July 2012, using a search algorithm to identify the full-text publications of the presented abstracts. Correlation between abstract characteristics and publication rate was then examined to distinguish the predictors for publications. Of 614 abstracts that were presented at USANZ ASM between 2005 and 2009, 183 papers were published, giving a publication rate of 29.80%. The papers were predominantly published in urological journals and were more likely to be published if they were presented by an international author or were retrospective studies or if basic science research. The mean (SD) time to publication was 14.46 (13.89) months and the mean Impact Factor of journals where papers were published was 2.90. The overall publication rate was relatively low compared with other urological meetings held in America and Europe. USANZ has a challenge of encouraging higher-quality research from the authors to further enhance its publication rate and consequently the

  5. Molecular comparisons of full length metapneumovirus (MPV genomes, including newly determined French AMPV-C and -D isolates, further supports possible subclassification within the MPV Genus.

    Directory of Open Access Journals (Sweden)

    Paul A Brown

    Full Text Available Four avian metapneumovirus (AMPV subgroups (A-D have been reported previously based on genetic and antigenic differences. However, until now full length sequences of the only known isolates of European subgroup C and subgroup D viruses (duck and turkey origin, respectively have been unavailable. These full length sequences were determined and compared with other full length AMPV and human metapneumoviruses (HMPV sequences reported previously, using phylogenetics, comparisons of nucleic and amino acid sequences and study of codon usage bias. Results confirmed that subgroup C viruses were more closely related to HMPV than they were to the other AMPV subgroups in the study. This was consistent with previous findings using partial genome sequences. Closer relationships between AMPV-A, B and D were also evident throughout the majority of results. Three metapneumovirus "clusters" HMPV, AMPV-C and AMPV-A, B and D were further supported by codon bias and phylogenetics. The data presented here together with those of previous studies describing antigenic relationships also between AMPV-A, B and D and between AMPV-C and HMPV may call for a subclassification of metapneumoviruses similar to that used for avian paramyxoviruses, grouping AMPV-A, B and D as type I metapneumoviruses and AMPV-C and HMPV as type II.

  6. An RNA-Seq strategy to detect the complete coding and non-coding transcriptome including full-length imprinted macro ncRNAs.

    Directory of Open Access Journals (Sweden)

    Ru Huang

    Full Text Available Imprinted macro non-protein-coding (nc RNAs are cis-repressor transcripts that silence multiple genes in at least three imprinted gene clusters in the mouse genome. Similar macro or long ncRNAs are abundant in the mammalian genome. Here we present the full coding and non-coding transcriptome of two mouse tissues: differentiated ES cells and fetal head using an optimized RNA-Seq strategy. The data produced is highly reproducible in different sequencing locations and is able to detect the full length of imprinted macro ncRNAs such as Airn and Kcnq1ot1, whose length ranges between 80-118 kb. Transcripts show a more uniform read coverage when RNA is fragmented with RNA hydrolysis compared with cDNA fragmentation by shearing. Irrespective of the fragmentation method, all coding and non-coding transcripts longer than 8 kb show a gradual loss of sequencing tags towards the 3' end. Comparisons to published RNA-Seq datasets show that the strategy presented here is more efficient in detecting known functional imprinted macro ncRNAs and also indicate that standardization of RNA preparation protocols would increase the comparability of the transcriptome between different RNA-Seq datasets.

  7. Spelling is Just a Click Away - A User-Centered Brain-Computer Interface Including Auto-Calibration and Predictive Text Entry.

    Science.gov (United States)

    Kaufmann, Tobias; Völker, Stefan; Gunesch, Laura; Kübler, Andrea

    2012-01-01

    Brain-computer interfaces (BCI) based on event-related potentials (ERP) allow for selection of characters from a visually presented character-matrix and thus provide a communication channel for users with neurodegenerative disease. Although they have been topic of research for more than 20 years and were multiply proven to be a reliable communication method, BCIs are almost exclusively used in experimental settings, handled by qualified experts. This study investigates if ERP-BCIs can be handled independently by laymen without expert support, which is inevitable for establishing BCIs in end-user's daily life situations. Furthermore we compared the classic character-by-character text entry against a predictive text entry (PTE) that directly incorporates predictive text into the character-matrix. N = 19 BCI novices handled a user-centered ERP-BCI application on their own without expert support. The software individually adjusted classifier weights and control parameters in the background, invisible to the user (auto-calibration). All participants were able to operate the software on their own and to twice correctly spell a sentence with the auto-calibrated classifier (once with PTE, once without). Our PTE increased spelling speed and, importantly, did not reduce accuracy. In sum, this study demonstrates feasibility of auto-calibrating ERP-BCI use, independently by laymen and the strong benefit of integrating predictive text directly into the character-matrix.

  8. Systematic review finds that study data not published in full text articles have unclear impact on meta-analyses results in medical research.

    Science.gov (United States)

    Schmucker, Christine M; Blümle, Anette; Schell, Lisa K; Schwarzer, Guido; Oeller, Patrick; Cabrera, Laura; von Elm, Erik; Briel, Matthias; Meerpohl, Joerg J

    2017-01-01

    A meta-analysis as part of a systematic review aims to provide a thorough, comprehensive and unbiased statistical summary of data from the literature. However, relevant study results could be missing from a meta-analysis because of selective publication and inadequate dissemination. If missing outcome data differ systematically from published ones, a meta-analysis will be biased with an inaccurate assessment of the intervention effect. As part of the EU-funded OPEN project (www.open-project.eu) we conducted a systematic review that assessed whether the inclusion of data that were not published at all and/or published only in the grey literature influences pooled effect estimates in meta-analyses and leads to different interpretation. Systematic review of published literature (methodological research projects). Four bibliographic databases were searched up to February 2016 without restriction of publication year or language. Methodological research projects were considered eligible for inclusion if they reviewed a cohort of meta-analyses which (i) compared pooled effect estimates of meta-analyses of health care interventions according to publication status of data or (ii) examined whether the inclusion of unpublished or grey literature data impacts the result of a meta-analysis. Seven methodological research projects including 187 meta-analyses comparing pooled treatment effect estimates according to different publication status were identified. Two research projects showed that published data showed larger pooled treatment effects in favour of the intervention than unpublished or grey literature data (Ratio of ORs 1.15, 95% CI 1.04-1.28 and 1.34, 95% CI 1.09-1.66). In the remaining research projects pooled effect estimates and/or overall findings were not significantly changed by the inclusion of unpublished and/or grey literature data. The precision of the pooled estimate was increased with narrower 95% confidence interval. Although we may anticipate that

  9. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  10. A new equation of state for core-collapse supernovae based on realistic nuclear forces and including a full nuclear ensemble

    International Nuclear Information System (INIS)

    Furusawa, S; Togashi, H; Nagakura, H; Sumiyoshi, K; Yamada, S; Suzuki, H; Takano, M

    2017-01-01

    We have constructed a nuclear equation of state (EOS) that includes a full nuclear ensemble for use in core-collapse supernova simulations. It is based on the EOS for uniform nuclear matter that two of the authors derived recently, applying a variational method to realistic two- and three-body nuclear forces. We have extended the liquid drop model of heavy nuclei, utilizing the mass formula that accounts for the dependences of bulk, surface, Coulomb and shell energies on density and/or temperature. As for light nuclei, we employ a quantum-theoretical mass evaluation, which incorporates the Pauli- and self-energy shifts. In addition to realistic nuclear forces, the inclusion of in-medium effects on the full ensemble of nuclei makes the new EOS one of the most realistic EOSs, which covers a wide range of density, temperature and proton fraction that supernova simulations normally encounter. We make comparisons with the FYSS EOS, which is based on the same formulation for the nuclear ensemble but adopts the relativistic mean field theory with the TM1 parameter set for uniform nuclear matter. The new EOS is softer than the FYSS EOS around and above nuclear saturation densities. We find that neutron-rich nuclei with small mass numbers are more abundant in the new EOS than in the FYSS EOS because of the larger saturation densities and smaller symmetry energy of nuclei in the former. We apply the two EOSs to 1D supernova simulations and find that the new EOS gives lower electron fractions and higher temperatures in the collapse phase owing to the smaller symmetry energy. As a result, the inner core has smaller masses for the new EOS. It is more compact, on the other hand, due to the softness of the new EOS and bounces at higher densities. It turns out that the shock wave generated by core bounce is a bit stronger initially in the simulation with the new EOS. The ensuing outward propagations of the shock wave in the outer core are very similar in the two simulations, which

  11. Molecular Comparisons of Full Length Metapneumovirus (MPV) Genomes, Including Newly Determined French AMPV-C and –D Isolates, Further Supports Possible Subclassification within the MPV Genus

    Science.gov (United States)

    Brown, Paul A.; Lemaitre, Evelyne; Briand, François-Xavier; Courtillon, Céline; Guionie, Olivier; Allée, Chantal; Toquin, Didier; Bayon-Auboyer, Marie-Hélène; Jestin, Véronique; Eterradossi, Nicolas

    2014-01-01

    Four avian metapneumovirus (AMPV) subgroups (A–D) have been reported previously based on genetic and antigenic differences. However, until now full length sequences of the only known isolates of European subgroup C and subgroup D viruses (duck and turkey origin, respectively) have been unavailable. These full length sequences were determined and compared with other full length AMPV and human metapneumoviruses (HMPV) sequences reported previously, using phylogenetics, comparisons of nucleic and amino acid sequences and study of codon usage bias. Results confirmed that subgroup C viruses were more closely related to HMPV than they were to the other AMPV subgroups in the study. This was consistent with previous findings using partial genome sequences. Closer relationships between AMPV-A, B and D were also evident throughout the majority of results. Three metapneumovirus “clusters” HMPV, AMPV-C and AMPV-A, B and D were further supported by codon bias and phylogenetics. The data presented here together with those of previous studies describing antigenic relationships also between AMPV-A, B and D and between AMPV-C and HMPV may call for a subclassification of metapneumoviruses similar to that used for avian paramyxoviruses, grouping AMPV-A, B and D as type I metapneumoviruses and AMPV-C and HMPV as type II. PMID:25036224

  12. Boring of full scale deposition holes at the Aespoe Hard Rock Laboratory. Operational experiences including boring performance and a work time analysis

    Energy Technology Data Exchange (ETDEWEB)

    Andersson, Christer [Swedish Nuclear Fuel and Waste Management Co., Stockholm (Sweden); Johansson, Aasa [SWECO, Stockholm (Sweden)

    2002-12-01

    Thirteen experimental deposition holes similar to those in the present KBS-3 design have been bored at the Aespoe Hard Rock Laboratory, Oskarshamn, Sweden. The objective with the boring program was to test and demonstrate the current technique for boring of large vertical holes in granitic rock. Conclusions and results from this project is used in the planning process for the deposition holes that will be bored in the real repository for spent nuclear fuel. The boreholes are also important for three major projects. The Prototype Repository, the Canister Retrieval Test and the Demonstration project will all need full-scale deposition holes for their commissioning. The holes are bored in full scale and have a radius of 1.75 m and a depth of 8.5 m. To bore the holes an existing TBM design was modified to produce a novel type Shaft Boring Machine (SBM) suitable for boring 1.75 m diameter holes from a relatively small tunnel. The cutter head was equipped with two types of roller cutters: two row carbide button cutters and disc cutters. Removal of the cuttings was made with a vacuum suction system. The boring was monitored and boring parameters recorded by a computerised system for the evaluation of the boring performance. During boring of four of the holes temperature, stress and strain measurements were performed. Acoustic emission measurements were also performed during boring of these four holes. The results of these activities will not be discussed in this report since they are reported separately. Criteria regarding nominal borehole diameter, deviation of start and end centre point, surface roughness and performance of the machine were set up according to the KBS-3 design and were fulfilled with a fair margin. The average total time for boring one deposition hole during this project was 105 hours.

  13. Boring of full scale deposition holes at the Aespoe Hard Rock Laboratory. Operational experiences including boring performance and a work time analysis

    International Nuclear Information System (INIS)

    Andersson, Christer; Johansson, Aasa

    2002-12-01

    Thirteen experimental deposition holes similar to those in the present KBS-3 design have been bored at the Aespoe Hard Rock Laboratory, Oskarshamn, Sweden. The objective with the boring program was to test and demonstrate the current technique for boring of large vertical holes in granitic rock. Conclusions and results from this project is used in the planning process for the deposition holes that will be bored in the real repository for spent nuclear fuel. The boreholes are also important for three major projects. The Prototype Repository, the Canister Retrieval Test and the Demonstration project will all need full-scale deposition holes for their commissioning. The holes are bored in full scale and have a radius of 1.75 m and a depth of 8.5 m. To bore the holes an existing TBM design was modified to produce a novel type Shaft Boring Machine (SBM) suitable for boring 1.75 m diameter holes from a relatively small tunnel. The cutter head was equipped with two types of roller cutters: two row carbide button cutters and disc cutters. Removal of the cuttings was made with a vacuum suction system. The boring was monitored and boring parameters recorded by a computerised system for the evaluation of the boring performance. During boring of four of the holes temperature, stress and strain measurements were performed. Acoustic emission measurements were also performed during boring of these four holes. The results of these activities will not be discussed in this report since they are reported separately. Criteria regarding nominal borehole diameter, deviation of start and end centre point, surface roughness and performance of the machine were set up according to the KBS-3 design and were fulfilled with a fair margin. The average total time for boring one deposition hole during this project was 105 hours

  14. Plagiarism in Academic Texts

    Directory of Open Access Journals (Sweden)

    Marta Eugenia Rojas-Porras

    2012-08-01

    Full Text Available The ethical and social responsibility of citing the sources in a scientific or artistic work is undeniable. This paper explores, in a preliminary way, academic plagiarism in its various forms. It includes findings based on a forensic analysis. The purpose of this paper is to raise awareness on the importance of considering these details when writing and publishing a text. Hopefully, this analysis may put the issue under discussion.

  15. Conditions governing the acceptance of radioactive wastes by the Hauptabteilung Dekontaminationsbetriebe (HDB). Full text of legal provisions, issue no.6 of July 1, 1991, as amended until January 1, 1995

    International Nuclear Information System (INIS)

    1995-01-01

    The conditions apply to the acceptance of radwaste by the Main Decontamination Dept. (HDB) of Karlsruhe Research Center, including radioactive remnants, contaminated plant components, and primary waste from the following waste generators: Institutes of the Karlsruhe Research Center, facilities located within the Center but run by other organisations, other outside facilities not linked with the Center, as e.g. waste generators in Baden-Wuerttemberg obliged to deliver their radwaste to the Radwaste Collecting Site of the Land of Baden-Wuerttemberg. Amendments are marked at the right-hand margin of the text

  16. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  17. Contrast-induced nephropathy in patients with diabetes mellitus between iso- and low-osmolar contrast media: A meta-analysis of full-text prospective, randomized controlled trials.

    Science.gov (United States)

    Han, Xiao-Fang; Zhang, Xin-Xiu; Liu, Ke-Mei; Tan, Hua; Zhang, Qiu

    2018-01-01

    This study was conducted to compare iso-osmolar contrast medium, iodixanol, with low-osmolar contrast media (LOCM) for assessing contrast-induced nephropathy (CIN) incidence, exclusively in the diabetic population. A systematic search was conducted for full-text, prospective, randomized controlled trials (RCTs). The primary outcome was incidence of CIN. Medline, Cochrane Central Register of Controlled Trials, and other sources were searched until May 31, 2017. Twelve RCTs finally met the search criteria. Iodixanol did not significantly reduce the risk of CIN (risk ratio [RR]: 0.72, 95% confidence interval (CI): [0.49, 1.04], p = 0.08). However, there was significantly reduced risk of CIN when iodixanol was compared to a LOCM agent iohexol (RR: 0.32, 95% CI [0.12, 0.89]). There were no differences between iodixanol and the other non-iohexol LOCM (RR: 0.92, 95% CI [0.68, 1.25]). In diabetic populations, iodixanol is not associated with a significant reduction of CIN risk. Iodixanol is associated with a reduced risk of CIN compared with iohexol, whereas no significant difference between iodixanol and other LOCM could be found.

  18. Zum Bildungspotenzial biblischer Texte

    Directory of Open Access Journals (Sweden)

    Theis, Joachim

    2017-11-01

    Full Text Available Biblical education as a holistic process goes far beyond biblical learning. It must be understood as a lifelong process, in which both biblical texts and their understanders operate appropriating their counterpart in a dialogical way. – Neither does the recipient’s horizon of understanding appear as an empty room, which had to be filled with the text only, nor is the latter a dead material one could only examine cognitively. The recipient discovers the meaning of the biblical text recomposing it by existential appropriation. So the text is brought to live in each individual reality. Both scientific insights and subjective structures as well as the understanders’ community must be included to avoid potential one-sidednesses. Unfortunately, a special negative association obscures the approach of the bible very often: Still biblical work as part of religious education appears in a cognitively oriented habit, which is neither regarding the vitality and sovereignty of the biblical texts nor the students’ desire for meaning. Moreover, the bible is getting misused for teaching moral terms or pontifications. Such downfalls can be disrupted by biblical didactics which are empowerment didactics. Regarding the sovereignty of biblical texts, these didactics assist the understander with his/her individuation by opening the texts with focus on the understander’s otherness. Thus each the text and the recipient become subjects in a dialogue. The approach of the Biblical-Enabling-Didactics leads the Bible to become always new a book of life. Understanding them from within their hermeneutics, empowerment didactics could be raised to the principle of biblical didactics in general and grow into an essential element of holistic education.

  19. A new international convention against terrorism: the International Convention for the Suppression of Acts of Nuclear Terrorism. Full text in French, English and Spanish. Introduction to the main elements of the convention

    International Nuclear Information System (INIS)

    Laborde, Jean-Paul; )

    2005-01-01

    The International Convention for the Suppression of Acts of Nuclear Terrorism is a 2005 United Nations treaty designed to criminalize acts of nuclear terrorism and to promote police and judicial cooperation to prevent, investigate and punish those acts. As of September 2016, the convention has 115 signatories and 106 state parties, including the nuclear powers China, France, India, Russia, the United Kingdom, and the United States. The Convention covers a broad range of acts and possible targets, including nuclear power plants and nuclear reactors; covers threats and attempts to commit such crimes or to participate in them, as an accomplice; stipulates that offenders shall be either extradited or prosecuted; encourages States to cooperate in preventing terrorist attacks by sharing information and assisting each other in connection with criminal investigations and extradition proceedings; and, deals with both crisis situations, assisting States to solve the situations and post-crisis situations by rendering nuclear material safe through the International Atomic Energy Agency (IAEA)

  20. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2018-01-01

    text can be defined by taking as point of departure the digital format in which everything is represented in the binary alphabet. While the notion of text, in most cases, lends itself to be independent of medium and embodiment, it is also often tacitly assumed that it is, in fact, modeled around...... the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities...

  1. Abstracts to be Delivered at the 2014 Annual Conference of the Association of Medical Microbiology and Infectious Disease Canada, April 3 to 5, Victoria, British Columbia, Alphabetized According to the Surname of the First Author. Full-text Abstracts Can be Accessed at www.pulsus.com

    Directory of Open Access Journals (Sweden)

    2014-01-01

    Full Text Available This document presents the titles of the abstracts to be presented at the 2014 Annual Conference of the Association of Medical Microbiology and Infectious Disease Canada (April 3 to 5, Victoria, British Columbia. The full-text abstracts are available online.

  2. Context and Structure in Automated Full-Text Information Access

    Science.gov (United States)

    1994-04-29

    Meisei, Makayo, Nitsuko and Tamura, all of Japan; Goldstar, Samsung and OPC of South Korea, and Sun Moon Star of Taiwan; AT&T says the practices have...IN MALAYSIA [ ... ] Another example topic description is shown below: Topic 034 <dom> Domain: Science and Technology <title>Topic: Entities Involved In

  3. HighWire Free Online Full-text Articles

    Science.gov (United States)

    Journal of Lipid Research all articles after 12 months Journal of Medical Ethics all articles 1 Jan 1975 Anticancer Research all articles after 2 years every Jan. Antimicrobial Agents and Chemotherapy all articles BMJ Open Diabetes Research & Care free site BMJ Open Gastroenterology free site BMJ Open

  4. Vocabulary Constraint on Texts

    Directory of Open Access Journals (Sweden)

    C. Sutarsyah

    2008-01-01

    Full Text Available This case study was carried out in the English Education Department of State University of Malang. The aim of the study was to identify and describe the vocabulary in the reading text and to seek if the text is useful for reading skill development. A descriptive qualitative design was applied to obtain the data. For this purpose, some available computer programs were used to find the description of vocabulary in the texts. It was found that the 20 texts containing 7,945 words are dominated by low frequency words which account for 16.97% of the words in the texts. The high frequency words occurring in the texts were dominated by function words. In the case of word levels, it was found that the texts have very limited number of words from GSL (General Service List of English Words (West, 1953. The proportion of the first 1,000 words of GSL only accounts for 44.6%. The data also show that the texts contain too large proportion of words which are not in the three levels (the first 2,000 and UWL. These words account for 26.44% of the running words in the texts.  It is believed that the constraints are due to the selection of the texts which are made of a series of short-unrelated texts. This kind of text is subject to the accumulation of low frequency words especially those of content words and limited of words from GSL. It could also defeat the development of students' reading skills and vocabulary enrichment.

  5. Directed Activities Related to Text: Text Analysis and Text Reconstruction.

    Science.gov (United States)

    Davies, Florence; Greene, Terry

    This paper describes Directed Activities Related to Text (DART), procedures that were developed and are used in the Reading for Learning Project at the University of Nottingham (England) to enhance learning from texts and that fall into two broad categories: (1) text analysis procedures, which require students to engage in some form of analysis of…

  6. The cover letter should include the corresponding author's full ...

    African Journals Online (AJOL)

    uaf

    2Ayub Agricultural Research Institute, Faisalabad, Pakistan. 3Department of ... countries. In Chile, from 1995 to 1997, it was seen as a ... Thus, owing to the lack of information on this side, the ..... (In English; summary in Spanish). Bol. del ...

  7. Library Users Expect Link Resolvers to Provide Full Text While Librarians Expect Accurate Results. A review of: Wakimoto, Jina Choi, David S. Walker, and Katherine S. Dabbour. “The Myths and Realities of SFX in Academic Libraries.” The Journal of Academic Librarianship 32.2 (Mar. 2006: 127‐ 36.

    Directory of Open Access Journals (Sweden)

    Wendy Furlan

    2006-12-01

    Full Text Available Objective – To determine how successfulthe link resolver, SFX, is in meeting the expectations of library users and librarians.Design – Analysis of an online user survey, library staff focus groups, retrospective analysis of system statistics, and test searches.Setting – Two California State University campus libraries in the United States: Northbridge, with over 31,000 students on campus, and San Marcos, with over 7,300 students on campus.Subjects – A total of 453 online survey responses were submitted from library users, 421 from Northbridge and 32 from SanMarcos. Twenty librarians took part in the focus groups conducted with library staff consisting of 14 of the 23 librarians from Northbridge (2 from technical services and 12 from public services, and 6 of the 10 San Marcos librarians (3 from technical services and 3 from public services. No further information was provided on the characteristics of the subjects.Methods – An online survey was offered to users of the two campus libraries for a two week period in May 2004. The survey consisted of 8 questions, 7 fixed response and 1 free text. Survey distribution was enabled via a different mechanism at each campus. The Northbridge library offered the survey to users via a pop‐up window each time the SFX service was clicked on, while the San Marcos library presented the survey as a link from the library’s home page. Survey responses from both campuses were combined and analysed together. Focus groups were conducted with librarians from each campus library on April 20th, 21st, and 29th, 2004. Librarians attended focus groups only with others from their own campus. Statistics were gathered from each campus’ local SFX system for the 3‐month period from September 14, 2004, to December 14,2004. Statistics from each campus were combined for analysis. The authors also conducted 224 test searches over the 3‐month period from July to September, 2004.Main results – Analysis of the

  8. Üstverinin Tam-Metin Bilgi Erişim Performansı Üzerindeki Etkisi: Küçük Ölçekli Türkçe Külliyat Üzerinde Deneysel Bir Araştırma / Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

    OpenAIRE

    Çapkın, Çağdaş

    2016-01-01

    Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate ...

  9. Text Maps: Helping Students Navigate Informational Texts.

    Science.gov (United States)

    Spencer, Brenda H.

    2003-01-01

    Notes that a text map is an instructional approach designed to help students gain fluency in reading content area materials. Discusses how the goal is to teach students about the important features of the material and how the maps can be used to build new understandings. Presents the procedures for preparing and using a text map. (SG)

  10. A Proposed Arabic Handwritten Text Normalization Method

    Directory of Open Access Journals (Sweden)

    Tarik Abu-Ain

    2014-11-01

    Full Text Available Text normalization is an important technique in document image analysis and recognition. It consists of many preprocessing stages, which include slope correction, text padding, skew correction, and straight the writing line. In this side, text normalization has an important role in many procedures such as text segmentation, feature extraction and characters recognition. In the present article, a new method for text baseline detection, straightening, and slant correction for Arabic handwritten texts is proposed. The method comprises a set of sequential steps: first components segmentation is done followed by components text thinning; then, the direction features of the skeletons are extracted, and the candidate baseline regions are determined. After that, selection of the correct baseline region is done, and finally, the baselines of all components are aligned with the writing line.  The experiments are conducted on IFN/ENIT benchmark Arabic dataset. The results show that the proposed method has a promising and encouraging performance.

  11. From Text to Political Positions: Text analysis across disciplines

    NARCIS (Netherlands)

    Kaal, A.R.; Maks, I.; van Elfrinkhof, A.M.E.

    2014-01-01

    ABSTRACT From Text to Political Positions addresses cross-disciplinary innovation in political text analysis for party positioning. Drawing on political science, computational methods and discourse analysis, it presents a diverse collection of analytical models including pure quantitative and

  12. Text analysis methods, text analysis apparatuses, and articles of manufacture

    Science.gov (United States)

    Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

    2014-10-28

    Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.

  13. Weitere Texte physiognomischen Inhalts

    Directory of Open Access Journals (Sweden)

    Böck, Barbara

    2004-12-01

    Full Text Available The present article offers the edition of three cuneiform texts belonging to the Akkadian handbook of omens drawn from the physical appearance as well as the morals and behaviour of man. The book comprising up to 27 chapters with more than 100 omens each was entitled in antiquity Alamdimmû. The edition of the three cuneiform tablets completes, thus, the author's monographic study on the ancient Mesopotamian divinatory discipline of physiognomy (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

    En este artículo se presenta la editio princeps de tres textos cuneiformes conservados en el British Museum (Londres y el Vorderasiatisches Museum (Berlín, que pertenecen al libro asirio-babilonio de presagios fisiognómicos. Este libro, titulado originalmente Alamdimmû ('forma, figura', consta de 27 capítulos, cada uno con más de cien presagios escritos en lengua acadia. Los tres textos completan así el estudio monográfico de la autora sobre la disciplina adivinatoria de la fisiognomía en el antiguo Oriente (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

  14. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  15. Utah Text Retrieval Project

    Energy Technology Data Exchange (ETDEWEB)

    Hollaar, L A

    1983-10-01

    The Utah Text Retrieval project seeks well-engineered solutions to the implementation of large, inexpensive, rapid text information retrieval systems. The project has three major components. Perhaps the best known is the work on the specialized processors, particularly search engines, necessary to achieve the desired performance and cost. The other two concern the user interface to the system and the system's internal structure. The work on user interface development is not only concentrating on the syntax and semantics of the query language, but also on the overall environment the system presents to the user. Environmental enhancements include convenient ways to browse through retrieved documents, access to other information retrieval systems through gateways supporting a common command interface, and interfaces to word processing systems. The system's internal structure is based on a high-level data communications protocol linking the user interface, index processor, search processor, and other system modules. This allows them to be easily distributed in a multi- or specialized-processor configuration. It also allows new modules, such as a knowledge-based query reformulator, to be added. 15 references.

  16. Informational Text and the CCSS

    Science.gov (United States)

    Aspen Institute, 2012

    2012-01-01

    What constitutes an informational text covers a broad swath of different types of texts. Biographies & memoirs, speeches, opinion pieces & argumentative essays, and historical, scientific or technical accounts of a non-narrative nature are all included in what the Common Core State Standards (CCSS) envisions as informational text. Also included…

  17. Text-Fabric

    NARCIS (Netherlands)

    Roorda, Dirk

    2016-01-01

    Text-Fabric is a Python3 package for Text plus Annotations. It provides a data model, a text file format, and a binary format for (ancient) text plus (linguistic) annotations. The emphasis of this all is on: data processing; sharing data; and contributing modules. A defining characteristic is that

  18. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  19. XML and Free Text.

    Science.gov (United States)

    Riggs, Ken Roger

    2002-01-01

    Discusses problems with marking free text, text that is either natural language or semigrammatical but unstructured, that prevent well-formed XML from marking text for readily available meaning. Proposes a solution to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML. (Author/LRW)

  20. Texting on the Move

    Science.gov (United States)

    ... text. What's the Big Deal? The problem is multitasking. No matter how young and agile we are, ... on something other than the road. In fact, driving while texting (DWT) can be more dangerous than ...

  1. Text Coherence in Translation

    Science.gov (United States)

    Zheng, Yanping

    2009-01-01

    In the thesis a coherent text is defined as a continuity of senses of the outcome of combining concepts and relations into a network composed of knowledge space centered around main topics. And the author maintains that in order to obtain the coherence of a target language text from a source text during the process of translation, a translator can…

  2. TEXT DEIXIS IN NARRATIVE SEQUENCES

    Directory of Open Access Journals (Sweden)

    Josep Rivera

    2007-06-01

    Full Text Available This study looks at demonstrative descriptions, regarding them as text-deictic procedures which contribute to weave discourse reference. Text deixis is thought of as a metaphorical referential device which maps the ground of utterance onto the text itself. Demonstrative expressions with textual antecedent-triggers, considered as the most important text-deictic units, are identified in a narrative corpus consisting of J. M. Barrie’s Peter Pan and its translation into Catalan. Some linguistic and discourse variables related to DemNPs are analysed to characterise adequately text deixis. It is shown that this referential device is usually combined with abstract nouns, thus categorising and encapsulating (non-nominal complex discourse entities as nouns, while performing a referential cohesive function by means of the text deixis + general noun type of lexical cohesion.

  3. A Typed Text Retrieval Query Language for XML Documents.

    Science.gov (United States)

    Colazzo, Dario; Sartiani, Carlo; Albano, Antonio; Manghi, Paolo; Ghelli, Giorgio; Lini, Luca; Paoli, Michele

    2002-01-01

    Discussion of XML focuses on a description of Tequyla-TX, a typed text retrieval query language for XML documents that can search on both content and structures. Highlights include motivations; numerous examples; word-based and char-based searches; tag-dependent full-text searches; text normalization; query algebra; data models and term language;…

  4. EXPLORING STUDENTS‟ DIFFICULTIES IN READING ACADEMIC TEXTS

    Directory of Open Access Journals (Sweden)

    Ira Ernawati

    2017-04-01

    Full Text Available Academic texts play an important role for university students. However, those texts are considered difficult. This study is intended to investigate students‘ difficulties in reading academic texts. The qualitative approach was employed in this study. The design was a case study. The participants were ten students from fifth semester of CLS: EE (Classroom Language and Strategy: Explaining and Exemplifying class who were selected by using purposive sampling. The data were gathered from students‘ journal reflections, observation, and interview. The finding shows that the students encountered reading difficulties in area of textual factors, namely vocabulary, comprehending specific information, text organization, and grammar and human factors including background knowledge, mood, laziness, and time constraint.

  5. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  6. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    Dictionaries for Text Production are information tools that are designed and constructed for helping users to produce (i.e. encode) texts, both oral and written texts. These can be broadly divided into two groups: (a) specialized text production dictionaries, i.e., dictionaries that only offer...... a small amount of lexicographic data, most or all of which are typically used in a production situation, e.g. synonym dictionaries, grammar and spelling dictionaries, collocation dictionaries, concept dictionaries such as the Longman Language Activator, which is advertised as the World’s First Production...... Dictionary; (b) general text production dictionaries, i.e., dictionaries that offer all or most of the lexicographic data that are typically used in a production situation. A review of existing production dictionaries reveals that there are many specialized text production dictionaries but only a few general...

  7. SparkText: Biomedical Text Mining on Big Data Framework.

    Directory of Open Access Journals (Sweden)

    Zhan Ye

    Full Text Available Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment.In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM, and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes.This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  8. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  9. Text mining for the biocuration workflow.

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  10. Social Studies: Texts and Supplements.

    Science.gov (United States)

    Curriculum Review, 1979

    1979-01-01

    This review of selected social studies texts, series, and supplements, mainly for the secondary level, includes a special section examining eight titles on warfare and terrorism for grades 4-12. (SJL)

  11. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  12. LocText

    DEFF Research Database (Denmark)

    Cejuela, Juan Miguel; Vinchurkar, Shrikant; Goldberg, Tatyana

    2018-01-01

    trees and was trained and evaluated on a newly improved LocTextCorpus. Combined with an automatic named-entity recognizer, LocText achieved high precision (P = 86%±4). After completing development, we mined the latest research publications for three organisms: human (Homo sapiens), budding yeast...

  13. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  14. The Perfect Text.

    Science.gov (United States)

    Russo, Ruth

    1998-01-01

    A chemistry teacher describes the elements of the ideal chemistry textbook. The perfect text is focused and helps students draw a coherent whole out of the myriad fragments of information and interpretation. The text would show chemistry as the central science necessary for understanding other sciences and would also root chemistry firmly in the…

  15. Text 2 Mind Map

    OpenAIRE

    Iona, John

    2017-01-01

    This is a review of the web resource 'Text 2 Mind Map' www.Text2MindMap.com. It covers what the resource is, and how it might be used in Library and education context, in particular for School Librarians.

  16. Text File Comparator

    Science.gov (United States)

    Kotler, R. S.

    1983-01-01

    File Comparator program IFCOMP, is text file comparator for IBM OS/VScompatable systems. IFCOMP accepts as input two text files and produces listing of differences in pseudo-update form. IFCOMP is very useful in monitoring changes made to software at the source code level.

  17. Monitoring interaction and collective text production through text mining

    Directory of Open Access Journals (Sweden)

    Macedo, Alexandra Lorandi

    2014-04-01

    Full Text Available This article presents the Concepts Network tool, developed using text mining technology. The main objective of this tool is to extract and relate terms of greatest incidence from a text and exhibit the results in the form of a graph. The Network was implemented in the Collective Text Editor (CTE which is an online tool that allows the production of texts in synchronized or non-synchronized forms. This article describes the application of the Network both in texts produced collectively and texts produced in a forum. The purpose of the tool is to offer support to the teacher in managing the high volume of data generated in the process of interaction amongst students and in the construction of the text. Specifically, the aim is to facilitate the teacher’s job by allowing him/her to process data in a shorter time than is currently demanded. The results suggest that the Concepts Network can aid the teacher, as it provides indicators of the quality of the text produced. Moreover, messages posted in forums can be analyzed without their content necessarily having to be pre-read.

  18. EST: Evading Scientific Text.

    Science.gov (United States)

    Ward, Jeremy

    2001-01-01

    Examines chemical engineering students' attitudes to text and other parts of English language textbooks. A questionnaire was administered to a group of undergraduates. Results reveal one way students get around the problem of textbook reading. (Author/VWL)

  19. nal Sesotho texts

    African Journals Online (AJOL)

    with literary texts written in indigenous South African languages. The project ... Homi Bhabha uses the words of Salman Rushdie to underline the fact that new .... I could not conceptualise an African-language-to-African-language dictionary. An.

  20. Machine Translation from Text

    Science.gov (United States)

    Habash, Nizar; Olive, Joseph; Christianson, Caitlin; McCary, John

    Machine translation (MT) from text, the topic of this chapter, is perhaps the heart of the GALE project. Beyond being a well defined application that stands on its own, MT from text is the link between the automatic speech recognition component and the distillation component. The focus of MT in GALE is on translating from Arabic or Chinese to English. The three languages represent a wide range of linguistic diversity and make the GALE MT task rather challenging and exciting.

  1. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  2. TEXT Energy Storage System

    International Nuclear Information System (INIS)

    Weldon, W.F.; Rylander, H.G.; Woodson, H.H.

    1977-01-01

    The Texas Experimental Tokamak (TEXT) Enery Storage System, designed by the Center for Electromechanics (CEM), consists of four 50 MJ, 125 V homopolar generators and their auxiliaries and is designed to power the toroidal and poloidal field coils of TEXT on a two-minute duty cycle. The four 50 MJ generators connected in series were chosen because they represent the minimum cost configuration and also represent a minimal scale up from the successful 5.0 MJ homopolar generator designed, built, and operated by the CEM

  3. Text and ideology: text-oriented discourse analysis

    Directory of Open Access Journals (Sweden)

    Maria Eduarda Gonçalves Peixoto

    2018-04-01

    Full Text Available The article aims to contribute to the understanding of the connection between text and ideology articulated by the text-oriented analysis of discourse (ADTO. Based on the reflections of Fairclough (1989, 2001, 2003 and Fairclough and Chouliaraki (1999, the debate presents the social ontology that ADTO uses to base its conception of social life as an open system and textually mediated; the article then explains the chronological-narrative development of the main critical theories of ideology, by virtue of which ADTO organizes the assumptions that underpin the particular use it makes of the term. Finally, the discussion presents the main aspects of the connection between text and ideology, offering a conceptual framework that can contribute to the domain of the theme according to a critical discourse analysis approach.

  4. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  5. The Emar Lexical Texts

    NARCIS (Netherlands)

    Gantzert, Merijn

    2011-01-01

    This four-part work provides a philological analysis and a theoretical interpretation of the cuneiform lexical texts found in the Late Bronze Age city of Emar, in present-day Syria. These word and sign lists, commonly dated to around 1100 BC, were almost all found in the archive of a single school.

  6. Text Induced Spelling Correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from a very large corpus of raw text, without supervision, and contains word

  7. Texts and Readers.

    Science.gov (United States)

    Iser, Wolfgang

    1980-01-01

    Notes that, since fictional discourse need not reflect prevailing systems of meaning and norms or values, readers gain detachment from their own presuppositions; by constituting and formulating text-sense, readers are constituting and formulating their own cognition and becoming aware of the operations for doing so. (FL)

  8. Documents and legal texts

    International Nuclear Information System (INIS)

    2017-01-01

    This section treats of the following documents and legal texts: 1 - Belgium 29 June 2014 - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy; 2 - Belgium, 7 December 2016. - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy

  9. SparkText: Biomedical Text Mining on Big Data Framework

    Science.gov (United States)

    He, Karen Y.; Wang, Kai

    2016-01-01

    Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652

  10. SparkText: Biomedical Text Mining on Big Data Framework.

    Science.gov (United States)

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  11. Strategies for Translating Vocative Texts

    Directory of Open Access Journals (Sweden)

    Olga COJOCARU

    2014-12-01

    Full Text Available The paper deals with the linguistic and cultural elements of vocative texts and the techniques used in translating them by giving some examples of texts that are typically vocative (i.e. advertisements and instructions for use. Semantic and communicative strategies are popular in translation studies and each of them has its own advantages and disadvantages in translating vocative texts. The advantage of semantic translation is that it takes more account of the aesthetic value of the SL text, while communicative translation attempts to render the exact contextual meaning of the original text in such a way that both content and language are readily acceptable and comprehensible to the readership. Focus is laid on the strategies used in translating vocative texts, strategies that highlight and introduce a cultural context to the target audience, in order to achieve their overall purpose, that is to sell or persuade the reader to behave in a certain way. Thus, in order to do that, a number of advertisements from the field of cosmetics industry and electronic gadgets were selected for analysis. The aim is to gather insights into vocative text translation and to create new perspectives on this field of research, now considered a process of innovation and diversion, especially in areas as important as economy and marketing.

  12. Strategy as Texts

    DEFF Research Database (Denmark)

    Obed Madsen, Søren

    of the strategy into four categories. Second, the managers produce new texts based on the original strategy document by using four different ways of translation models. The study’s findings contribute to three areas. Firstly, it shows that translation is more than a sociological process. It is also...... a craftsmanship that requires knowledge and skills, which unfortunately seems to be overlooked in both the literature and in practice. Secondly, it shows that even though a strategy text is in singular, the translation makes strategy plural. Thirdly, the article proposes a way to open up the black box of what......This article shows empirically how managers translate a strategy plan at an individual level. By analysing how managers in three organizations translate strategies, it identifies that the translation happens in two steps: First, the managers decipher the strategy by coding the different parts...

  13. Application of LSP texts in translator training

    Directory of Open Access Journals (Sweden)

    Larisa Ilynska

    2017-06-01

    Full Text Available The paper presents discussion of the results of extensive empirical research into efficient methods of educating and training translators of LSP (language for special purposes texts. The methodology is based on using popular LSP texts in the respective fields as one of the main media for translator training. The aim of the paper is to investigate the efficiency of this methodology in developing thematic, linguistic and cultural competences of the students, following Bloom’s revised taxonomy and European Master in Translation Network (EMT translator training competences. The methodology has been tested on the students of a professional Master study programme called Technical Translation implemented by the Institute of Applied Linguistics, Riga Technical University, Latvia. The group of students included representatives of different nationalities, translating from English into Latvian, Russian and French. Analysis of popular LSP texts provides an opportunity to structure student background knowledge and expand it to account for linguistic innovation. Application of popular LSP texts instead of purely technical or scientific texts characterised by neutral style and rigid genre conventions provides an opportunity for student translators to develop advanced text processing and decoding skills, to develop awareness of expressive resources of the source and target languages and to develop understanding of socio-pragmatic language use.

  14. English Metafunction Analysis in Chemistry Text: Characterization of Scientific Text

    Directory of Open Access Journals (Sweden)

    Ahmad Amin Dalimunte, M.Hum

    2013-09-01

    Full Text Available The objectives of this research are to identify what Metafunctions are applied in chemistry text and how they characterize a scientific text. It was conducted by applying content analysis. The data for this research was a twelve-paragraph chemistry text. The data were collected by applying a documentary technique. The document was read and analyzed to find out the Metafunction. The data were analyzed by some procedures: identifying the types of process, counting up the number of the processes, categorizing and counting up the cohesion devices, classifying the types of modulation and determining modality value, finally counting up the number of sentences and clauses, then scoring the grammatical intricacy index. The findings of the research show that Material process (71of 100 is mostly used, circumstance of spatial location (26 of 56 is more dominant than the others. Modality (5 is less used in order to avoid from subjectivity. Impersonality is implied through less use of reference either pronouns (7 or demonstrative (7, conjunctions (60 are applied to develop ideas, and the total number of the clauses are found much more dominant (109 than the total number of the sentences (40 which results high grammatical intricacy index. The Metafunction found indicate that the chemistry text has fulfilled the characteristics of scientific or academic text which truly reflects it as a natural science.

  15. Reading Authentic Texts

    DEFF Research Database (Denmark)

    Balling, Laura Winther

    2013-01-01

    Most research on cognates has focused on words presented in isolation that are easily defined as cognate between L1 and L2. In contrast, this study investigates what counts as cognate in authentic texts and how such cognates are read. Participants with L1 Danish read news articles in their highly...... proficient L2, English, while their eye-movements were monitored. The experiment shows a cognate advantage for morphologically simple words, but only when cognateness is defined relative to translation equivalents that are appropriate in the context. For morphologically complex words, a cognate disadvantage...... word predictability indexed by the conditional probability of each word....

  16. Documents and legal texts

    International Nuclear Information System (INIS)

    2016-01-01

    This section treats of the following documents and legal texts: 1 - Brazil: Law No. 13,260 of 16 March 2016 (To regulate the provisions of item XLIII of Article 5 of the Federal Constitution on terrorism, dealing with investigative and procedural provisions and redefining the concept of a terrorist organisation; and amends Laws No. 7,960 of 21 December 1989 and No. 12,850 of 2 August 2013); 2 - India: The Atomic Energy (Amendment) Act, 2015; Department Of Atomic Energy Notification (Civil Liability for Nuclear Damage); 3 - Japan: Act on Subsidisation, etc. for Nuclear Damage Compensation Funds following the implementation of the Convention on Supplementary Compensation for Nuclear Damage

  17. Journalistic Text Production

    DEFF Research Database (Denmark)

    Haugaard, Rikke Hartmann

    , a multiple case study investigated three professional text producers’ practices as they unfolded in their natural setting at the Spanish newspaper, El Mundo. • Results indicate that journalists’ revisions are related to form markedly more often than to content. • Results suggest two writing phases serving...... at the Spanish newspaper, El Mundo, in Madrid. The study applied a combination of quantitative and qualitative methods, i.e. keystroke logging, participant observation and retrospective interview. Results indicate that journalists’ revisions are related to form markedly more often than to content (approx. three...

  18. Identifying issue frames in text.

    Directory of Open Access Journals (Sweden)

    Eyal Sagi

    Full Text Available Framing, the effect of context on cognitive processes, is a prominent topic of research in psychology and public opinion research. Research on framing has traditionally relied on controlled experiments and manually annotated document collections. In this paper we present a method that allows for quantifying the relative strengths of competing linguistic frames based on corpus analysis. This method requires little human intervention and can therefore be efficiently applied to large bodies of text. We demonstrate its effectiveness by tracking changes in the framing of terror over time and comparing the framing of abortion by Democrats and Republicans in the U.S.

  19. Text-based language identification of multilingual names

    CSIR Research Space (South Africa)

    Giwa, O

    2015-11-01

    Full Text Available Text-based language identification (T-LID) of isolated words has been shown to be useful for various speech processing tasks, including pronunciation modelling and data categorisation. When the words to be categorised are proper names, the task...

  20. Doing Mathematics with Purpose: Mathematical Text Types

    Science.gov (United States)

    Dostal, Hannah M.; Robinson, Richard

    2018-01-01

    Mathematical literacy includes learning to read and write different types of mathematical texts as part of purposeful mathematical meaning making. Thus in this article, we describe how learning to read and write mathematical texts (proof text, algorithmic text, algebraic/symbolic text, and visual text) supports the development of students'…

  1. Documents and legal texts

    International Nuclear Information System (INIS)

    2013-01-01

    This section reprints a selection of recently published legislative texts and documents: - Russian Federation: Federal Law No.170 of 21 November 1995 on the use of atomic energy, Adopted by the State Duma on 20 October 1995; - Uruguay: Law No.19.056 On the Radiological Protection and Safety of Persons, Property and the Environment (4 January 2013); - Japan: Third Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (concerning Damages related to Rumour-Related Damage in the Agriculture, Forestry, Fishery and Food Industries), 30 January 2013; - France and the United States: Joint Statement on Liability for Nuclear Damage (Aug 2013); - Franco-Russian Nuclear Power Declaration (1 November 2013)

  2. Text mining for the biocuration workflow

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A. P. C; Krallinger, Martin; Arighi, Cecilia; Cohen, K. Bretonnel; Valencia, Alfonso; Wu, Cathy H.; Chatr-Aryamontri, Andrew; Dowell, Karen G.; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G.

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on ‘Text Mining for the BioCuration Workflow’ at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community. PMID:22513129

  3. Automatic text summarization

    CERN Document Server

    Torres Moreno, Juan Manuel

    2014-01-01

    This new textbook examines the motivations and the different algorithms for automatic document summarization (ADS). We performed a recent state of the art. The book shows the main problems of ADS, difficulties and the solutions provided by the community. It presents recent advances in ADS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several exemples are included in order to clarify the theoretical concepts.  The books currently available in the area of Automatic Document Summarization are not recent. Powerful algorithms have been develop

  4. Documents and legal texts

    International Nuclear Information System (INIS)

    2015-01-01

    This section treats of the following Documents and legal texts: 1 - Canada: Nuclear Liability and Compensation Act (An Act respecting civil liability and compensation for damage in case of a nuclear incident, repealing the Nuclear Liability Act and making consequential amendments to other acts); 2 - Japan: Act on Compensation for Nuclear Damage (The purpose of this act is to protect persons suffering from nuclear damage and to contribute to the sound development of the nuclear industry by establishing a basic system regarding compensation in case of nuclear damage caused by reactor operation etc.); Act on Indemnity Agreements for Compensation of Nuclear Damage; 3 - Slovak Republic: Act on Civil Liability for Nuclear Damage and on its Financial Coverage and on Changes and Amendments to Certain Laws (This Act regulates: a) The civil liability for nuclear damage incurred in the causation of a nuclear incident, b) The scope of powers of the Nuclear Regulatory Authority (hereinafter only as the 'Authority') in relation to the application of this Act, c) The competence of the National Bank of Slovakia in relation to the supervised financial market entities in the financial coverage of liability for nuclear damage; and d) The penalties for violation of this Act)

  5. Documents and legal texts

    International Nuclear Information System (INIS)

    2014-01-01

    This section of the Bulletin presents the recently published documents and legal texts sorted by country: - Brazil: Resolution No. 169 of 30 April 2014. - Japan: Act Concerning Exceptions to Interruption of Prescription Pertaining to Use of Settlement Mediation Procedures by the Dispute Reconciliation Committee for Nuclear Damage Compensation in relation to Nuclear Damage Compensation Disputes Pertaining to the Great East Japan Earthquake (Act No. 32 of 5 June 2013); Act Concerning Measures to Achieve Prompt and Assured Compensation for Nuclear Damage Arising from the Nuclear Plant Accident following the Great East Japan Earthquake and Exceptions to the Extinctive Prescription, etc. of the Right to Claim Compensation for Nuclear Damage (Act No. 97 of 11 December 2013); Fourth Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage Resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.); Outline of 'Fourth Supplement to Interim Guidelines (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.)'. - OECD Nuclear Energy Agency: Decision and Recommendation of the Steering Committee Concerning the Application of the Paris Convention to Nuclear Installations in the Process of Being Decommissioned; Joint Declaration on the Security of Supply of Medical Radioisotopes. - United Arab Emirates: Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage; Ratification of the Federal Supreme Council of Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage

  6. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  7. Comprehending text in literature class

    Directory of Open Access Journals (Sweden)

    Purić Daliborka S.

    2016-01-01

    Full Text Available The paper discusses the problem of understanding a text and the contribution of methodological apparatus in the reader book to comprehension of a text being read in junior classes of elementary school. By using the technique of content analysis from methodological apparatuses in eight reader books for the fourth grade of elementary school, approved for usage in 2014/2015 academic year, and surveying 350 teachers in 33 elementary schools and 11 administrative districts in the Republic of Serbia we examined: (a to what extent the Serbian language text book contents enable junior students to understand a literary text; (b to what extent teachers accept the suggestions offered in the textbook for preparing literature teaching. The results show that a large number of suggestions relate to reading comprehension, but some of categories of understanding are unevenly distributed in the methodological apparatus. On the other hand, the majority of teachers use the methodological apparatus given in a textbook for preparing classes, not only the textbook he or she selected for teaching but also other textbooks for the same grade.

  8. Full closure strategic analysis.

    Science.gov (United States)

    2014-07-01

    The full closure strategic analysis was conducted to create a decision process whereby full roadway : closures for construction and maintenance activities can be evaluated and approved or denied by CDOT : Traffic personnel. The study reviewed current...

  9. Methodological Details and Full Bibliography

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset has several components, The first part describes fully our literature review, providing details not included in the text. The second part provides all...

  10. A Guide Text or Many Texts? "That is the Question”

    Directory of Open Access Journals (Sweden)

    Delgado de Valencia Sonia

    2001-08-01

    Full Text Available The use of supplementary materials in the classroom has always been an essential part of the teaching and learning process. To restrict our teaching to the scope of one single textbook means to stand behind the advances of knowledge, in any area and context. Young learners appreciate any new and varied support that expands their knowledge of the world: diaries, letters, panels, free texts, magazines, short stories, poems or literary excerpts, and articles taken from Internet are materials that will allow learnersto share more and work more collaboratively. In this article we are going to deal with some of these materials, with the criteria to select, adapt, and create them that may be of interest to the learner and that may promote reading and writing processes. Since no text can entirely satisfy the needs of students and teachers, the creativity of both parties will be necessary to improve the quality of teaching through the adequate use and adaptation of supplementary materials.

  11. Text

    International Nuclear Information System (INIS)

    Anon.

    2009-01-01

    The purpose of this act is to safeguard against the dangers and harmful effects of radioactive waste and to contribute to public safety and environmental protection by laying down requirements for the safe and efficient management of radioactive waste. We will find definitions, interrelation with other legislation, responsibilities of the state and local governments, responsibilities of radioactive waste management companies and generators, formulation of the basic plan for the control of radioactive waste, radioactive waste management ( with public information, financing and part of spent fuel management), Korea radioactive waste management corporation ( business activities, budget), establishment of a radioactive waste fund in order to secure the financial resources required for radioactive waste management, and penalties in case of improper operation of radioactive waste management. (N.C.)

  12. Full page insight

    DEFF Research Database (Denmark)

    Cortsen, Rikke Platz

    2014-01-01

    Alan Moore and his collaborating artists often manipulate time and space by drawing upon the formal elements of comics and making alternative constellations. This article looks at an element that is used frequently in comics of all kinds – the full page – and discusses how it helps shape spatio......, something that it shares with the full page in comics. Through an analysis of several full pages from Moore titles like Swamp Thing, From Hell, Watchmen and Promethea, it is made clear why the full page provides an apt vehicle for an apocalypse in comics....

  13. Teaching Text Structure: Examining the Affordances of Children's Informational Texts

    Science.gov (United States)

    Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

    2016-01-01

    This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…

  14. Prayer in Qumran texts. A brief introduction

    Directory of Open Access Journals (Sweden)

    Zdzisław J. Kapera

    2011-03-01

    Full Text Available Of some three hundred literary texts found in the caves of the Judaean Desert and those close to Khirbet Qumran, 56 are various pieces of poetry and liturgy. Seven specific groups have been distinguished among them: 1. Liturgy on sunshine and sunset and on specific days; 2. Liturgy on specific ceremonies of the community; 3. Eschatological prayers; 4. Magic texts; 5. Collections of psalms (including pseudepigrapha; 6. Thanksgiving hymns; 7. Prose prayers. The issue of how the Qumranians were praying is here briefly touched upon. Then there is a description of morning and evening prayers, Sabbath prayers, specific liturgy of the annual ceremony of entering the New Covenant, the Hodayot (Thanksgiving Hymns, pseudepigraphic Psalms (like Ps 151, and the eschatological prayers. The introduction ends with a summary evaluation of the role of the texts in reconstructing the historical development of the Jewish prayer of the late Second Temple period. The need to study the relationship of the Qumran prayers with the early Christian prayers is also briefly discussed.

  15. Environmental law. Important laws and ordinances for environmental protection. As of July 1, 1989. [Full text edition with detailed index and an introduction by Prof. Dr. P. C. Storm]. Umweltrecht. Wichtige Gesetze und Verordnungen zum Schutz der Umwelt. Stand 1. Juli 1989

    Energy Technology Data Exchange (ETDEWEB)

    1989-01-01

    This pocketbook contains major federal regulations on environmental protection. They serve to protect and cultivate mankind's natural foundations of life, to preserve the environment.The environmental law is devided as follows: Constitutional law on the environment, common administrative law on the environment, special administrative law on the environment including conservation of nature and preservation of rural amenities, protection of waters, waste management, protection against nuisances, nuclear energy and radiation protection, energy conservation, protection against dangerous substances, private law relating to the environment, criminal law relating to the environment. (orig.).

  16. Remedial investigation work plan for Bear Creek Valley Operable Unit 1 (S-3 Ponds, Boneyard/Burnyard, Oil Landfarm, Sanitary Landfill I, and the Burial Grounds, including Oil Retention Ponds 1 and 2) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee. Volume 1, Main text

    International Nuclear Information System (INIS)

    1994-03-01

    This document refers to data concerning the Environmental Restoration Program implemented at the Oak Ridge Y-12 plant. Topics discussed include: Remediation plans for the burial grounds, sanitary landfill I, oil retention ponds, S-3 ponds, and the boneyard/burnyard at Y-12. This document also contains information about the environmental policies regulating the remediation

  17. A quick survey of text categorization algorithms

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper contains an overview of basic formulations and approaches to text classification. This paper surveys the algorithms used in text categorization: handcrafted rules, decision trees, decision rules, on-line learning, linear classifier, Rocchio’s algorithm, k Nearest Neighbor (kNN, Support Vector Machines (SVM.

  18. Full Service Leasing

    OpenAIRE

    Richter, Ján

    2009-01-01

    Aim of this master thesis is to describe the service of Full Service Leasing, as a modern form of financing and management of assets, primarily automobile fleet. Description of full service leasing is designed as a comprehensive and complete guide to support reader's position when deciding to finance and manage a fleet by this service. Whether the reader is an entrepreneur, CFO, fleet manager, new employee of leasing company, or anyone who is interested in this service, this master thesis wil...

  19. Important Text Characteristics for Early-Grades Text Complexity

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2015-01-01

    The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…

  20. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  1. The Balinese Unicode Text Processing

    Directory of Open Access Journals (Sweden)

    Imam Habibi

    2009-06-01

    Full Text Available In principal, the computer only recognizes numbers as the representation of a character. Therefore, there are many encoding systems to allocate these numbers although not all characters are covered. In Europe, every single language even needs more than one encoding system. Hence, a new encoding system known as Unicode has been established to overcome this problem. Unicode provides unique id for each different characters which does not depend on platform, program, and language. Unicode standard has been applied in a number of industries, such as Apple, HP, IBM, JustSystem, Microsoft, Oracle, SAP, Sun, Sybase, and Unisys. In addition, language standards and modern information exchanges such as XML, Java, ECMA Script (JavaScript, LDAP, CORBA 3.0, and WML make use of Unicode as an official tool for implementing ISO/IEC 10646. There are four things to do according to Balinese script: the algorithm of transliteration, searching, sorting, and word boundary analysis (spell checking. To verify the truth of algorithm, some applications are made. These applications can run on Linux/Windows OS platform using J2SDK 1.5 and J2ME WTK2 library. The input and output of the algorithm/application are character sequence that is obtained from keyboard punch and external file. This research produces a module or a library which is able to process the Balinese text based on Unicode standard. The output of this research is the ability, skill, and mastering of 1. Unicode standard (21-bit as a substitution to ASCII (7-bit and ISO8859-1 (8-bit as the former default character set in many applications. 2. The Balinese Unicode text processing algorithm. 3. An experience of working with and learning from an international team that consists of the foremost experts in the area: Michael Everson (Ireland, Peter Constable (Microsoft US, I Made Suatjana, and Ida Bagus Adi Sudewa.

  2. Compressive full waveform lidar

    Science.gov (United States)

    Yang, Weiyi; Ke, Jun

    2017-05-01

    To avoid high bandwidth detector, fast speed A/D converter, and large size memory disk, a compressive full waveform LIDAR system, which uses a temporally modulated laser instead of a pulsed laser, is studied in this paper. Full waveform data from NEON (National Ecological Observatory Network) are used. Random binary patterns are used to modulate the source. To achieve 0.15 m ranging resolution, a 100 MSPS A/D converter is assumed to make measurements. SPIRAL algorithm with canonical basis is employed when Poisson noise is considered in the low illuminated condition.

  3. Multimodal Diversity of Postmodernist Fiction Text

    Directory of Open Access Journals (Sweden)

    U. I. Tykha

    2016-12-01

    Full Text Available The article is devoted to the analysis of structural and functional manifestations of multimodal diversity in postmodernist fiction texts. Multimodality is defined as the coexistence of more than one semiotic mode within a certain context. Multimodal texts feature a diversity of semiotic modes in the communication and development of their narrative. Such experimental texts subvert conventional patterns by introducing various semiotic resources – verbal or non-verbal.

  4. Tagging narrator's names in Hadith text | Rahman | Journal of ...

    African Journals Online (AJOL)

    N.A. Rahman, N.K. Ismail, Z.M. Nor, M.N. Alias, M.S. Kamis, N Alias. Abstract. No Abstract. Keywords: tagging; hadith text; name. Full Text: EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT · http://dx.doi.org/10.4314/jfas.v9i5s.21 · AJOL African Journals Online. HOW TO ...

  5. Science and Technology Text Mining Basic Concepts

    National Research Council Canada - National Science Library

    Losiewicz, Paul

    2003-01-01

    ...). It then presents some of the most widely used data and text mining techniques, including clustering and classification methods, such as nearest neighbor, relational learning models, and genetic...

  6. Full faith in myself

    Indian Academy of Sciences (India)

    Lawrence

    Full faith in myself. Meenakshi Banerjee. 12. Ihad my schooling at the Irish Convent, Loreto, in Asansol,. West Bengal. Perhaps the earliest memories I have are of myself as a very determined child with a deep appreciation of and inquisitiveness regarding nature although not understanding most of it at that tender age.

  7. Diet - full liquid

    Science.gov (United States)

    ... Water Fruit juices, including nectars and juices with pulp Butter, margarine, oil, cream, custard, and pudding Plain ... LJ, eds. Sleisenger and Fordtran's Gastrointestinal and Liver Disease: Pathophysiology/Diagnosis/Management . 10th ed. Philadelphia, PA: Elsevier ...

  8. New Historicism: Text and Context

    Directory of Open Access Journals (Sweden)

    Violeta M. Vesić

    2016-02-01

    Full Text Available During most of the twentieth century history was seen as a phenomenon outside of literature that guaranteed the veracity of literary interpretation. History was unique and it functioned as a basis for reading literary works. During the seventies of the twentieth century there occurred a change of attitude towards history in American literary theory, and there appeared a new theoretical approach which soon became known as New Historicism. Since its inception, New Historicism has been identified with the study of Renaissance and Romanticism, but nowadays it has been increasingly involved in other literary trends. Although there are great differences in the arguments and practices at various representatives of this school, New Historicism has clearly recognizable features and many new historicists will agree with the statement of Walter Cohen that New Historicism, when it appeared in the eighties, represented something quite new in reference to the studies of theory, criticism and history (Cohen 1987, 33. Theoretical connection with Bakhtin, Foucault and Marx is clear, as well as a kind of uneasy tie with deconstruction and the work of Paul de Man. At the center of this approach is a renewed interest in the study of literary works in the light of historical and political circumstances in which they were created. Foucault encouraged readers to begin to move literary texts and to link them with discourses and representations that are not literary, as well as to examine the sociological aspects of the texts in order to take part in the social struggles of today. The study of literary works using New Historicism is the study of politics, history, culture and circumstances in which these works were created. With regard to one of the main fact which is located in the center of the criticism, that history cannot be viewed objectively and that reality can only be understood through a cultural context that reveals the work, re-reading and interpretation of

  9. Remedial investigation work plan for Bear Creek Valley Operable Unit 1 (S-3 Ponds, Boneyard/Burnyard, Oil Landfarm, Sanitary Landfill 1, and the Burial Grounds, including Oil Retention Ponds 1 and 2) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee. Volume 1, Main text

    Energy Technology Data Exchange (ETDEWEB)

    1993-09-01

    The intent and scope of the work plan are to assemble all data necessary to facilitate selection of remediation alternatives for the sites in Bear Creek Valley Operable Unit 1 (BCV OU 1) such that the risk to human health and the environment is reduced to acceptable levels based on agreements with regulators. The ultimate goal is to develop a final Record Of Decision (ROD) for all of the OUs in BCV, including the integrator OU. However, the initial aim of the source OUs is to develop a ROD for interim measures. For source OUs such as BCV OU 1, data acquisition will not be carried out in a single event, but will be carried out in three stages that accommodate the schedule for developing a ROD for interim measures and the final site-wide ROD. The three stages are as follows: Stage 1, Assemble sufficient data to support decisions such as the need for removal actions, whether to continue with the remedial investigation (RI) process, or whether no further action is required. If the decision is made to continue the RI/FS process, then: Stage 2, Assemble sufficient data to allow for a ROD for interim measures that reduce risks to the human health and the environment. Stage 3, Provide input from the source OU that allows a final ROD to be issued for all OUs in the BCV hydrologic regime. One goal of the RI work plan will be to ensure that sampling operations required for the initial stage are not repeated at later stages. The overall goals of this RI are to define the nature and extent of contamination so that the impact of leachate, surface water runoff, and sediment from the OU I sites on the integrator OU can be evaluated, the risk to human health and the environment can be defined, and the general physical characteristics of the subsurface can be determined such that remedial alternatives can be screened.

  10. Classroom Texting in College Students

    Science.gov (United States)

    Pettijohn, Terry F.; Frazier, Erik; Rieser, Elizabeth; Vaughn, Nicholas; Hupp-Wilds, Bobbi

    2015-01-01

    A 21-item survey on texting in the classroom was given to 235 college students. Overall, 99.6% of students owned a cellphone and 98% texted daily. Of the 138 students who texted in the classroom, most texted friends or significant others, and indicate the reason for classroom texting is boredom or work. Students who texted sent a mean of 12.21…

  11. Plate Full of Color

    Centers for Disease Control (CDC) Podcasts

    The Eagle Books are a series of four books that are brought to life by wise animal characters - Mr. Eagle, Miss Rabbit, and Coyote - who engage Rain That Dances and his young friends in the joy of physical activity, eating healthy foods, and learning from their elders about health and diabetes prevention. Plate Full of Color teaches the value of eating a variety of colorful and healthy foods.

  12. Partial index replicated and distributed scheme for full-text search on ...

    Indian Academy of Sciences (India)

    2Department of Computer Application, Krishna Institute of Engineering & .... In the wireless data broadcast system, many researchers have used the ..... Graph 3 depicts decrement of tuning time as the repeatability increases in the proposed.

  13. Sådan kombinerer du søgninger i SocINDEX with Full Text

    DEFF Research Database (Denmark)

    2017-01-01

    Valg af materiale/medie/form: YouTube Valg af arbejdsform: E-læring Begrundelse for valg af materiale/medie/form/arbejdsform: Flipped Classroom......Valg af materiale/medie/form: YouTube Valg af arbejdsform: E-læring Begrundelse for valg af materiale/medie/form/arbejdsform: Flipped Classroom...

  14. On-line access to the full-texts of non periodical documents

    International Nuclear Information System (INIS)

    Svrsek, L.

    2004-01-01

    This article describes several options how electronic books (technical handbooks, scientific books, reference works, etc.) are published and available on-line on the Internet. There is a short description of some of the major services provided by worldwide publishers. As a part of the presentation there will be a live demonstration of selected services and work slices of the most interested systems. (author)

  15. Europe PMC: a full-text literature database for the life sciences and platform for innovation

    Science.gov (United States)

    2015-01-01

    This article describes recent developments of Europe PMC (http://europepmc.org), the leading database for life science literature. Formerly known as UKPMC, the service was rebranded in November 2012 as Europe PMC to reflect the scope of the funding agencies that support it. Several new developments have enriched Europe PMC considerably since then. Europe PMC now offers RESTful web services to access both articles and grants, powerful search tools such as citation-count sort order and data citation features, a service to add publications to your ORCID, a variety of export formats, and an External Links service that enables any related resource to be linked from Europe PMC content. PMID:25378340

  16. Electronic books. On-line access to the full-texts of non periodical documents

    International Nuclear Information System (INIS)

    Svrsek, L.

    2004-01-01

    This presentation describes several options how electronic books (technical handbooks, scientific books, reference works, etc.) are published and available on-line on the Internet. There is a short description of some of the major services provided by worldwide publishers. As a part of the presentation there will be a live demonstration of selected services and work slices of the most interested systems. (author)

  17. Full Inclusion: Understanding the Role of Gay and Lesbian Texts and Films in Teacher Education Classrooms

    Science.gov (United States)

    Hermann-Wilmarth, Jill M.

    2007-01-01

    This paper identifies some of the resources the author has found and used to help future teachers become fully inclusive teachers, particularly of early elementary students. Through sharing these resources--children's literature, a children's literature textbook, edited books for teacher educators and pre- and inservice teachers, and a video--the…

  18. Early Career Researchers Demand Full-text and Rely on Google to Find Scholarly Sources

    OpenAIRE

    Richard Hayman

    2017-01-01

    A Review of: Nicholas, D., Boukacem-Zeghmouri, C., Rodríguez-Bravo, B., Xu, J., Watkinson, A., Abrizah, A., Herman, E., & Świgoń, M. (2017). Where and how early career researchers find scholarly information. Learned Publishing, 30(1), 19-29. http://dx.doi.org/10.1002/leap.1087 Abstract Objective – To examine the attitudes and information behaviours of early career researchers (ECRs) when locating scholarly information. Design – Qualitative longitudinal study. Setting – R...

  19. Comprehension challenges in the fourth grade: The roles of text cohesion, text genre, and readers’ prior knowledge

    Directory of Open Access Journals (Sweden)

    Danielle S. McNamara

    2011-07-01

    Full Text Available We examined young readers’ comprehension as a function of text genre (narrative, science, text cohesion (high, low, and readers’ abilities (reading decoding skills and world knowledge. The overarching purpose of this study was to contribute to our understanding of the fourth grade slump. Children in grade 4 read four texts, including one high and one low cohesion text from each genre. Comprehension of each text was assessed with 12 multiple-choice questions and free and cued recall. Comprehension was enhanced by increased knowledge: high knowledge readers showed better comprehension than low knowledge readers and narratives were comprehended better than science texts. Interactions between readers’ knowledge levels and text characteristics indicated that the children showed larger effects of knowledge for science than for narrative texts, and those with more knowledge better understood the low cohesion, narrative texts, showing a reverse cohesion effect. Decoding skill benefited comprehension, but effects of text genre and cohesion depended less on decoding skill than prior knowledge. Overall, the study indicates that the fourth grade slump is at least partially attributable to the emergence of complex dependencies between the nature of the text and the reader’s prior knowledge. The results also suggested that simply adding cohesion cues, and not explanatory information, is not likely to be sufficient for young readers as an approach to improving comprehension of challenging texts.

  20. Comprehension challenges in the fourth grade: The roles of text cohesion, text genre, and readers’ prior knowledge

    Directory of Open Access Journals (Sweden)

    Danielle S. McNAMARA

    2011-11-01

    Full Text Available We examined young readers’ comprehension as a function of text genre (narrative, science, text cohesion (high, low, and readers’ abilities (reading decoding skills and world knowledge. The overarching purpose of this study was to contribute to our understanding of the fourth grade slump. Children in grade 4 read four texts, including one high and one low cohesion text from each genre. Comprehension of each text was assessed with 12 multiple-choice questions and free and cued recall. Comprehension was enhanced by increased knowledge: high knowledge readers showed bettercomprehension than low knowledge readers and narratives were comprehended better than science texts. Interactions between readers’ knowledge levels and text characteristics indicated that thechildren showed larger effects of knowledge for science than for narrative texts, and those with more knowledge better understood the low cohesion, narrative texts, showing a reverse cohesion effect.Decoding skill benefited comprehension, but effects of text genre and cohesion depended less on decoding skill than prior knowledge. Overall, the study indicates that the fourth grade slump is at leastpartially attributable to the emergence of complex dependencies between the nature of the text and the reader’s prior knowledge. The results also suggested that simply adding cohesion cues, and notexplanatory information, is not likely to be sufficient for young readers as an approach to improving comprehension of challenging texts.

  1. Observation of [Formula: see text] and [Formula: see text] decays.

    Science.gov (United States)

    Aaij, R; Adeva, B; Adinolfi, M; Ajaltouni, Z; Akar, S; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Andreassi, G; Andreotti, M; Andrews, J E; Appleby, R B; Archilli, F; d'Argent, P; Arnau Romeu, J; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Babuschkin, I; Bachmann, S; Back, J J; Badalov, A; Baesso, C; Baker, S; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Baszczyk, M; Batozskaya, V; Batsukh, B; Battista, V; Bay, A; Beaucourt, L; Beddow, J; Bedeschi, F; Bediaga, I; Bel, L J; Bellee, V; Belloli, N; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bertolin, A; Betancourt, C; Betti, F; Bettler, M-O; van Beuzekom, M; Bezshyiko, Ia; Bifani, S; Billoir, P; Bird, T; Birnkraut, A; Bitadze, A; Bizzeti, A; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Boettcher, T; Bondar, A; Bondar, N; Bonivento, W; Bordyuzhin, I; Borgheresi, A; Borghi, S; Borisyak, M; Borsato, M; Bossu, F; Boubdir, M; Bowcock, T J V; Bowen, E; Bozzi, C; Braun, S; Britsch, M; Britton, T; Brodzicka, J; Buchanan, E; Burr, C; Bursche, A; Buytaert, J; Cadeddu, S; Calabrese, R; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D H; Capriotti, L; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carniti, P; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cavallero, G; Cenci, R; Charles, M; Charpentier, Ph; Chatzikonstantinidis, G; Chefdeville, M; Chen, S; Cheung, S-F; Chobanova, V; Chrzaszcz, M; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coco, V; Cogan, J; Cogneras, E; Cogoni, V; Cojocariu, L; Collazuol, G; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombs, G; Coquereau, S; Corti, G; Corvo, M; Costa Sobral, C M; Couturier, B; Cowan, G A; Craik, D C; Crocombe, A; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Da Cunha Marinho, F; Dall'Occo, E; Dalseno, J; David, P N Y; Davis, A; De Aguiar Francisco, O; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Serio, M; De Simone, P; Dean, C-T; Decamp, D; Deckenhoff, M; Del Buono, L; Demmer, M; Dendek, A; Derkach, D; Deschamps, O; Dettori, F; Dey, B; Di Canto, A; Dijkstra, H; Dordei, F; Dorigo, M; Dosil Suárez, A; Dovbnya, A; Dreimanis, K; Dufour, L; Dujany, G; Dungs, K; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Déléage, N; Easo, S; Ebert, M; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; Ely, S; Esen, S; Evans, H M; Evans, T; Falabella, A; Farley, N; Farry, S; Fay, R; Fazzini, D; Ferguson, D; Fernandez Prieto, A; Ferrari, F; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fini, R A; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fleuret, F; Fohl, K; Fontana, M; Fontanelli, F; Forshaw, D C; Forty, R; Franco Lima, V; Frank, M; Frei, C; Fu, J; Furfaro, E; Färber, C; Gallas Torreira, A; Galli, D; Gallorini, S; Gambetta, S; Gandelman, M; Gandini, P; Gao, Y; Garcia Martin, L M; García Pardiñas, J; Garra Tico, J; Garrido, L; Garsed, P J; Gascon, D; Gaspar, C; Gavardi, L; Gazzoni, G; Gerick, D; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianì, S; Gibson, V; Girard, O G; Giubega, L; Gizdov, K; Gligorov, V V; Golubkov, D; Golutvin, A; Gomes, A; Gorelov, I V; Gotti, C; Govorkova, E; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graverini, E; Graziani, G; Grecu, A; Griffith, P; Grillo, L; Gruberg Cazon, B R; Grünberg, O; Gushchin, E; Guz, Yu; Gys, T; Göbel, C; Hadavizadeh, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hatch, M; He, J; Head, T; Heister, A; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hombach, C; Hopchev, H; Hulsbergen, W; Humair, T; Hushchyn, M; Hussain, N; Hutchcroft, D; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jawahery, A; Jiang, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kandybei, S; Kanso, W; Karacson, M; Kariuki, J M; Karodia, S; Kecke, M; Kelsey, M; Kenyon, I R; Kenzie, M; Ketel, T; Khairullin, E; Khanji, B; Khurewathanakul, C; Kirn, T; Klaver, S; Klimaszewski, K; Koliiev, S; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Kosmyntseva, A; Kozachuk, A; Kozeiha, M; Kravchuk, L; Kreplin, K; Kreps, M; Krokovny, P; Kruse, F; Krzemien, W; Kucewicz, W; Kucharczyk, M; Kudryavtsev, V; Kuonen, A K; Kurek, K; Kvaratskheliya, T; Lacarrere, D; Lafferty, G; Lai, A; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Leflat, A; Lefrançois, J; Lefèvre, R; Lemaitre, F; Lemos Cid, E; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Likhomanenko, T; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, X; Loh, D; Longstaff, I; Lopes, J H; Lucchesi, D; Lucio Martinez, M; Luo, H; Lupato, A; Luppi, E; Lupton, O; Lusiani, A; Lyu, X; Machefert, F; Maciuc, F; Maev, O; Maguire, K; Malde, S; Malinin, A; Maltsev, T; Manca, G; Mancinelli, G; Manning, P; Maratas, J; Marchand, J F; Marconi, U; Marin Benito, C; Marino, P; Marks, J; Martellotti, G; Martin, M; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massacrier, L M; Massafferri, A; Matev, R; Mathad, A; Mathe, Z; Matteuzzi, C; Mauri, A; Maurin, B; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; Meadows, B; Meier, F; Meissner, M; Melnychuk, D; Merk, M; Merli, A; Michielin, E; Milanes, D A; Minard, M-N; Mitzel, D S; Mogini, A; Molina Rodriguez, J; Monroy, I A; Monteil, S; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Morris, A B; Mountain, R; Muheim, F; Mulder, M; Mussini, M; Müller, D; Müller, J; Müller, K; Müller, V; Naik, P; Nakada, T; Nandakumar, R; Nandi, A; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nieswand, S; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; O'Hanlon, D P; Oblakowska-Mucha, A; Obraztsov, V; Ogilvy, S; Oldeman, R; Onderwater, C J G; Otalora Goicochea, J M; Otto, A; Owen, P; Oyanguren, A; Pais, P R; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Pappalardo, L L; Parker, W; Parkes, C; Passaleva, G; Pastore, A; Patel, G D; Patel, M; Patrignani, C; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perret, P; Pescatore, L; Petridis, K; Petrolini, A; Petrov, A; Petruzzo, M; Picatoste Olloqui, E; Pietrzyk, B; Pikies, M; Pinci, D; Pistone, A; Piucci, A; Playfer, S; Plo Casasus, M; Poikela, T; Polci, F; Poluektov, A; Polyakov, I; Polycarpo, E; Pomery, G J; Popov, A; Popov, D; Popovici, B; Poslavskii, S; Potterat, C; Price, E; Price, J D; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Quagliani, R; Rachwal, B; Rademacker, J H; Rama, M; Ramos Pernas, M; Rangel, M S; Raniuk, I; Ratnikov, F; Raven, G; Redi, F; Reichert, S; Dos Reis, A C; Remon Alepuz, C; Renaudin, V; Ricciardi, S; Richards, S; Rihl, M; Rinnert, K; Rives Molina, V; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Lopez, J A; Rodriguez Perez, P; Rogozhnikov, A; Roiser, S; Rollings, A; Romanovskiy, V; Romero Vidal, A; Ronayne, J W; Rotondo, M; Rudolph, M S; Ruf, T; Ruiz Valls, P; Saborido Silva, J J; Sadykhov, E; Sagidova, N; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santimaria, M; Santovetti, E; Sarti, A; Satriano, C; Satta, A; Saunders, D M; Savrina, D; Schael, S; Schellenberg, M; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmelzer, T; Schmidt, B; Schneider, O; Schopper, A; Schubert, K; Schubiger, M; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Semennikov, A; Sergi, A; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Siddi, B G; Silva Coutinho, R; Silva de Oliveira, L; Simi, G; Simone, S; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, E; Smith, I T; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Souza De Paula, B; Spaan, B; Spradlin, P; Sridharan, S; Stagni, F; Stahl, M; Stahl, S; Stefko, P; Stefkova, S; Steinkamp, O; Stemmle, S; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Sun, L; Sutcliffe, W; Swientek, K; Syropoulos, V; Szczekowski, M; Szumlak, T; T'Jampens, S; Tayduganov, A; Tekampe, T; Tellarini, G; Teubert, F; Thomas, E; van Tilburg, J; Tilley, M J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Toriello, F; Tournefier, E; Tourneur, S; Trabelsi, K; Traill, M; Tran, M T; Tresch, M; Trisovic, A; Tsaregorodtsev, A; Tsopelas, P; Tully, A; Tuning, N; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vacca, C; Vagnoni, V; Valassi, A; Valat, S; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vecchi, S; van Veghel, M; Velthuis, J J; Veltri, M; Veneziano, G; Venkateswaran, A; Vernet, M; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Viemann, H; Vilasis-Cardona, X; Vitti, M; Volkov, V; Vollhardt, A; Voneki, B; Vorobyev, A; Vorobyev, V; Voß, C; de Vries, J A; Vázquez Sierra, C; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wang, J; Ward, D R; Wark, H M; Watson, N K; Websdale, D; Weiden, A; Whitehead, M; Wicht, J; Wilkinson, G; Wilkinson, M; Williams, M; Williams, M P; Williams, M; Williams, T; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wraight, K; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yin, H; Yu, J; Yuan, X; Yushchenko, O; Zarebski, K A; Zavertyaev, M; Zhang, L; Zhang, Y; Zhang, Y; Zhelezov, A; Zheng, Y; Zhokhov, A; Zhu, X; Zhukov, V; Zucchelli, S

    2017-01-01

    The decays [Formula: see text] and [Formula: see text] are observed for the first time using a data sample corresponding to an integrated luminosity of 3.0 fb[Formula: see text], collected by the LHCb experiment in proton-proton collisions at the centre-of-mass energies of 7 and 8[Formula: see text]. The branching fractions relative to that of [Formula: see text] are measured to be [Formula: see text]where the first uncertainties are statistical and the second are systematic.

  2. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  3. Plate Full of Color

    Centers for Disease Control (CDC) Podcasts

    2008-08-04

    The Eagle Books are a series of four books that are brought to life by wise animal characters - Mr. Eagle, Miss Rabbit, and Coyote - who engage Rain That Dances and his young friends in the joy of physical activity, eating healthy foods, and learning from their elders about health and diabetes prevention. Plate Full of Color teaches the value of eating a variety of colorful and healthy foods.  Created: 8/4/2008 by National Center for Chronic Disease Prevention and Health Promotion (NCCDPHP).   Date Released: 8/5/2008.

  4. Figure text extraction in biomedical literature.

    Directory of Open Access Journals (Sweden)

    Daehyun Kim

    2011-01-01

    Full Text Available Figures are ubiquitous in biomedical full-text articles, and they represent important biomedical knowledge. However, the sheer volume of biomedical publications has made it necessary to develop computational approaches for accessing figures. Therefore, we are developing the Biomedical Figure Search engine (http://figuresearch.askHERMES.org to allow bioscientists to access figures efficiently. Since text frequently appears in figures, automatically extracting such text may assist the task of mining information from figures. Little research, however, has been conducted exploring text extraction from biomedical figures.We first evaluated an off-the-shelf Optical Character Recognition (OCR tool on its ability to extract text from figures appearing in biomedical full-text articles. We then developed a Figure Text Extraction Tool (FigTExT to improve the performance of the OCR tool for figure text extraction through the use of three innovative components: image preprocessing, character recognition, and text correction. We first developed image preprocessing to enhance image quality and to improve text localization. Then we adapted the off-the-shelf OCR tool on the improved text localization for character recognition. Finally, we developed and evaluated a novel text correction framework by taking advantage of figure-specific lexicons.The evaluation on 382 figures (9,643 figure texts in total randomly selected from PubMed Central full-text articles shows that FigTExT performed with 84% precision, 98% recall, and 90% F1-score for text localization and with 62.5% precision, 51.0% recall and 56.2% F1-score for figure text extraction. When limiting figure texts to those judged by domain experts to be important content, FigTExT performed with 87.3% precision, 68.8% recall, and 77% F1-score. FigTExT significantly improved the performance of the off-the-shelf OCR tool we used, which on its own performed with 36.6% precision, 19.3% recall, and 25.3% F1-score for

  5. Gender Analysis On Islamic Texts: A Study On Its Accuracy

    Directory of Open Access Journals (Sweden)

    Muchammad Ichsan

    2014-06-01

    Full Text Available Gender equality movement is spreading all over the world, including in Indonesia where Muslim gender activists have made hard efforts to ensure gender fairness and equality among people. One of their efforts is emphasizing the urgency of reinterpreting Islamic texts. They insist on the reinterpretation of Islamic texts based on gender perspective and analysis due to the existence of many Islamic texts that trespass the principles of gender equality and fairness they have been fighting for. This paper aims at assuring and examining the accuracy of using gender perspective as a tool for analyzing the Islamic text. It is found that using gender perspective and analysis for reinterpreting Islamic texts is not in line with the Islamic principles and will only produce laws and points of views which deviate from Islamic teachings. To reach the goals of this study, a descriptive-analytical approach is employed.

  6. Qualitative Features of Written Summary Texts Produced by Teachers

    Directory of Open Access Journals (Sweden)

    Hülya YAZICI OKUYAN

    2011-12-01

    Full Text Available This research aimed to find an answer to the question: "Do summary texts produced by teachers have the characteristics that a summary text is supposed to have?” Descriptive method was used in the research. The study group consisted of 55 teachers who work as Turkish Language and Literature teachers at central primary and secondary schools in Burdur. During the research, the essay “Kitap Az Yaşamayı Önler” by Çetin Altan was used as the source text and the summary texts produced by teachers were evaluated using a criteria-based and gradual analysis instrument. At the end of the study, it was determined that the teachers only managed to reach the sufficient level in terms of reconstructing the summary texts through authentic sentences and reflecting the main idea of the source text in the summary texts. However, according to the research results regarding the teachers’ competence in creating a new title for the summary texts, including the source text’s all supporting ideas and important information in the summary texts and providing the summary texts with the capacity of reflecting the source text, it has been observed that the teachers lack the required knowledge and skill

  7. Toward full MOX core design

    International Nuclear Information System (INIS)

    Rouviere, G.; Guillet, J.L.; Bruna, G.B.; Pelet, J.

    1999-01-01

    This paper presents a selection of the main preliminary results of a study program sponsored by COGEMA and currently carried out by FRAMATOME. The objective of this study is to investigate the feasibility of full MOX core loading in a French 1300 MWe PWR, a recent and widespread standard nuclear power plant. The investigation includes core nuclear design, thermal hydraulic and systems aspects. (authors)

  8. Full spectrum water

    International Nuclear Information System (INIS)

    Brannock, Matthew; Fergus, Ian; Griffiths, David

    2011-01-01

    Coal Seam Gas water (CSG) in Australia is typically brackish with high alkalinity and pH, and if not managed correctly may adversely affect the whole environment. To achieve a sustainable and holistic outcome for CSG associated water, an integrated approach is required where CSG producers working in close cooperation with all stakeholders, including the state and federal governments, regulators, community and land owners.

  9. An Embedded Application for Degraded Text Recognition

    Directory of Open Access Journals (Sweden)

    Thillou Céline

    2005-01-01

    Full Text Available This paper describes a mobile device which tries to give the blind or visually impaired access to text information. Three key technologies are required for this system: text detection, optical character recognition, and speech synthesis. Blind users and the mobile environment imply two strong constraints. First, pictures will be taken without control on camera settings and a priori information on text (font or size and background. The second issue is to link several techniques together with an optimal compromise between computational constraints and recognition efficiency. We will present the overall description of the system from text detection to OCR error correction.

  10. The Instructional Text like a Textual Genre

    Directory of Open Access Journals (Sweden)

    Adiane Fogali Marinello

    2011-07-01

    Full Text Available This article analyses the instructional text as a textual genre and is part of the research called Reading and text production from the textual genre perspective, done at Universidade de Caxias do Sul, Campus Universitário da Região dos Vinhedos. Firstly, some theoretical assumptions about textual genre are presented, then, the instructional text is characterized. After that an instructional text is analyzed and, finally, some activities related to reading and writing of the mentioned genre directed to High School and University students are suggested.

  11. Emptiness and Fullness

    DEFF Research Database (Denmark)

    Bregnbæk, Susanne; Bunkenborg, Mikkel

    As critical voices question the quality, authenticity, and value of people, goods, and words in post-Mao China, accusations of emptiness render things open to new investments of meaning, substance, and value. Exploring the production of lack and desire through fine-grained ethnography, this volume...... examines how diagnoses of emptiness operate in a range of very different domains in contemporary China: In the ostensibly meritocratic exam system and the rhetoric of officials, in underground churches, housing bubbles, and nationalist fantasies, in bodies possessed by spirits and evaluations of jade......, there is a pervasive concern with states of lack and emptiness and the contributions suggest that this play of emptiness and fullness is crucial to ongoing constructions of quality, value, and subjectivity in China....

  12. Arabic text classification using Polynomial Networks

    Directory of Open Access Journals (Sweden)

    Mayy M. Al-Tahrawi

    2015-10-01

    Full Text Available In this paper, an Arabic statistical learning-based text classification system has been developed using Polynomial Neural Networks. Polynomial Networks have been recently applied to English text classification, but they were never used for Arabic text classification. In this research, we investigate the performance of Polynomial Networks in classifying Arabic texts. Experiments are conducted on a widely used Arabic dataset in text classification: Al-Jazeera News dataset. We chose this dataset to enable direct comparisons of the performance of Polynomial Networks classifier versus other well-known classifiers on this dataset in the literature of Arabic text classification. Results of experiments show that Polynomial Networks classifier is a competitive algorithm to the state-of-the-art ones in the field of Arabic text classification.

  13. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  14. Flexible frontiers for text division into rows

    Directory of Open Access Journals (Sweden)

    Dan L. Lacrămă

    2009-01-01

    Full Text Available This paper presents an original solution for flexible hand-written text division into rows. Unlike the standard procedure, the proposed method avoids the isolated characters extensions amputation and reduces the recognition error rate in the final stage.

  15. Ontology Assisted Formal Specification Extraction from Text

    Directory of Open Access Journals (Sweden)

    Andreea Mihis

    2010-12-01

    Full Text Available In the field of knowledge processing, the ontologies are the most important mean. They make possible for the computer to understand better the natural language and to make judgments. In this paper, a method which use ontologies in the semi-automatic extraction of formal specifications from a natural language text is proposed.

  16. Full metal jacket!

    CERN Document Server

    Laëtitia Pedroso

    2011-01-01

    Ten years ago, standard issue clothing only gave CERN firemen partial protection but today our fire-fighters are equipped with state-of-the-art, full personal protective equipment.   CERN's Fire Brigade team. For many years, the members of CERN's Fire Brigade went on call-outs clad in their work trousers and fire-rescue coats, which only afforded them partial protection. Today, textile manufacturing techniques have moved on a long way and CERN's firemen are now kitted out with state-of-the-art personal protective equipment. The coat and trousers are three-layered, comprising fire-resistant aramide, a protective membrane and a thermal lining. The CERN Fire Brigade' new state-of-the-art personal protection equipment. "This equipment is fully compliant with the standards in force and is therefore resistant to cuts, abrasion, electrical arcs with thermal effects and, of course, fire," explains Patrick Berlinghi, the CERN Fire Brigade's Logistics Officer. You might think that su...

  17. Policies for full employment

    DEFF Research Database (Denmark)

    de Koning, Jaap; Layard, Richard; Nickel, Stephen

    European unemployment is too high, and employment is too low. Over 7½ per cent of Europe's workforce is unemployed, and only two thirds of people aged 15-64 are in work. At the Lisbon summit two years ago the heads of government set the target that by 2010 the employment rate should rise from 64...... per cent to at least 70 per cent. And for older workers between 55 and 64 the employment rate should rise from 38 per cent to at least one half. These are ambitious targets. They will require two big changes: more people must seek work, and among those seeking work a higher proportion must get a job....... So we need higher participation, and (for full employment) we need a much lower unemployment rate. Can it be done? A mere glance at the experience of different European countries shows that it can. As Table 1 shows, four E.U. countries already exceed the overall target for 2010 (Britain, Denmark...

  18. The Only Safe SMS Texting Is No SMS Texting.

    Science.gov (United States)

    Toth, Cheryl; Sacopulos, Michael J

    2015-01-01

    Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.

  19. Predicting Prosody from Text for Text-to-Speech Synthesis

    CERN Document Server

    Rao, K Sreenivasa

    2012-01-01

    Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.

  20. Text recycling: acceptable or misconduct?

    Science.gov (United States)

    Harriman, Stephanie; Patel, Jigisha

    2014-08-16

    Text recycling, also referred to as self-plagiarism, is the reproduction of an author's own text from a previous publication in a new publication. Opinions on the acceptability of this practice vary, with some viewing it as acceptable and efficient, and others as misleading and unacceptable. In light of the lack of consensus, journal editors often have difficulty deciding how to act upon the discovery of text recycling. In response to these difficulties, we have created a set of guidelines for journal editors on how to deal with text recycling. In this editorial, we discuss some of the challenges of developing these guidelines, and how authors can avoid undisclosed text recycling.

  1. Text against Text: Counterbalancing the Hegemony of Assessment.

    Science.gov (United States)

    Cosgrove, Cornelius

    A study examined whether composition specialists can counterbalance the potential privileging of the assessment perspective, or of self-appointed interpreters of that perspective, through the study of assessment discourse as text. Fourteen assessment texts were examined, most of them journal articles and most of them featuring the common…

  2. New seismograph includes filters

    Energy Technology Data Exchange (ETDEWEB)

    1979-11-02

    The new Nimbus ES-1210 multichannel signal enhancement seismograph from EG and G geometrics has recently been redesigned to include multimode signal fillers on each amplifier. The ES-1210F is a shallow exploration seismograph for near subsurface exploration such as in depth-to-bedrock, geological hazard location, mineral exploration, and landslide investigations.

  3. Effects of Text Messaging on Academic Performance

    Directory of Open Access Journals (Sweden)

    Barks Amanda

    2011-12-01

    Full Text Available University students frequently send and receive cellular phone text messages during classroominstruction. Cognitive psychology research indicates that multi-tasking is frequently associatedwith performance cost. However, university students often have considerable experience withelectronic multi-tasking and may believe that they can devote necessary attention to a classroomlecture while sending and receiving text messages. In the current study, university students whoused text messaging were randomly assigned to one of two conditions: 1. a group that sent andreceived text messages during a lecture or, 2. a group that did not engage in text messagingduring the lecture. Participants who engaged in text messaging demonstrated significantlypoorer performance on a test covering lecture content compared with the group that did notsend and receive text messages. Participants exhibiting higher levels of text messaging skill hadsignificantly lower test scores than participants who were less proficient at text messaging. It ishypothesized that in terms of retention of lecture material, more frequent task shifting by thosewith greater text messaging proficiency contributed to poorer performance. Overall, the findingsdo not support the view, held by many university students, that this form of multitasking has littleeffect on the acquisition of lecture content. Results provide empirical support for teachers andprofessors who ban text messaging in the classroom.

  4. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    Today, information travels fast. Texts travel, too. In a corporate context, the question is how to manage which knowledge elements should travel to a new language area or market and in which form? The decision to let knowledge elements travel or not travel highly depends on the limitation...... and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  5. Texting while driving: is speech-based text entry less risky than handheld text entry?

    Science.gov (United States)

    He, J; Chaparro, A; Nguyen, B; Burge, R J; Crandall, J; Chaparro, B; Ni, R; Cao, S

    2014-11-01

    Research indicates that using a cell phone to talk or text while maneuvering a vehicle impairs driving performance. However, few published studies directly compare the distracting effects of texting using a hands-free (i.e., speech-based interface) versus handheld cell phone, which is an important issue for legislation, automotive interface design and driving safety training. This study compared the effect of speech-based versus handheld text entries on simulated driving performance by asking participants to perform a car following task while controlling the duration of a secondary text-entry task. Results showed that both speech-based and handheld text entries impaired driving performance relative to the drive-only condition by causing more variation in speed and lane position. Handheld text entry also increased the brake response time and increased variation in headway distance. Text entry using a speech-based cell phone was less detrimental to driving performance than handheld text entry. Nevertheless, the speech-based text entry task still significantly impaired driving compared to the drive-only condition. These results suggest that speech-based text entry disrupts driving, but reduces the level of performance interference compared to text entry with a handheld device. In addition, the difference in the distraction effect caused by speech-based and handheld text entry is not simply due to the difference in task duration. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. Analytic device including nanostructures

    KAUST Repository

    Di Fabrizio, Enzo M.; Fratalocchi, Andrea; Totero Gongora, Juan Sebastian; Coluccio, Maria Laura; Candeloro, Patrizio; Cuda, Gianni

    2015-01-01

    A device for detecting an analyte in a sample comprising: an array including a plurality of pixels, each pixel including a nanochain comprising: a first nanostructure, a second nanostructure, and a third nanostructure, wherein size of the first nanostructure is larger than that of the second nanostructure, and size of the second nanostructure is larger than that of the third nanostructure, and wherein the first nanostructure, the second nanostructure, and the third nanostructure are positioned on a substrate such that when the nanochain is excited by an energy, an optical field between the second nanostructure and the third nanostructure is stronger than an optical field between the first nanostructure and the second nanostructure, wherein the array is configured to receive a sample; and a detector arranged to collect spectral data from a plurality of pixels of the array.

  7. Active Learning for Text Classification

    OpenAIRE

    Hu, Rong

    2011-01-01

    Text classification approaches are used extensively to solve real-world challenges. The success or failure of text classification systems hangs on the datasets used to train them, without a good dataset it is impossible to build a quality system. This thesis examines the applicability of active learning in text classification for the rapid and economical creation of labelled training data. Four main contributions are made in this thesis. First, we present two novel selection strategies to cho...

  8. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  9. Saskatchewan resources. [including uranium

    Energy Technology Data Exchange (ETDEWEB)

    1979-09-01

    The production of chemicals and minerals for the chemical industry in Saskatchewan are featured, with some discussion of resource taxation. The commodities mentioned include potash, fatty amines, uranium, heavy oil, sodium sulfate, chlorine, sodium hydroxide, sodium chlorate and bentonite. Following the successful outcome of the Cluff Lake inquiry, the uranium industry is booming. Some developments and production figures for Gulf Minerals, Amok, Cenex and Eldorado are mentioned.

  10. Figure-associated text summarization and evaluation.

    Directory of Open Access Journals (Sweden)

    Balaji Polepalli Ramesh

    Full Text Available Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903.

  11. Text segmentation in degraded historical document images

    Directory of Open Access Journals (Sweden)

    A.S. Kavitha

    2016-07-01

    Full Text Available Text segmentation from degraded Historical Indus script images helps Optical Character Recognizer (OCR to achieve good recognition rates for Hindus scripts; however, it is challenging due to complex background in such images. In this paper, we present a new method for segmenting text and non-text in Indus documents based on the fact that text components are less cursive compared to non-text ones. To achieve this, we propose a new combination of Sobel and Laplacian for enhancing degraded low contrast pixels. Then the proposed method generates skeletons for text components in enhanced images to reduce computational burdens, which in turn helps in studying component structures efficiently. We propose to study the cursiveness of components based on branch information to remove false text components. The proposed method introduces the nearest neighbor criterion for grouping components in the same line, which results in clusters. Furthermore, the proposed method classifies these clusters into text and non-text cluster based on characteristics of text components. We evaluate the proposed method on a large dataset containing varieties of images. The results are compared with the existing methods to show that the proposed method is effective in terms of recall and precision.

  12. Text Genres in Information Organization

    Science.gov (United States)

    Nahotko, Marek

    2016-01-01

    Introduction: Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method: The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information…

  13. Teachers' Texts in Culturally Responsive Teaching

    Science.gov (United States)

    Kesler, Ted

    2011-01-01

    In this paper, the author shares three teaching stories that demonstrate the social, cultural, political, and historical factors of all texts in specific interpretive communities. The author shows how the texts that comprised his curriculum constructed particular subject positions that inevitably included some students but marginalized and…

  14. Being Included and Excluded

    DEFF Research Database (Denmark)

    Korzenevica, Marina

    2016-01-01

    Following the civil war of 1996–2006, there was a dramatic increase in the labor mobility of young men and the inclusion of young women in formal education, which led to the transformation of the political landscape of rural Nepal. Mobility and schooling represent a level of prestige that rural...... politics. It analyzes how formal education and mobility either challenge or reinforce traditional gendered norms which dictate a lowly position for young married women in the household and their absence from community politics. The article concludes that women are simultaneously excluded and included from...... community politics. On the one hand, their mobility and decision-making powers decrease with the increase in the labor mobility of men and their newly gained education is politically devalued when compared to the informal education that men gain through mobility, but on the other hand, schooling strengthens...

  15. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed the chronol......For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  16. BioNames: linking taxonomy, texts, and trees

    Directory of Open Access Journals (Sweden)

    Roderic D.M. Page

    2013-10-01

    Full Text Available BioNames is a web database of taxonomic names for animals, linked to the primary literature and, wherever possible, to phylogenetic trees. It aims to provide a taxonomic “dashboard” where at a glance we can see a summary of the taxonomic and phylogenetic information we have for a given taxon and hence provide a quick answer to the basic question “what is this taxon?” BioNames combines classifications from the Global Biodiversity Information Facility (GBIF and GenBank, images from the Encyclopedia of Life (EOL, animal names from the Index of Organism Names (ION, and bibliographic data from multiple sources including the Biodiversity Heritage Library (BHL and CrossRef. The user interface includes display of full text articles, interactive timelines of taxonomic publications, and zoomable phylogenies. It is available at http://bionames.org.

  17. Full system decontamination feasibility studies

    International Nuclear Information System (INIS)

    Denault, R.P.; LeSurf, J.E.; Walschot, F.W.

    1988-01-01

    Many chemical decontaminations have been performed on subsystems in light water reactors (BWRs and PWRs) but none on the full system (including the fuel) of large, (>500 MWe) investor owned reactors. Full system decontaminations on pressure-tubed reactors have been shown to facilitate maintenance, inspection, repair and replacement of reactor components. Further advantages are increased reactor availability and plant life extension. A conceptual study has been performed for EPRI (for PWRs) and Commonwealth Edison Co (for BWRs) into the applicability and cost benefit of full system decontaminations (FSD). The joint study showed that FSDs in both PWRs and BWRs, with or without the fuel included in the decontamination, are feasible and cost beneficial provided a large amount of work is to be done following the decontamination. The large amounts of radioactive waste generated can be managed using current technologies. Considerable improvements in waste handling, and consequent cost savings, can be obtained if new techniques which are now reaching commercial application are used. (author)

  18. Stemming Malay Text and Its Application in Automatic Text Categorization

    Science.gov (United States)

    Yasukawa, Michiko; Lim, Hui Tian; Yokoo, Hidetoshi

    In Malay language, there are no conjugations and declensions and affixes have important grammatical functions. In Malay, the same word may function as a noun, an adjective, an adverb, or, a verb, depending on its position in the sentence. Although extensively simple root words are used in informal conversations, it is essential to use the precise words in formal speech or written texts. In Malay, to make sentences clear, derivative words are used. Derivation is achieved mainly by the use of affixes. There are approximately a hundred possible derivative forms of a root word in written language of the educated Malay. Therefore, the composition of Malay words may be complicated. Although there are several types of stemming algorithms available for text processing in English and some other languages, they cannot be used to overcome the difficulties in Malay word stemming. Stemming is the process of reducing various words to their root forms in order to improve the effectiveness of text processing in information systems. It is essential to avoid both over-stemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes. Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary. The use of set of rules is aimed at reducing the occurrence of under-stemming errors, while that of the dictionaries is believed to reduce the occurrence of over-stemming errors. We performed an experiment to evaluate the application of our stemmer in text mining software. For the experiment, text data used were actual web pages collected from the World Wide Web to demonstrate the effectiveness of our Malay stemming algorithm. The experimental results showed that our stemmer can effectively increase the precision of the extracted Boolean expressions for text categorization.

  19. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  20. Text Mining in Organizational Research.

    Science.gov (United States)

    Kobayashi, Vladimer B; Mol, Stefan T; Berkers, Hannah A; Kismihók, Gábor; Den Hartog, Deanne N

    2018-07-01

    Despite the ubiquity of textual data, so far few researchers have applied text mining to answer organizational research questions. Text mining, which essentially entails a quantitative approach to the analysis of (usually) voluminous textual data, helps accelerate knowledge discovery by radically increasing the amount data that can be analyzed. This article aims to acquaint organizational researchers with the fundamental logic underpinning text mining, the analytical stages involved, and contemporary techniques that may be used to achieve different types of objectives. The specific analytical techniques reviewed are (a) dimensionality reduction, (b) distance and similarity computing, (c) clustering, (d) topic modeling, and (e) classification. We describe how text mining may extend contemporary organizational research by allowing the testing of existing or new research questions with data that are likely to be rich, contextualized, and ecologically valid. After an exploration of how evidence for the validity of text mining output may be generated, we conclude the article by illustrating the text mining process in a job analysis setting using a dataset composed of job vacancies.

  1. LINGUISTIC DATABASE FOR AUTOMATIC GENERATION SYSTEM OF ENGLISH ADVERTISING TEXTS

    Directory of Open Access Journals (Sweden)

    N. A. Metlitskaya

    2017-01-01

    Full Text Available The article deals with the linguistic database for the system of automatic generation of English advertising texts on cosmetics and perfumery. The database for such a system includes two main blocks: automatic dictionary (that contains semantic and morphological information for each word, and semantic-syntactical formulas of the texts in a special formal language SEMSINT. The database is built on the result of the analysis of 30 English advertising texts on cosmetics and perfumery. First, each word was given a unique code. For example, N stands for nouns, A – for adjectives, V – for verbs, etc. Then all the lexicon of the analyzed texts was distributed into different semantic categories. According to this semantic classification each word was given a special semantic code. For example, the record N01 that is attributed to the word «lip» in the dictionary means that this word refers to nouns of the semantic category «part of a human’s body».The second block of the database includes the semantic-syntactical formulas of the analyzed advertising texts written in a special formal language SEMSINT. The author gives a brief description of this language, presenting its essence and structure. Also, an example of one formalized advertising text in SEMSINT is provided.

  2. Current Writing: Text and Reception in Southern Africa - Vol 18, No 1 ...

    African Journals Online (AJOL)

    Lions, leopards and liminal spaces:Representations of Biosociality in the Writings of Katy Payne, Linda Tucker and Gillian van Houten · EMAIL FULL TEXT EMAIL FULL TEXT DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT. W Woodward ...

  3. NOTICING AND TEXT-BASED CHAT

    Directory of Open Access Journals (Sweden)

    Chun Lai

    2006-09-01

    Full Text Available This study examined the capacity of text-based online chat to promote learners’ noticing of their problematic language productions and of the interactional feedback from their interlocutors. In this study, twelve ESL learners formed six mixed-proficiency dyads. The same dyads worked on two spot-the-difference tasks, one via online chat and the other through face-to-face conversation. Stimulated recall sessions were held subsequently to identify instances of noticing. It was found that text-based online chat promotes noticing more than face-to-face conversations, especially in terms of learners’ noticing of their own linguistic mistakes.

  4. A new multidimensional model with text dimensions: definition and implementation

    Directory of Open Access Journals (Sweden)

    MariaJ. Martin-Bautista

    2013-02-01

    Full Text Available We present a new multidimensional model with textual dimensions based on a knowledge structure extracted from the texts, where any textual attribute in a database can be processed, and not only XML texts. This dimension allows to treat the textual data in the same way as the non-textual one in an automatic way, without user's intervention, so all the classical operations in the multidimensional model can been defined for this textual dimension. While most of the models dealing with texts that can be found in the literature are not implemented, in this proposal, the multidimensional model and the OLAP system have been implemented in a software tool, so it can be tested on real data. A case study with medical data is included in this work.

  5. Text collections for evaluation of Russian morphological taggers

    Directory of Open Access Journals (Sweden)

    Lyashevskaya Olga

    2017-12-01

    Full Text Available The paper describes the preparation and development of the text collections within the framework of MorphoRuEval-2017 shared task, an evaluation campaign designed to stimulate development of the automatic morphological processing technologies for Russian. The main challenge for the organizers was to standardize all available Russian corpora with the manually verified high-quality tagging to a single format (Universal Dependencies CONLL-U. The sources of the data were the disambiguated subcorpus of the Russian National Corpus, SynTagRus, OpenCorpora.org data and GICR corpus with the resolved homonymy, all exhibiting different tagsets, rules for lemmatization, pipeline architecture, technical solutions and error systematicity. The collections includes both normative texts (the news and modern literature and more informal discourse (social media and spoken data, the texts are available under CC BY-NC-SA 3.0 license.

  6. NOTICING HYBRID RECASTS IN TEXT CHAT

    Directory of Open Access Journals (Sweden)

    Mark J. Oliver

    2016-12-01

    Full Text Available This study examined ten EFL learners’ noticing of the corrective nature of a form of text-based SCMC (text chat feedback that combined a recast of a grammatical error with metalinguistic information. The feedback, termed a hybrid recast, was provided by a native-speaker interlocutor during two text chat activities: a spot-the-difference and picture-ordering task. Data was collected in two ways: analysis of task-based dyadic text chat interaction in which uptake was used as an indicator of learner noticing, and a post-task questionnaire containing questions that identified evidence of learner noticing. Interaction analysis showed that learners responded to almost two thirds of the hybrid recasts with uptake. In addition, every learner provided evidence that they had correctly perceived at least some of the hybrid recasts as corrective in their post-task questionnaire responses.

  7. Some Notes and Critiques of Selected Lexicographic Texts in Bikol

    Directory of Open Access Journals (Sweden)

    Louward Allen Zubiri

    2014-12-01

    Full Text Available This study revolves around notes and critiques of Bikol lexicography. Based on a selection of six dictionaries, the structure and diachronic development of Bikol lexicography were analyzed. Half of the selected dictionaries were authored in full or part by Malcolm Mintz, a renowned Bikol expert. These enabled a linear analysis of changes in lexicography that have occurred within the span of four decades. The dictionaries studied include the earliest and most influential Bikol dictionary of Lisboa (1865. The study presents a preliminary description of lexicographic work done in Bikol and traces the shift in the paradigm of dictionary making from the Spanish era to the present.

  8. Text Character Extraction Implementation from Captured Handwritten Image to Text Conversionusing Template Matching Technique

    Directory of Open Access Journals (Sweden)

    Barate Seema

    2016-01-01

    Full Text Available Images contain various types of useful information that should be extracted whenever required. A various algorithms and methods are proposed to extract text from the given image, and by using that user will be able to access the text from any image. Variations in text may occur because of differences in size, style,orientation, alignment of text, and low image contrast, composite backgrounds make the problem during extraction of text. If we develop an application that extracts and recognizes those texts accurately in real time, then it can be applied to many important applications like document analysis, vehicle license plate extraction, text- based image indexing, etc and many applications have become realities in recent years. To overcome the above problems we develop such application that will convert the image into text by using algorithms, such as bounding box, HSV model, blob analysis,template matching, template generation.

  9. A validity study of self-reported daily texting frequency, cell phone characteristics, and texting styles among young adults.

    Science.gov (United States)

    Gold, Judith E; Rauscher, Kimberly J; Zhu, Motao

    2015-04-02

    Texting is associated with adverse health effects including musculoskeletal disorders, sleep disturbances, and traffic crashes. Many studies have relied on self-reported texting frequency, yet the validity of self-reports is unknown. Our objective was to provide some of the first data on the validity of self-reported texting frequency, cell phone characteristics including input device (e.g. touchscreen), key configuration (e.g., QWERTY), and texting styles including phone orientation (e.g., horizontal) and hands holding the phone while texting. Data were collected using a self-administered questionnaire and observation of a texting task among college students ages 18 to 24. To gauge agreement between self-reported and phone bill-derived categorical number of daily text messages sent, we calculated percent of agreement, Spearman correlation coefficient, and a linear weighted kappa statistic. For agreement between self-reported and observed cell phone characteristics and texting styles we calculated percentages of agreement. We used chi-square tests to detect significant differences (α = 0.05) by gender and study protocol. There were 106 participants; 87 of which had complete data for texting frequency analyses. Among these 87, there was 26% (95% CI: 21-31) agreement between self-reported and phone bill-derived number of daily text messages sent with a Spearman's rho of 0.48 and a weighted kappa of 0.17 (95% CI: 0.06-0.27). Among those who did not accurately report the number of daily texts sent, 81% overestimated this number. Among the full sample (n = 106), there was high agreement between self-reported and observed texting input device (96%, 95% CI: 91-99), key configuration (89%, 95% CI: 81-94), and phone orientation while texting (93%, 95% CI: 86-97). No differences were found by gender or study protocol among any items. While young adults correctly reported their cell phone's characteristics and phone orientation while texting, most incorrectly

  10. GPU-Accelerated Text Mining

    International Nuclear Information System (INIS)

    Cui, X.; Mueller, F.; Zhang, Y.; Potok, Thomas E.

    2009-01-01

    Accelerating hardware devices represent a novel promise for improving the performance for many problem domains but it is not clear for which domains what accelerators are suitable. While there is no room in general-purpose processor design to significantly increase the processor frequency, developers are instead resorting to multi-core chips duplicating conventional computing capabilities on a single die. Yet, accelerators offer more radical designs with a much higher level of parallelism and novel programming environments. This present work assesses the viability of text mining on CUDA. Text mining is one of the key concepts that has become prominent as an effective means to index the Internet, but its applications range beyond this scope and extend to providing document similarity metrics, the subject of this work. We have developed and optimized text search algorithms for GPUs to exploit their potential for massive data processing. We discuss the algorithmic challenges of parallelization for text search problems on GPUs and demonstrate the potential of these devices in experiments by reporting significant speedups. Our study may be one of the first to assess more complex text search problems for suitability for GPU devices, and it may also be one of the first to exploit and report on atomic instruction usage that have recently become available in NVIDIA devices

  11. Powerful Vocabulary Acquisition through Texts Comparison

    Directory of Open Access Journals (Sweden)

    Mohammad Reza Hasannejad

    2015-03-01

    Full Text Available This study aimed to investigate if dual version reading comprehension had a positive effect on Intermediate EFL students’ general vocabulary acquisition, receptive and productive knowledge of vocabulary and students’ synonymous power of words. Two groups were selected - the experimental group and the control group. The study included: (1 four pretests (2 the dual version reading comprehension, and (3 four posttests. It was found that there was no significant difference between the two groups of students on the pretests. However there was a significant difference between the two groups of the students on the posttests. Overall, the dual version reading comprehension vocabulary-learning made the experimental group learners outperformed the control groups in terms of their performance on four types of vocabulary tests. This indicates that students following dual version reading comprehension were more successful in vocabulary acquisition, and developing their receptive knowledge of vocabulary, transferring their receptive knowledge in to the productive knowledge and enhancing the memorization of the synonymous words.

  12. Supernova equations of state including full nuclear ensemble with in-medium effects

    Science.gov (United States)

    Furusawa, Shun; Sumiyoshi, Kohsuke; Yamada, Shoichi; Suzuki, Hideyuki

    2017-01-01

    We construct new equations of state for baryons at sub-nuclear densities for the use in core-collapse supernova simulations. The abundance of various nuclei is obtained together with thermodynamic quantities. The formulation is an extension of the previous model, in which we adopted the relativistic mean field theory with the TM1 parameter set for nucleons, the quantum approach for d, t, h and α as well as the liquid drop model for the other nuclei under the nuclear statistical equilibrium. We reformulate the model of the light nuclei other than d, t, h and α based on the quasi-particle description. Furthermore, we modify the model so that the temperature dependences of surface and shell energies of heavy nuclei could be taken into account. The pasta phases for heavy nuclei and the Pauli- and self-energy shifts for d, t, h and α are taken into account in the same way as in the previous model. We find that nuclear composition is considerably affected by the modifications in this work, whereas thermodynamical quantities are not changed much. In particular, the washout of shell effect has a great impact on the mass distribution above T ∼ 1 MeV. This improvement may have an important effect on the rates of electron captures and coherent neutrino scatterings on nuclei in supernova cores.

  13. Gyrokinetic Vlasov code including full three-dimensional geometry of experiments

    International Nuclear Information System (INIS)

    Nunami, Masanori; Watanabe, Tomohiko; Sugama, Hideo

    2010-03-01

    A new gyrokinetic Vlasov simulation code, GKV-X, is developed for investigating the turbulent transport in magnetic confinement devices with non-axisymmetric configurations. Effects of the magnetic surface shapes in a three-dimensional equilibrium obtained from the VMEC code are accurately incorporated. Linear simulations of the ion temperature gradient instabilities and the zonal flows in the Large Helical Device (LHD) configuration are carried out by the GKV-X code for a benchmark test against the GKV code. The frequency, the growth rate, and the mode structure of the ion temperature gradient instability are influenced by the VMEC geometrical data such as the metric tensor components of the Boozer coordinates for high poloidal wave numbers, while the difference between the zonal flow responses obtained by the GKV and GKV-X codes is found to be small in the core LHD region. (author)

  14. Sacred texts and mystic meaning: An inquiry into Christian ...

    African Journals Online (AJOL)

    EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT · http://dx.doi.org/10.4314/actat.v31i2.3 · AJOL African Journals Online. HOW TO USE AJOL... for Researchers · for Librarians · for Authors · FAQ's · More about AJOL · AJOL's Partners · Terms and Conditions of Use ...

  15. Individual Profiling Using Text Analysis

    Science.gov (United States)

    2016-04-15

    AFRL-AFOSR-UK-TR-2016-0011 Individual Profiling using Text Analysis 140333 Mark Stevenson UNIVERSITY OF SHEFFIELD, DEPARTMENT OF PSYCHOLOGY Final...REPORT TYPE      Final 3.  DATES COVERED (From - To)      15 Sep 2014 to 14 Sep 2015 4.  TITLE AND SUBTITLE Individual Profiling using Text Analysis ...consisted of collections of tweets for a number of Twitter users whose gender, age and personality scores are known. The task was to construct some system

  16. Finding text in color images

    Science.gov (United States)

    Zhou, Jiangying; Lopresti, Daniel P.; Tasdizen, Tolga

    1998-04-01

    In this paper, we consider the problem of locating and extracting text from WWW images. A previous algorithm based on color clustering and connected components analysis works well as long as the color of each character is relatively uniform and the typography is fairly simple. It breaks down quickly, however, when these assumptions are violated. In this paper, we describe more robust techniques for dealing with this challenging problem. We present an improved color clustering algorithm that measures similarity based on both RGB and spatial proximity. Layout analysis is also incorporated to handle more complex typography. THese changes significantly enhance the performance of our text detection procedure.

  17. Unsteady panel method for complex configurations including wake modeling

    CSIR Research Space (South Africa)

    Van Zyl, Lourens H

    2008-01-01

    Full Text Available implementations of the DLM are however not very versatile in terms of geometries that can be modeled. The ZONA6 code offers a versatile surface panel body model including a separated wake model, but uses a pressure panel method for lifting surfaces. This paper...

  18. Current Writing: Text and Reception in Southern Africa: Advanced ...

    African Journals Online (AJOL)

    Current Writing: Text and Reception in Southern Africa: Advanced Search. Journal Home > Current Writing: Text and Reception in Southern Africa: Advanced Search. Log in or Register to get access to full text downloads.

  19. Intertextuality within the linguistic analysis of a literary text

    Directory of Open Access Journals (Sweden)

    Л Н Лунькова

    2008-12-01

    Full Text Available The article is devoted to the phenomenon of precedent texts in fiction, the ways they are introduced into it and the possibilities of their linguistic interpretation within secondary texts.

  20. Global Prospects for Full Employment

    Directory of Open Access Journals (Sweden)

    Ivo Šlaus

    2011-04-01

    Full Text Available The recent international financial crisis highlights the crucial role of employment in human welfare and social stability. Access to remunerative employment opportunities is essential for economic security in a market-based economic system. As the rise of democracy compelled nations to extend the voting right to all citizens, employment must be recognized as a fundamental human right. In total defiance of conventional wisdom, since 1950 job growth has outpaced the explosive growth of population, the rapid adoption of labor-saving technologies, the manifold expansion of world trade, and the dramatic shift from manual labor to white collar work. In an increasingly globalized labor market, current nation-centric theories and models of employment need to be replaced with a human-centered global perspective complemented by new indicators that recognize the central and essential contribution of employment to human economic welfare. Employment and economy are subsets of society and their growth is driven by the more fundamental process of social development. A vast array of unmet social needs combined with an enormous reservoir of underutilized social resources – technological, scientific, educational, organizational, cultural and psychological – can be harnessed to dramatically expand employment opportunities and achieve full employment on a global basis. This paper examines the theoretical basis, policy issues and strategies required to eradicate unemployment nationally and globally.

  1. Multilingual text induced spelling correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams

  2. Automated analysis of instructional text

    Energy Technology Data Exchange (ETDEWEB)

    Norton, L.M.

    1983-05-01

    The development of a capability for automated processing of natural language text is a long-range goal of artificial intelligence. This paper discusses an investigation into the issues involved in the comprehension of descriptive, as opposed to illustrative, textual material. The comprehension process is viewed as the conversion of knowledge from one representation into another. The proposed target representation consists of statements of the prolog language, which can be interpreted both declaratively and procedurally, much like production rules. A computer program has been written to model in detail some ideas about this process. The program successfully analyzes several heavily edited paragraphs adapted from an elementary textbook on programming, automatically synthesizing as a result of the analysis a working Prolog program which, when executed, can parse and interpret let commands in the basic language. The paper discusses the motivations and philosophy of the project, the many kinds of prerequisite knowledge which are necessary, and the structure of the text analysis program. A sentence-by-sentence account of the analysis of the sample text is presented, describing the syntactic and semantic processing which is involved. The paper closes with a discussion of lessons learned from the project, possible alternative approaches, and possible extensions for future work. The entire project is presented as illustrative of the nature and complexity of the text analysis process, rather than as providing definitive or optimal solutions to any aspects of the task. 12 references.

  3. Solar Concepts: A Background Text.

    Science.gov (United States)

    Gorham, Jonathan W.

    This text is designed to provide teachers, students, and the general public with an overview of key solar energy concepts. Various energy terms are defined and explained. Basic thermodynamic laws are discussed. Alternative energy production is described in the context of the present energy situation. Described are the principal contemporary solar…

  4. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    -folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...

  5. MANAGING THE TRANSLATION OF ECONOMIC TEXTS

    Directory of Open Access Journals (Sweden)

    Pop Anamaria Mirabela

    2012-12-01

    Full Text Available Theoretically, translation may pass as science; practically, it seems closer to art. Translation is a challenging activity requiring a set of abilities and posing few difficulties that appear during the translation process. This paper investigates the extent to which sub-technical vocabulary can constitute a problem to Romanian students of economics reading in English, by looking at the translations produced as independent or pair work during English classes and analyzing the various errors which may appeared. The exigencies required by the efficient business communication have increased in the past few decades because of rising international trade, increased migration, globalization, the recognition of linguistic minorities, and the expansion of the mass media and technology. All these led us to approach the topic of translation which is actually a job that requires skills, stages of research necessary for disclosure of transfer characteristic into the target language, training, experience and a good sense of languages. The paper defines the theoretical issues and terminology: translation, types of translation, economic texts and then focuses on the presentation of the practical work carried out throughout the academic year of second year students. Considering that only 28% of the entire European population can read English, and even less people in South America and Asia can, it is obvious that an effective communication of business matters relies on an accurate understanding of terminology. Economics is a field of knowledge in accelerated scientific and technological development. As there is a permanent and ever increasing need to quickly update their knowledge, economists read and learn directly in the original language of the publication and stick to it in daily usage, including conferences, scientific events and articles written in Romanian. Besides researching properly the markets, finding distribution channels, and dealing with legal

  6. Figure-associated text summarization and evaluation.

    Science.gov (United States)

    Polepalli Ramesh, Balaji; Sethi, Ricky J; Yu, Hong

    2015-01-01

    Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).

  7. Indonesian Text-To-Speech System Using Diphone Concatenative Synthesis

    Directory of Open Access Journals (Sweden)

    Sutarman

    2015-02-01

    Full Text Available In this paper, we describe the design and develop a database of Indonesian diphone synthesis using speech segment of recorded voice to be converted from text to speech and save it as audio file like WAV or MP3. In designing and develop a database of Indonesian diphone there are several steps to follow; First, developed Diphone database includes: create a list of sample of words consisting of diphones organized by prioritizing looking diphone located in the middle of a word if not at the beginning or end; recording the samples of words by segmentation. ;create diphones made with a tool Diphone Studio 1.3. Second, develop system using Microsoft Visual Delphi 6.0, includes: the conversion system from the input of numbers, acronyms, words, and sentences into representations diphone. There are two kinds of conversion (process alleged in analyzing the Indonesian text-to-speech system. One is to convert the text to be sounded to phonem and two, to convert the phonem to speech. Method used in this research is called Diphone Concatenative synthesis, in which recorded sound segments are collected. Every segment consists of a diphone (2 phonems. This synthesizer may produce voice with high level of naturalness. The Indonesian Text to Speech system can differentiate special phonemes like in ‘Beda’ and ‘Bedak’ but sample of other spesific words is necessary to put into the system. This Indonesia TTS system can handle texts with abbreviation, there is the facility to add such words.

  8. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases...... the accuracy at the same time. The test example is classified using simpler and smaller model. The training examples in a particular cluster share the common vocabulary. At the time of clustering, we do not take into account the labels of the training examples. After the clusters have been created......, the classifier is trained on each cluster having reduced dimensionality and less number of examples. The experimental results show that the proposed model outperforms the existing classification models for the task of suspicious email detection and topic categorization on the Reuters-21578 and 20 Newsgroups...

  9. Linguistic dating of biblical texts

    DEFF Research Database (Denmark)

    Young, Ian; Rezetko, Robert; Ehrensvärd, Martin Gustaf

    Since the beginning of critical scholarship biblical texts have been dated using linguistic evidence. In recent years this has become a controversial topic, especially with the publication of Ian Young (ed.), Biblical Hebrew: Studies in Chronology and Typology (2003). However, until now there has...... been no introduction and comprehensive study of the field. Volume 1 introduces the field of linguistic dating of biblical texts, particularly to intermediate and advanced students of biblical Hebrew who have a reasonable background in the language, having completed at least an introductory course...... in this volume are: What is it that makes Archaic Biblical Hebrew archaic , Early Biblical Hebrew early , and Late Biblical Hebrew late ? Does linguistic typology, i.e. different linguistic characteristics, convert easily and neatly into linguistic chronology, i.e. different historical origins? A large amount...

  10. Nigel: A Systemic Grammar for Text Generation.

    Science.gov (United States)

    1983-02-01

    presumed. Basic references on the systemic framework include [Berry 75, Berry 77, Halliday 76a, Halliday 76b, Hudson 76, Halliday 81, de Joia 80...Edinburgh, 1979. [do Joia 80] de Joia , A., and A. Stanton, Terms in Systemic Linguistics, Batsford Academic and Educational, Ltd., London, 1980. -’C...1 A Grammar for Text Generation- -The Challenge ................................. 1 *1.2 A Grammar for Text Generation--The Design

  11. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level. Th....... By comparing the biological with the textual account of autopoietic agency, the end conclusion is that a newly derived concept of sociopoiesis might be better suited for discussing the architecture of textual systems....

  12. The TEXT upgrade vertical interferometer

    International Nuclear Information System (INIS)

    Hallock, G.A.; Gartman, M.L.; Li, W.; Chiang, K.; Shin, S.; Castles, R.L.; Chatterjee, R.; Rahman, A.S.

    1992-01-01

    A far-infrared interferometer has been installed on TEXT upgrade to obtain electron density profiles. The primary system views the plasma vertically through a set of large (60-cm radialx7.62-cm toroidal) diagnostic ports. A 1-cm channel spacing (59 channels total) and fast electronic time response is used, to provide high resolution for radial profiles and perturbation experiments. Initial operation of the vertical system was obtained late in 1991, with six operating channels

  13. Reasoning with Annotations of Texts

    OpenAIRE

    Ma , Yue; Lévy , François; Ghimire , Sudeep

    2011-01-01

    International audience; Linguistic and semantic annotations are important features for text-based applications. However, achieving and maintaining a good quality of a set of annotations is known to be a complex task. Many ad hoc approaches have been developed to produce various types of annotations, while comparing those annotations to improve their quality is still rare. In this paper, we propose a framework in which both linguistic and domain information can cooperate to reason with annotat...

  14. The Relationship between Paraphrasing and Text Analysis

    Directory of Open Access Journals (Sweden)

    María Luisa Cepeda Islas

    2013-04-01

    Full Text Available Given the importance of paraphrasing in the process of comprehension for college students, this study assessed the level of implementation of text analysis and paraphrases the response of a sample of senior students of the career psychology. We selected a group of freshmen to the Psychology course, which was asked to answer a questionnaire and carry out the summary of an empirical article. The results showed that participants have a low level of text analysis, at the same time had low levels of paraphrasing. It was seen that the predominant textual copy. They envision some possibilities for the structure of a training workshop not only paraphrasing but on the analysis of text.

  15. Text-Filled Stacked Area Graphs

    DEFF Research Database (Denmark)

    Kraus, Martin

    2011-01-01

    -filled stacked area graphs; i.e., graphs that feature stacked areas that are filled with small-typed text. Since these graphs allow for computing the text layout automatically, it is possible to include large amounts of textual detail with very little effort. We discuss the most important challenges and some...... solutions for the design of text-filled stacked area graphs with the help of an exemplary visualization of the genres, publication years, and titles of a database of several thousand PC games....

  16. Quantum mechanics a comprehensive text for chemistry

    CERN Document Server

    Arora, Kishor

    2010-01-01

    This book contains 14 chapters. The text includes the inadequacy of classical mechanics and covers basic and fundamental concepts of quantum mechanics including concepts of transitional, vibration rotation and electronic energies, introduction to concepts of angular momenta, approximatemethods and their application concepts related to electron spin, symmetery concepts and quantum mechanics and ultimately the book features the theories of chemical bonding and use of softwares in quantum mechanics. the text of the book is presented in a lucid manner with ample examples and illustrations wherever

  17. CONVERSATIONS -- AND NEGOTIATED INTERACTION -- IN TEXT AND VOICE CHAT ROOMS

    Directory of Open Access Journals (Sweden)

    Kevin Jepson

    2005-09-01

    Full Text Available Despite the expanded use of the Internet for language learning and practice, little attention if any has been given to the quality of interaction among English L2 speakers in conversational text or voice chat rooms. This study explored the patterns of repair moves in synchronous non-native speaker (NNS text chat rooms in comparison to voice chat rooms on the Internet. The following questions were posed: (a Which types of repair moves occur in text and voice chats; and (b what are the differences, if any, between the repair moves in text chats and voice chats when time is held constant? Repair moves made by anonymous NNSs in 10, 5-minute, synchronous chat room sessions (5 text-chat sessions, 5 voice-chat sessions were counted and analyzed using chi-square with alpha set at .05. Significant differences were found between the higher number of total repair moves made in voice chats and the smaller number in text chats. Qualitative data analysis showed that repair work in voice chats was often pronunciation-related. The study includes discussion that may affect teachers' and learners' considerations of the value of NNS chat room interaction for second language development.

  18. LITURGICAL TEXT IN RUSSIAN LITERATURE. PROBLEM STATEMENT

    Directory of Open Access Journals (Sweden)

    Avetis Serezhaevich Seropyan

    2012-11-01

    Full Text Available The article analyses artistic expressions of liturgical language in the literary text and its interaction of the Holy Tradition. Many Russian authors knew the liturgical text well. Studying it reveals the crucial meaning of the Gospel and liturgical texts (as part of the Holy Tradition for Russian literature. Authors saw the essence of every phenomenon in the word for it, and the nature of God in His name. Some ideas and sayings of the authors and their characters find their sources in liturgical texts. The article focuses on liturgical sources of some characters' commemorations and invocations, as well as poetical topics of the symbolists, Dostoevsky's famous dictum on beauty which will save the world (The Idiot, etc. De-cyphering this liturgical code will help us learn and comprehend the hidden endless meaning of a literary text. The specific feature of Russian literature is its pursuit of the spiritual liturgical exploration of the world, an exploration when truth takes shape and thus becomes real in both literary text and history.

  19. Text mining by Tsallis entropy

    Science.gov (United States)

    Jamaati, Maryam; Mehri, Ali

    2018-01-01

    Long-range correlations between the elements of natural languages enable them to convey very complex information. Complex structure of human language, as a manifestation of natural languages, motivates us to apply nonextensive statistical mechanics in text mining. Tsallis entropy appropriately ranks the terms' relevance to document subject, taking advantage of their spatial correlation length. We apply this statistical concept as a new powerful word ranking metric in order to extract keywords of a single document. We carry out an experimental evaluation, which shows capability of the presented method in keyword extraction. We find that, Tsallis entropy has reliable word ranking performance, at the same level of the best previous ranking methods.

  20. Biased limiter experiments on text

    International Nuclear Information System (INIS)

    Phillips, P.E.; Wootton, A.J.; Rowan, W.L.; Ritz, C.P.; Rhodes, T.L.; Bengtson, R.D.; Hodge, W.L.; Durst, R.D.; McCool, S.C.; Richards, B.; Gentle, K.W.; Schoch, P.; Forster, J.C.; Hickok, R.L.; Evans, T.E.

    1987-01-01

    Experiments using an electrically biased limiter have been performed on the Texas Experimental Tokamak (TEXT). A small movable limiter is inserted past the main poloidal ring limiter (which is electrically connected to the vacuum vessel) and biased at V Lim with respect to it. The floating potential, plasma potential and shear layer position can be controlled. With vertical strokeV Lim vertical stroke ≥ 50 V the plasma density increases. For V Lim Lim > 0 the results obtained are inconclusive. Variation of V Lim changes the electrostatic turbulence which may explain the observed total flux changes. (orig.)

  1. Achieving and sustaining full employment.

    Science.gov (United States)

    Rosen, S M

    1995-01-01

    Human rights and public health considerations provide strong support for policies that maximize employment. Ample historical and conceptual evidence supports the feasibility of full employment policies. New factors affecting the labor force, the rate of technological change, and the globalization of economic activity require appropriate policies--international as well as national--but do not invalidate the ability of modern states to apply the measures needed. Among these the most important include: (I) systematic reduction in working time with no loss of income, (2) active labor market policies, (3) use of fiscal and monetary measures to sustain the needed level of aggregate demand, (4) restoration of equal bargaining power between labor and capital, (5) social investment in neglected and outmoded infrastructure, (6) accountability of corporations for decisions to shift or reduce capital investment, (7) major reductions in military spending, to be replaced by socially needed and economically productive expenditures, (8) direct public sector job creation, (9) reform of monetary policy to restore emphasis on minimizing unemployment and promoting full employment. None are without precedent in modern economies. The obstacles are ideological and political. To overcome them will require intellectual clarity and effective advocacy.

  2. Grammar Texts and Consumerist Subtexts

    Science.gov (United States)

    Sokolik, M. E.

    2007-01-01

    While several checklists exist for the evaluation of ESL/EFL textbooks, none includes suggestions for looking for specific biases, especially those found in the content of examples and sample sentences. Growing awareness in publishing has reduced problems in the presentation of gender-based and racial biases in most ESL/EFL grammar textbooks, but…

  3. Chinese legal texts – Quantitative Description

    Directory of Open Access Journals (Sweden)

    Ľuboš GAJDOŠ

    2017-06-01

    Full Text Available The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation.

  4. Data-Model Relationship in Text-Independent Speaker Recognition

    Directory of Open Access Journals (Sweden)

    Stapert Robert

    2005-01-01

    Full Text Available Text-independent speaker recognition systems such as those based on Gaussian mixture models (GMMs do not include time sequence information (TSI within the model itself. The level of importance of TSI in speaker recognition is an interesting question and one addressed in this paper. Recent works has shown that the utilisation of higher-level information such as idiolect, pronunciation, and prosodics can be useful in reducing speaker recognition error rates. In accordance with these developments, the aim of this paper is to show that as more data becomes available, the basic GMM can be enhanced by utilising TSI, even in a text-independent mode. This paper presents experimental work incorporating TSI into the conventional GMM. The resulting system, known as the segmental mixture model (SMM, embeds dynamic time warping (DTW into a GMM framework. Results are presented on the 2000-speaker SpeechDat Welsh database which show improved speaker recognition performance with the SMM.

  5. Authentic texts in teaching French as a foreign language

    Directory of Open Access Journals (Sweden)

    Meta Lah

    2010-12-01

    Full Text Available The present paper is aimed at providing a ref lection on the use of authentic texts in French as a foreign language classroom. The author bases herself on an analysis of texts taken from four textbook sets (Le nouveau sans fronti`eres, Panorama, Campus and Rond point, which were or are still used in teaching French as a foreign language. Initially, a definition of authenticity and a survey of authentic material usage through history are provided. In the overview of the texts forming the corpus the texts are divided into authentic, adapted, apparently authentic and those for which no assumption can be made as to their authenticity. The authenticity analysis is also carried out by taking into account the analysis of/categorisation into text types (according to Adam. The author proceeds from two premises, i.e. firstly she foresees that authentic texts will be present in all text books analysed and secondly, considering the greater accessibility of materials, that their presence will be more pronounced in recent textbooks. However, none of the two hypo theses is confirmed, as authentic texts are found in the first three textbook sets, but not in the most recent one, while their presence is most pronounced in the oldest textbook set, i.e. in Le nouveau sans fronti`eres. The result of the analysis is thus somehow surprising given the overall accessibility of all kinds of authentic materials. In the author's opinion more authentic texts should be included into textbooks to thus enhance the purposeful ness of the foreign language classroom.

  6. Transfer Learning beyond Text Classification

    Science.gov (United States)

    Yang, Qiang

    Transfer learning is a new machine learning and data mining framework that allows the training and test data to come from different distributions or feature spaces. We can find many novel applications of machine learning and data mining where transfer learning is necessary. While much has been done in transfer learning in text classification and reinforcement learning, there has been a lack of documented success stories of novel applications of transfer learning in other areas. In this invited article, I will argue that transfer learning is in fact quite ubiquitous in many real world applications. In this article, I will illustrate this point through an overview of a broad spectrum of applications of transfer learning that range from collaborative filtering to sensor based location estimation and logical action model learning for AI planning. I will also discuss some potential future directions of transfer learning.

  7. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  8. Some aspects of gender inequality in selected African literary texts

    Directory of Open Access Journals (Sweden)

    L.L. Kwatsha

    2009-07-01

    Full Text Available This article focuses on gender inequalities in selected African literary texts. I explore the way in which some African writers view gender inequalities and stereotypes in their characters. We will also be able to see who is involved and affected by these gender inequalities and how. Gender theory will be used as a framework. The aspects of gender that are discussed, include gender stereotypes, gender roles, gender identity, the superiority of men, inequality in polygamous marriages, sex roles, the sexual division of labour and arranged marriage. This study will also include the views of writers from other part of Africa. These views have a lot in common but sometimes they vary because of the influence and different ideologies of the society concerned.

  9. The Pelindaba text and its previous

    International Nuclear Information System (INIS)

    Adeniji, O.

    1996-01-01

    The main body of the Treaty, the preamble, articles 1-22, and the map are reproduced in this issue in the section ''Documentation Relating to Disarmament and International Security''. The complete text, including annexes and protocols, is contained in document A/50/426

  10. Studies of electron cyclotron emission on text

    International Nuclear Information System (INIS)

    Gandy, R.F.

    1990-07-01

    The Auburn University electron cyclotron emission (ECE) system has made many significant contributions to the TEXT experimental program during the past five years. Contributions include electron temperature information used in the following areas of study: electron cyclotron heating (ECH), pellet injection, and impurity/energy transport. Details of the role which the Auburn ECE system has played will now be discussed

  11. Text Entry by Gazing and Smiling

    Directory of Open Access Journals (Sweden)

    Outi Tuisku

    2013-01-01

    Full Text Available Face Interface is a wearable prototype that combines the use of voluntary gaze direction and facial activations, for pointing and selecting objects on a computer screen, respectively. The aim was to investigate the functionality of the prototype for entering text. First, three on-screen keyboard layout designs were developed and tested (n=10 to find a layout that would be more suitable for text entry with the prototype than traditional QWERTY layout. The task was to enter one word ten times with each of the layouts by pointing letters with gaze and select them by smiling. Subjective ratings showed that a layout with large keys on the edge and small keys near the center of the keyboard was rated as the most enjoyable, clearest, and most functional. Second, using this layout, the aim of the second experiment (n=12 was to compare entering text with Face Interface to entering text with mouse. The results showed that text entry rate for Face Interface was 20 characters per minute (cpm and 27 cpm for the mouse. For Face Interface, keystrokes per character (KSPC value was 1.1 and minimum string distance (MSD error rate was 0.12. These values compare especially well with other similar techniques.

  12. Inspiration and the Texts of the Bible

    Directory of Open Access Journals (Sweden)

    Dirk Buchner

    1997-12-01

    Full Text Available This article seeks to explore what the inspired text of the Old Testament was as it existed for the New Testament authors, particularly for the author of the book of Hebrews. A quick look at the facts makes. it clear that there was, at the time, more than one 'inspired' text, among these were the Septuagint and the Masoretic Text 'to name but two'. The latter eventually gained ascendancy which is why it forms the basis of our translated Old Testament today. Yet we have to ask: what do we make of that other text that was the inspired Bible to the early Church, especially to the writer of the book of Hebrews, who ignored the Masoretic text? This article will take a brief look at some suggestions for a doctrine of inspiration that keeps up with the facts of Scripture. Allied to this, the article is something of a bibliographical study of recent developments in textual research following the discovery of the Dead Sea scrolls.

  13. Ancient medical texts, modern reading problems

    Directory of Open Access Journals (Sweden)

    Maria Carlota Rosa

    2006-12-01

    Full Text Available The word tradition has a very specific meaning in linguistics: the passing down of a text, which may have been completed or corrected by different copyists at different times, when the concept of authorship was not the same as it is today. When reading an ancient text the word tradition must be in the reader's mind. To discuss one of the problems an ancient text poses to its modern readers, this work deals with one of the first printed medical texts in Portuguese, the Regimento proueytoso contra ha pestenença, and draws a parallel between it and two related texts, A moche profitable treatise against the pestilence, and the Recopilaçam das cousas que conuem guardar se no modo de preseruar à Cidade de Lixboa E os sãos, & curar os que esteuerem enfermos de Peste. The problems which arise out of the textual structure of those books show how difficult is to establish a tradition of another type, the medical tradition. The linguistic study of the innumerable medieval plague treatises may throw light on the continuities and on the disruptions of the so-called hippocratic-galenical medical tradition.

  14. Reading an ESL Writer’s Text

    Directory of Open Access Journals (Sweden)

    Paul Kei Matsuda

    2011-03-01

    Full Text Available This paper focuses on reading as a central act of communication in the tutorial session. Writing center tutors without extensive experience reading writing by second language writers may have difficulty getting past the many differences in surface-level features, organization, and rhetorical moves. After exploring some of the sources of these differences in writing, the authors present strategies that writing tutors can use to work effectively with second language writers.

  15. Full-scope training simulators

    International Nuclear Information System (INIS)

    Ugedo, E.

    1986-01-01

    The following topics to be covered in this report are: Reasons justifying the use of full-scope simulators for operator qualification. Full-scope simulator description: the control room, the physical models, the computer complex, the instructor's console. Main features of full-scope simulators. Merits of simulator training. The role of full-scope simulators in the training programs. The process of ordering and acquiring a full-scope simulator. Maintaining and updating simulator capabilities. (orig./GL)

  16. No More Provincialism: Art and Text

    Directory of Open Access Journals (Sweden)

    Heather Barker

    2010-11-01

    Full Text Available This essay discusses the writing and personalities surrounding the 1981 establishment of the Australian art magazine, Art & Text, and traces its progression under Paul Taylor’s editorship up to his relocation to New York. During this period, Art & Text published Taylor’s own essays and, more importantly, those of other writers and artists — Meaghan Morris, Paul Foss, Philip Brophy, Imants Tillers, Rex Butler, Edward Colless — all articulating a consistent and complex postmodern position. The magazine’s founder and editor, Paul Taylor, personified the shattering impact of postmodernism upon the Australian art world as well as postmodernism’s limitations. Taylor facilitated a new theoretical framework for the discussion of Australian art, one that continues to dominate the internationalist aspirations of Australian art writers. He produced temporarily convincing solutions to problems that earlier critics had wrestled with unsuccessfully, in particular the twin problems of provincialism, and the relationship of Australian to international art.

  17. A programmed text in statistics

    CERN Document Server

    Hine, J

    1975-01-01

    Exercises for Section 2 42 Physical sciences and engineering 42 43 Biological sciences 45 Social sciences Solutions to Exercises, Section 1 47 Physical sciences and engineering 47 49 Biological sciences 49 Social sciences Solutions to Exercises, Section 2 51 51 PhYSical sciences and engineering 55 Biological sciences 58 Social sciences 62 Tables 2 62 x - tests involving variances 2 63,64 x - one tailed tests 2 65 x - two tailed tests F-distribution 66-69 Preface This project started some years ago when the Nuffield Foundation kindly gave a grant for writing a pro­ grammed text to use with service courses in statistics. The work carried out by Mrs. Joan Hine and Professor G. B. Wetherill at Bath University, together with some other help from time to time by colleagues at Bath University and elsewhere. Testing was done at various colleges and universities, and some helpful comments were received, but we particularly mention King Edwards School, Bath, who provided some sixth formers as 'guinea pigs' for the fir...

  18. The Impact of Texting on Comprehension

    Directory of Open Access Journals (Sweden)

    Jamal K. M. Ali

    2015-07-01

    Full Text Available This paper presents a study of the effects of texting on English language comprehension. The authors believe that English used in texting causes a lack of comprehension for English speakers, learners, and texters. Wei, Xian-hai and Jiang (2008:3 declare “In Netspeak, there are some newly-created vocabularies, which people cannot comprehend them either from their partial pronunciation or from their figures.” Crystal (2007:23 claims; “variation causes problems of comprehension and acceptability. If you speak or write differently from the way I do, we may fail to understand each other.”  In this paper, the authors conducted a questionnaire at Aligarh Muslim University to ninety respondents from five different Faculties and four different levels. To measure respondents’ comprehension of English texting, the authors gave the respondents abbreviations used by texters and asked them to write the full forms of the abbreviations. The authors found that many abbreviations were not understood, which suggested that most of the respondents did not understand and did not use these abbreviations.

  19. Training for Auditing (Listening of Foreign Texts: Methodology and Experience

    Directory of Open Access Journals (Sweden)

    Anzhelika S. Boutousova

    2017-10-01

    Full Text Available Auditing is considered systematically as a psychophysiological and cognitive process, on the one hand, and as a type of speech activity, on the other. The levels and stages of learning to listen to foreign language texts with their inherent difficulties are singled out. There are elementary, intermediate and advanced levels of learning listening. The stages of training are divided into pretext, text and post-text. Based on the analysis of scientific literature and personal observations, language, cognitive and socio-cultural difficulties in listening have been discovered. A system of exercises aimed at forming an auditory skills is described. Audience skills include segmentation of speech into parts, anticipation of the meaning of parts of words and sentences, forecasting of form and meaning at the text level, skills related to the development of the mechanism of memory; compression and interpretation of the text. The auditory skills are interpreted as listening recognition and understanding of individual words and expressions and grammatical structures.

  20. An Analysis on Reading Texts in Teaching Turkish to Foreigners

    Directory of Open Access Journals (Sweden)

    Adem İŞCAN

    2017-09-01

    Full Text Available Being one of the four basic language skills, reading has a great importance in teaching Turkish to foreigners. It is required to develop reading skills to develop vocabulary. There have been some problems in teaching Turkish as second language. These problems are generally related to difference in alphabet, inadequacy of the sources used in teaching Turkish, methods and techniques used and the texts used. The basic sources used in teaching Turkish to foreigners are texts. This study aims at determination of the opinions of students in Gaziosmanpaşa University and Ondokuz Mayıs University Turkish Education and Application Center (TOMER concerning Turkish reading texts. General browsing method was used in the study. The questionnaire comprising of 24 items was applied to 25 students in beginner level and 7 students in advanced level. With this study, it is foreseen to arrange the texts being the key stone according to the wishes of and in compliance with the levels of students; giving importance to pre-reading, reading and post-reading activities and including questions with short-answer about the text as well as questions to develop high level skills.

  1. Enhancing biomedical text summarization using semantic relation extraction.

    Directory of Open Access Journals (Sweden)

    Yue Shang

    Full Text Available Automatic text summarization for a biomedical concept can help researchers to get the key points of a certain topic from large amount of biomedical literature efficiently. In this paper, we present a method for generating text summary for a given biomedical concept, e.g., H1N1 disease, from multiple documents based on semantic relation extraction. Our approach includes three stages: 1 We extract semantic relations in each sentence using the semantic knowledge representation tool SemRep. 2 We develop a relation-level retrieval method to select the relations most relevant to each query concept and visualize them in a graphic representation. 3 For relations in the relevant set, we extract informative sentences that can interpret them from the document collection to generate text summary using an information retrieval based method. Our major focus in this work is to investigate the contribution of semantic relation extraction to the task of biomedical text summarization. The experimental results on summarization for a set of diseases show that the introduction of semantic knowledge improves the performance and our results are better than the MEAD system, a well-known tool for text summarization.

  2. Mathematical Model of Thyristor Inverter Including a Series-parallel Resonant Circuit

    Directory of Open Access Journals (Sweden)

    Miroslaw Luft

    2008-01-01

    Full Text Available The article presents a mathematical model of thyristor inverter including a series-parallel resonant circuit with theaid of state variable method. Maple procedures are used to compute current and voltage waveforms in the inverter.

  3. ERRORS AND DIFFICULTIES IN TRANSLATING LEGAL TEXTS

    Directory of Open Access Journals (Sweden)

    Camelia, CHIRILA

    2014-11-01

    Full Text Available Nowadays the accurate translation of legal texts has become highly important as the mistranslation of a passage in a contract, for example, could lead to lawsuits and loss of money. Consequently, the translation of legal texts to other languages faces many difficulties and only professional translators specialised in legal translation should deal with the translation of legal documents and scholarly writings. The purpose of this paper is to analyze translation from three perspectives: translation quality, errors and difficulties encountered in translating legal texts and consequences of such errors in professional translation. First of all, the paper points out the importance of performing a good and correct translation, which is one of the most important elements to be considered when discussing translation. Furthermore, the paper presents an overview of the errors and difficulties in translating texts and of the consequences of errors in professional translation, with applications to the field of law. The paper is also an approach to the differences between languages (English and Romanian that can hinder comprehension for those who have embarked upon the difficult task of translation. The research method that I have used to achieve the objectives of the paper was the content analysis of various Romanian and foreign authors' works.

  4. n-Gram-Based Text Compression

    Directory of Open Access Journals (Sweden)

    Vu H. Nguyen

    2016-01-01

    Full Text Available We propose an efficient method for compressing Vietnamese text using n-gram dictionaries. It has a significant compression ratio in comparison with those of state-of-the-art methods on the same dataset. Given a text, first, the proposed method splits it into n-grams and then encodes them based on n-gram dictionaries. In the encoding phase, we use a sliding window with a size that ranges from bigram to five grams to obtain the best encoding stream. Each n-gram is encoded by two to four bytes accordingly based on its corresponding n-gram dictionary. We collected 2.5 GB text corpus from some Vietnamese news agencies to build n-gram dictionaries from unigram to five grams and achieve dictionaries with a size of 12 GB in total. In order to evaluate our method, we collected a testing set of 10 different text files with different sizes. The experimental results indicate that our method achieves compression ratio around 90% and outperforms state-of-the-art methods.

  5. Full Employment in Industrialized Countries.

    Science.gov (United States)

    Britton, Andrew

    1997-01-01

    Argues that full employment must be acceptable on both social and economic grounds. Examines profound changes in industrialized economies since the 1970s and the diversity of employment contracts. Suggests that difficult policy decisions surround full employment. (SK)

  6. Text mining resources for the life sciences.

    Science.gov (United States)

    Przybyła, Piotr; Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable-those that have the crucial ability to share information, enabling smooth integration and reusability. © The Author(s) 2016. Published by Oxford University Press.

  7. Text mining resources for the life sciences

    Science.gov (United States)

    Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable—those that have the crucial ability to share information, enabling smooth integration and reusability. PMID:27888231

  8. Annotated chemical patent corpus: a gold standard for text mining.

    Directory of Open Access Journals (Sweden)

    Saber A Akhondi

    Full Text Available Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.

  9. Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction

    Directory of Open Access Journals (Sweden)

    Darko Brodić

    2010-05-01

    Full Text Available Text line segmentation is an essential stage in off-line optical character recognition (OCR systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms.

  10. Difficulties in translation of socio-political texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2013-12-01

    Full Text Available Belonging of Russian socio-political texts to publicistic style assumes being guided by functional approach in order to find most adequate linguistic means by transfer of pragmatic meaning of the source text. Intralinguistic meaning can slightly remain by the interpretation of German texts. Lexical and grammatical transformations help preserving semantic-syntactic structure of the target text which means achievement of the same communicative effect by the translate which is being achieved by the source text.

  11. Digital Rights Management for a Chinese XML Text Centre

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The Electronic Text Centre of the OpenUniversity of Hong Kong(OUHK)has been in full operationsince early 2001.It currently houses 7,300+electronictexts,including free electronic titles,electronic titlespurchased directly from the market,and about,1,000 locallyproduced electronic titles.The locally produced titles are notavailable in the market but require local digitization andnegotiation with publishers with regard to the right to use(RTU)them so as to meet the learning needs of the OUHKcommunity.Nearl...

  12. The home concept in poetic texts: new ways of understanding

    Directory of Open Access Journals (Sweden)

    С А Радзиевская

    2010-03-01

    Full Text Available The article focuses on the analysis of the HOME concept in American poetic texts and on the description of the model of its content. Linguocognitive mechanisms of the formation of the images of home are revealed.

  13. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...

  14. Monolingual Accounting Dictionaries for EFL Text Production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2009-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...... of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...

  15. Texte, intertexte et iconotexte dans le message publicitaire

    Directory of Open Access Journals (Sweden)

    I. C. CORJAN

    2007-12-01

    Full Text Available The paper states the fact that although the roles of the text and of the image in an advertisement are dynamic and appear in variable proportions depending on numerous criteria, they are also beautifully balanced, especially if compared from a functional point of view, determining and depending upon each other in a coherent discursive unity. Between text and image there are usually the following relations of reciprocity: support (argumentation, redundancy, paraphrase; complementarity (in-formation that is autonomous in form, but convergent in content; rhetorical amplification (meaning transfer, metaphorisation, hyperbolisation etc.; opposition (antithesis, exaggeration, non-antagonistic contradiction etc.. In general, both the verbal and the iconic develop, much to their mutual advantage, an extra meaning which the former establishes linguistically, and the latter exaggerates visually. In this case, however, the image always goes beyond the verbal information, thanks to its deep-rooted polysemy. Thus, in the special case of printed advertising, there are a few prevalent icono-textual structures, with obvious intertextual and paratextual functions: intertextuality of the writ-ten text; figurative iconic intertextuality; icono-textual intertextuality; double intertextuality: verbal-written and icono-textual. The end of the study puts forward a new formulation of the icono-text and of the typology of printed advertising included in written media and indoor/ outdoor posting.

  16. Functions of Expressions of Futurality in Professional Economic Texts

    Directory of Open Access Journals (Sweden)

    Mikuláš Martin

    2016-07-01

    Full Text Available The aim of this corpus-based study is to identify the functions that selected expressions of futurality can express in professional economic texts. The classification of functions is established on the corpus of seven economic books. Excerpted instances of futural constructions are analysed with respect to textual and interpersonal functions as defined by Halliday. Futurality is interpreted broadly to include all lexical and grammatical means referring to the future. This approach makes it also possible to analyse futurality as a means of text coherence. Hence the core grammatical means are interpreted along with co-occurring lexical means under the two categories of functions to provide a comprehensive model of text coherence with regard to futurality. Frequency analysis shows that core futural expressions are not distributed equally throughout the corpus. While some expressions (e.g., will and the present simple tense dominate, others prove to be rather insignificant (e.g., be on the point/verge of, the present progressive tense. In addition, both lexical and grammatical constructions regularly co-occur in clusters, contributing to the coherence of the economic texts.

  17. Recognizing Cursive Typewritten Text Using Segmentation-Free System

    Directory of Open Access Journals (Sweden)

    Mohammad S. Khorsheed

    2015-01-01

    Full Text Available Feature extraction plays an important role in text recognition as it aims to capture essential characteristics of the text image. Feature extraction algorithms widely range between robust and hard to extract features and noise sensitive and easy to extract features. Among those feature types are statistical features which are derived from the statistical distribution of the image pixels. This paper presents a novel method for feature extraction where simple statistical features are extracted from a one-pixel wide window that slides across the text line. The feature set is clustered in the feature space using vector quantization. The feature vector sequence is then injected to a classification engine for training and recognition purposes. The recognition system is applied to a data corpus which includes cursive Arabic text of more than 600 A4-size sheets typewritten in multiple computer-generated fonts. The system performance is compared to a previously published system from the literature with a similar engine but a different feature set.

  18. How Full is Full Employment? : How Tools and Not Theory Explained Full Employment

    NARCIS (Netherlands)

    Rodenburg, P.

    2016-01-01

    The post-war debate on full employment policy was blurred and unclear since the concept of full employment itself was theoretically unclear and un-operational. Unable to theoretically determine the unemployment level of full employment, economists tried to find more empirically based ways to

  19. Using Text Models In Diagnostic Tasks.

    Directory of Open Access Journals (Sweden)

    Korostil Yuriy

    2015-09-01

    Full Text Available This paper contains developing of a method of solving diagnostic tasks for complex technical objects (STO based on using text models (TMi to describe the functioning of STO. A TMi model is a text description, in normalized form, of all fragments of STO functioning process. The description of TMi is for med using semantic vocabularies of different types, which are generated on the basis of usage of information about all the aspects of STO construction and functioning. Such interpretation description is a subject area for tasks of STO diagnostics. Detection of malfunction and deviations of a functioning process of STO from an established functioning mode is implemented on the basis of analysis of semantic parameters of text description of the STO functioning process in order to determine semantic anomalies which occur in the descriptions of the STO functioning process, as well as in the descriptions of fragments of its functioning. Semantic anomalies occur in case when values of semantic parameters go beyond their established limits.

  20. Modeling statistical properties of written text.

    Directory of Open Access Journals (Sweden)

    M Angeles Serrano

    Full Text Available Written text is one of the fundamental manifestations of human language, and the study of its universal regularities can give clues about how our brains process information and how we, as a society, organize and share it. Among these regularities, only Zipf's law has been explored in depth. Other basic properties, such as the existence of bursts of rare words in specific documents, have only been studied independently of each other and mainly by descriptive models. As a consequence, there is a lack of understanding of linguistic processes as complex emergent phenomena. Beyond Zipf's law for word frequencies, here we focus on burstiness, Heaps' law describing the sublinear growth of vocabulary size with the length of a document, and the topicality of document collections, which encode correlations within and across documents absent in random null models. We introduce and validate a generative model that explains the simultaneous emergence of all these patterns from simple rules. As a result, we find a connection between the bursty nature of rare words and the topical organization of texts and identify dynamic word ranking and memory across documents as key mechanisms explaining the non trivial organization of written text. Our research can have broad implications and practical applications in computer science, cognitive science and linguistics.

  1. INNER DIALOGICITY OF MEDICAL SCIENTIFIC TEXTS

    Directory of Open Access Journals (Sweden)

    Efremova Nataliya Vladimirovna

    2015-06-01

    Full Text Available The author studies inner dialogicity as an integral property of a scientist's thinking activity, a way of a scientific idea development, one of the cognitive and discursive mechanisms of new knowledge formation, its crystallization and dementalisation in a text, as a way of search for truth. Such approach to dialogicity in the study of a scientific text makes it possible to analyze the cogitative processes proceeding in human consciousness and cognitive activity, allows to fully understand the stated scientific concept, to define pragmatic strategies of the author, to plunge into his reflexive world. On the material of medical scientific texts of N.M. Amosov and F. G. Uglov, famous scientists in the field of cardio surgery, it is established that traces of internal dialogicity manifestation in the textual space of scientists actualize the origin of new knowledge, the change of author's semantic positions, his ability to reflect, compare, analyze his own thoughts and actions, to estimate oneself and the features of thinking process which are realized in logic of a statement of the scientific concept, an explanation of concepts, terms at judgment of the points of view of contemporaries and predecessors, adherents and scientist's opponents, and also orientation to the addressee's presupposition, activization of his cogitative activity. Linguistic, discursive, verbal analysis singles out the impact on the addressee, his mental activity.

  2. AUTHENTIC TEXTS FOR CRITICAL READING ACTIVITIES

    Directory of Open Access Journals (Sweden)

    Ila Amalia

    2016-03-01

    Full Text Available This research takes an action research aimed at promoting critical reading (“thinking” while reading skills using authentic materials among the students. This research also aims to reveal the students perception on using critical reading skills in reading activities. Nineteen English Education Department students who took Reading IV class, participated in this project. There were three cycles with three different critical reading strategies were applied. Meanwhile, the authentic materials were taken from newspaper and internet articles. The result revealed that the use of critical reading strategies along with the use of authentic materials has improved students’ critical reading skills as seen from the improvement of each cycle - the students critical reading skill was 54% (fair in the cycle 1 improved to 68% (average in cycle 2, and 82% (good in cycle 3.. In addition, based on the critical reading skill criteria, the students’ critical reading skill has improved from 40% (nearly meet to 80% (exceed. Meanwhile, from the students’ perception questionnaire, it was shown that 63% students agreed the critical reading activity using authentic text could improve critical thinking and 58% students agreed that doing critical reading activity could improve reading comprehension. The result had the implication that the use of authentic texts could improve students’ critical reading skills if it was taught by performing not lecturing them. Selectively choosing various strategies and materials can trigger students’ activeness in responding to a text, that eventually shape their critical reading skills.

  3. Cell Phoning and Texting While Driving

    Directory of Open Access Journals (Sweden)

    Judy Honoria Rosaire Telemaque

    2015-07-01

    Full Text Available A qualitative phenomenological study was conducted on the consequences of cell phone use while operating a vehicle. We discussed why talking and texting on cell phones are so popular through the analysis of our interviews with police officers, driving instructors, and parents of teens and young adults. The participants came from central, northeastern, northwestern, and southeastern Connecticut. All had exposure with respect to the effects of cell phone usage problem. The study reached a point of theoretical saturation or redundancy by which the analysis no longer resulted in new themes. We concluded that the discoveries revealed the necessity for education, expansion of technology, and additional driver education preparation, which may provide a path for leadership to help solve the problem.

  4. Using ontology network structure in text mining.

    Science.gov (United States)

    Berndt, Donald J; McCart, James A; Luther, Stephen L

    2010-11-13

    Statistical text mining treats documents as bags of words, with a focus on term frequencies within documents and across document collections. Unlike natural language processing (NLP) techniques that rely on an engineered vocabulary or a full-featured ontology, statistical approaches do not make use of domain-specific knowledge. The freedom from biases can be an advantage, but at the cost of ignoring potentially valuable knowledge. The approach proposed here investigates a hybrid strategy based on computing graph measures of term importance over an entire ontology and injecting the measures into the statistical text mining process. As a starting point, we adapt existing search engine algorithms such as PageRank and HITS to determine term importance within an ontology graph. The graph-theoretic approach is evaluated using a smoking data set from the i2b2 National Center for Biomedical Computing, cast as a simple binary classification task for categorizing smoking-related documents, demonstrating consistent improvements in accuracy.

  5. Achieving single channel, full duplex wireless communication

    KAUST Repository

    Choi, Jung Il; Jain, Mayank; Srinivasan, Kannan; Levis, Phil; Katti, Sachin

    2010-01-01

    This paper discusses the design of a single channel full-duplex wireless transceiver. The design uses a combination of RF and baseband techniques to achieve full-duplexing with minimal effect on link reliability. Experiments on real nodes show the full-duplex prototype achieves median performance that is within 8% of an ideal full-duplexing system. This paper presents Antenna Cancellation, a novel technique for self-interference cancellation. In conjunction with existing RF interference cancellation and digital baseband interference cancellation, antenna cancellation achieves the amount of self-interference cancellation required for full-duplex operation. The paper also discusses potential MAC and network gains with full-duplexing. It suggests ways in which a full-duplex system can solve some important problems with existing wireless systems including hidden terminals, loss of throughput due to congestion, and large end-to-end delays. Copyright 2010 ACM.

  6. Habermas's Expressivist Theology: Chalice Half-Full?

    Directory of Open Access Journals (Sweden)

    Felmon John Davis

    2015-07-01

    Full Text Available The article addresses the question of the respect owed to believers and their faiths and states that a demand for respect for the person of the believer does not imply a demand for respect for their faith. However, being 'respect' a complex and ambiguous notion, the article studies some arguments that go in the direction of justifying the move from respect for persons to respect for their beliefs. According to Habermas, there is a respect citizens of a democracy owe each other that requires taking each other's opinions seriously, including their religiously motivated opinions. What is more, Habermas claims that we all have something to learn from each other. The articles argues against this line of thought and states there is no obligation to respect anything about people's moral claims except their right to make them. The article argues against Habermas's approach by showing its epistemological and ontological inconsistency and concludes that respect for persons as moral reason-givers or as fellow-citizens does not lead to any substantial respect for the contents of their claims.

  7. Making School Development Credible. Text, Context, Irony

    Directory of Open Access Journals (Sweden)

    Mats Börjesson

    2012-01-01

    Full Text Available

    The article argues for the importance of an open, reflexive-methodological approach when switching between studying text, context and researcher activity. Close linguistic analysis can benefit from being linked with the researcher’s contextualisation of his empirical material as well as with more distanced readings. The more specific starting point for this article is that school development, like other similar terms such as school improvement and the like, makes use of linguistic building blocks with which whole narratives about today’s and tomorrow’s schools can be constructed. The subject of the study is a short text issued by the Swedish Schools Inspectorate (Skolinspektionen. Government language changes according to the authorities’ role in society and their own definitions of their functions, and an important aspect here is the legitimacy of the authorities’ texts. By means of various kinds of close linguistic analysis, the above-mentioned text is studied with regard to choice of categories, hierarchies of modalisation and the rhetorical effects of different types of formulations in a broader political-social landscape. The article concludes with a reflective discussion on the relationship between government language and irony as a stylistic device – a device that is based on the results of the close empirical analysis.[i]



    [i] The article is part of the project ”School  Development as Narrative”, funded by the Swedish Research Council. The author would like to thank the two reviewers for very valuable comments.

  8. Reconfigurable Full-Page Braille Displays

    Science.gov (United States)

    Garner, H. Douglas

    1994-01-01

    Electrically actuated braille display cells of proposed type arrayed together to form full-page braille displays. Like other braille display cells, these provide changeable patterns of bumps driven by digitally recorded text stored on magnetic tapes or in solid-state electronic memories. Proposed cells contain electrorheological fluid. Viscosity of such fluid increases in strong electrostatic field.

  9. The interjection in old Romanian texts

    Directory of Open Access Journals (Sweden)

    Margareta Manu Magda

    2017-09-01

    Full Text Available The paper tries to identify the special problems posed by the study of interjection based on the examination of a corpus of texts from the old Romanian (1600–1780, referring to texts from modern Romanian. We have watched how certain interjectional formations have acquired, through diachronic expansion, new grammatical, semantic and pragmatic values.The structure of the paper is the following: the introduction (§1 summarizes the author’s position on the status of the interjection category at a morphosyntactic, semantic and pragmatic level (§1.1 and on the relation between different linguistic structures and their grammaticalization / pragmaticalization process (§1.2. The second section (§2 refers to the specific routes followed by the evolution of the various categories of the analysed interjections, from the old Romanian to the modern Romanian: the presentatives adecă, iată, ni (§2.1, the hortatives haide, ni (§2.2, the addressing particles bre, măi (§2.3, the connectors with demarcation signal function adevăr, amin (§2.4. The third section (§3 has as objective the description of a species of delocutive derivation, illustrated in Romanian by the lexicalized semantic variants of the secondary interjection Doamne!. The study concludes with several final considerations regarding the results of the research (§4.

  10. Speech Act Classification of German Advertising Texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2015-12-01

    Full Text Available This paper uses the theory of speech acts and the underlying concept of pragmalinguistics to determine the types of speech acts and their classification in the German advertising printed texts. We ascertain that the advertising of cars and accessories, household appliances and computer equipment, watches, fancy goods, food, pharmaceuticals, and financial, insurance, legal services and also airline advertising is dominated by a pragmatic principle, which is based on demonstrating information about the benefits of a product / service. This influences the frequent usage of certain speech acts. The dominant form of exposure is to inform the recipient-user about the characteristics of the advertised product. This information is fore-grounded by means of stylistic and syntactic constructions specific to the advertisement (participial constructions, appositional constructions which contribute to emphasize certain notional components within the framework of the advertising text. Stylistic and syntactic devices of reduction (parceling constructions convey the author's idea. Other means like repetitions, enumerations etc are used by the advertiser to strengthen his selling power. The advertiser focuses the attention of the consumer on the characteristics of the product seeking to convince him of the utility of the product and to influence his/ her buying behavior.

  11. Helios: Understanding Solar Evolution Through Text Analytics

    Energy Technology Data Exchange (ETDEWEB)

    Randazzese, Lucien [SRI International, Menlo Park, CA (United States)

    2016-12-02

    This proof-of-concept project focused on developing, testing, and validating a range of bibliometric, text analytic, and machine-learning based methods to explore the evolution of three photovoltaic (PV) technologies: Cadmium Telluride (CdTe), Dye-Sensitized solar cells (DSSC), and Multi-junction solar cells. The analytical approach to the work was inspired by previous work by the same team to measure and predict the scientific prominence of terms and entities within specific research domains. The goal was to create tools that could assist domain-knowledgeable analysts in investigating the history and path of technological developments in general, with a focus on analyzing step-function changes in performance, or “breakthroughs,” in particular. The text-analytics platform developed during this project was dubbed Helios. The project relied on computational methods for analyzing large corpora of technical documents. For this project we ingested technical documents from the following sources into Helios: Thomson Scientific Web of Science (papers), the U.S. Patent & Trademark Office (patents), the U.S. Department of Energy (technical documents), the U.S. National Science Foundation (project funding summaries), and a hand curated set of full-text documents from Thomson Scientific and other sources.

  12. PEDANT: Parallel Texts in Göteborg

    Directory of Open Access Journals (Sweden)

    Daniel Ridings

    2012-09-01

    Full Text Available

    The article presents the status of the PEDANT project with parallel corpora at the Language Bank at Göteborg University. The solutions for access to the corpus data are presented. Access is provided by way of the internet and standard applications and SGML-aware programming tools. The SGML format for encoding translation pairs is outlined together. The methods allow working with everything from plain text to texts densely encoded with linguistic information.

     

    In hierdie artikel word 'n beskrywing gegee van die stand van die PEDANT-projek met parallelle korpora by die Taalbank by die Universiteit van Göteborg. Oplossings vir die verkryging van toegang tot die korpusdata word aangedui. Toegang word verskaf deur middel van die Internet en standaardtoepassings en SGML-sensitiewe programmeringshulpmiddels. Die SGML-formaat vir die enkodering van vertaalpare word gesamentlik geskets. Hierdie metodes laat toe dat gewerk kan word met enigiets vanaf suiwer teks tot tekste wat taalkundig dig geëtiketteer is.

     

  13. Three Writers of Arabic Texts in Yogyakarta

    Directory of Open Access Journals (Sweden)

    Muhamad Murtadlo

    2015-02-01

    Full Text Available This study examines the use of the Arabic alphabet in religious literature in Yogyakarta. This study uses a case study on three figure writers of religious texts that using the Arabic alphabet in southern part of Central Java (Yogyakarta, namely Asrori Ahmad (Magelang, Ali Maksum (Yogyakarta, and Ahmad Mujab Mahalli (Bantul. This study concluded that the writing of religious texts in Arabic alphabet in the southern Java area had been carried out by means of using Arabic Pegon, and only a few people who wrote in the Arabic language. The transmission of Arabic Pegon in Yogyakarta is allegedly from north coast of Java, especially from Lasem / East Java. The tradition of Arabic language teaching in the pesantrens still focuses mostly on the reading effort, communication, and understanding and it is not oriented to the writing skill. The presence of international journals initiated by the College of Islamic religious institutions and the effort of translation business into Arabic from certain institutions gives an opportunity to strengthen the use of the Arabic alphabet in Indonesia.

  14. A Novel Current-Mode Full-Wave Rectifier Based on One CDTA and Two Diodes

    Directory of Open Access Journals (Sweden)

    F. Khateb

    2010-09-01

    Full Text Available Precision rectifiers are important building blocks for analog signal processing. The traditional approach based on diodes and operational amplifiers (OpAmps exhibits undesirable effects caused by limited OpAmp slew rate and diode commutations. In the paper, a full-wave rectifier based on one CDTA and two Schottky diodes is presented. The PSpice simulation results are included.

  15. Developed Turbulence: From Full Simulations to Full Mode Reductions

    NARCIS (Netherlands)

    Grossmann, Siegfried; Lohse, Detlef; Reeh, Achim

    1996-01-01

    Developed Navier-Stokes turbulence is simulated with varying wave-vector mode reductions. The flatness and the skewness of the velocity derivative depend on the degree of mode reduction. They show a crossover towards the value of the full numerical simulation when the viscous subrange starts to be

  16. Attenuation in the translation of bilingual journalistic texts

    Directory of Open Access Journals (Sweden)

    Armando González Salinas

    2015-12-01

    Full Text Available First approach to the identification of mitigation/intensification markers in translating journalistic articles from bilingual publication in English and its translation into Spanish. There are three articles from Newsweek Magazine, a useful tool to use with translation students. The objective is the Annotated Translation of finished articles and book chapters that, after a pre-screening analysis, foster the translation of the Spanish version from written English texts. The first journalistic article with a historical theme is: The Return of Ruthless Richard – El Regreso del despiadado Ricardo (Ricardo III, used to detect the mitigation/intensification markers in both versions. Steps: 1. Sentence by sentence analysis in English, to find mitigation characteristics. 2. Review selected sentences in English with the Spanish counterpart. 3. Comparison and contrast of both versions. 4. Discuss similarities/differences to notice if the transfer of markers signals mitigation/intensification in both versions. 5. Discuss findings and write comments of translation aspects whose changes are discussed: annotated translation. Although most mitigation studies are based on oral discourse (not excluding the written text, this research considers written texts as a means and an end, in the observation and description, since original versions are unchangeable; this opens the option to edit and modify, which is promoted, before the final version elaborated between students and researchers is reached. Some comments are included as the result of what is described above.

  17. Poetic prayer as a "text in text" in Ukrainian Cyrillic editions of the Baroque age

    Directory of Open Access Journals (Sweden)

    Kurhanova O.

    2016-01-01

    Full Text Available The article observes a prayer as a meta-genre of Baroque Ukrainian poetry, found at the pages of Ukrainian Cyrillic religious editions of the late 16th - early 18th centuries. Double nature of baroque poetic prayer is underlined. Poetic prayer originates from the tradition of liturgical prayer and contains it as a "text in text", at the same time poetical prayer is inserted as a "text in text" into the editions, in which it is printed. Two types of semantic connections between baroque poetic prayers and the text of their editions are described. The first type is presented by the poetry, which contains an image of a person, who took part in the text creation and in the process of its edition: the author/editor (in the role of the prayer addresser or the patron of art (in the role of a person, about whom the prayer request was made. The topics of prayer appellations in the poetry of this group are requests for earthly and heavenly boons for the author/editor/patron of the edition, for positive reception of the book; thanksgivings for the help in the book writing/publishing etc. The poetry of the second type contains images, which are central for the text of an edition. These are, as a rule, addressees of the prayer text - God, Saint Virgin, Angels and Saints. The content of a prayer appeal in the poetry of this type is a request for salvation and help in spiritual self-perfection of a lyric hero or a church community. It is defined that the content of poetic prayers, printed at pages of Cyrillic editions of the late 16th - early 18th centuries, was influenced both by text elements of the edition (topic, central images, and by non-textual factors (illustration plot, accompanied by a poem, prayer intention of an author/editor, existence of a patron of edition, different circumstances of editorial process etc.

  18. Automatic vowels selection and ranking in Russian enciphered texts

    Directory of Open Access Journals (Sweden)

    Yuri I. Petrenko

    2018-01-01

    Full Text Available This work was developed while teaching students the cryptanalysis. The course includes the study of statistics of (Russian encrypted texts. The purpose of training is to learn how to extract redundant information of the text and to descript the cryptogram without a password. One of the most comfortable methods for learning is a simple substitution and similar encryption systems, which are presented in most courses of cryptography. This paper presents a method of automatic separation of vowels and consonants in Russian texts, which releases some of the redundancy of the cipher text. In addition, this method greatly facilitates the task of descript some other symmetric ciphers which may be reduced to simple substitution.The aim of this work is to develop and implement a method for the automatic selection of vowels in Russian texts, enciphered by a simple substitution and similar encryption systems.According to the theory of Shannon, for unambiguous decoding of the text you want the redundancy of the text that exceeds the entropy of the password. After the separation of vowels and consonants redundancy of the text increases to one bit per symbol, this allows you to open shorter encrypted texts. Moreover, the separation of vowels and consonants greatly simplifies the cryptanalysis of some ciphers. For instance, cryptanalysis of the most famous encryption method - method of simple substitution-requires selection of one of N! possible passwords (where N is the number of letters in the alphabet. For the Russian language it is 33! or nearly 2 to 123rd degrees of options. After the separation of vowels and consonants you will need a selection of 10!*23!, or nearly 2 to 96th degrees of options. The number of combinations is reduced to one hundred million times, that makes the cryptanalysis much easier. The program that implements this method first creates a matrix of the probabilities of bigrams of the text.For this matrix Markov criterion calculated

  19. Stemming of Slovenian library science texts

    Directory of Open Access Journals (Sweden)

    Polona Vilar

    2002-01-01

    Full Text Available The theme of the article is the preparation of a stemming algorithm for Slovenian library science texts. The procedure consisted of three phases: learning, testing and evaluation.The preparation of the optimal stemmer for Slovenian texts from the field of library science is presented, its testing and comparison with two other stemmers for the Slovenian language: the Popovič stemmer and the Generic stemmer. A corpus of 790.000 words from the field of library science was used for learning. Lists of stems, word endings and stop-words were built. In the testing phase, the component parts of the algorithm were tested on an additional corpus of 167.000 words. In the evaluation phase, a comparison of the three stemmers processing the same word corpus was made. The results of each stemmer were compared with an intellectually prepared control result of the stemming of the corpus. It consisted of groups of semantically connected words with no errors. Understemming was especially monitored – the number of stems for semantically connected words, produced by an algorithm. The results were statistically processed with the Kruskal-Wallis test. The Optimal stemmer produced the best results.It matched best with the reference results and also gave the smallest number of stems for one semantic meaning. The Popovič stemmer followed closely. The Generic stemmer proved to be the least accurate. The procedures described in the thesis can represent a platform for the development of the tools for automatic indexing and retrieval for library science texts in Slovenian language.

  20. NAMED ENTITY RECOGNITION FROM BIOMEDICAL TEXT -AN INFORMATION EXTRACTION TASK

    Directory of Open Access Journals (Sweden)

    N. Kanya

    2016-07-01

    Full Text Available Biomedical Text Mining targets the Extraction of significant information from biomedical archives. Bio TM encompasses Information Retrieval (IR and Information Extraction (IE. The Information Retrieval will retrieve the relevant Biomedical Literature documents from the various Repositories like PubMed, MedLine etc., based on a search query. The IR Process ends up with the generation of corpus with the relevant document retrieved from the Publication databases based on the query. The IE task includes the process of Preprocessing of the document, Named Entity Recognition (NER from the documents and Relationship Extraction. This process includes Natural Language Processing, Data Mining techniques and machine Language algorithm. The preprocessing task includes tokenization, stop word Removal, shallow parsing, and Parts-Of-Speech tagging. NER phase involves recognition of well-defined objects such as genes, proteins or cell-lines etc. This process leads to the next phase that is extraction of relationships (IE. The work was based on machine learning algorithm Conditional Random Field (CRF.

  1. Full-F gyrofluid model

    DEFF Research Database (Denmark)

    Madsen, Jens

    2013-01-01

    variables. The fluid moment hierarchy is closed by approximating the gyrokinetic distribution function as a finite order Hermite-Laguerre polynomial and by determining closure approximations for terms involving the gyrokinetic gyro-averaging operator. The model exactly conserves the gyrokinetic full......A global electromagnetic gyrofluid model based on the full-F gyrokinetic model is derived. The gyrofluid moment variables are not split into fluctuating and equilibrium parts. Profiles are evolved freely, and gyro-averaging operators are not parametrized, but are functions of the gyrofluid moment...

  2. The socio-demographics of texting

    DEFF Research Database (Denmark)

    Ling, Richard; Bertel, Troels Fibæk; Sundsøy, Pål

    2012-01-01

    Who texts, and with whom do they text? This article examines the use of texting using metered traffic data from a large dataset (nearly 400 million anonymous text messages). We ask 1) How much do different age groups use mobile phone based texting (SMS)? 2) How wide is the circle of texting...

  3. PaperBLAST: Text Mining Papers for Information about Homologs.

    Science.gov (United States)

    Price, Morgan N; Arkin, Adam P

    2017-01-01

    Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST's database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quickly finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. PaperBLAST is available at http://papers.genomics.lbl.gov/. IMPORTANCE With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins' functions.

  4. PaperBLAST: Text Mining Papers for Information about Homologs

    International Nuclear Information System (INIS)

    Price, Morgan N.; Arkin, Adam P.

    2017-01-01

    Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST’s database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quickly finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins’ functions.

  5. The Fiction of Full BEKK

    NARCIS (Netherlands)

    C-L. Chang (Chia-Lin); M.J. McAleer (Michael)

    2017-01-01

    textabstractThe purpose of the paper is to show that univariate GARCH is not a special case of multivariate GARCH, specifically the Full BEKK model, except under parametric restrictions on the off-diagonal elements of the random coefficient autoregressive coefficient matrix, provides the regularity

  6. Empty calories and phantom fullness

    NARCIS (Netherlands)

    Camps, Guido; Mars, Monica; Graaf, de Kees; Smeets, Paul A.M.

    2016-01-01

    Background: Stomach fullness is a determinant of satiety. Although both the viscosity and energy content have been shown to delay gastric emptying, their relative importance is not well understood. Objective: We compared the relative effects of and interactions between the viscosity and energy

  7. Rectilinear Full Steiner Tree Generation

    DEFF Research Database (Denmark)

    Zachariasen, Martin

    1999-01-01

    The fastest exact algorithm (in practice) for the rectilinear Steiner tree problem in the plane uses a two-phase scheme: First, a small but sufficient set of full Steiner trees (FSTs) is generated and then a Steiner minimum tree is constructed from this set by using simple backtrack search, dynamic...

  8. Full autonomy; Autarkie im Komplettpaket

    Energy Technology Data Exchange (ETDEWEB)

    Augsten, Eva

    2011-05-31

    Normally, those who talk of full solar autonomy refer to the annual balance of a house. Now, architect Timo Leukefeld and Helma Eigenheimbau AG presented a really autonomous solar house which is available on a turnkey basis for 363,000 Euros.

  9. [Endoscopic full-thickness resection].

    Science.gov (United States)

    Meier, B; Schmidt, A; Caca, K

    2016-08-01

    Conventional endoscopic resection techniques such as endoscopic mucosal resection (EMR) or endoscopic submucosal dissection (ESD) are powerful tools for the treatment of gastrointestinal (GI) neoplasms. However, those techniques are limited to the superficial layers of the GI wall (mucosa and submucosa). Lesions without lifting sign (usually arising from deeper layers) or lesions in difficult anatomic positions (appendix, diverticulum) are difficult - if not impossible - to resect using conventional techniques, due to the increased risk of complications. For larger lesions (>2 cm), ESD appears to be superior to the conventional techniques because of the en bloc resection, but the procedure is technically challenging, time consuming, and associated with complications even in experienced hands. Since the development of the over-the-scope clips (OTSC), complications like bleeding or perforation can be endoscopically better managed. In recent years, different endoscopic full-thickness resection techniques came to the focus of interventional endoscopy. Since September 2014, the full-thickness resection device (FTRD) has the CE marking in Europe for full-thickness resection in the lower GI tract. Technically the device is based on the OTSC system and combines OTSC application and snare polypectomy in one step. This study shows all full-thickness resection techniques currently available, but clearly focuses on the experience with the FTRD in the lower GI tract.

  10. Pedoinformatics Approach to Soil Text Analytics

    Science.gov (United States)

    Furey, J.; Seiter, J.; Davis, A.

    2017-12-01

    The several extant schema for the classification of soils rely on differing criteria, but the major soil science taxonomies, including the United States Department of Agriculture (USDA) and the international harmonized World Reference Base for Soil Resources systems, are based principally on inferred pedogenic properties. These taxonomies largely result from compiled individual observations of soil morphologies within soil profiles, and the vast majority of this pedologic information is contained in qualitative text descriptions. We present text mining analyses of hundreds of gigabytes of parsed text and other data in the digitally available USDA soil taxonomy documentation, the Soil Survey Geographic (SSURGO) database, and the National Cooperative Soil Survey (NCSS) soil characterization database. These analyses implemented iPython calls to Gensim modules for topic modelling, with latent semantic indexing completed down to the lowest taxon level (soil series) paragraphs. Via a custom extension of the Natural Language Toolkit (NLTK), approximately one percent of the USDA soil series descriptions were used to train a classifier for the remainder of the documents, essentially by treating soil science words as comprising a novel language. While location-specific descriptors at the soil series level are amenable to geomatics methods, unsupervised clustering of the occurrence of other soil science words did not closely follow the usual hierarchy of soil taxa. We present preliminary phrasal analyses that may account for some of these effects.

  11. Psychologie des discours et didactique des textes (Psychology of Discourse and the Teaching of Texts).

    Science.gov (United States)

    Bronckart, Jean-Paul, Ed.

    1995-01-01

    This collection of articles on the nature of discourse and writing instruction include: "Une demarche de psychologie de discours; quelques aspects introductifs" ("An Application of Discourse Psychology; Introductory Thoughts") (Jean-Paul Bronckart); "Les procedes de prise en charge enonciative dans trois genres de texts expositifs" ("The Processes…

  12. Biosemiotics: Communication and Causation (Information included

    Directory of Open Access Journals (Sweden)

    Juan Ramon Álvarez

    2009-11-01

    Full Text Available Pretensions of Biosemiotics as a unified approach to biological information are critically scrutinized within the study of different projects of semiotisation of nature and naturalization ot cultural processes. Main textual references and arguments are presented and critically pondered. Biosemiotics is here presented as an analytical method to study communication as founded in causality.

  13. Connected text reading and differences in text reading fluency in adult readers.

    Directory of Open Access Journals (Sweden)

    Sebastian Wallot

    Full Text Available The process of connected text reading has received very little attention in contemporary cognitive psychology. This lack of attention is in parts due to a research tradition that emphasizes the role of basic lexical constituents, which can be studied in isolated words or sentences. However, this lack of attention is in parts also due to the lack of statistical analysis techniques, which accommodate interdependent time series. In this study, we investigate text reading performance with traditional and nonlinear analysis techniques and show how outcomes from multiple analyses can used to create a more detailed picture of the process of text reading. Specifically, we investigate reading performance of groups of literate adult readers that differ in reading fluency during a self-paced text reading task. Our results indicate that classical metrics of reading (such as word frequency do not capture text reading very well, and that classical measures of reading fluency (such as average reading time distinguish relatively poorly between participant groups. Nonlinear analyses of distribution tails and reading time fluctuations provide more fine-grained information about the reading process and reading fluency.

  14. Politeness Strategies Used in Text Messaging

    Directory of Open Access Journals (Sweden)

    Shahrzad Eshghinejad

    2016-03-01

    Full Text Available One aspect of short message service (SMS communication through a cell phone is use of politeness strategies. As it is extensively argued that females are more polite language users, the present study sought to describe the strategies used by these two groups and to find out whether there is any significant difference between male and female English as a foreign language (EFL learners in the use of positive and negative politeness strategies in sending SMS to their professors, considering that there is an asymmetric power relation and social distance between them. To this end, a corpus of 300 L1 (Persian and L2 (English request messages was compiled. Results of qualitative and quantitative data analysis showed no significant difference between the two groups. Results of the study have implication in politeness research.

  15. DIALOGUE JOURNAL FOR TEACHING RECOUNT TEXTS

    Directory of Open Access Journals (Sweden)

    Rohani Rohani

    2017-12-01

    Full Text Available The current study is a quasi experiment to reveal the effectiveness of dialog journal on recount writing. Sixty eighth graders participated in the research. A pretest, posttest, and questionnaire were administered to obtain the data. After treatments were given, the mean score of the experimental group increased from 65.80 to 75.00. The t-test showed that there was a significant difference between the experimental group and the control group with t-value 2.128, higher than the t-table (2.00. The questionnaire revealed that the students‘ response to the dialogue journal was relatively positive. The written feedback was perceived to be effective and fun.

  16. Text and Subject Position after Althusser

    Directory of Open Access Journals (Sweden)

    Antony Easthope

    1994-01-01

    Full Text Available Althusser's achievement is that he redefined Marxism. He reconceptualizes history and totality in terms of different times, construes knowledge as the outcome of a process of construction, and interprets subjectivity as an effect of ideology and unconscious processes. Unfortunately, Althusser's functionalist view of ideology claims that the subject recognizes itself as a subject because it duplicates— reflects—an absolute subject. However, Lacan's notion of the mirror stage remedies this fault. Lacan's subject always misrecognizes itself in a process of contradiction that threatens the stability of any given social order. Moreover, unlike Foucault's subject, which is limited in that subjectivity is folded back into a vaguely expanded notion of "power," this revised Althusserian subject allows careful reading of texts. The critic does not simply read against the grain; he or she exposes the multiple points of identification offered the reader. For example, Wordsworth's "The Solitary Reaper" installs the reader in multiple positions: a devotee of high culture and the national canon, a lover of the verbal signifier and its play, a consumer of confessional discourse, and a masculine "I" desiring a laboring, singing woman.

  17. General description of magnetic fluctuations in TEXT

    International Nuclear Information System (INIS)

    Kim, Y.J.

    1989-01-01

    The magnetic fluctuations in TEXT (R = 1m, a = 0.26m, ohmically heated tokamak with a full poloidal limiter) have been extensively measured with magnetic probes in the shadow of the limiter with an instrumental range of f -1 (m rms p (f > 50kHz) at the limiter radius is found to be of order 10 -5 T, which is too small to produce significant transport directly. Over the range of discharge parameters in TEXT, the B rms p (f > 50kHz) is observed to have a strong q a dependence (q a -2.2 ) and also a density dependence (n eo -0.8 ). Furthermore, the magnetic fluctuations show a significant correlation with edge electrostatic density fluctuations measured by Langmiur probe inside the limiter radius, and extending along magnetic field lines. Phase variation of the correlated components suggests k double-prime/k perpendicular ∼ 0.005. The B p rms (f >50kHz) is also found to be little dependent on parallel electric field E double-prime. Magnetic fluctuations in both low and high frequency ranges have been characterized by their response to gas puffing, pellet injection, impurity injection, and the effect of an ergodic magnetic limiter. The behavior of magnetic fluctuations with electron cyclotron resonance heating (ECRH) has been also investigated in detail

  18. El manual como texto Schoolbook as text

    Directory of Open Access Journals (Sweden)

    Agustín Escolano Benito

    2012-12-01

    Full Text Available Este trabajo aborda la cuestión de la identidad del libro escolar como un género textual específico en el contexto de la manualística clásica y moderna, contextualizando los análisis en el marco de la cultura de la escuela tradicional y en la era de la revolución digital y bajo una perspectiva historiográfica y teórica. También plantea el nacimiento y primeros desarrollos de la manualística como campo intelectual y académico y sus contribuciones a la definición de la identidad del libro escolar.This paper discusses the question of identifying a coursebook as a specific text genre in the context of the classical and modern manualistics, situating the analysis within the traditional school culture and the digital revolution era, under a historical and theoretical perspective. It also covers the birth and initial development of manualistics as an intelectual and academic field and its contributions to the definition of the schoolbook identity.

  19. Challenges in the interpretation of lyric texts

    Directory of Open Access Journals (Sweden)

    Buljan-Legati Ivana

    2016-01-01

    Full Text Available It is possible, perhaps, to choose the right path to the answer to the questions how poetry has been disappearing over the centuries and has lost its purpose in the ever greater void of outer space and how it has turned from a common and welcome social activity into a phenomenon that will have to leave its fellow-townspeople due to enormous suspicion about the communal language, the world view of the majority and the material world, if first, (at least a rough reconstruction of the sense and nature of continual changes in the poetic mechanism has been done (a more detailed overview would extend the paper enormously, as well as of the changes in style and the reception of poetry, since each choice of a possible linguistic system in a particular historical period soon heralded its own boundary line. From a popular, entertaining and educational genre as a transparent means of social communication, which has brought the individual into a community by generating stable certainty, and gave him the sense of control over his own destiny and meaning, lyrics will outgrow proportionally the aesthetic dimensions of its texts (which will subsequently substitute the foretoken of literacy, becoming less comparable and surmountable, in certain periods almost a nontransferable artistic view. In such circumstances, the public will start to have less understanding and tolerance for its 'weaknesses'.

  20. Interview als Text vs. Interview als Interaktion

    Directory of Open Access Journals (Sweden)

    Arnulf Deppermann

    2013-09-01

    Full Text Available Das Interview ist nach wie vor das beliebteste sozialwissenschaftliche Verfahren des Datengewinns. Ökonomie der Erhebung, Vergleichbarkeit und die Möglichkeit, Einsicht in Praxisbereiche und historisch-biografische Dimensionen zu erhalten, die der direkten Beobachtung kaum zugänglich sind, machen seine Attraktivität aus. Zugleich mehren sich Kritiken, die seine Leistungsfähigkeit problematisieren, indem sie auf die begrenzte Reichweite der Explikationsfähigkeiten der Befragten, die Reaktivität der Erhebung oder die Differenz zwischen Handeln und dem Bericht über Handeln verweisen. Im Beitrag wird zwischen Ansätzen, die das Interview als Text, und solchen, die es als Interaktion verstehen, unterschieden. Nach dem Text-Verständnis werden Interviews unter inhaltlichen Gesichtspunkten analysiert und als Zugang zu einer vorgängigen sozialen oder psychischen Wirklichkeit angesehen. Das Interaktions-Verständnis versteht Interviews dagegen als situierte Praxis, in welcher im Hier und Jetzt von InterviewerInnen und Befragten gemeinsam soziale Sinnstrukturen hergestellt werden. Anhand ubiquitärer Phänomene der Interviewinteraktion – Fragen, Antworten und die Selbstpositionierung von InterviewerInnen und Befragten – werden Praktiken des interaktiv-performativen Handelns im Interview dargestellt. Ihre Relevanz für die Interviewkonstitution und ihre Erkenntnispotenziale für die Interviewauswertung werden aufgezeigt. Es wird dafür plädiert, die interaktive Konstitutionsweise von Interviews empirisch zu erforschen und methodisch konsequent zu berücksichtigen. URN: http://nbn-resolving.de/urn:nbn:de:0114-fqs1303131

  1. Text summarization as a decision support aid

    Directory of Open Access Journals (Sweden)

    Workman T

    2012-05-01

    Full Text Available Abstract Background PubMed data potentially can provide decision support information, but PubMed was not exclusively designed to be a point-of-care tool. Natural language processing applications that summarize PubMed citations hold promise for extracting decision support information. The objective of this study was to evaluate the efficiency of a text summarization application called Semantic MEDLINE, enhanced with a novel dynamic summarization method, in identifying decision support data. Methods We downloaded PubMed citations addressing the prevention and drug treatment of four disease topics. We then processed the citations with Semantic MEDLINE, enhanced with the dynamic summarization method. We also processed the citations with a conventional summarization method, as well as with a baseline procedure. We evaluated the results using clinician-vetted reference standards built from recommendations in a commercial decision support product, DynaMed. Results For the drug treatment data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.848 and 0.377, while conventional summarization produced 0.583 average recall and 0.712 average precision, and the baseline method yielded average recall and precision values of 0.252 and 0.277. For the prevention data, Semantic MEDLINE enhanced with dynamic summarization achieved average recall and precision scores of 0.655 and 0.329. The baseline technique resulted in recall and precision scores of 0.269 and 0.247. No conventional Semantic MEDLINE method accommodating summarization for prevention exists. Conclusion Semantic MEDLINE with dynamic summarization outperformed conventional summarization in terms of recall, and outperformed the baseline method in both recall and precision. This new approach to text summarization demonstrates potential in identifying decision support data for multiple needs.

  2. Workflows for Full Waveform Inversions

    Science.gov (United States)

    Boehm, Christian; Krischer, Lion; Afanasiev, Michael; van Driel, Martin; May, Dave A.; Rietmann, Max; Fichtner, Andreas

    2017-04-01

    Despite many theoretical advances and the increasing availability of high-performance computing clusters, full seismic waveform inversions still face considerable challenges regarding data and workflow management. While the community has access to solvers which can harness modern heterogeneous computing architectures, the computational bottleneck has fallen to these often manpower-bounded issues that need to be overcome to facilitate further progress. Modern inversions involve huge amounts of data and require a tight integration between numerical PDE solvers, data acquisition and processing systems, nonlinear optimization libraries, and job orchestration frameworks. To this end we created a set of libraries and applications revolving around Salvus (http://salvus.io), a novel software package designed to solve large-scale full waveform inverse problems. This presentation focuses on solving passive source seismic full waveform inversions from local to global scales with Salvus. We discuss (i) design choices for the aforementioned components required for full waveform modeling and inversion, (ii) their implementation in the Salvus framework, and (iii) how it is all tied together by a usable workflow system. We combine state-of-the-art algorithms ranging from high-order finite-element solutions of the wave equation to quasi-Newton optimization algorithms using trust-region methods that can handle inexact derivatives. All is steered by an automated interactive graph-based workflow framework capable of orchestrating all necessary pieces. This naturally facilitates the creation of new Earth models and hopefully sparks new scientific insights. Additionally, and even more importantly, it enhances reproducibility and reliability of the final results.

  3. LHCb : Full Experiment System Test

    CERN Multimedia

    Cattaneo, M

    2009-01-01

    LHCb had been planning to commission its High Level Trigger software and Data Quality monitoring procedures using real collisions data from the LHC pilot run. Following the LHC incident on 19th September 2008, it was decided to commission the system using simulated data. This “Full Experiment System Test” consists of: - Injection of simulated minimum bias events into the full HLT farm, after selection by a simulated Level 0 trigger. - Processing in the HLT farm to achieve the output rate expected for nominal LHC luminosity running, sustained over the typical duration of an LHC fill. - Real time Data Quality validation of the HLT output, validation of calibration and alignment parameters for use in the reconstruction. - Transmission of the event data, calibration data and book-keeping information to Tier1 sites and full reconstruction of the event data. - Data Quality validation of the reconstruction output. We will report on the preparations and results of FEST09, and on the status of commissioning for no...

  4. Text Messaging Interventions on Cancer Screening Rates: A Systematic Review.

    Science.gov (United States)

    Uy, Catherine; Lopez, Jennifer; Trinh-Shevrin, Chau; Kwon, Simona C; Sherman, Scott E; Liang, Peter S

    2017-08-24

    Despite high-quality evidence demonstrating that screening reduces mortality from breast, cervical, colorectal, and lung cancers, a substantial portion of the population remains inadequately screened. There is a critical need to identify interventions that increase the uptake and adoption of evidence-based screening guidelines for preventable cancers at the community practice level. Text messaging (short message service, SMS) has been effective in promoting behavioral change in various clinical settings, but the overall impact and reach of text messaging interventions on cancer screening are unknown. The objective of this systematic review was to assess the effect of text messaging interventions on screening for breast, cervical, colorectal, and lung cancers. We searched multiple databases for studies published between the years 2000 and 2017, including PubMed, EMBASE, and the Cochrane Library, to identify controlled trials that measured the effect of text messaging on screening for breast, cervical, colorectal, or lung cancers. Study quality was evaluated using the Cochrane risk of bias tool. Our search yielded 2238 citations, of which 31 underwent full review and 9 met inclusion criteria. Five studies examined screening for breast cancer, one for cervical cancer, and three for colorectal cancer. No studies were found for lung cancer screening. Absolute screening rates for individuals who received text message interventions were 0.6% to 15.0% higher than for controls. Unadjusted relative screening rates for text message recipients were 4% to 63% higher compared with controls. Text messaging interventions appear to moderately increase screening rates for breast and cervical cancer and may have a small effect on colorectal cancer screening. Benefit was observed in various countries, including resource-poor and non-English-speaking populations. Given the paucity of data, additional research is needed to better quantify the effectiveness of this promising intervention

  5. On the Origins of the Lubok Text The Register of Dames and Handsome Maidens

    Directory of Open Access Journals (Sweden)

    Alexandra A. Pletneva

    2015-08-01

    Full Text Available This article is devoted to the attempts to identify the prototype and the sources of The Register of Dames and Handsome Maidens, which was part of the lubok tradition of the 17th–18th centuries. This text is a list of female proper names provided with humorous commentaries. The satirical texts of the 17th century frequently replicate the structure of the documents and literary texts of those times. Some satirical texts represent the structure of church services, an alphabet prayer, a petition, etc. Among the manuscript and lubok texts of the 17th–18th centuries, there is a significant number of texts which include lists of names with humorous characteristics. These texts may be dated back to the Old Russian dictionaries of proper names, e.g., the Commentaries on the Names in Alphabetical Order by Maximus the Greek. Unlike the other texts organized as lists of names, The Register of Dames is related to the wedding ritual, and its probable source can be identified as the register of potential brides for Tsar Alexey Mikhailovich, which was compiled in 1669-1670, when Tsar Alexey was planning to marry for a second time. The text of the Register includes all but two of the names that appear in the probable prototype, including rather rare names. Furthermore, the name Natalia—that is, the tsar’s future wife—is particularly emphasized.

  6. Bengali text summarization by sentence extraction

    OpenAIRE

    Sarkar, Kamal

    2012-01-01

    Text summarization is a process to produce an abstract or a summary by selecting significant portion of the information from one or more texts. In an automatic text summarization process, a text is given to the computer and the computer returns a shorter less redundant extract or abstract of the original text(s). Many techniques have been developed for summarizing English text(s). But, a very few attempts have been made for Bengali text summarization. This paper presents a method for Bengali ...

  7. Text Clustering Algorithm Based on Random Cluster Core

    Directory of Open Access Journals (Sweden)

    Huang Long-Jun

    2016-01-01

    Full Text Available Nowadays clustering has become a popular text mining algorithm, but the huge data can put forward higher requirements for the accuracy and performance of text mining. In view of the performance bottleneck of traditional text clustering algorithm, this paper proposes a text clustering algorithm with random features. This is a kind of clustering algorithm based on text density, at the same time using the neighboring heuristic rules, the concept of random cluster is introduced, which effectively reduces the complexity of the distance calculation.

  8. Compensatory function of crime fiction texts in mass culture

    Directory of Open Access Journals (Sweden)

    G. O. Krapivnyk

    2014-09-01

    Full Text Available Philosophical and anthropological analysis allowed to confirm that the compensatory function is one of the basic functions of formulaic crime fiction texts in mass culture. It closely interacts and has no clear borderlines with other anthropological functions of the crime fiction genre. Structuring and systematizing human consciousness, entertaining and educating the recipient, the compensatory function positively influences the addresser, the addressee and the modern society. In the mass culture the function under consideration has a number of aspects, in particular, therapeutic, entertaining, educating etc. It transforms into a mechanism of regulating mental state when life processes are becoming more complicated and human psyche needs regular training and relaxation (including the creation of virtual situations in the crime fiction world, associated with crime, violence, aggression and their effects for all the related parties. However, the compensatory function of crime fiction texts also follows classical traditions, oriented on rationalism and enlightenment, as well as encourages regulation of moral and legal state system in the times of Modernity and Postmodernity, that is promotes moral and legal culture in the society.

  9. Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature.

    Science.gov (United States)

    Müller, H-M; Van Auken, K M; Li, Y; Sternberg, P W

    2018-03-09

    The biomedical literature continues to grow at a rapid pace, making the challenge of knowledge retrieval and extraction ever greater. Tools that provide a means to search and mine the full text of literature thus represent an important way by which the efficiency of these processes can be improved. We describe the next generation of the Textpresso information retrieval system, Textpresso Central (TPC). TPC builds on the strengths of the original system by expanding the full text corpus to include the PubMed Central Open Access Subset (PMC OA), as well as the WormBase C. elegans bibliography. In addition, TPC allows users to create a customized corpus by uploading and processing documents of their choosing. TPC is UIMA compliant, to facilitate compatibility with external processing modules, and takes advantage of Lucene indexing and search technology for efficient handling of millions of full text documents. Like Textpresso, TPC searches can be performed using keywords and/or categories (semantically related groups of terms), but to provide better context for interpreting and validating queries, search results may now be viewed as highlighted passages in the context of full text. To facilitate biocuration efforts, TPC also allows users to select text spans from the full text and annotate them, create customized curation forms for any data type, and send resulting annotations to external curation databases. As an example of such a curation form, we describe integration of TPC with the Noctua curation tool developed by the Gene Ontology (GO) Consortium. Textpresso Central is an online literature search and curation platform that enables biocurators and biomedical researchers to search and mine the full text of literature by integrating keyword and category searches with viewing search results in the context of the full text. It also allows users to create customized curation interfaces, use those interfaces to make annotations linked to supporting evidence statements

  10. A full quantum network scheme

    International Nuclear Information System (INIS)

    Ma Hai-Qiang; Wei Ke-Jin; Yang Jian-Hui; Li Rui-Xue; Zhu Wu

    2014-01-01

    We present a full quantum network scheme using a modified BB84 protocol. Unlike other quantum network schemes, it allows quantum keys to be distributed between two arbitrary users with the help of an intermediary detecting user. Moreover, it has good expansibility and prevents all potential attacks using loopholes in a detector, so it is more practical to apply. Because the fiber birefringence effects are automatically compensated, the scheme is distinctly stable in principle and in experiment. The simple components for every user make our scheme easier for many applications. The experimental results demonstrate the stability and feasibility of this scheme. (general)

  11. Cusiana Cupiagua. Towards full production

    International Nuclear Information System (INIS)

    Carta Petrolera

    1998-01-01

    When production facilities at Cupiagua are ready around mid-year, the Cusiana Cupiagua complex will start full production, this is the largest oil project in Colombia to date. The Cupiagua CPF covers over 60 hectares on the tableland of the same name in the jurisdiction of Aguazul, a town in Casanare province in northeast Colombia, and currently considered the country's oil cradle. Work is being carried out in a sort of niche that appears to be protected by a natural wall made up of the folds of the Casanare foothills

  12. CHANT (CHinese ANcient Texts): a comprehensive database of all ancient Chinese texts up to 600 AD

    OpenAIRE

    Ho, Che Wah

    2006-01-01

    The CHinese ANcient Texts (CHANT) database is a long-term project which began in 1988 to build up a comprehensive database of all ancient Chinese texts up to the sixth century AD. The project is near completion and the entire database, which includes both traditional and excavated materials, will be released on the CHANT Web site (www.chant.org) in mid-2002. With more than a decade of experience in establishing an electronic Chinese literary database, we have gained much insight useful to the...

  13. Social Media Text Classification by Enhancing Well-Formed Text Trained Model

    Directory of Open Access Journals (Sweden)

    Phat Jotikabukkana

    2016-09-01

    Full Text Available Social media are a powerful communication tool in our era of digital information. The large amount of user-generated data is a useful novel source of data, even though it is not easy to extract the treasures from this vast and noisy trove. Since classification is an important part of text mining, many techniques have been proposed to classify this kind of information. We developed an effective technique of social media text classification by semi-supervised learning utilizing an online news source consisting of well-formed text. The computer first automatically extracts news categories, well-categorized by publishers, as classes for topic classification. A bag of words taken from news articles provides the initial keywords related to their category in the form of word vectors. The principal task is to retrieve a set of new productive keywords. Term Frequency-Inverse Document Frequency weighting (TF-IDF and Word Article Matrix (WAM are used as main methods. A modification of WAM is recomputed until it becomes the most effective model for social media text classification. The key success factor was enhancing our model with effective keywords from social media. A promising result of 99.50% accuracy was achieved, with more than 98.5% of Precision, Recall, and F-measure after updating the model three times.

  14. Text Analysis: Critical Component of Planning for Text-Based Discussion Focused on Comprehension of Informational Texts

    Science.gov (United States)

    Kucan, Linda; Palincsar, Annemarie Sullivan

    2018-01-01

    This investigation focuses on a tool used in a reading methods course to introduce reading specialist candidates to text analysis as a critical component of planning for text-based discussions. Unlike planning that focuses mainly on important text content or information, a text analysis approach focuses both on content and how that content is…

  15. SIAM 2007 Text Mining Competition dataset

    Data.gov (United States)

    National Aeronautics and Space Administration — Subject Area: Text Mining Description: This is the dataset used for the SIAM 2007 Text Mining competition. This competition focused on developing text mining...

  16. The Effect of Different Modes of English Captioning on EFL Learners' General Listening Comprehension: Full Text vs. Keyword Captions

    Science.gov (United States)

    Behroozizad, Sorayya; Majidi, Sudabeh

    2015-01-01

    This study investigated the effect of different modes of English captioning on EFL learners' general listening comprehension. To this end, forty-five intermediate-level learners were selected based on their scores on a standardized English proficiency test (PET) to carry out the study. Then, the selected participants were randomly assigned into…

  17. Sådan søger du på emneord i SocINDEX with Full Text

    DEFF Research Database (Denmark)

    2017-01-01

    Valg af materiale/medie/form: YouTube Valg af arbejdsform: E-læring Begrundelse for valg af materiale/medie/form/arbejdsform: Flipped Classroom......Valg af materiale/medie/form: YouTube Valg af arbejdsform: E-læring Begrundelse for valg af materiale/medie/form/arbejdsform: Flipped Classroom...

  18. Measurement of [Formula: see text] polarisation in [Formula: see text] collisions at [Formula: see text] = 7 TeV.

    Science.gov (United States)

    Aaij, R; Adeva, B; Adinolfi, M; Affolder, A; Ajaltouni, Z; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Anderson, J; Andreassen, R; Andreotti, M; Andrews, J E; Appleby, R B; Aquines Gutierrez, O; Archilli, F; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Bachmann, S; Back, J J; Badalov, A; Balagura, V; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Batozskaya, V; Bauer, Th; Bay, A; Beddow, J; Bedeschi, F; Bediaga, I; Belogurov, S; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bettler, M-O; van Beuzekom, M; Bien, A; Bifani, S; Bird, T; Bizzeti, A; Bjørnstad, P M; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Bondar, A; Bondar, N; Bonivento, W; Borghi, S; Borgia, A; Borsato, M; Bowcock, T J V; Bowen, E; Bozzi, C; Brambach, T; van den Brand, J; Bressieux, J; Brett, D; Britsch, M; Britton, T; Brook, N H; Brown, H; Bursche, A; Busetto, G; Buytaert, J; Cadeddu, S; Calabrese, R; Callot, O; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carranza-Mejia, H; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cenci, R; Charles, M; Charpentier, Ph; Cheung, S-F; Chiapolini, N; Chrzaszcz, M; Ciba, K; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coca, C; Coco, V; Cogan, J; Cogneras, E; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombes, M; Coquereau, S; Corti, G; Corvo, M; Counts, I; Couturier, B; Cowan, G A; Craik, D C; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Dalseno, J; David, P; David, P N Y; Davis, A; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Silva, W; De Simone, P; Decamp, D; Deckenhoff, M; Del Buono, L; Déléage, N; Derkach, D; Deschamps, O; Dettori, F; Di Canto, A; Dijkstra, H; Donleavy, S; Dordei, F; Dorigo, M; Dosil Suárez, A; Dossett, D; Dovbnya, A; Dupertuis, F; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Easo, S; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; El Rifai, I; Elsasser, Ch; Esen, S; Evans, T; Falabella, A; Färber, C; Farinelli, C; Farry, S; Ferguson, D; Fernandez Albor, V; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fontana, M; Fontanelli, F; Forty, R; Francisco, O; Frank, M; Frei, C; Frosini, M; Fu, J; Furfaro, E; Gallas Torreira, A; Galli, D; Gandelman, M; Gandini, P; Gao, Y; Garofoli, J; Garra Tico, J; Garrido, L; Gaspar, C; Gauld, R; Gavardi, L; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianelle, A; Giani, S; Gibson, V; Giubega, L; Gligorov, V V; Göbel, C; Golubkov, D; Golutvin, A; Gomes, A; Gordon, H; Gotti, C; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graziani, G; Grecu, A; Greening, E; Gregson, S; Griffith, P; Grillo, L; Grünberg, O; Gui, B; Gushchin, E; Guz, Yu; Gys, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Hampson, T; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hartmann, T; He, J; Head, T; Heijne, V; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hoballah, M; Hombach, C; Hulsbergen, W; Hunt, P; Hussain, N; Hutchcroft, D; Hynds, D; Iakovenko, V; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jaton, P; Jawahery, A; Jezabek, M; Jing, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kaballo, M; Kandybei, S; Kanso, W; Karacson, M; Karbach, T M; Kelsey, M; Kenyon, I R; Ketel, T; Khanji, B; Khurewathanakul, C; Klaver, S; Kochebina, O; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Korolev, M; Kozlinskiy, A; Kravchuk, L; Kreplin, K; Kreps, M; Krocker, G; Krokovny, P; Kruse, F; Kucharczyk, M; Kudryavtsev, V; Kurek, K; Kvaratskheliya, T; La Thi, V N; Lacarrere, D; Lafferty, G; Lai, A; Lambert, D; Lambert, R W; Lanciotti, E; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Lefèvre, R; Leflat, A; Lefrançois, J; Leo, S; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Liles, M; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, G; Lohn, S; Longstaff, I; Longstaff, I; Lopes, J H; Lopez-March, N; Lowdon, P; Lu, H; Lucchesi, D; Luisier, J; Luo, H; Lupato, A; Luppi, E; Lupton, O; Machefert, F; Machikhiliyan, I V; Maciuc, F; Maev, O; Malde, S; Manca, G; Mancinelli, G; Manzali, M; Maratas, J; Marchand, J F; Marconi, U; Marino, P; Märki, R; Marks, J; Martellotti, G; Martens, A; Martín Sánchez, A; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massafferri, A; Matev, R; Mathe, Z; Matteuzzi, C; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; McSkelly, B; Meadows, B; Meier, F; Meissner, M; Merk, M; Milanes, D A; Minard, M-N; Molina Rodriguez, J; Monteil, S; Moran, D; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Mountain, R; Muheim, F; Müller, K; Muresan, R; Muster, B; Naik, P; Nakada, T; Nandakumar, R; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nicol, M; Niess, V; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; Oblakowska-Mucha, A; Obraztsov, V; Oggero, S; Ogilvy, S; Okhrimenko, O; Oldeman, R; Onderwater, G; Orlandea, M; Otalora Goicochea, J M; Owen, P; Oyanguren, A; Pal, B K; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Parkes, C; Parkinson, C J; Passaleva, G; Patel, G D; Patel, M; Patrignani, C; Pazos Alvarez, A; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perez Trigo, E; Perret, P; Perrin-Terrin, M; Pescatore, L; Pesen, E; Petridis, K; Petrolini, A; Picatoste Olloqui, E; Pietrzyk, B; Pilař, T; Pinci, D; Pistone, A; Playfer, S; Plo Casasus, M; Polci, F; Polok, G; Poluektov, A; Polycarpo, E; Popov, A; Popov, D; Popovici, B; Potterat, C; Powell, A; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Rachwal, B; Rademacker, J H; Rakotomiaramanana, B; Rama, M; Rangel, M S; Raniuk, I; Rauschmayr, N; Raven, G; Redford, S; Reichert, S; Reid, M M; Dos Reis, A C; Ricciardi, S; Richards, A; Rinnert, K; Rives Molina, V; Roa Romero, D A; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Perez, P; Roiser, S; Romanovsky, V; Romero Vidal, A; Rotondo, M; Rouvinet, J; Ruf, T; Ruffini, F; Ruiz, H; Ruiz Valls, P; Sabatino, G; Saborido Silva, J J; Sagidova, N; Sail, P; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santovetti, E; Sapunov, M; Sarti, A; Satriano, C; Satta, A; Savrie, M; Savrina, D; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmidt, B; Schneider, O; Schopper, A; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Seco, M; Semennikov, A; Senderowska, K; Sepp, I; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Shires, A; Silva Coutinho, R; Simi, G; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, N A; Smith, E; Smith, E; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Soomro, F; Souza, D; Souza De Paula, B; Spaan, B; Sparkes, A; Spinella, F; Spradlin, P; Stagni, F; Stahl, S; Steinkamp, O; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Stroili, R; Subbiah, V K; Sun, L; Sutcliffe, W; Swientek, K; Swientek, S; Syropoulos, V; Szczekowski, M; Szczypka, P; Szilard, D; Szumlak, T; T'Jampens, S; Teklishyn, M; Tellarini, G; Teodorescu, E; Teubert, F; Thomas, C; Thomas, E; van Tilburg, J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Torr, N; Tournefier, E; Tourneur, S; Tran, M T; Tresch, M; Tsaregorodtsev, A; Tsopelas, P; Tuning, N; Ubeda Garcia, M; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vagnoni, V; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vázquez Sierra, C; Vecchi, S; Velthuis, J J; Veltri, M; Veneziano, G; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Vilasis-Cardona, X; Vollhardt, A; Volyanskyy, D; Voong, D; Vorobyev, A; Vorobyev, V; Voß, C; Voss, H; de Vries, J A; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wandernoth, S; Wang, J; Ward, D R; Watson, N K; Webber, A D; Websdale, D; Whitehead, M; Wicht, J; Wiedner, D; Wiggers, L; Wilkinson, G; Williams, M P; Williams, M; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wright, S; Wu, S; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yuan, X; Yushchenko, O; Zangoli, M; Zavertyaev, M; Zhang, F; Zhang, L; Zhang, W C; Zhang, Y; Zhelezov, A; Zhokhov, A; Zhong, L; Zvyagin, A

    The polarisation of prompt [Formula: see text] mesons is measured by performing an angular analysis of [Formula: see text] decays using proton-proton collision data, corresponding to an integrated luminosity of 1.0[Formula: see text], collected by the LHCb detector at a centre-of-mass energy of 7 TeV. The polarisation is measured in bins of transverse momentum [Formula: see text] and rapidity [Formula: see text] in the kinematic region [Formula: see text] and [Formula: see text], and is compared to theoretical models. No significant polarisation is observed.

  19. CERC Dataset (Full Hadza Data)

    DEFF Research Database (Denmark)

    2016-01-01

    The dataset includes demographic, behavioral, and religiosity data from eight different populations from around the world. The samples were drawn from: (1) Coastal and (2) Inland Tanna, Vanuatu; (3) Hadzaland, Tanzania; (4) Lovu, Fiji; (5) Pointe aux Piment, Mauritius; (6) Pesqueiro, Brazil; (7......) Kyzyl, Tyva Republic; and (8) Yasawa, Fiji. Related publication: Purzycki, et al. (2016). Moralistic Gods, Supernatural Punishment and the Expansion of Human Sociality. Nature, 530(7590): 327-330....

  20. Scaling laws for TEXT plasma profiles

    International Nuclear Information System (INIS)

    McCool, S.C.; Bravenec, R.V.; Chen, J.Y.; Foster, M.S.; Li, W.L.; Ouroura, A.; Phillips, P.E.; Richards, B.; Wenzel, K.W.; Zhang, Z.M.

    1994-01-01

    Regression analysis has been performed on a number of measured profiles including temperature and density vs. nominal macroscopic operating parameters for TEXT tokamak (pre-upgrade) ohmic plasmas. The resulting simple empirical model has enabled the authors to quickly approximate profiles of electron temperature and density, ion temperature, and soft x-ray brightness, as well as the scalar quantities: total radiated power, q=1 radius, sawtooth period and amplitude, and energy confinement time as a power law of toroidal field, plasma current, chord average density, and fueling gas atomic weight. The model profiles are only applicable to the plasma interior, i.e. within the limiter radius. In most cases the predicted model profiles are within the experimental error bars of measured profiles and are more accurate at predicting profile variation for small operating parameter changes than the measured profiles

  1. Robust keyword retrieval method for OCRed text

    Science.gov (United States)

    Fujii, Yusaku; Takebe, Hiroaki; Tanaka, Hiroshi; Hotta, Yoshinobu

    2011-01-01

    Document management systems have become important because of the growing popularity of electronic filing of documents and scanning of books, magazines, manuals, etc., through a scanner or a digital camera, for storage or reading on a PC or an electronic book. Text information acquired by optical character recognition (OCR) is usually added to the electronic documents for document retrieval. Since texts generated by OCR generally include character recognition errors, robust retrieval methods have been introduced to overcome this problem. In this paper, we propose a retrieval method that is robust against both character segmentation and recognition errors. In the proposed method, the insertion of noise characters and dropping of characters in the keyword retrieval enables robustness against character segmentation errors, and character substitution in the keyword of the recognition candidate for each character in OCR or any other character enables robustness against character recognition errors. The recall rate of the proposed method was 15% higher than that of the conventional method. However, the precision rate was 64% lower.

  2. AJI BLEGODAWA TEXT IN THE PERSPECTIVE OF FUNCTIONAL SYSTEMIC LINGUISTICTS

    Directory of Open Access Journals (Sweden)

    I Wayan Rasna

    2012-11-01

    Full Text Available This research give answers to the following five problems; they are (1 the lexico grammar of AjiBlegodawa Text (Text Aji Blegodawa; hereon abbreviated to TAB; (2 the context of situation (registerand the context of culture (genre of TAB; (4 the ideational, interpersonal and textual meanings of TAB;and (5 the values in TAB. Note taking method was employed for collecting the data needed for thelexicogrammar, the context of situation, the functions, meanings, and the values. The data needed for thecultural context were collected by note taking, questionnaire, observation and structured interview.Structured interview, in which eleven informants were interviewed, was also employed for collecting dataneeded for the values. Functional system linguistics (hereon abbreviated to FSL introduced by Hallidaywas employed to analyze the data (Halliday, 1985: 2004; 2005; (Halliday and Maththiessen, 2004.The findings show that the frequencies of the processes in the text are as follows: the materialprocess appears 674 times (52.29%; the relational process takes place 233 times (18.08% and the mentalprocess occurs 177 (13.73%.With regard to circumstances, the circumstance of location is the most dominant followed by thecircumstance of manner. From the context of situation, it can be identified that the field is black magic;from the participants, it can be identified that the main participant is Blegodawa. The mode issimultaneously used to form the configuration of meaning. It can be revealed that the main participantsupported by the supporting participants kill the victim. Viewed from the cultural point of view, thecultural norms referred to in TAB destroy life. The linguistic functionsin TAB are: 1 ideational function which includes belief, the tradition of the magic world, taboo,historical relationship and ritual; 2 interpersonal function which includes interactive function and selfexpressive function and 3 textual function. The meanings in TAB include

  3. Graphics in Text: A Bibliography. Monograph No. 6.

    Science.gov (United States)

    Macdonald-Ross, Michael; Smith, Eleanor

    This bibliography lists books and articles discussing graphic aspects of human communication. References have been selected for their relevance to the design of self-instructional texts for the adult learner; for the most part, research on younger children, on non-text media, and on non-educational texts is not included. Items are organized into…

  4. Terminology extraction from medical texts in Polish.

    Science.gov (United States)

    Marciniak, Małgorzata; Mykowiecka, Agnieszka

    2014-01-01

    Hospital documents contain free text describing the most important facts relating to patients and their illnesses. These documents are written in specific language containing medical terminology related to hospital treatment. Their automatic processing can help in verifying the consistency of hospital documentation and obtaining statistical data. To perform this task we need information on the phrases we are looking for. At the moment, clinical Polish resources are sparse. The existing terminologies, such as Polish Medical Subject Headings (MeSH), do not provide sufficient coverage for clinical tasks. It would be helpful therefore if it were possible to automatically prepare, on the basis of a data sample, an initial set of terms which, after manual verification, could be used for the purpose of information extraction. Using a combination of linguistic and statistical methods for processing over 1200 children hospital discharge records, we obtained a list of single and multiword terms used in hospital discharge documents written in Polish. The phrases are ordered according to their presumed importance in domain texts measured by the frequency of use of a phrase and the variety of its contexts. The evaluation showed that the automatically identified phrases cover about 84% of terms in domain texts. At the top of the ranked list, only 4% out of 400 terms were incorrect while out of the final 200, 20% of expressions were either not domain related or syntactically incorrect. We also observed that 70% of the obtained terms are not included in the Polish MeSH. Automatic terminology extraction can give results which are of a quality high enough to be taken as a starting point for building domain related terminological dictionaries or ontologies. This approach can be useful for preparing terminological resources for very specific subdomains for which no relevant terminologies already exist. The evaluation performed showed that none of the tested ranking procedures were

  5. Umbilical cord blood glucose levels in full-term newborns

    Directory of Open Access Journals (Sweden)

    A. L. Karpova

    2014-01-01

    Full Text Available The purpose of the investigation was to determine the umbilical cord venous blood level of glucose in full-term newborns and its relationship to the mode of delivery. The investigation included 102 full-term newborn infants, including 33 and 69 babies born via cesar-ean and vaginal delivery, respectively. Umbilical cord serum glucose levels were determined by the glucose oxidase test using a Sap-phire-400 biochemical analyzer. In healthy full-term newborns, the mean umbilical cord blood glucose levels were 4,29±0,88 mmol/1 (minimum, 2,9 mmol/1 and maximum, 5,9 mmol/1. In the babies born via cesarean delivery, the umbilical cord blood concentration of glucose was ascertained to be significantly lower than in those born vaginally (3,84+0,71 mmol/1 versus 4,51+0,87 mmol/1; /><0,0001. Abdominal delivery can be apparently considered to be a risk factor for hypoglycemia in neonatal infants.

  6. What Do You Do With Hands Like These? Close Reading Facilitates Exploration and Text Creation

    Directory of Open Access Journals (Sweden)

    Lindsey Moses

    2013-05-01

    Full Text Available This article shares instructional ideas to enhance language and literacy experiences involving the reading and writing processes of young bilinguals (Spanish and English in Colorado, USA when engaging with informational texts. Informational texts provide language scaffolds for young bilinguals because they build on their background knowledge about the world around them. Drawing on their recognition of real-world concepts found in informational texts, teaching ideas that enrich both academic and social vocabulary are shared. These teaching ideas suggest moving beyond the read aloud and individual reading of informational texts; they suggest instead teaching young learners to ‘read like writers’ and utilize Jenkins and Page’s What Do You Do With a Tail Like This? (2003 as a mentor text. This article includes relevant research, teaching ideas and classroom examples for scaffolding a close reading, ultimately resulting in intercultural explorations as the children share their writing about their home contexts.

  7. Texts A and B of the Horothesia Dossier at Istros

    Directory of Open Access Journals (Sweden)

    James H. Oliver

    2002-10-01

    Full Text Available A bilingual inscription from Istros surviving in two copies is reedited, an archive of decisions of the first and second centuries A.D. concerning the city’s privileges, especially as to fishing.

  8. Examining Text Complexity in the Early Grades

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Hiebert, Elfrieda H.; Koons, Heather H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2016-01-01

    The Common Core raises the stature of texts to new heights, creating a hubbub. The fuss is especially messy at the early grades, where children are expected to read more complex texts than in the past. But early-grades teachers have been given little actionable guidance about text complexity. The authors recently examined early-grades texts to…

  9. PUBLIC SERVICE ADVERTISING: AN ANALYSIS ON TEXT AND SEMIOTICS

    Directory of Open Access Journals (Sweden)

    Ni Wayan Sukarini

    2012-07-01

    Full Text Available This study concerns with text and semiotics analysis on the use of language in public service advertising (PSA. PSA in this study is the text which is especially on health. There are three problems that are analysed in this research, namely: (1 grammatical structure and the lexical of the text; (2 the relationship of trichotomies (representamen, object, and interpretant with the three components of sign in nonverbal aspect; and (3 ideologies and messages conveyed in the verbal and nonverbal signs. Three methods applied in this research respectively including descriptive, qualitative, and interpretative. The type of data was the written one which was taken from printed media in the forms of poster and brochure. The data was collected through five procedures, they are clipping, numbering, coding, picturing, and documenting. As a scientific writing, a number of theories must be applied for the analysis. The relevant theories are semantics, semiotics, speech act, hermeneutics, language function, and text structure. These six theories were applied eclecticly in analysing the grammatical structure, lexicals, signs, and the structure of texts in order to elaborate the meaning, ideology, and message which were being conveyed through the texts of PSA. The result of the analysis showed that the grammatical structure applied in the PSA of health could be classified into the simple structure in the forms of phrase, clause, and sentence. The use of verbs dominated initially in order to express the imperative meaning but still had the purpose of being persuasive. Kinds of lexicals found were very close to disease, reproduction, and health either the general terms, for example victims, medicine or the specific ones like HIV/AIDS, Odha, perinatal, nifas, jampersal, sadari. From the nonverbal aspect, the relationship of trichotomy with the three of sign components are more realistics in the Object with its three sub components. Triadic relationship of three sub

  10. PUBLIC SERVICE ADVERTISING: AN ANALYSIS ON TEXT AND SEMIOTICS

    Directory of Open Access Journals (Sweden)

    Ni Wayan Sukarini

    2015-07-01

    Full Text Available This study concerns with text and semiotics analysis on the use of language in public service advertising (PSA. PSA in this study is the text which is especially on health. There are three problems that are analysed in this research, namely: (1 grammatical structure and the lexical of the text; (2 the relationship of trichotomies (representamen, object, and interpretant with the three components of sign in nonverbal aspect; and (3 ideologies and messages conveyed in the verbal and nonverbal signs. Three methods applied in this research respectively including descriptive, qualitative, and interpretative. The type of data was the written one which was taken from printed media in the forms of poster and brochure. The data was collected through five procedures, they are clipping, numbering, coding, picturing, and documenting. As a scientific writing, a number of theories must be applied for the analysis. The relevant theories are semantics, semiotics, speech act, hermeneutics, language function, and text structure. These six theories were applied eclecticly in analysing the grammatical structure, lexicals, signs, and the structure of texts in order to elaborate the meaning, ideology, and message which were being conveyed through the texts of PSA. The result of the analysis showed that the grammatical structure applied in the PSA of health could be classified into the simple structure in the forms of phrase, clause, and sentence. The use of verbs dominated initially in order to express the imperative meaning but still had the purpose of being persuasive. Kinds of lexicals found were very close to disease, reproduction, and health either the general terms, for example victims, medicine or the specific ones like HIV/AIDS, Odha, perinatal, nifas, jampersal, sadari. From the nonverbal aspect, the relationship of trichotomy with the three of sign components are more realistics in the Object with its three sub components. Triadic relationship of three sub

  11. Gender bias in an occupational therapy text.

    Science.gov (United States)

    Reese, C C

    1987-06-01

    Willard and Spackman's Occupational Therapy does not create an awareness and understanding of the role of women in the field of occupational therapy. Nor does the text include general policy statements or reflections on how gender bias affects our work. What is our true consciousness as women therapists? Maria Mies (1983) wrote that women consent to their own oppression or subordination through silence. "Only when there is a rupture in the 'normal' life of a woman, a divorce, an end of a relationship, is there a chance for her to become conscious of her true condition which had been unconsciously submerged in a patriarchal system" (p. 125). True consciousness occurs in occupational therapy when practitioners avoid the use of activities or occupation in therapy. This is our "rupture". I believe the profession needs to develop a policy statement discussing gender concerns in our theory and practice linked to the progress made by women scientists in anthropology, psychology, sociology, history, and literature. It is critical to good treatment that the gender role factor be included in our research on the generic impact of activity on the individual and small group. As female occupational therapists we have the opportunity to make a significant imprint on a gender-based understanding of the health value of activities in our daily lives.

  12. Incorporating other texts: Intertextuality in Malaysian CSR reports

    Directory of Open Access Journals (Sweden)

    Kumaran Rajandran

    2016-11-01

    Full Text Available In Malaysia, corporate social responsibility (CSR is relatively new but corporations have been required to engage in and disclose their CSR. A typical genre for disclosure is CSR reports and these reports often refer to other texts. The article investigates the act of referencing to other texts or intertextuality in Malaysian CSR reports. It creates an archive of CEO Statements and Environment Sections in CSR reports and studies the archive for keywords, which can identify the incorporated texts. The function of these texts is examined in relation to Malaysia’s corporate context. CSR reports contain explicit references to documents (policies, regulations, reports, research, standards and to individuals/groups (CEOs, stakeholders, expert organizations. The incorporated texts display variation in corporate control, which organizes these texts along an intertextual cline. The cline helps to identify corporate and non-corporate sources among the texts. The selection of incorporated texts may reflect government and stock exchange demands. The texts are not standardized and are relevant for the CSR domain and corporations, where these texts monitor and justify CSR performance. Yet, the incorporated texts may perpetuate inexact reporting because corporations select the texts and the parts of texts to refer to. Since these texts have been employed to scrutinize initiatives and results, CSR reports can claim to represent the “truth” about a corporation’s CSR. Hence, intertextuality serves corporate interests.

  13. Transforming Ethnomathematical Ideas in Western Mathematics Curriculum Texts

    Science.gov (United States)

    Dickenson-Jones, Amelia

    2008-01-01

    When ethnomathematical ideas, that is, the mathematical ideas of different cultural groups, are included in mathematics curriculum texts they can become part of the learning experience in various ways. Once included in western classroom mathematics texts, the ethnomathematical ideas become transformed. The transformations involve changes in form…

  14. Calculus of Elementary Functions, Part II. Student Text. Revised Edition.

    Science.gov (United States)

    Herriot, Sarah T.; And Others

    This course is intended for students who have a thorough knowledge of college preparatory mathematics, including algebra, axiomatic geometry, trigonometry, and analytic geometry. This text, Part II, contains material designed to follow Part I. Chapters included in this text are: (6) Derivatives of Exponential and Related Functions; (7) Area and…

  15. Review Document: Full Software Trigger

    CERN Document Server

    Albrecht, J; Raven, G

    2014-01-01

    This document presents a trigger system for the upgraded LHCb detector, scheduled to begin operation in 2020. This document serves as input for the internal review towards the "DAQ, online and trigger TDR". The proposed trigger system is implemented entirely in software. In this document we show that track reconstruction of a similar quality to that available in the offline algorithms can be performed on the full inelastic $pp$-collision rate, without prior event selections implemented in custom hardware and without relying upon a partial event reconstruction. A track nding eciency of 98.8 % relative to oine can be achieved for tracks with $p_T >$ 500 MeV/$c$. The CPU time required for this reconstruction is about 40 % of the available budget. Proof-of-principle selections are presented which demonstrate that excellent performance is achievable using an inclusive beauty trigger, in addition to exclusive beauty and charm triggers. Finally, it is shown that exclusive beauty and charm selections that do not intr...

  16. Runwien: a text-based interface for the WIEN package

    Science.gov (United States)

    Otero de la Roza, A.; Luaña, Víctor

    2009-05-01

    A new text-based interface for WIEN2k, the full-potential linearized augmented plane-waves (FPLAPW) program, is presented. This code provides an easy to use, yet powerful way of generating arbitrarily large sets of calculations. Thus, properties over a potential energy surface and WIEN2k parameter exploration can be calculated using a simple input text file. This interface also provides new capabilities to the WIEN2k package, such as the calculation of elastic constants on hexagonal systems or the automatic gathering of relevant information. Additionally, runwien is modular, flexible and intuitive. Program summaryProgram title: runwien Catalogue identifier: AECM_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AECM_v1_0.html Program obtainable from: CPC Program Library, Queen's University, Belfast, N. Ireland Licensing provisions: GPL version 3 No. of lines in distributed program, including test data, etc.: 62 567 No. of bytes in distributed program, including test data, etc.: 610 973 Distribution format: tar.gz Programming language: gawk (with locale POSIX or similar) Computer: All running Unix, Linux Operating system: Unix, GNU/Linux Classification: 7.3 External routines: WIEN2k ( http://www.wien2k.at/), GAWK ( http://www.gnu.org/software/gawk/), rename by L. Wall, a Perl script which renames files, modified by R. Barker to check for the existence of target files, gnuplot ( http://www.gnuplot.info/) Subprograms used:Cat Id: ADSY_v1_0/AECB_v1_0, Title: GIBBS/CRITIC, Reference: CPC 158 (2004) 57/CPC 999 (2009) 999 Nature of problem: Creation of a text-based, batch-oriented interface for the WIEN2k package. Solution method: WIEN2k solves the Kohn-Sham equations of a solid using the FPLAPW formalism. Runwien interprets an input file containing the description of the geometry and structure of the solid and drives the execution of the WIEN2k programs. The input is simplified thanks to the default values of the WIEN2k parameters known to runwien. Additional

  17. SAW Classification Algorithm for Chinese Text Classification

    OpenAIRE

    Xiaoli Guo; Huiyu Sun; Tiehua Zhou; Ling Wang; Zhaoyang Qu; Jiannan Zang

    2015-01-01

    Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word) algorithm. The algorithm uses the special space effect of Chinese text where words...

  18. Emotiogenic Cognitive Function of Modern School Teaching Texts

    Directory of Open Access Journals (Sweden)

    Любовь Васильевна Ерохина

    2015-12-01

    Full Text Available The article is devoted to the analysis of emotional attractiveness of modern school educational texts and ecological/non-ecological influence upon pupils’ cognition in teaching communication. Reasoning is based on the thesis that - emotional attractiveness of modern school educational texts opposes their cognitive function. Emotional educational text profile and its components are under consideration. The article is concerned with ecological and cognitive and emotional asymmetry content. The material under focus is printed texts of some of modern school textbooks, teaching methodical aids, academic competitions, mass media information from the cognitive ecology point of view.

  19. A Comparative Study in Communication Texts & Theories

    Directory of Open Access Journals (Sweden)

    Mahdi Mohsenian Rad

    2011-04-01

    Full Text Available The present article is to study the characteristics and general nature of audience and is described it as "Audience phantasm" in developing countries. It firstly notes that there will be consequences such as audience distancing themselves from local and official media if policy makers and media officials' imaginations of audiences become far from related bare facts have happened in the era of global media and the nature of audiences. Knowing and analyzing the image of audiences presented in new communication theories. Accordingly, as the abovementioned authorities’ viewpoint of audiences exposed to media messages keeps distance from the true nature of media activities, media‐message receivers and their current position in the booming market of media, as termed by Mohsenyan Rad as “Message Bazaar”, there will possibly be disastrous social, cultural, political, and even economic consequences with regard to media uses.Then the history and definitions of “audience”, "uses & gratifications theory" and the concept of "Audience Phantasm" is described. After that, based on those and the increased options as well as the right of selecting of today-audience in the situation of message bazaar, as a result the characteristics of them are explained.

  20. Paratextual Transactions: Text and Off Text in William Blake’s Milton and Jerusalem

    Directory of Open Access Journals (Sweden)

    Annalisa Volpone

    2016-07-01

    Full Text Available Il saggio si concentra sulle dinamiche tra testo e paratesto negli ultimi due libri profetici di William Blake, Milton e Jerusalem. A partire dalle riflessioni di Genette in Soglie, si ragionerà su come parola e immagine interagiscono nelle tavole e concorrono a produrre significato. I libri profetici, in quanto opera d’arte “integrata”, implicano una partecipazione attiva da parte del lettore, che deve saper interpretare i molteplici livelli di interazione tra le componenti testuali e paratestuali, all’interno della complessa cornice rappresentata dalla letteratura apocalittica alla quale l’ultima produzione di Blake appartiene.