WorldWideScience

Sample records for relevant full text

  1. The Medline/full-text research project.

    Science.gov (United States)

    McKinin, E J; Sievert, M; Johnson, E D; Mitchell, J A

    1991-05-01

    This project was designed to test the relative efficacy of index terms and full-text for the retrieval of documents in those MEDLINE journals for which full-text searching was also available. The full-text files used were MEDIS from Mead Data Central and CCML from BRS Information Technologies. One hundred clinical medical topics were searched in these two files as well as the MEDLINE file to accumulate the necessary data. It was found that full-text identified significantly more relevant articles than did the indexed file, MEDLINE. The full-text searches, however, lacked the precision of searches done in the indexed file. Most relevant items missed in the full-text files, but identified in MEDLINE, were missed because the searcher failed to account for some aspect of natural language, used a logical or positional operator that was too restrictive, or included a concept which was implied, but not expressed in the natural language. Very few of the unique relevant full-text citations would have been retrieved by title or abstract alone. Finally, as of July, 1990 the more current issue of a journal was just as likely to appear in MEDLINE as in one of the full-text files.

  2. FTP: Full-Text Publishing?

    Science.gov (United States)

    Jul, Erik

    1992-01-01

    Describes the use of file transfer protocol (FTP) on the INTERNET computer network and considers its use as an electronic publishing system. The differing electronic formats of text files are discussed; the preparation and access of documents are described; and problems are addressed, including a lack of consistency. (LRW)

  3. Academic Journal Embargoes and Full Text Databases.

    Science.gov (United States)

    Brooks, Sam

    2003-01-01

    Documents the reasons for embargoes of academic journals in full text databases (i.e., publisher-imposed delays on the availability of full text content) and provides insight regarding common misconceptions. Tables present data on selected journals covering a cross-section of subjects and publishers and comparing two full text business databases.…

  4. The Weaknesses of Full-Text Searching

    Science.gov (United States)

    Beall, Jeffrey

    2008-01-01

    This paper provides a theoretical critique of the deficiencies of full-text searching in academic library databases. Because full-text searching relies on matching words in a search query with words in online resources, it is an inefficient method of finding information in a database. This matching fails to retrieve synonyms, and it also retrieves…

  5. Where Full-Text Is Viable.

    Science.gov (United States)

    Cotton, P. L.

    1987-01-01

    Defines two types of online databases: source, referring to those intended to be complete in themselves, whether full-text or abstracts; and bibliographic, meaning those that are not complete. Predictions are made about the future growth rate of these two types of databases, as well as full-text versus abstract databases. (EM)

  6. Systematic characterizations of text similarity in full text biomedical publications.

    Science.gov (United States)

    Sun, Zhaohui; Errami, Mounir; Long, Tara; Renard, Chris; Choradia, Nishant; Garner, Harold

    2010-09-15

    Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text articles are becoming increasingly available, yet the similarities among them have not been systematically studied. Here, we quantitatively investigated the full text similarity of biomedical publications in PubMed Central. 72,011 full text articles from PubMed Central (PMC) were parsed to generate three different datasets: full texts, sections, and paragraphs. Text similarity comparisons were performed on these datasets using the text similarity algorithm eTBLAST. We measured the frequency of similar text pairs and compared it among different datasets. We found that high abstract similarity can be used to predict high full text similarity with a specificity of 20.1% (95% CI [17.3%, 23.1%]) and sensitivity of 99.999%. Abstract similarity and full text similarity have a moderate correlation (Pearson correlation coefficient: -0.423) when the similarity ratio is above 0.4. Among pairs of articles in PMC, method sections are found to be the most repetitive (frequency of similar pairs, methods: 0.029, introduction: 0.0076, results: 0.0043). In contrast, among a set of manually verified duplicate articles, results are the most repetitive sections (frequency of similar pairs, results: 0.94, methods: 0.89, introduction: 0.82). Repetition of introduction and methods sections is more likely to be committed by the same authors (odds of a highly similar pair having at least one shared author, introduction: 2.31, methods: 1.83, results: 1.03). There is also significantly more similarity in pairs of review articles than in pairs containing one review and one nonreview paper (frequency of similar pairs: 0.0167 and 0.0023, respectively). While quantifying abstract similarity is an effective approach for finding duplicate citations, a comprehensive full text analysis is necessary to uncover all potential duplicate citations in the scientific literature and is helpful when

  7. Selecting Full-Text Undergraduate Periodicals Databases.

    Science.gov (United States)

    Still, Julie M.; Kassabian, Vibiana

    1999-01-01

    Examines how libraries and librarians can compare full-text general periodical indices, using ProQuest Direct, Periodical Abstracts (via Ovid), and EBSCOhost as examples. Explores breadth and depth of coverage; manipulation of results (email/download/print); ease of use (searching); and indexing quirks. (AEF)

  8. Multilingual access to full text databases

    International Nuclear Information System (INIS)

    Fluhr, C.; Radwan, K.

    1990-05-01

    Many full text databases are available in only one language, or more, they may contain documents in different languages. Even if the user is able to understand the language of the documents in the database, it could be easier for him to express his need in his own language. For the case of databases containing documents in different languages, it is more simple to formulate the query in one language only and to retrieve documents in different languages. This paper present the developments and the first experiments of multilingual search, applied to french-english pair, for text data in nuclear field, based on the system SPIRIT. After reminding the general problems of full text databases search by queries formulated in natural language, we present the methods used to reformulate the queries and show how they can be expanded for multilingual search. The first results on data in nuclear field are presented (AFCEN norms and INIS abstracts). 4 refs

  9. Database citation in full text biomedical articles.

    Science.gov (United States)

    Kafkas, Şenay; Kim, Jee-Hyub; McEntyre, Johanna R

    2013-01-01

    Molecular biology and literature databases represent essential infrastructure for life science research. Effective integration of these data resources requires that there are structured cross-references at the level of individual articles and biological records. Here, we describe the current patterns of how database entries are cited in research articles, based on analysis of the full text Open Access articles available from Europe PMC. Focusing on citation of entries in the European Nucleotide Archive (ENA), UniProt and Protein Data Bank, Europe (PDBe), we demonstrate that text mining doubles the number of structured annotations of database record citations supplied in journal articles by publishers. Many thousands of new literature-database relationships are found by text mining, since these relationships are also not present in the set of articles cited by database records. We recommend that structured annotation of database records in articles is extended to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the literature. The very high precision and high-throughput of this text-mining pipeline makes this activity possible both accurately and at low cost, which will allow the development of new integrated data services.

  10. Is searching full text more effective than searching abstracts?

    Directory of Open Access Journals (Sweden)

    Lin Jimmy

    2009-02-01

    Full Text Available Abstract Background With the growing availability of full-text articles online, scientists and other consumers of the life sciences literature now have the ability to go beyond searching bibliographic records (title, abstract, metadata to directly access full-text content. Motivated by this emerging trend, I posed the following question: is searching full text more effective than searching abstracts? This question is answered by comparing text retrieval algorithms on MEDLINE® abstracts, full-text articles, and spans (paragraphs within full-text articles using data from the TREC 2007 genomics track evaluation. Two retrieval models are examined: bm25 and the ranking algorithm implemented in the open-source Lucene search engine. Results Experiments show that treating an entire article as an indexing unit does not consistently yield higher effectiveness compared to abstract-only search. However, retrieval based on spans, or paragraphs-sized segments of full-text articles, consistently outperforms abstract-only search. Results suggest that highest overall effectiveness may be achieved by combining evidence from spans and full articles. Conclusion Users searching full text are more likely to find relevant articles than searching only abstracts. This finding affirms the value of full text collections for text retrieval and provides a starting point for future work in exploring algorithms that take advantage of rapidly-growing digital archives. Experimental results also highlight the need to develop distributed text retrieval algorithms, since full-text articles are significantly longer than abstracts and may require the computational resources of multiple machines in a cluster. The MapReduce programming model provides a convenient framework for organizing such computations.

  11. Is searching full text more effective than searching abstracts?

    Science.gov (United States)

    Lin, Jimmy

    2009-02-03

    With the growing availability of full-text articles online, scientists and other consumers of the life sciences literature now have the ability to go beyond searching bibliographic records (title, abstract, metadata) to directly access full-text content. Motivated by this emerging trend, I posed the following question: is searching full text more effective than searching abstracts? This question is answered by comparing text retrieval algorithms on MEDLINE abstracts, full-text articles, and spans (paragraphs) within full-text articles using data from the TREC 2007 genomics track evaluation. Two retrieval models are examined: bm25 and the ranking algorithm implemented in the open-source Lucene search engine. Experiments show that treating an entire article as an indexing unit does not consistently yield higher effectiveness compared to abstract-only search. However, retrieval based on spans, or paragraphs-sized segments of full-text articles, consistently outperforms abstract-only search. Results suggest that highest overall effectiveness may be achieved by combining evidence from spans and full articles. Users searching full text are more likely to find relevant articles than searching only abstracts. This finding affirms the value of full text collections for text retrieval and provides a starting point for future work in exploring algorithms that take advantage of rapidly-growing digital archives. Experimental results also highlight the need to develop distributed text retrieval algorithms, since full-text articles are significantly longer than abstracts and may require the computational resources of multiple machines in a cluster. The MapReduce programming model provides a convenient framework for organizing such computations.

  12. Mining biological networks from full-text articles.

    Science.gov (United States)

    Czarnecki, Jan; Shepherd, Adrian J

    2014-01-01

    The study of biological networks is playing an increasingly important role in the life sciences. Many different kinds of biological system can be modelled as networks; perhaps the most important examples are protein-protein interaction (PPI) networks, metabolic pathways, gene regulatory networks, and signalling networks. Although much useful information is easily accessible in publicly databases, a lot of extra relevant data lies scattered in numerous published papers. Hence there is a pressing need for automated text-mining methods capable of extracting such information from full-text articles. Here we present practical guidelines for constructing a text-mining pipeline from existing code and software components capable of extracting PPI networks from full-text articles. This approach can be adapted to tackle other types of biological network.

  13. Classification of protein-protein interaction full-text documents using text and citation network features.

    Science.gov (United States)

    Kolchinsky, Artemy; Abi-Haidar, Alaa; Kaur, Jasleen; Hamed, Ahmed Abdeen; Rocha, Luis M

    2010-01-01

    We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.

  14. 48 CFR 2852.102-270 - Incorporation in full text.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Incorporation in full text... 2852.102-270 Incorporation in full text. JAR provisions or clauses shall be incorporated in solicitations and contracts in full text. ...

  15. 48 CFR 1952.102-2 - Incorporation in full text.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 6 2010-10-01 2010-10-01 true Incorporation in full text... Clauses 1952.102-2 Incorporation in full text. All IAAR provisions and clauses shall be incorporated in solicitations and/or contracts in full text. ...

  16. Full Text Psychology Journals Available from Popular Library Databases

    Science.gov (United States)

    Joswick, Kathleen E.

    2006-01-01

    The author identified 433 core journals in psychology and investigated their full text availability in popular databases. While 62 percent of the studied journals were available in at least one database, access from individual databases ranged from 1.4 percent to 38.1 percent of the titles. The full text of influential psychology journals is not…

  17. "Free full text articles": where to search for them?

    Science.gov (United States)

    Singh, Ashish; Singh, Manish; Singh, Ajai Kumar; Singh, Deepti; Singh, Pratibha; Sharma, Abhishek

    2011-07-01

    References form the backbone of any medical literature. Presently, because of high inflation, it is very difficult for any library/organization/college to purchase all journals. The condition is even worse for an individual person, such as private practitioners. The solution lies in the free availability of full-text articles. Here, the authors share their experiences about the accessibility of free full-text articles.

  18. Searching for Bill and Jane: Electronic Full-Text Literature.

    Science.gov (United States)

    Still, Julie; Kassabian, Vibiana

    1998-01-01

    Examines electronic full-text literature available on the World Wide Web and on CD-ROM. Discusses authors and genres, electronic texts, and fees. Highlights Shakespeare, Jane Austen, and nature writing. Provides a bibliography of Web guides, specialized Shakespeare pages, and pages dealing with the Shakespeare authorship debate and secondary…

  19. Extractive text summarization system to aid data extraction from full text in systematic review development.

    Science.gov (United States)

    Bui, Duy Duc An; Del Fiol, Guilherme; Hurdle, John F; Jonnalagadda, Siddhartha

    2016-12-01

    Extracting data from publication reports is a standard process in systematic review (SR) development. However, the data extraction process still relies too much on manual effort which is slow, costly, and subject to human error. In this study, we developed a text summarization system aimed at enhancing productivity and reducing errors in the traditional data extraction process. We developed a computer system that used machine learning and natural language processing approaches to automatically generate summaries of full-text scientific publications. The summaries at the sentence and fragment levels were evaluated in finding common clinical SR data elements such as sample size, group size, and PICO values. We compared the computer-generated summaries with human written summaries (title and abstract) in terms of the presence of necessary information for the data extraction as presented in the Cochrane review's study characteristics tables. At the sentence level, the computer-generated summaries covered more information than humans do for systematic reviews (recall 91.2% vs. 83.8%, p<0.001). They also had a better density of relevant sentences (precision 59% vs. 39%, p<0.001). At the fragment level, the ensemble approach combining rule-based, concept mapping, and dictionary-based methods performed better than individual methods alone, achieving an 84.7% F-measure. Computer-generated summaries are potential alternative information sources for data extraction in systematic review development. Machine learning and natural language processing are promising approaches to the development of such an extractive summarization system. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. Subject Retrieval from Full-Text Databases in the Humanities

    Science.gov (United States)

    East, John W.

    2007-01-01

    This paper examines the problems involved in subject retrieval from full-text databases of secondary materials in the humanities. Ten such databases were studied and their search functionality evaluated, focusing on factors such as Boolean operators, document surrogates, limiting by subject area, proximity operators, phrase searching, wildcards,…

  1. Full text and figure display improves bioscience literature search.

    Directory of Open Access Journals (Sweden)

    Anna Divoli

    Full Text Available When reading bioscience journal articles, many researchers focus attention on the figures and their captions. This observation led to the development of the BioText literature search engine, a freely available Web-based application that allows biologists to search over the contents of Open Access Journals, and see figures from the articles displayed directly in the search results. This article presents a qualitative assessment of this system in the form of a usability study with 20 biologist participants using and commenting on the system. 19 out of 20 participants expressed a desire to use a bioscience literature search engine that displays articles' figures alongside the full text search results. 15 out of 20 participants said they would use a caption search and figure display interface either frequently or sometimes, while 4 said rarely and 1 said undecided. 10 out of 20 participants said they would use a tool for searching the text of tables and their captions either frequently or sometimes, while 7 said they would use it rarely if at all, 2 said they would never use it, and 1 was undecided. This study found evidence, supporting results of an earlier study, that bioscience literature search systems such as PubMed should show figures from articles alongside search results. It also found evidence that full text and captions should be searched along with the article title, metadata, and abstract. Finally, for a subset of users and information needs, allowing for explicit search within captions for figures and tables is a useful function, but it is not entirely clear how to cleanly integrate this within a more general literature search interface. Such a facility supports Open Access publishing efforts, as it requires access to full text of documents and the lifting of restrictions in order to show figures in the search interface.

  2. SSRF-PDM and its full-text retrieval improvement

    International Nuclear Information System (INIS)

    Tong Xingfan; Deng Huiyu; Li Zhiming

    2011-01-01

    Project and data management is essential for Shanghai Synchrotron Radiation Facility (SSRF) which is a huge scientific platform for science research and technology development in China. With Product Data Management (PDM) system, SSRF improves its information service greatly. In this paper, we introduce the network structure, configuration modules and client terminals of the PDM system and the improvement in full-text retrieval subsystem, including its algorithms and details of implement in order to optimize the retrieval system.(authors)

  3. Full text and figure display improves bioscience literature search.

    Science.gov (United States)

    Divoli, Anna; Wooldridge, Michael A; Hearst, Marti A

    2010-04-14

    When reading bioscience journal articles, many researchers focus attention on the figures and their captions. This observation led to the development of the BioText literature search engine, a freely available Web-based application that allows biologists to search over the contents of Open Access Journals, and see figures from the articles displayed directly in the search results. This article presents a qualitative assessment of this system in the form of a usability study with 20 biologist participants using and commenting on the system. 19 out of 20 participants expressed a desire to use a bioscience literature search engine that displays articles' figures alongside the full text search results. 15 out of 20 participants said they would use a caption search and figure display interface either frequently or sometimes, while 4 said rarely and 1 said undecided. 10 out of 20 participants said they would use a tool for searching the text of tables and their captions either frequently or sometimes, while 7 said they would use it rarely if at all, 2 said they would never use it, and 1 was undecided. This study found evidence, supporting results of an earlier study, that bioscience literature search systems such as PubMed should show figures from articles alongside search results. It also found evidence that full text and captions should be searched along with the article title, metadata, and abstract. Finally, for a subset of users and information needs, allowing for explicit search within captions for figures and tables is a useful function, but it is not entirely clear how to cleanly integrate this within a more general literature search interface. Such a facility supports Open Access publishing efforts, as it requires access to full text of documents and the lifting of restrictions in order to show figures in the search interface.

  4. Layout-aware text extraction from full-text PDF of scientific articles

    Directory of Open Access Journals (Sweden)

    Ramakrishnan Cartic

    2012-05-01

    Full Text Available Abstract Background The Portable Document Format (PDF is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the ‘Layout-Aware PDF Text Extraction’ (LA-PDFText system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Results Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1 Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2 Classifying text blocks into rhetorical categories using a rule-based method and (3 Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF

  5. UKPMC: a full text article resource for the life sciences.

    Science.gov (United States)

    McEntyre, Johanna R; Ananiadou, Sophia; Andrews, Stephen; Black, William J; Boulderstone, Richard; Buttery, Paula; Chaplin, David; Chevuru, Sandeepreddy; Cobley, Norman; Coleman, Lee-Ann; Davey, Paul; Gupta, Bharti; Haji-Gholam, Lesley; Hawkins, Craig; Horne, Alan; Hubbard, Simon J; Kim, Jee-Hyub; Lewin, Ian; Lyte, Vic; MacIntyre, Ross; Mansoor, Sami; Mason, Linda; McNaught, John; Newbold, Elizabeth; Nobata, Chikashi; Ong, Ernest; Pillai, Sharmila; Rebholz-Schuhmann, Dietrich; Rosie, Heather; Rowbotham, Rob; Rupp, C J; Stoehr, Peter; Vaughan, Philip

    2011-01-01

    UK PubMed Central (UKPMC) is a full-text article database that extends the functionality of the original PubMed Central (PMC) repository. The UKPMC project was launched as the first 'mirror' site to PMC, which in analogy to the International Nucleotide Sequence Database Collaboration, aims to provide international preservation of the open and free-access biomedical literature. UKPMC (http://ukpmc.ac.uk) has undergone considerable development since its inception in 2007 and now includes both a UKPMC and PubMed search, as well as access to other records such as Agricola, Patents and recent biomedical theses. UKPMC also differs from PubMed/PMC in that the full text and abstract information can be searched in an integrated manner from one input box. Furthermore, UKPMC contains 'Cited By' information as an alternative way to navigate the literature and has incorporated text-mining approaches to semantically enrich content and integrate it with related database resources. Finally, UKPMC also offers added-value services (UKPMC+) that enable grantees to deposit manuscripts, link papers to grants, publish online portfolios and view citation information on their papers. Here we describe UKPMC and clarify the relationship between PMC and UKPMC, providing historical context and future directions, 10 years on from when PMC was first launched.

  6. Layout-aware text extraction from full-text PDF of scientific articles.

    Science.gov (United States)

    Ramakrishnan, Cartic; Patnia, Abhishek; Hovy, Eduard; Burns, Gully Apc

    2012-05-28

    The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the 'Layout-Aware PDF Text Extraction' (LA-PDFText) system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1) Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2) Classifying text blocks into rhetorical categories using a rule-based method and (3) Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF. Finally, we discuss preliminary error analysis for

  7. Full text clustering and relationship network analysis of biomedical publications.

    Directory of Open Access Journals (Sweden)

    Renchu Guan

    Full Text Available Rapid developments in the biomedical sciences have increased the demand for automatic clustering of biomedical publications. In contrast to current approaches to text clustering, which focus exclusively on the contents of abstracts, a novel method is proposed for clustering and analysis of complete biomedical article texts. To reduce dimensionality, Cosine Coefficient is used on a sub-space of only two vectors, instead of computing the Euclidean distance within the space of all vectors. Then a strategy and algorithm is introduced for Semi-supervised Affinity Propagation (SSAP to improve analysis efficiency, using biomedical journal names as an evaluation background. Experimental results show that by avoiding high-dimensional sparse matrix computations, SSAP outperforms conventional k-means methods and improves upon the standard Affinity Propagation algorithm. In constructing a directed relationship network and distribution matrix for the clustering results, it can be noted that overlaps in scope and interests among BioMed publications can be easily identified, providing a valuable analytical tool for editors, authors and readers.

  8. Multilingual access to full text databases; Acces multilingue aux bases de donnees en texte integral

    Energy Technology Data Exchange (ETDEWEB)

    Fluhr, C; Radwan, K [Institut National des Sciences et Techniques Nucleaires (INSTN), Centre d` Etudes de Saclay, 91 - Gif-sur-Yvette (France)

    1990-05-01

    Many full text databases are available in only one language, or more, they may contain documents in different languages. Even if the user is able to understand the language of the documents in the database, it could be easier for him to express his need in his own language. For the case of databases containing documents in different languages, it is more simple to formulate the query in one language only and to retrieve documents in different languages. This paper present the developments and the first experiments of multilingual search, applied to french-english pair, for text data in nuclear field, based on the system SPIRIT. After reminding the general problems of full text databases search by queries formulated in natural language, we present the methods used to reformulate the queries and show how they can be expanded for multilingual search. The first results on data in nuclear field are presented (AFCEN norms and INIS abstracts). 4 refs.

  9. Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text.

    Science.gov (United States)

    Garten, Yael; Altman, Russ B

    2009-02-05

    Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded rapidly, but is dispersed in many journals. It is challenging, therefore, to identify important associations between drugs and molecular entities--particularly genes and gene variants, and thus these critical connections are often lost. Text mining techniques can allow us to convert the free-style text to a computable, searchable format in which pharmacogenomic concepts (such as genes, drugs, polymorphisms, and diseases) are identified, and important links between these concepts are recorded. Availability of full text articles as input into text mining engines is key, as literature abstracts often do not contain sufficient information to identify these pharmacogenomic associations. Thus, building on a tool called Textpresso, we have created the Pharmspresso tool to assist in identifying important pharmacogenomic facts in full text articles. Pharmspresso parses text to find references to human genes, polymorphisms, drugs and diseases and their relationships. It presents these as a series of marked-up text fragments, in which key concepts are visually highlighted. To evaluate Pharmspresso, we used a gold standard of 45 human-curated articles. Pharmspresso identified 78%, 61%, and 74% of target gene, polymorphism, and drug concepts, respectively. Pharmspresso is a text analysis tool that extracts pharmacogenomic concepts from the literature automatically and thus captures our current understanding of gene-drug interactions in a computable form. We have made Pharmspresso available at http://pharmspresso.stanford.edu.

  10. A text-mining system for extracting metabolic reactions from full-text articles.

    Science.gov (United States)

    Czarnecki, Jan; Nobeli, Irene; Smith, Adrian M; Shepherd, Adrian J

    2012-07-23

    Increasingly biological text mining research is focusing on the extraction of complex relationships relevant to the construction and curation of biological networks and pathways. However, one important category of pathway - metabolic pathways - has been largely neglected.Here we present a relatively simple method for extracting metabolic reaction information from free text that scores different permutations of assigned entities (enzymes and metabolites) within a given sentence based on the presence and location of stemmed keywords. This method extends an approach that has proved effective in the context of the extraction of protein-protein interactions. When evaluated on a set of manually-curated metabolic pathways using standard performance criteria, our method performs surprisingly well. Precision and recall rates are comparable to those previously achieved for the well-known protein-protein interaction extraction task. We conclude that automated metabolic pathway construction is more tractable than has often been assumed, and that (as in the case of protein-protein interaction extraction) relatively simple text-mining approaches can prove surprisingly effective. It is hoped that these results will provide an impetus to further research and act as a useful benchmark for judging the performance of more sophisticated methods that are yet to be developed.

  11. Meditation on OM: Relevance from ancient texts and contemporary science

    Directory of Open Access Journals (Sweden)

    Kumar Sanjay

    2010-01-01

    Full Text Available Background: In Indian scriptures the sacred syllable Om is the primordial sound from which all other sounds and creation emerge which signifies the Supreme Power. Aims: To explore the significance of the syllable OM from ancient texts and effects of OM meditation in contemporary science. Descriptions from ancient texts: The descriptions of Om have been taken from four Upanisads (Mundaka, Mandukya, Svetasvatara, and Katha, the Bhagvad Gita, and Patanjali′s Yoga Sutras. Scientific studies on Om: Autonomic and respiratory studies suggest that there is a combination of mental alertness with physiological rest during the practice of Om meditation. Evoked potentials studies suggest a decrease in sensory transmission time at the level of the auditory association cortices, along with recruitment of more neurons at mesencephalic-diencephalic levels. Conclusion: It is considered that a person who realizes Om, merges with the Absolute. Scientific studies on Om suggest that the mental repetition of Om results in physiological alertness, and increased sensitivity to sensory transmission.

  12. Full text clustering and relationship network analysis of biomedical publications.

    Science.gov (United States)

    Guan, Renchu; Yang, Chen; Marchese, Maurizio; Liang, Yanchun; Shi, Xiaohu

    2014-01-01

    Rapid developments in the biomedical sciences have increased the demand for automatic clustering of biomedical publications. In contrast to current approaches to text clustering, which focus exclusively on the contents of abstracts, a novel method is proposed for clustering and analysis of complete biomedical article texts. To reduce dimensionality, Cosine Coefficient is used on a sub-space of only two vectors, instead of computing the Euclidean distance within the space of all vectors. Then a strategy and algorithm is introduced for Semi-supervised Affinity Propagation (SSAP) to improve analysis efficiency, using biomedical journal names as an evaluation background. Experimental results show that by avoiding high-dimensional sparse matrix computations, SSAP outperforms conventional k-means methods and improves upon the standard Affinity Propagation algorithm. In constructing a directed relationship network and distribution matrix for the clustering results, it can be noted that overlaps in scope and interests among BioMed publications can be easily identified, providing a valuable analytical tool for editors, authors and readers.

  13. Extracting Characteristics of the Study Subjects from Full-Text Articles.

    Science.gov (United States)

    Demner-Fushman, Dina; Mork, James G

    Characteristics of the subjects of biomedical research are important in determining if a publication describing the research is relevant to a search. To facilitate finding relevant publications, MEDLINE citations provide Medical Subject Headings that describe the subjects' characteristics, such as their species, gender, and age. We seek to improve the recommendation of these headings by the Medical Text Indexer (MTI) that supports manual indexing of MEDLINE. To that end, we explore the potential of the full text of the publications. Using simple recall-oriented rule-based methods we determined that adding sentences extracted from the methods sections and captions to the abstracts prior to MTI processing significantly improved recall and F1 score with only a slight drop in precision. Improvements were also achieved in directly assigning several headings extracted from the full text. These results indicate the need for further development of automated methods capable of leveraging the full text for indexing.

  14. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    release of intracellular myocyte components. Clinical sequelae to rhabdomyolysis include hypovolemia, hyperkalemia, metabolic acidosis and acute renal failure which is the most serious complication. Renal failure is caused by renal vasoconstriction, myoglobin and heme protein toxicity. Usual explanations of the cause of.

  15. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Case presentation. A seventy five year old Libyan man was seen in the urology department of Tripoli Medical Centre, Tripoli, Libya with six month history of left loin pain. The patient noted a mass in the left loin two days before he was assessed in the hospital. Also he started to vomit. There was no history of haematuria.

  16. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    impact of using a robotic dispensing machine in community pharmacies was gathered using a structured questionnaire and analysed in ... dispensing time was also shorter and staff satisfaction increased. ... reference customers who were using a ROWA robotic .... Costs situation Purchase price Stock value Personnel costs.

  17. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    –60 years, in Al-Jala Women Hospital in. Tripoli, Libya. Haemoglobin concentration was measured using an automated haematology analyzer. ... i.e. by relatives and friends of the patient needing blood. A .... More attention should be given.

  18. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    2009-07-20

    Jul 20, 2009 ... Table 1: Influenza pandemics of the 20th and 21st century. Name of ... could be responsible for the rapid human -to- human transmission [21]. Using evolutionary analysis to estimate the timescale of the origins, Smith and his research team from The. University of ... The biology of influenza A viruses is very.

  19. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    2009-01-11

    Jan 11, 2009 ... battery, retail for approximately £30GBP although bulk buying ..... care to store them carefully. Electrode costs .... nerve stimulation does not relieve in labour pain: updated ... (Online : Update Software), 2003(3): p. CD003222.

  20. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Abstract; The detection of single base mismatches in DNA is important for diagnostics, treatment of ... nucleic acid detectors, and show how such exciplexes can register the presence of .... Titration experiments were carried out using a stock.

  1. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    from Turkey, Mexico, Brazil, USA, and Spain determined some criteria in favor ... female gender, and higher level of education. [9-14]. ... teachers and/or workers in that facility. Then a random ..... Psychosocial profile in favor of organ donation.

  2. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    1Department of Surgery, Obafemi Awolowo, 2Department of Community Health,. Obafemi ... the use of mesh, either open or laparoscopic [15,21], but this ... recurrence. METHODS AND PATIENTS .... TAH-BSO* = Total abdominal hysterectomy and bilateral salpingoophorectomy. Recurrent I.H. # = Recurrent inguinal hernia.

  3. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    fishermen were enrolled at three Marine stations in Basra, Iraq. Demographic data, types .... that are used to sting and kill their prey or for defense. ... cardiotoxic, and dermatonecrotic toxins [1,6]. Figure 3: 4 ... May last a few weeks. The hands ...

  4. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    2009-05-03

    May 3, 2009 ... The patient made an uneventful recovery. The final histopathology report was consistent with metastatic renal carcinoma. The patient was referred to the oncologist but unfortunately defaulted further treatment. . She is currently well and disease free 24 months after metastatectomy. Electronic PDF security ...

  5. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    healing process when wound tensile strength is very low or absent (days 0-30). It is during this time, when .... and two (4.5%) were supraumbilical. Table 1: Age distribution and the outcome of surgery in the 44 women with incisional hernia. Variable. Frequency. Percentage. Age. 60. 9. 15. 10. 6. 4.

  6. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Original Article. Effects of Gender and Seasonal Variation on the Prevalence of. Bacterial Septicaemia Among Young Children in Benin City,. Nigeria. Omoregie R1,2, Egbe CA2, Ogefere HO1,3, Igbarumah I2, Omijie RE2. 1School of Medical Laboratory Sciences, 2Department of Medical Microbiology, University of Benin.

  7. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Figure 2 axial CT section with contrast media showing extension of lesion. Figure 3 Photomicrograph revealing many dilated cavernous lymphatic channels filled with eosinophilic coagulum. (Haematoxylin and Eosin section Orginal magnification 40 X). Discussion. Cystic hygroma, known as cystic lymphangioma is a.

  8. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    Abstract; The expression of EGFR and p53 has not been adequately studied as a prognostic tool in urinary bladder tumors. We analyzed 74 bladder cancer samples from Egypt for EGFR and p53 expression using immunohistochemistry. The tumors .... have some potential value in differential diagnosis of problem cases, but ...

  9. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    receptor; TNF-α: Tumor Necrosis Factor–alpha; TGF-β1: transforming growth factor-β1. INTRODUCTION .... hormones that mediate inflammatory and immune responses in a ..... score, lactate, and base deficit), as well as treatment with agents ...

  10. Exploring Culturally Relevant Texts with Kindergartners and Their Families

    Science.gov (United States)

    Schrodt, Katie; Fain, Jeanne Gilliam; Hasty, Michelle

    2015-01-01

    This article shares a monthly curricular model that can help teachers and families bridge the gap between home and school by using literature and response inspired from the cultures in the classroom. Integrating culturally responsive texts and family response journals into the classroom and the home can help children, their families, and peers…

  11. Challenges for automatically extracting molecular interactions from full-text articles.

    Science.gov (United States)

    McIntosh, Tara; Curran, James R

    2009-09-24

    The increasing availability of full-text biomedical articles will allow more biomedical knowledge to be extracted automatically with greater reliability. However, most Information Retrieval (IR) and Extraction (IE) tools currently process only abstracts. The lack of corpora has limited the development of tools that are capable of exploiting the knowledge in full-text articles. As a result, there has been little investigation into the advantages of full-text document structure, and the challenges developers will face in processing full-text articles. We manually annotated passages from full-text articles that describe interactions summarised in a Molecular Interaction Map (MIM). Our corpus tracks the process of identifying facts to form the MIM summaries and captures any factual dependencies that must be resolved to extract the fact completely. For example, a fact in the results section may require a synonym defined in the introduction. The passages are also annotated with negated and coreference expressions that must be resolved.We describe the guidelines for identifying relevant passages and possible dependencies. The corpus includes 2162 sentences from 78 full-text articles. Our corpus analysis demonstrates the necessity of full-text processing; identifies the article sections where interactions are most commonly stated; and quantifies the proportion of interaction statements requiring coherent dependencies. Further, it allows us to report on the relative importance of identifying synonyms and resolving negated expressions. We also experiment with an oracle sentence retrieval system using the corpus as a gold-standard evaluation set. We introduce the MIM corpus, a unique resource that maps interaction facts in a MIM to annotated passages within full-text articles. It is an invaluable case study providing guidance to developers of biomedical IR and IE systems, and can be used as a gold-standard evaluation set for full-text IR tasks.

  12. A Full-Text-Based Search Engine for Finding Highly Matched Documents Across Multiple Categories

    Science.gov (United States)

    Nguyen, Hung D.; Steele, Gynelle C.

    2016-01-01

    This report demonstrates the full-text-based search engine that works on any Web-based mobile application. The engine has the capability to search databases across multiple categories based on a user's queries and identify the most relevant or similar. The search results presented here were found using an Android (Google Co.) mobile device; however, it is also compatible with other mobile phones.

  13. Improving e-book access via a library-developed full-text search tool.

    Science.gov (United States)

    Foust, Jill E; Bergen, Phillip; Maxeiner, Gretchen L; Pawlowski, Peter N

    2007-01-01

    This paper reports on the development of a tool for searching the contents of licensed full-text electronic book (e-book) collections. The Health Sciences Library System (HSLS) provides services to the University of Pittsburgh's medical programs and large academic health system. The HSLS has developed an innovative tool for federated searching of its e-book collections. Built using the XML-based Vivísimo development environment, the tool enables a user to perform a full-text search of over 2,500 titles from the library's seven most highly used e-book collections. From a single "Google-style" query, results are returned as an integrated set of links pointing directly to relevant sections of the full text. Results are also grouped into categories that enable more precise retrieval without reformulation of the search. A heuristic evaluation demonstrated the usability of the tool and a web server log analysis indicated an acceptable level of usage. Based on its success, there are plans to increase the number of online book collections searched. This library's first foray into federated searching has produced an effective tool for searching across large collections of full-text e-books and has provided a good foundation for the development of other library-based federated searching products.

  14. Improving e-book access via a library-developed full-text search tool*

    Science.gov (United States)

    Foust, Jill E.; Bergen, Phillip; Maxeiner, Gretchen L.; Pawlowski, Peter N.

    2007-01-01

    Purpose: This paper reports on the development of a tool for searching the contents of licensed full-text electronic book (e-book) collections. Setting: The Health Sciences Library System (HSLS) provides services to the University of Pittsburgh's medical programs and large academic health system. Brief Description: The HSLS has developed an innovative tool for federated searching of its e-book collections. Built using the XML-based Vivísimo development environment, the tool enables a user to perform a full-text search of over 2,500 titles from the library's seven most highly used e-book collections. From a single “Google-style” query, results are returned as an integrated set of links pointing directly to relevant sections of the full text. Results are also grouped into categories that enable more precise retrieval without reformulation of the search. Results/Evaluation: A heuristic evaluation demonstrated the usability of the tool and a web server log analysis indicated an acceptable level of usage. Based on its success, there are plans to increase the number of online book collections searched. Conclusion: This library's first foray into federated searching has produced an effective tool for searching across large collections of full-text e-books and has provided a good foundation for the development of other library-based federated searching products. PMID:17252065

  15. Effects of Coherence and Relevance on Shallow and Deep Text Processing.

    Science.gov (United States)

    Lehman, Stephen; Schraw, Gregory

    2002-01-01

    Examines the effects of coherence and relevance on shallow and deeper text processing, testing the hypothesis that enhancing the relevance of text segments compensates for breaks in local and global coherence. Results reveal that breaks in local coherence had no effect on any outcome measures, whereas relevance enhanced deeper processing.…

  16. Efficient extraction of protein-protein interactions from full-text articles.

    Science.gov (United States)

    Hakenberg, Jörg; Leaman, Robert; Vo, Nguyen Ha; Jonnalagadda, Siddhartha; Sullivan, Ryan; Miller, Christopher; Tari, Luis; Baral, Chitta; Gonzalez, Graciela

    2010-01-01

    Proteins and their interactions govern virtually all cellular processes, such as regulation, signaling, metabolism, and structure. Most experimental findings pertaining to such interactions are discussed in research papers, which, in turn, get curated by protein interaction databases. Authors, editors, and publishers benefit from efforts to alleviate the tasks of searching for relevant papers, evidence for physical interactions, and proper identifiers for each protein involved. The BioCreative II.5 community challenge addressed these tasks in a competition-style assessment to evaluate and compare different methodologies, to make aware of the increasing accuracy of automated methods, and to guide future implementations. In this paper, we present our approaches for protein-named entity recognition, including normalization, and for extraction of protein-protein interactions from full text. Our overall goal is to identify efficient individual components, and we compare various compositions to handle a single full-text article in between 10 seconds and 2 minutes. We propose strategies to transfer document-level annotations to the sentence-level, which allows for the creation of a more fine-grained training corpus; we use this corpus to automatically derive around 5,000 patterns. We rank sentences by relevance to the task of finding novel interactions with physical evidence, using a sentence classifier built from this training corpus. Heuristics for paraphrasing sentences help to further remove unnecessary information that might interfere with patterns, such as additional adjectives, clauses, or bracketed expressions. In BioCreative II.5, we achieved an f-score of 22 percent for finding protein interactions, and 43 percent for mapping proteins to UniProt IDs; disregarding species, f-scores are 30 percent and 55 percent, respectively. On average, our best-performing setup required around 2 minutes per full text. All data and pattern sets as well as Java classes that

  17. Understanding disciplinary vocabularies using a full-text enabled domain-independent term extraction approach.

    Science.gov (United States)

    Yan, Erjia; Williams, Jake; Chen, Zheng

    2017-01-01

    Publication metadata help deliver rich analyses of scholarly communication. However, research concepts and ideas are more effectively expressed through unstructured fields such as full texts. Thus, the goals of this paper are to employ a full-text enabled method to extract terms relevant to disciplinary vocabularies, and through them, to understand the relationships between disciplines. This paper uses an efficient, domain-independent term extraction method to extract disciplinary vocabularies from a large multidisciplinary corpus of PLoS ONE publications. It finds a power-law pattern in the frequency distributions of terms present in each discipline, indicating a semantic richness potentially sufficient for further study and advanced analysis. The salient relationships amongst these vocabularies become apparent in application of a principal component analysis. For example, Mathematics and Computer and Information Sciences were found to have similar vocabulary use patterns along with Engineering and Physics; while Chemistry and the Social Sciences were found to exhibit contrasting vocabulary use patterns along with the Earth Sciences and Chemistry. These results have implications to studies of scholarly communication as scholars attempt to identify the epistemological cultures of disciplines, and as a full text-based methodology could lead to machine learning applications in the automated classification of scholarly work according to disciplinary vocabularies.

  18. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts.

    Science.gov (United States)

    Westergaard, David; Stærfeldt, Hans-Henrik; Tønsberg, Christian; Jensen, Lars Juhl; Brunak, Søren

    2018-02-01

    Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only.

  19. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

    Science.gov (United States)

    Westergaard, David; Stærfeldt, Hans-Henrik

    2018-01-01

    Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15 million English scientific full-text articles published during the period 1823–2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein–protein, disease–gene, and protein subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full-text articles consistently outperforms using abstracts only. PMID:29447159

  20. "What is relevant in a text document?": An interpretable machine learning approach.

    Directory of Open Access Journals (Sweden)

    Leila Arras

    Full Text Available Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text's category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP, a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.

  1. Prognosis Essay Scoring and Article Relevancy Using Multi-Text Features and Machine Learning

    Directory of Open Access Journals (Sweden)

    Arif Mehmood

    2017-01-01

    Full Text Available This study develops a model for essay scoring and article relevancy. Essay scoring is a costly process when we consider the time spent by an evaluator. It may lead to inequalities of the effort by various evaluators to apply the same evaluation criteria. Bibliometric research uses the evaluation criteria to find relevancy of articles instead. Researchers mostly face relevancy issues while searching articles. Therefore, they classify the articles manually. However, manual classification is burdensome due to time needed for evaluation. The proposed model performs automatic essay evaluation using multi-text features and ensemble machine learning. The proposed method is implemented in two data sets: a Kaggle short answer data set for essay scoring that includes four ranges of disciplines (Science, Biology, English, and English language Arts, and a bibliometric data set having IoT (Internet of Things and non-IoT classes. The efficacy of the model is measured against the Tandalla and AutoP approach using Cohen’s kappa. The model achieves kappa values of 0.80 and 0.83 for the first and second data sets, respectively. Kappa values show that the proposed model has better performance than those of earlier approaches.

  2. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts

    DEFF Research Database (Denmark)

    Westergaard, David; Stærfeldt, Hans Henrik; Tønsberg, Christian

    2018-01-01

    Across academia and industry, text mining has become a popular strategy for keeping up with the rapid growth of the scientific literature. Text mining of the scientific literature has mostly been carried out on collections of abstracts, due to their availability. Here we present an analysis of 15...... subcellular associations using a named entity recognition system, and quantitatively report on their accuracy using gold standard benchmark data sets. We subsequently compare the findings to corresponding results obtained on 16.5 million abstracts included in MEDLINE and show that text mining of full...... million English scientific full-text articles published during the period 1823-2016. We describe the development in article length and publication sub-topics during these nearly 250 years. We showcase the potential of text mining by extracting published protein-protein, disease-gene, and protein...

  3. BC4GO: a full-text corpus for the BioCreative IV GO task.

    Science.gov (United States)

    Van Auken, Kimberly; Schaeffer, Mary L; McQuilton, Peter; Laulederkind, Stanley J F; Li, Donghui; Wang, Shur-Jen; Hayman, G Thomas; Tweedie, Susan; Arighi, Cecilia N; Done, James; Müller, Hans-Michael; Sternberg, Paul W; Mao, Yuqing; Wei, Chih-Hsuan; Lu, Zhiyong

    2014-01-01

    Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain ∼ 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community. Database URL: http://www.biocreative.org/resources/corpora/bc-iv-go-task-corpus/. Published by Oxford University Press 2014. This work is written by US

  4. Managing nuclear knowledge: IAEA activities and international coordination. Including resource material full text CD-ROM

    International Nuclear Information System (INIS)

    2005-06-01

    The present CD-ROM summarizes some activities carried out by the Departments of Nuclear Energy and Nuclear Safety and Security in the area of nuclear knowledge management in the period 2003-2005. It comprises, as open resource, most of the relevant documents in full text, including policy level documents, reports, presentation material by Member States and meeting summaries. The collection starts with a reprint of the report to the IAEA General Conference 2004 on Nuclear Knowledge [GOV/2004/56-GC(48)/12] summarizing the developments in nuclear knowledge management since the 47th session of the General Conference in 2003 and covers Managing Nuclear Knowledge including safety issues and Information and Strengthening Education and Training for Capacity Building. It contains an excerpt on Nuclear Knowledge from the General Conference Resolution [GC(48)/RES/13] on Strengthening the Agency's Activities Related to Nuclear Science, Technology and Applications. On the CD-ROM itself, all documents can easily be accessed by clicking on their titles on the subject pages (also printed at the end of this Working Material). Part 1 of the CD-ROM covers the activities in the period 2003-2005 and part 2 presents a resource material full text CD-ROM on Managing Nuclear Knowledge issued in October 2003

  5. Full-text publication of abstracts in emergency medicine in Denmark.

    Science.gov (United States)

    Ravn, Anne Katrine; Petersen, Dan Brun; Folkestad, Lars; Hallas, Peter; Brabrand, Mikkel

    2014-05-24

    Abstracts presented at medical conferences or scientific meetings should ideally be published as full-text articles in peer-reviewed journals after initial presentation and feedback regardless of the findings. The aim of this survey was to determine the publication rate of papers presented at the Danish Emergency Medicine Conferences in 2009, 2010 and 2011. Abstracts presented at the conferences were identified and authors contacted to obtain publication information. A further search was conducted using relevant databases. Publication rates for the 2009 and 2010 were approximately 30% (25-31.6%). The publication rate for the 2011 conference was 14.5% within 18 months with an additional 9% under review prior to publication. When comparing full-text publication rates from DEMC to previous international studies in EM Danish EM research community has similar publication rates. However, other more established specialties have higher publication levels. Knowledge of reasons for non-publication could lead to efforts to promote publication like funding; the possibility of discussion between authors and editors at conferences; "publication mentors"; and/or research courses provided by the Danish Society of Emergency Medicine.

  6. The Flip Sides of Full-Text: Superindex and the Harvard Business Review/Online.

    Science.gov (United States)

    Dadlez, Eva M.

    1984-01-01

    This article illustrates similarities between two different types of full-text databases--Superindex, Harvard Business Review/Online--and uses them as arena to demonstrate search and display applications of full-text. The selection of logical operators, full-text search strategies, and keywords and Bibliographic Retrieval Service's Occurrence…

  7. Retrieval of publications addressing shared decision making: an evaluation of full-text searches on medical journal websites.

    Science.gov (United States)

    Blanc, Xavier; Collet, Tinh-Hai; Auer, Reto; Iriarte, Pablo; Krause, Jan; Légaré, France; Cornuz, Jacques; Clair, Carole

    2015-04-07

    Full-text searches of articles increase the recall, defined by the proportion of relevant publications that are retrieved. However, this method is rarely used in medical research due to resource constraints. For the purpose of a systematic review of publications addressing shared decision making, a full-text search method was required to retrieve publications where shared decision making does not appear in the title or abstract. The objective of our study was to assess the efficiency and reliability of full-text searches in major medical journals for identifying shared decision making publications. A full-text search was performed on the websites of 15 high-impact journals in general internal medicine to look up publications of any type from 1996-2011 containing the phrase "shared decision making". The search method was compared with a PubMed search of titles and abstracts only. The full-text search was further validated by requesting all publications from the same time period from the individual journal publishers and searching through the collected dataset. The full-text search for "shared decision making" on journal websites identified 1286 publications in 15 journals compared to 119 through the PubMed search. The search within the publisher-provided publications of 6 journals identified 613 publications compared to 646 with the full-text search on the respective journal websites. The concordance rate was 94.3% between both full-text searches. Full-text searching on medical journal websites is an efficient and reliable way to identify relevant articles in the field of shared decision making for review or other purposes. It may be more widely used in biomedical research in other fields in the future, with the collaboration of publishers and journals toward open-access data.

  8. Moving beyond Text Highlights: Inferring Users' Interests to Improve the Relevance of Retrieval

    Science.gov (United States)

    Balakrishnan, Vimala; Mehmood, Yasir; Nagappan, Yoganathan

    2016-01-01

    Introduction: Studies have indicated that users' text highlighting behaviour can be further manipulated to improve the relevance of retrieved results. This article reports on a study that examined users' text highlight frequency, length and users' copy-paste actions. Method: A binary voting mechanism was employed to determine the weights for the…

  9. Searching Harvard Business Review Online. . . Lessons in Searching a Full Text Database.

    Science.gov (United States)

    Tenopir, Carol

    1985-01-01

    This article examines the Harvard Business Review Online (HBRO) database (bibliographic description fields, abstracts, extracted information, full text, subject descriptors) and reports on 31 sample HBRO searches conducted in Bibliographic Retrieval Services to test differences between searching full text and searching bibliographic record. Sample…

  10. Full-Text Linking: Affiliated versus Nonaffiliated Access in a Free Database.

    Science.gov (United States)

    Grogg, Jill E.; Andreadis, Debra K.; Kirk, Rachel A.

    2002-01-01

    Presents a comparison of access to full-text articles from a free bibliographic database (PubSCIENCE) for affiliated and unaffiliated users. Found that affiliated users had access to more full-text articles than unaffiliated users had, and that both types of users could increase their level of access through additional searching and greater…

  11. tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles.

    Science.gov (United States)

    Cejuela, Juan Miguel; McQuilton, Peter; Ponting, Laura; Marygold, Steven J; Stefancsik, Raymund; Millburn, Gillian H; Rost, Burkhard

    2014-01-01

    The breadth and depth of biomedical literature are increasing year upon year. To keep abreast of these increases, FlyBase, a database for Drosophila genomic and genetic information, is constantly exploring new ways to mine the published literature to increase the efficiency and accuracy of manual curation and to automate some aspects, such as triaging and entity extraction. Toward this end, we present the 'tagtog' system, a web-based annotation framework that can be used to mark up biological entities (such as genes) and concepts (such as Gene Ontology terms) in full-text articles. tagtog leverages manual user annotation in combination with automatic machine-learned annotation to provide accurate identification of gene symbols and gene names. As part of the BioCreative IV Interactive Annotation Task, FlyBase has used tagtog to identify and extract mentions of Drosophila melanogaster gene symbols and names in full-text biomedical articles from the PLOS stable of journals. We show here the results of three experiments with different sized corpora and assess gene recognition performance and curation speed. We conclude that tagtog-named entity recognition improves with a larger corpus and that tagtog-assisted curation is quicker than manual curation. DATABASE URL: www.tagtog.net, www.flybase.org.

  12. The Implementation of Cosine Similarity to Calculate Text Relevance between Two Documents

    Science.gov (United States)

    Gunawan, D.; Sembiring, C. A.; Budiman, M. A.

    2018-03-01

    Rapidly increasing number of web pages or documents leads to topic specific filtering in order to find web pages or documents efficiently. This is a preliminary research that uses cosine similarity to implement text relevance in order to find topic specific document. This research is divided into three parts. The first part is text-preprocessing. In this part, the punctuation in a document will be removed, then convert the document to lower case, implement stop word removal and then extracting the root word by using Porter Stemming algorithm. The second part is keywords weighting. Keyword weighting will be used by the next part, the text relevance calculation. Text relevance calculation will result the value between 0 and 1. The closer value to 1, then both documents are more related, vice versa.

  13. Full-text publication of abstract-presented work in sport and exercise psychology.

    Science.gov (United States)

    Shue, Sarah; Warden, Stuart

    2018-01-01

    Meetings promote information sharing, but do not enable full dissemination of details. A systematic search was conducted for abstracts presented at the 2010 and 2011 Association of Applied Sport Psychology Annual Conferences to determine the full-text dissemination rate of work presented in abstract form and investigate factors influencing this rate. Systematic searches were sequentially conducted to determine whether the abstract-presented work had been published in full-text format in the 5 years following presentation. If a potential full-text publication was identified, information from the conference abstract (eg, results, number of participants in the sample(s), measurement tools used and so on) was compared with the full text to ensure the two entities represented the same body of work. Abstract factors of interest were assessed using logistic regression. Ninety-four out of 423 presented abstracts (22.2%) were published in full text. Odds of full-text publication increased if the abstract was from an international institution, presented in certain conference sections or presented as a lecture. Those attending professional conferences should be cautious when translating data presented at conferences into their applied work because of the low rate of peer-reviewed and full-text publication of the information.

  14. Investigating and Annotating the Role of Citation in Biomedical Full-Text Articles.

    Science.gov (United States)

    Yu, Hong; Agarwal, Shashank; Frid, Nadya

    2009-11-01

    Citations are ubiquitous in scientific articles and play important roles for representing the semantic content of a full-text biomedical article. In this work, we manually examined full-text biomedical articles to analyze the semantic content of citations in full-text biomedical articles. After developing a citation relation schema and annotation guideline, our pilot annotation results show an overall agreement of 0.71, and here we report on the research challenges and the lessons we've learned while trying to overcome them. Our work is a first step toward automatic citation classification in full-text biomedical articles, which may contribute to many text mining tasks, including information retrieval, extraction, summarization, and question answering.

  15. Full-text publication of abstract-presented work in physical therapy: do therapists publish what they preach?

    Science.gov (United States)

    Smith, Heather D; Bogenschutz, Elizabeth D; Bayliss, Amy J; Altenburger, Peter A; Warden, Stuart J

    2011-02-01

    Professional meetings, such as the American Physical Therapy Association's (APTA's) Combined Sections Meeting (CSM), provide forums for sharing information relevant to physical therapy. An indicator of whether therapists fully disseminate their work is the number of full-text peer-reviewed publications that result. The purposes of this study were: (1) to determine the full-text publication rate of work presented in abstract form at CSM and (2) to investigate factors influencing this rate. A systematic search was undertaken to locate full-text publications of work presented in abstract form within the Orthopaedic and Sports Physical Therapy sections at CSM between 2000 and 2004. Eligible publications were published within 5 years following abstract presentation. The influences of APTA section, year of abstract presentation, institution of origin, study design, sample size, study significance, reporting of a funding source, and presentation type on full-text publication rate were assessed. Characteristics of full-text publications were explored. Work presented in 1 out of 4 abstracts (25.4%) progressed to full-text publication. Odds of full-text publication increased if the abstract originated from a doctorate-granting or "other" institution, reported findings of an experimental study, reported a statistically significant finding, included a larger sample size, disclosed a funding source, or was presented as a platform presentation. More than one third (37.8%) of full-text publications were published in the Journal of Orthopaedic and Sports Physical Therapy or Physical Therapy, and 4 out of 10 full-text publications (39.2%) contained at least one major change from information presented in abstract form. The full-text publication rate for information presented in abstract form within the Orthopaedic and Sports Physical Therapy sections at CSM is low relative to comparative disciplines. Caution should be exercised when translating information presented at CSM into

  16. MeSH: a window into full text for document summarization.

    Science.gov (United States)

    Bhattacharya, Sanmitra; Ha-Thuc, Viet; Srinivasan, Padmini

    2011-07-01

    Previous research in the biomedical text-mining domain has historically been limited to titles, abstracts and metadata available in MEDLINE records. Recent research initiatives such as TREC Genomics and BioCreAtIvE strongly point to the merits of moving beyond abstracts and into the realm of full texts. Full texts are, however, more expensive to process not only in terms of resources needed but also in terms of accuracy. Since full texts contain embellishments that elaborate, contextualize, contrast, supplement, etc., there is greater risk for false positives. Motivated by this, we explore an approach that offers a compromise between the extremes of abstracts and full texts. Specifically, we create reduced versions of full text documents that contain only important portions. In the long-term, our goal is to explore the use of such summaries for functions such as document retrieval and information extraction. Here, we focus on designing summarization strategies. In particular, we explore the use of MeSH terms, manually assigned to documents by trained annotators, as clues to select important text segments from the full text documents. Our experiments confirm the ability of our approach to pick the important text portions. Using the ROUGE measures for evaluation, we were able to achieve maximum ROUGE-1, ROUGE-2 and ROUGE-SU4 F-scores of 0.4150, 0.1435 and 0.1782, respectively, for our MeSH term-based method versus the maximum baseline scores of 0.3815, 0.1353 and 0.1428, respectively. Using a MeSH profile-based strategy, we were able to achieve maximum ROUGE F-scores of 0.4320, 0.1497 and 0.1887, respectively. Human evaluation of the baselines and our proposed strategies further corroborates the ability of our method to select important sentences from the full texts. sanmitra-bhattacharya@uiowa.edu; padmini-srinivasan@uiowa.edu.

  17. Building a protein name dictionary from full text: a machine learning term extraction approach

    Directory of Open Access Journals (Sweden)

    Campagne Fabien

    2005-04-01

    Full Text Available Abstract Background The majority of information in the biological literature resides in full text articles, instead of abstracts. Yet, abstracts remain the focus of many publicly available literature data mining tools. Most literature mining tools rely on pre-existing lexicons of biological names, often extracted from curated gene or protein databases. This is a limitation, because such databases have low coverage of the many name variants which are used to refer to biological entities in the literature. Results We present an approach to recognize named entities in full text. The approach collects high frequency terms in an article, and uses support vector machines (SVM to identify biological entity names. It is also computationally efficient and robust to noise commonly found in full text material. We use the method to create a protein name dictionary from a set of 80,528 full text articles. Only 8.3% of the names in this dictionary match SwissProt description lines. We assess the quality of the dictionary by studying its protein name recognition performance in full text. Conclusion This dictionary term lookup method compares favourably to other published methods, supporting the significance of our direct extraction approach. The method is strong in recognizing name variants not found in SwissProt.

  18. Full-text publication of abstracts presented at European Orthodontic Society congresses

    NARCIS (Netherlands)

    Livas, Christos; Pandis, Nikolaos; Ren, Yijin

    2014-01-01

    INTRODUCTION: Empirical evidence has indicated that only a subsample of studies conducted reach full-text publication and this phenomenon has become known as publication bias. A form of publication bias is the selectively delayed full publication of conference abstracts. The objective of this

  19. SERVICES OF FULL-TEXT SEARCHING IN A DISTRIBUTED INFORMATION ENVIRONMENT (PROJECT HUMANITARIANA

    Directory of Open Access Journals (Sweden)

    S. K. Lyapin

    2015-01-01

    Full Text Available Problem statement. We justify the possibility of full-text search services application in both universal and specialized (in terms of resource base digital libraries for the extraction and analysis of the context knowledge in the humanities. The architecture and services of virtual information and resource center for extracting knowledge from the humanitarian texts generated by «Humanitariana» project are described. The functional integration of the resources and services for a full-text search in a distributed decentralized environment, organized in the Internet / Intranet architecture under the control of the client (user browser accessing a variety of independent servers. An algorithm for a distributed full-text query implementation is described. Methods. Method of combining requency-ranked and paragraph-oriented full-text queries is used: the first are used for the preliminary analysis of the subject area or a combination product (explication of "vertical" context, or macro context, the second - for the explication of "horizontal" context, or micro context within copyright paragraph. The results of the frequency-ranked queries are used to compile paragraph-oriented queries. Results. The results of textual research are shown on the topics "The question of fact in Russian philosophy", "The question of loneliness in Russian philosophy and culture". About 50 pieces of context knowledge on the total resource base of about 2,500 full-text resources have been explicated and briefly described to their further expert investigating. Practical significance. The proposed technology (advanced full-text searching services in a distributed information environment can be used for the information support of humanitarian studies and education in the humanities, for functional integration of resources and services of various organizations, for carrying out interdisciplinary research.

  20. Evaluating Open-Source Full-Text Search Engines for Matching ICD-10 Codes.

    Science.gov (United States)

    Jurcău, Daniel-Alexandru; Stoicu-Tivadar, Vasile

    2016-01-01

    This research presents the results of evaluating multiple free, open-source engines on matching ICD-10 diagnostic codes via full-text searches. The study investigates what it takes to get an accurate match when searching for a specific diagnostic code. For each code the evaluation starts by extracting the words that make up its text and continues with building full-text search queries from the combinations of these words. The queries are then run against all the ICD-10 codes until a match indicates the code in question as a match with the highest relative score. This method identifies the minimum number of words that must be provided in order for the search engines choose the desired entry. The engines analyzed include a popular Java-based full-text search engine, a lightweight engine written in JavaScript which can even execute on the user's browser, and two popular open-source relational database management systems.

  1. The structural and content aspects of abstracts versus bodies of full text journal articles are different.

    Science.gov (United States)

    Cohen, K Bretonnel; Johnson, Helen L; Verspoor, Karin; Roeder, Christophe; Hunter, Lawrence E

    2010-09-29

    An increase in work on the full text of journal articles and the growth of PubMedCentral have the opportunity to create a major paradigm shift in how biomedical text mining is done. However, until now there has been no comprehensive characterization of how the bodies of full text journal articles differ from the abstracts that until now have been the subject of most biomedical text mining research. We examined the structural and linguistic aspects of abstracts and bodies of full text articles, the performance of text mining tools on both, and the distribution of a variety of semantic classes of named entities between them. We found marked structural differences, with longer sentences in the article bodies and much heavier use of parenthesized material in the bodies than in the abstracts. We found content differences with respect to linguistic features. Three out of four of the linguistic features that we examined were statistically significantly differently distributed between the two genres. We also found content differences with respect to the distribution of semantic features. There were significantly different densities per thousand words for three out of four semantic classes, and clear differences in the extent to which they appeared in the two genres. With respect to the performance of text mining tools, we found that a mutation finder performed equally well in both genres, but that a wide variety of gene mention systems performed much worse on article bodies than they did on abstracts. POS tagging was also more accurate in abstracts than in article bodies. Aspects of structure and content differ markedly between article abstracts and article bodies. A number of these differences may pose problems as the text mining field moves more into the area of processing full-text articles. However, these differences also present a number of opportunities for the extraction of data types, particularly that found in parenthesized text, that is present in article bodies

  2. Facilitating Full-text Access to Biomedical Literature Using Open Access Resources.

    Science.gov (United States)

    Kang, Hongyu; Hou, Zhen; Li, Jiao

    2015-01-01

    Open access (OA) resources and local libraries often have their own literature databases, especially in the field of biomedicine. We have developed a method of linking a local library to a biomedical OA resource facilitating researchers' full-text article access. The method uses a model based on vector space to measure similarities between two articles in local library and OA resources. The method achieved an F-score of 99.61%. This method of article linkage and mapping between local library and OA resources is available for use. Through this work, we have improved the full-text access of the biomedical OA resources.

  3. Full text publication rates of studies presented at an international emergency medicine scientific meeting.

    Science.gov (United States)

    Chan, Jannet W M; Graham, Colin A

    2011-09-01

    The publication rate of full text papers following an abstract presentation at a medical conference is variable, and few studies have examined the situation with respect to international emergency medicine conferences. This retrospective study aimed to identify the publication rate of abstracts presented at the 2006 International Conference on Emergency Medicine (ICEM) held in Halifax, Canada. The full text publication rate was 33.2%, similar to previous emergency medicine meetings. English language barriers may play a role in the low publication rate seen.

  4. Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

    Directory of Open Access Journals (Sweden)

    Çağdaş Çapkın

    2016-12-01

    Full Text Available Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR, full-text (FIR and hybrid (HIR content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR and information retrieval performance improved.

  5. 10. National Nuclear Science and Technologies Congress Proceedings Full Texts Volume 1

    International Nuclear Information System (INIS)

    2009-01-01

    X. National Nuclear Science and Technologies Congress was held on 6-9 October 2009 in Mugla, Turkey in the course of collaborative organization undertaken by Turkish Atomic Energy Authority, Mugla University and Sitki Kocman Foundation. This first volume of Proceedings Book contains 75 submitted presentations and 36 of them are full texts on applications of nuclear techniques.

  6. Endnote Referencing Software: Importing references from an Ebsco database, attaching full text, organising your Endnote library

    OpenAIRE

    Turner, Susan

    2017-01-01

    This video demonstrates importing bibliographic references from EBSCO Discovery Service, the same method can be used for all EBSCO databases. \\ud The video also demonstrates how to attach full text files to the references and how to organise your references within the endnote library using groups.

  7. Design of an On-Line Query Language for Full Text Patent Search.

    Science.gov (United States)

    Glantz, Richard S.

    The design of an English-like query language and an interactive computer environment for searching the full text of the U.S. patent collection are discussed. Special attention is paid to achieving a transparent user interface, to providing extremely broad search capabilities (including nested substitution classes, Kleene star events, and domain…

  8. Preparing College Students To Search Full-Text Databases: Is Instruction Necessary?

    Science.gov (United States)

    Riley, Cheryl; Wales, Barbara

    Full-text databases allow Central Missouri State University's clients to access some of the serials that libraries have had to cancel due to escalating subscription costs; EbscoHost, the subject of this study, is one such database. The database is available free to all Missouri residents. A survey was designed consisting of 21 questions intended…

  9. Full Text or Abstract? : Examining Topic Coherence Scores Using Latent Dirichlet Allocation

    NARCIS (Netherlands)

    Syed, S.; Spruit, M.

    2017-01-01

    This paper assesses topic coherence and human topic ranking of uncovered latent topics from scientific publications when utilizing the topic model latent Dirichlet allocation (LDA) on abstract and full-text data. The coherence of a topic, used as a proxy for topic quality, is based on the

  10. Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion.

    Science.gov (United States)

    Agarwal, Shashank; Yu, Hong

    2009-12-01

    Biomedical texts can be typically represented by four rhetorical categories: Introduction, Methods, Results and Discussion (IMRAD). Classifying sentences into these categories can benefit many other text-mining tasks. Although many studies have applied different approaches for automatically classifying sentences in MEDLINE abstracts into the IMRAD categories, few have explored the classification of sentences that appear in full-text biomedical articles. We first evaluated whether sentences in full-text biomedical articles could be reliably annotated into the IMRAD format and then explored different approaches for automatically classifying these sentences into the IMRAD categories. Our results show an overall annotation agreement of 82.14% with a Kappa score of 0.756. The best classification system is a multinomial naïve Bayes classifier trained on manually annotated data that achieved 91.95% accuracy and an average F-score of 91.55%, which is significantly higher than baseline systems. A web version of this system is available online at-http://wood.ims.uwm.edu/full_text_classifier/.

  11. Full-text automated detection of surgical site infections secondary to neurosurgery in Rennes, France.

    Science.gov (United States)

    Campillo-Gimenez, Boris; Garcelon, Nicolas; Jarno, Pascal; Chapplain, Jean Marc; Cuggia, Marc

    2013-01-01

    The surveillance of Surgical Site Infections (SSI) contributes to the management of risk in French hospitals. Manual identification of infections is costly, time-consuming and limits the promotion of preventive procedures by the dedicated teams. The introduction of alternative methods using automated detection strategies is promising to improve this surveillance. The present study describes an automated detection strategy for SSI in neurosurgery, based on textual analysis of medical reports stored in a clinical data warehouse. The method consists firstly, of enrichment and concept extraction from full-text reports using NOMINDEX, and secondly, text similarity measurement using a vector space model. The text detection was compared to the conventional strategy based on self-declaration and to the automated detection using the diagnosis-related group database. The text-mining approach showed the best detection accuracy, with recall and precision equal to 92% and 40% respectively, and confirmed the interest of reusing full-text medical reports to perform automated detection of SSI.

  12. Beyond genes, proteins, and abstracts: Identifying scientific claims from full-text biomedical articles.

    Science.gov (United States)

    Blake, Catherine

    2010-04-01

    Massive increases in electronically available text have spurred a variety of natural language processing methods to automatically identify relationships from text; however, existing annotated collections comprise only bioinformatics (gene-protein) or clinical informatics (treatment-disease) relationships. This paper introduces the Claim Framework that reflects how authors across biomedical spectrum communicate findings in empirical studies. The Framework captures different levels of evidence by differentiating between explicit and implicit claims, and by capturing under-specified claims such as correlations, comparisons, and observations. The results from 29 full-text articles show that authors report fewer than 7.84% of scientific claims in an abstract, thus revealing the urgent need for text mining systems to consider the full-text of an article rather than just the abstract. The results also show that authors typically report explicit claims (77.12%) rather than an observations (9.23%), correlations (5.39%), comparisons (5.11%) or implicit claims (2.7%). Informed by the initial manual annotations, we introduce an automated approach that uses syntax and semantics to identify explicit claims automatically and measure the degree to which each feature contributes to the overall precision and recall. Results show that a combination of semantics and syntax is required to achieve the best system performance. 2009 Elsevier Inc. All rights reserved.

  13. [Exploration and construction of the full-text database of acupuncture literature in the Republic of China].

    Science.gov (United States)

    Fei, Lin; Zhao, Jing; Leng, Jiahao; Zhang, Shujian

    2017-10-12

    The ALIPORC full-text database is targeted at a specific full-text database of acupuncture literature in the Republic of China. Starting in 2015, till now, the database has been getting completed, focusing on books relevant with acupuncture, articles and advertising documents, accomplished or published in the Republic of China. The construction of this database aims to achieve the source sharing of acupuncture medical literature in the Republic of China through the retrieval approaches to diversity and accurate content presentation, contributes to the exchange of scholars, reduces the paper damage caused by paging and simplify the retrieval of the rare literature. The writers have made the explanation of the database in light of sources, characteristics and current situation of construction; and have discussed on improving the efficiency and integrity of the database and deepening the development of acupuncture literature in the Republic of China.

  14. Database citation in supplementary data linked to Europe PubMed Central full text biomedical articles.

    Science.gov (United States)

    Kafkas, Şenay; Kim, Jee-Hyub; Pi, Xingjun; McEntyre, Johanna R

    2015-01-01

    In this study, we present an analysis of data citation practices in full text research articles and their corresponding supplementary data files, made available in the Open Access set of articles from Europe PubMed Central. Our aim is to investigate whether supplementary data files should be considered as a source of information for integrating the literature with biomolecular databases. Using text-mining methods to identify and extract a variety of core biological database accession numbers, we found that the supplemental data files contain many more database citations than the body of the article, and that those citations often take the form of a relatively small number of articles citing large collections of accession numbers in text-based files. Moreover, citation of value-added databases derived from submission databases (such as Pfam, UniProt or Ensembl) is common, demonstrating the reuse of these resources as datasets in themselves. All the database accession numbers extracted from the supplementary data are publicly accessible from http://dx.doi.org/10.5281/zenodo.11771. Our study suggests that supplementary data should be considered when linking articles with data, in curation pipelines, and in information retrieval tasks in order to make full use of the entire research article. These observations highlight the need to improve the management of supplemental data in general, in order to make this information more discoverable and useful.

  15. Getting more out of biomedical documents with GATE's full lifecycle open source text analytics.

    Directory of Open Access Journals (Sweden)

    Hamish Cunningham

    Full Text Available This software article describes the GATE family of open source text analysis tools and processes. GATE is one of the most widely used systems of its type with yearly download rates of tens of thousands and many active users in both academic and industrial contexts. In this paper we report three examples of GATE-based systems operating in the life sciences and in medicine. First, in genome-wide association studies which have contributed to discovery of a head and neck cancer mutation association. Second, medical records analysis which has significantly increased the statistical power of treatment/outcome models in the UK's largest psychiatric patient cohort. Third, richer constructs in drug-related searching. We also explore the ways in which the GATE family supports the various stages of the lifecycle present in our examples. We conclude that the deployment of text mining for document abstraction or rich search and navigation is best thought of as a process, and that with the right computational tools and data collection strategies this process can be made defined and repeatable. The GATE research programme is now 20 years old and has grown from its roots as a specialist development tool for text processing to become a rather comprehensive ecosystem, bringing together software developers, language engineers and research staff from diverse fields. GATE now has a strong claim to cover a uniquely wide range of the lifecycle of text analysis systems. It forms a focal point for the integration and reuse of advances that have been made by many people (the majority outside of the authors' own group who work in text processing for biomedicine and other areas. GATE is available online under GNU open source licences and runs on all major operating systems. Support is available from an active user and developer community and also on a commercial basis.

  16. Full-text publication of abstracts in emergency medicine in Denmark

    DEFF Research Database (Denmark)

    Ravn, Anne Katrine; Petersen, Dan Brun; Folkestad, Lars

    2014-01-01

    INTRODUCTION: Abstracts presented at medical conferences or scientific meetings should ideally be published as full-text articles in peer-reviewed journals after initial presentation and feedback regardless of the findings. The aim of this survey was to determine the publication rate of papers...... similar publication rates. However, other more established specialties have higher publication levels. Knowledge of reasons for non-publication could lead to efforts to promote publication like funding; the possibility of discussion between authors and editors at conferences; "publication mentors"; and...

  17. The Establishment of the Chinese Full-text Electronic Periodical Database and Service System

    Directory of Open Access Journals (Sweden)

    Huei-Chu Chang

    2003-12-01

    Full Text Available A database covers important journals to critical mass, with powerful search interface, and easy for remote access is the most reasonable electronic resource for users. This article try to start from the project of digitizing bio-medical journals in Taiwan area to the CEPS, discuss the related issues about the selection of journals, the digitized of back issues, the copyright transfer from authors to database producers, the feedback to authors for payment from revenue. It also talks about the flow of journal publishing, marketing, function and the proposed cost-effectiveness in CEPS.[Article content in Chinese

  18. Using distant supervised learning to identify protein subcellular localizations from full-text scientific articles.

    Science.gov (United States)

    Zheng, Wu; Blake, Catherine

    2015-10-01

    Databases of curated biomedical knowledge, such as the protein-locations reflected in the UniProtKB database, provide an accurate and useful resource to researchers and decision makers. Our goal is to augment the manual efforts currently used to curate knowledge bases with automated approaches that leverage the increased availability of full-text scientific articles. This paper describes experiments that use distant supervised learning to identify protein subcellular localizations, which are important to understand protein function and to identify candidate drug targets. Experiments consider Swiss-Prot, the manually annotated subset of the UniProtKB protein knowledge base, and 43,000 full-text articles from the Journal of Biological Chemistry that contain just under 11.5 million sentences. The system achieves 0.81 precision and 0.49 recall at sentence level and an accuracy of 57% on held-out instances in a test set. Moreover, the approach identifies 8210 instances that are not in the UniProtKB knowledge base. Manual inspection of the 50 most likely relations showed that 41 (82%) were valid. These results have immediate benefit to researchers interested in protein function, and suggest that distant supervision should be explored to complement other manual data curation efforts. Copyright © 2015 Elsevier Inc. All rights reserved.

  19. Full-text publication of abstracts presented at European Orthodontic Society congresses.

    Science.gov (United States)

    Livas, Christos; Pandis, Nikolaos; Ren, Yijin

    2014-10-01

    Empirical evidence has indicated that only a subsample of studies conducted reach full-text publication and this phenomenon has become known as publication bias. A form of publication bias is the selectively delayed full publication of conference abstracts. The objective of this article was to examine the publication status of oral abstracts and poster-presentation abstracts, included in the scientific program of the 82nd and 83rd European Orthodontic Society (EOS) congresses, held in 2006 and 2007, and to identify factors associated with full-length publication. A systematic search of PubMed and Google Scholar databases was performed in April 2013 using author names and keywords from the abstract title to locate abstract and full-article publications. Information regarding mode of presentation, type of affiliation, geographical origin, statistical results, and publication details were collected and analyzed using univariable and multivariable logistic regression. Approximately 51 per cent of the EOS 2006 and 55 per cent of the EOS 2007 abstracts appeared in print more than 5 years post congress. A mean period of 1.32 years elapsed between conference and publication date. Mode of presentation (oral or poster), use of statistical analysis, and research subject area were significant predictors for publication success. Inherent discrepancies of abstract reporting, mainly related to presentation of preliminary results and incomplete description of methods, may be considered in analogous studies. On average 52.2 per cent of the abstracts presented at the two EOS conferences reached full publication. Abstracts presented orally, including statistical analysis, were more likely to get published. © The Author 2013. Published by Oxford University Press on behalf of the European Orthodontic Society. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  20. Large-scale extraction of gene interactions from full-text literature using DeepDive.

    Science.gov (United States)

    Mallory, Emily K; Zhang, Ce; Ré, Christopher; Altman, Russ B

    2016-01-01

    A complete repository of gene-gene interactions is key for understanding cellular processes, human disease and drug response. These gene-gene interactions include both protein-protein interactions and transcription factor interactions. The majority of known interactions are found in the biomedical literature. Interaction databases, such as BioGRID and ChEA, annotate these gene-gene interactions; however, curation becomes difficult as the literature grows exponentially. DeepDive is a trained system for extracting information from a variety of sources, including text. In this work, we used DeepDive to extract both protein-protein and transcription factor interactions from over 100,000 full-text PLOS articles. We built an extractor for gene-gene interactions that identified candidate gene-gene relations within an input sentence. For each candidate relation, DeepDive computed a probability that the relation was a correct interaction. We evaluated this system against the Database of Interacting Proteins and against randomly curated extractions. Our system achieved 76% precision and 49% recall in extracting direct and indirect interactions involving gene symbols co-occurring in a sentence. For randomly curated extractions, the system achieved between 62% and 83% precision based on direct or indirect interactions, as well as sentence-level and document-level precision. Overall, our system extracted 3356 unique gene pairs using 724 features from over 100,000 full-text articles. Application source code is publicly available at https://github.com/edoughty/deepdive_genegene_app russ.altman@stanford.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  1. Early Career Researchers Demand Full-text and Rely on Google to Find Scholarly Sources

    Directory of Open Access Journals (Sweden)

    Richard Hayman

    2017-12-01

    Full Text Available A Review of: Nicholas, D., Boukacem-Zeghmouri, C., Rodríguez-Bravo, B., Xu, J., Watkinson, A., Abrizah, A., Herman, E., & Świgoń, M. (2017. Where and how early career researchers find scholarly information. Learned Publishing, 30(1, 19-29. http://dx.doi.org/10.1002/leap.1087 Abstract Objective – To examine the attitudes and information behaviours of early career researchers (ECRs when locating scholarly information. Design – Qualitative longitudinal study. Setting – Research participants from the United Kingdom, United States of America, China, France, Malaysia, Poland, and Spain. Subjects – A total 116 participants from various disciplines, aged 35 and younger, who were holding or had previously held a research position, but not in a tenured position. All participants held a doctorate or were in the process of earning one. Methods – Using structured interviews of 60-90 minutes, researchers asked 60 questions of each participant via face-to-face, Skype, or telephone interviews. The interview format and questions were formed via focus groups. Main Results – As part of a longitudinal project, results reported are limited to the first year of the study, and focused on three primary questions identified by the authors: where do ECRs find scholarly information, whether they use their smartphones to locate and read scholarly information, and what social media do they use to find scholarly information. Researchers describe how ECRs themselves interpreted the phrase scholarly information to primarily mean journal articles, while the researchers themselves had a much expanded definition to include professional and “scholarly contacts, ideas, and data” (p. 22. This research shows that Google and Google Scholar are widely used by ECRs for locating scholarly information regardless of discipline, language, or geography. Their analysis by country points to currency and the combined breadth-and-depth search experience that Google provides as

  2. The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

    Directory of Open Access Journals (Sweden)

    Ulrich Schäfer

    2013-02-01

    Full Text Available We describe a novel approach to precise searching in the full content of digital libraries. The Searchbench (for search workbench is based on sentence-wise syntactic and semantic natural language processing (NLP of both born-digital and scanned publications in PDF format. The term born-digital means natively digital, i.e. prepared electronically using typesetting systems such as LaTeX, OpenOffice, and the like. In the Searchbench, queries can be formulated as (possibly underspecified statements, consisting of simple subject-predicate-object constructs such as ‘algorithm improves word alignment’. This reduces the number of false hits in large document collections when the search words happen to appear close to each other, but are not semantically related. The method also abstracts from passive voice and predicate synonyms. Moreover, negated statements can be excluded from the search results, and negated antonym predicates again count as synonyms (e.g. not include = exclude.In the Searchbench, a sentence-semantic search can be combined with search filters for classical full-text, bibliographic metadata and automatically computed domain terms. Auto-suggest fields facilitate text input. Queries can be bookmarked or emailed. Furthermore, a novel citation browser in the Searchbench allows graphical navigation in citation networks. These have been extracted automatically from metadata and paper texts. The citation browser displays short phrases from citation sentences at the edges in the citation graph and thus allows students and researchers to quickly browse publications and immerse into a new research field. By clicking on a citation edge, the original citation sentence is shown in context, and optionally also in the original PDF layout.To showcase the usefulness of our research, we have a applied it to a collection of currently approx. 25,000 open access research papers in the field of computational linguistics and language technology, the ACL

  3. Full Text Searching and Customization in the NASA ADS Abstract Service

    Science.gov (United States)

    Eichhorn, G.; Accomazzi, A.; Grant, C. S.; Kurtz, M. J.; Henneken, E. A.; Thompson, D. M.; Murray, S. S.

    2004-01-01

    The NASA-ADS Abstract Service provides a sophisticated search capability for the literature in Astronomy, Planetary Sciences, Physics/Geophysics, and Space Instrumentation. The ADS is funded by NASA and access to the ADS services is free to anybody worldwide without restrictions. It allows the user to search the literature by author, title, and abstract text. The ADS database contains over 3.6 million references, with 965,000 in the Astronomy/Planetary Sciences database, and 1.6 million in the Physics/Geophysics database. 2/3 of the records have full abstracts, the rest are table of contents entries (titles and author lists only). The coverage for the Astronomy literature is better than 95% from 1975. Before that we cover all major journals and many smaller ones. Most of the journal literature is covered back to volume 1. We now get abstracts on a regular basis from most journals. Over the last year we have entered basically all conference proceedings tables of contents that are available at the Harvard Smithsonian Center for Astrophysics library. This has greatly increased the coverage of conference proceedings in the ADS. The ADS also covers the ArXiv Preprints. We download these preprints every night and index all the preprints. They can be searched either together with the other abstracts or separately. There are currently about 260,000 preprints in that database. In January 2004 we have introduced two new services, full text searching and a personal notification service called "myADS". As all other ADS services, these are free to use for anybody.

  4. An exploratory analysis of PubMed's free full-text limit on citation retrieval for clinical questions.

    Science.gov (United States)

    Krieger, Mary M; Richter, Randy R; Austin, Tricia M

    2008-10-01

    The research sought to determine (1) how use of the PubMed free full-text (FFT) limit affects citation retrieval and (2) how use of the FFT limit impacts the types of articles and levels of evidence retrieved. Four clinical questions based on a research agenda for physical therapy were searched in PubMed both with and without the use of the FFT limit. Retrieved citations were examined for relevancy to each question. Abstracts of relevant citations were reviewed to determine the types of articles and levels of evidence. Descriptive analysis was used to compare the total number of citations, number of relevant citations, types of articles, and levels of evidence both with and without the use of the FFT limit. Across all 4 questions, the FFT limit reduced the number of citations to 11.1% of the total number of citations retrieved without the FFT limit. Additionally, high-quality evidence such as systematic reviews and randomized controlled trials were missed when the FFT limit was used. Health sciences librarians play a key role in educating users about the potential impact the FFT limit has on the number of citations, types of articles, and levels of evidence retrieved.

  5. Full-text publication of abstracts presented at meetings of a Latin American scientific society.

    Science.gov (United States)

    Dicembrino, Manuela; Anderson, Mariana; Vely, Ana Gabriela; Ossorio, María Fabiana; Ferrero, Fernando

    2014-12-01

    To estimate the proportion of abstracts presented at meetings of the Latin American Society for Pediatric Research that are fully-published, to describe the reasons for not publishing papers, and to assess the impact of funding on the publication rate. Abstracts presented at meetings held between 2005 and 2009 were included. Authors were contacted and invited to take a survey on the publication of their work or the reasons not to do it. Information was collected on 232 (71.4%) of the 325 abstracts presented. Of these, 58.6% were fully-published (136/232). Funded studies (40.0%) had more chances of publication (OR: 2.2; 95% CI: 1.2-3.9). "Lack of time" was the most common reason for failure to publish (35/96). 58.6% of abstracts presented at meetings of the Latin American Society for Pediatric Research, were published as full-text articles; lack of time was the most common reason for failure to publish. Funded research had more chances of being published.

  6. E2FM: an encrypted and compressed full-text index for collections of genomic sequences.

    Science.gov (United States)

    Montecuollo, Ferdinando; Schmid, Giovannni; Tagliaferri, Roberto

    2017-09-15

    Next Generation Sequencing (NGS) platforms and, more generally, high-throughput technologies are giving rise to an exponential growth in the size of nucleotide sequence databases. Moreover, many emerging applications of nucleotide datasets-as those related to personalized medicine-require the compliance with regulations about the storage and processing of sensitive data. We have designed and carefully engineered E 2 FM -index, a new full-text index in minute space which was optimized for compressing and encrypting nucleotide sequence collections in FASTA format and for performing fast pattern-search queries. E 2 FM -index allows to build self-indexes which occupy till to 1/20 of the storage required by the input FASTA file, thus permitting to save about 95% of storage when indexing collections of highly similar sequences; moreover, it can exactly search the built indexes for patterns in times ranging from few milliseconds to a few hundreds milliseconds, depending on pattern length. Source code is available at https://github.com/montecuollo/E2FM . ferdinando.montecuollo@unicampania.it. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  7. Full text publication rates of papers presented at the British Foot and Ankle Society.

    Science.gov (United States)

    Marsland, D; Mumith, A; Taylor, H P

    2017-07-13

    Techniques in foot and ankle surgery have expanded rapidly in recent years, often presented at national society meetings. It is important that research is published to guide evidence based practice. Many abstracts however do not go on to full text publication. A database was created of all abstracts presented at BOFAS meetings from 2009 to 2013. Computerised searches were performed using PubMed and Google search engines. In total 341 papers were presented, with an overall publication rate of 31.7%. Of 251 clinical papers, 200 were case series (79.6%). Factors associated with publication success included basic science studies, papers related to arthroscopic surgery and research performed outside the UK. A relatively low conversion rate from presentation to publication could be as a result of papers failing to pass the scrutiny of peer review, or that the work is never formally submitted for publication. The information from this study could be used to prioritise future research and promote higher quality research. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.

  8. Scholarly Electronic Full-Text Publications via the Internet: Issues and Impacts

    Science.gov (United States)

    Kosmin, Linda J.

    1999-01-01

    On-line access to complete texts of scholarly journal articles, conference papers, and books is facilitated by rapidly developing World-wide Web Internet access and capabilities. Meanwhile, print publications continue to be produced and read in spite of the proliferation of many networked electronic publications. The purpose of this presentation is to highlight fundamental issues impacting stakeholder groups, as the trend continues towards migration from paper to affordable ubiquitous networked full-text publications. Librarians, publishers, authors and end-users have various viewpoints, interests, and concerns. There are many issues challenging all stakeholder groups. For instance, all share concerns about administering copyright compliance and enforcing fair use. Uncontrollable electronic downstreaming could result in infringed copyright, while limiting a publisher's entitled revenue stream. Moreover, metered fee-based access may hamper scholarly information research. And, self-authoring on the Internet without peer filtering could lead to information clutter. Many related issues challenge librarians in particular. Among these are rising journal subscription prices, regardless if offered in print or electronic. Some electronic offerings are independent of print, others supplement or duplicate print; several publishers presently require subscribing to print in order to access electronic. Furthermore, numbers of publications are n'ow being marketed via the Internet directly to end-users, which can be viewed as encouraging users to bypass the traditional library. A key issue challenging publishers today is the rapidly expanding electronic user base that is demanding delivery of added-value full-text to desktop computers. Also of growing concern appears to be the decline in print sales to libraries, thereby reducing traditional revenue stream potential. Nowadays, publishers are more hesitant about investing in the production of publications geared toward small niche

  9. Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.

    Directory of Open Access Journals (Sweden)

    Anika Oellrich

    Full Text Available Electronic health records and scientific articles possess differing linguistic characteristics that may impact the performance of natural language processing tools developed for one or the other. In this paper, we investigate the performance of four extant concept recognition tools: the clinical Text Analysis and Knowledge Extraction System (cTAKES, the National Center for Biomedical Ontology (NCBO Annotator, the Biomedical Concept Annotation System (BeCAS and MetaMap. Each of the four concept recognition systems is applied to four different corpora: the i2b2 corpus of clinical documents, a PubMed corpus of Medline abstracts, a clinical trails corpus and the ShARe/CLEF corpus. In addition, we assess the individual system performances with respect to one gold standard annotation set, available for the ShARe/CLEF corpus. Furthermore, we built a silver standard annotation set from the individual systems' output and assess the quality as well as the contribution of individual systems to the quality of the silver standard. Our results demonstrate that mainly the NCBO annotator and cTAKES contribute to the silver standard corpora (F1-measures in the range of 21% to 74% and their quality (best F1-measure of 33%, independent from the type of text investigated. While BeCAS and MetaMap can contribute to the precision of silver standard annotations (precision of up to 42%, the F1-measure drops when combined with NCBO Annotator and cTAKES due to a low recall. In conclusion, the performances of individual systems need to be improved independently from the text types, and the leveraging strategies to best take advantage of individual systems' annotations need to be revised. The textual content of the PubMed corpus, accession numbers for the clinical trials corpus, and assigned annotations of the four concept recognition systems as well as the generated silver standard annotation sets are available from http://purl.org/phenotype/resources. The textual content

  10. FULIR Full-text Institutional Repository of the Ruđer Bošković Institute

    Directory of Open Access Journals (Sweden)

    Macan, B.

    2014-11-01

    administration is conducted by the librarians, who also provide a helpdesk service for scientists with depositing and copyright issues. On the depositing side of the repository, FULIR has implemented plugins for automatic checking of copyright issues for archiving articles published in journals by automatic searching of SHERPA/RoMEO database and displaying relevant information to users. There is also the possibility of importing records from other databases, such as arXive or PubMED through items ID’s. There are various search and browse possibilities available in FULIR for users, whereof a full-text search is worth mentioning. Statistical plugin (IRStats2 – Beta – version 0.0.4 is also implemented, as well as altmetric plugin (Altmetric – version 1.0.5. FULIR is fully compatible with OpenAIRE infrastructure and as such is the first OA repository in Croatia enabling RBI scientists to satisfy the conditions of European Commission for archiving full-texts of published papers financed under the Horizon 2020 program in OpenAIRE compatible OAR. Future plans for FULIR are related with further work on interoperability issues with the Croatian Scientific Bibliography – CROSBI, to enable pushing and pulling records from one system to another. There is also the need to educate the scientists about the advantages of OA and the importance of archiving full- text documents into institutional repositories, as well as educating them about copyright issues. Helping other Croatian institutions in implementing their own OAR, possibly through centralized infrastructure on the national level is also one of forthcoming activities where experience gathered with FULIR will be of the utmost importance.

  11. Getting more out of biomedical documents with GATE's full lifecycle open source text analytics.

    Science.gov (United States)

    Cunningham, Hamish; Tablan, Valentin; Roberts, Angus; Bontcheva, Kalina

    2013-01-01

    This software article describes the GATE family of open source text analysis tools and processes. GATE is one of the most widely used systems of its type with yearly download rates of tens of thousands and many active users in both academic and industrial contexts. In this paper we report three examples of GATE-based systems operating in the life sciences and in medicine. First, in genome-wide association studies which have contributed to discovery of a head and neck cancer mutation association. Second, medical records analysis which has significantly increased the statistical power of treatment/outcome models in the UK's largest psychiatric patient cohort. Third, richer constructs in drug-related searching. We also explore the ways in which the GATE family supports the various stages of the lifecycle present in our examples. We conclude that the deployment of text mining for document abstraction or rich search and navigation is best thought of as a process, and that with the right computational tools and data collection strategies this process can be made defined and repeatable. The GATE research programme is now 20 years old and has grown from its roots as a specialist development tool for text processing to become a rather comprehensive ecosystem, bringing together software developers, language engineers and research staff from diverse fields. GATE now has a strong claim to cover a uniquely wide range of the lifecycle of text analysis systems. It forms a focal point for the integration and reuse of advances that have been made by many people (the majority outside of the authors' own group) who work in text processing for biomedicine and other areas. GATE is available online under GNU open source licences and runs on all major operating systems. Support is available from an active user and developer community and also on a commercial basis.

  12. Interactive Effects of Working Memory Self-Regulatory Ability and Relevance Instructions on Text Processing

    Science.gov (United States)

    Hamilton, Nancy Jo

    2012-01-01

    Reading is a process that requires the enactment of many cognitive processes. Each of these processes uses a certain amount of working memory resources, which are severely constrained by biology. More efficiency in the function of working memory may mediate the biological limits of same. Reading relevancy instructions may be one such method to…

  13. The BioC-BioGRID corpus: full text articles annotated for curation of protein–protein and genetic interactions

    Science.gov (United States)

    Kim, Sun; Chatr-aryamontri, Andrew; Chang, Christie S.; Oughtred, Rose; Rust, Jennifer; Wilbur, W. John; Comeau, Donald C.; Dolinski, Kara; Tyers, Mike

    2017-01-01

    A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein–protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future

  14. The BioC-BioGRID corpus: full text articles annotated for curation of protein-protein and genetic interactions.

    Science.gov (United States)

    Islamaj Dogan, Rezarta; Kim, Sun; Chatr-Aryamontri, Andrew; Chang, Christie S; Oughtred, Rose; Rust, Jennifer; Wilbur, W John; Comeau, Donald C; Dolinski, Kara; Tyers, Mike

    2017-01-01

    A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein-protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future

  15. Extraction of Pluvial Flood Relevant Volunteered Geographic Information (VGI by Deep Learning from User Generated Texts and Photos

    Directory of Open Access Journals (Sweden)

    Yu Feng

    2018-01-01

    Full Text Available In recent years, pluvial floods caused by extreme rainfall events have occurred frequently. Especially in urban areas, they lead to serious damages and endanger the citizens’ safety. Therefore, real-time information about such events is desirable. With the increasing popularity of social media platforms, such as Twitter or Instagram, information provided by voluntary users becomes a valuable source for emergency response. Many applications have been built for disaster detection and flood mapping using crowdsourcing. Most of the applications so far have merely used keyword filtering or classical language processing methods to identify disaster relevant documents based on user generated texts. As the reliability of social media information is often under criticism, the precision of information retrieval plays a significant role for further analyses. Thus, in this paper, high quality eyewitnesses of rainfall and flooding events are retrieved from social media by applying deep learning approaches on user generated texts and photos. Subsequently, events are detected through spatiotemporal clustering and visualized together with these high quality eyewitnesses in a web map application. Analyses and case studies are conducted during flooding events in Paris, London and Berlin.

  16. Dynamic programming re-ranking for PPI interactor and pair extraction in full-text articles

    Science.gov (United States)

    2011-01-01

    Background Experimentally verified protein-protein interactions (PPIs) cannot be easily retrieved by researchers unless they are stored in PPI databases. The curation of such databases can be facilitated by employing text-mining systems to identify genes which play the interactor role in PPIs and to map these genes to unique database identifiers (interactor normalization task or INT) and then to return a list of interaction pairs for each article (interaction pair task or IPT). These two tasks are evaluated in terms of the area under curve of the interpolated precision/recall (AUC iP/R) score because the order of identifiers in the output list is important for ease of curation. Results Our INT system developed for the BioCreAtIvE II.5 INT challenge achieved a promising AUC iP/R of 43.5% by using a support vector machine (SVM)-based ranking procedure. Using our new re-ranking algorithm, we have been able to improve system performance (AUC iP/R) by 1.84%. Our experimental results also show that with the re-ranked INT results, our unsupervised IPT system can achieve a competitive AUC iP/R of 23.86%, which outperforms the best BC II.5 INT system by 1.64%. Compared to using only SVM ranked INT results, using re-ranked INT results boosts AUC iP/R by 7.84%. Statistical significance t-test results show that our INT/IPT system with re-ranking outperforms that without re-ranking by a statistically significant difference. Conclusions In this paper, we present a new re-ranking algorithm that considers co-occurrence among identifiers in an article to improve INT and IPT ranking results. Combining the re-ranked INT results with an unsupervised approach to find associations among interactors, the proposed method can boost the IPT performance. We also implement score computation using dynamic programming, which is faster and more efficient than traditional approaches. PMID:21342534

  17. Commercial Database Design vs. Library Terminology Comprehension: Why Do Students Print Abstracts Instead of Full-Text Articles?

    Science.gov (United States)

    Imler, Bonnie; Eichelberger, Michelle

    2014-01-01

    When asked to print the full text of an article, many undergraduate college students print the abstract instead of the full text. This study seeks to determine the underlying cause(s) of this confusion. In this quantitative study, participants (n = 40) performed five usability tasks to assess ease of use and usefulness of five commercial library…

  18. download full text

    African Journals Online (AJOL)

    Hence, the main objective of the research was to carry out scientific studies on its ... The animals were sacrificed on day 30 after the NIB scoring and blood sample ... effect on locomotion and rearing activities when compared with the control.

  19. download full text

    African Journals Online (AJOL)

    The overshadowing of education policies in foreign language education at primary .... Cummins states that a threshold level of linguistics competence must be ..... language education planning is designed to accommodate these interests.

  20. download full text

    African Journals Online (AJOL)

    Keywords: Technology, French as a foreign language, Learners, Instruction ... This translates to an increase of 3.7 percent or 1.4 million new mobile subscriptions ... technology (ICT) in foreign language learning and the availability as well as capacities ..... In spite of the many benefits of creating an authentic French learning ...

  1. download full text

    African Journals Online (AJOL)

    Dale E. Zand (1997) argues that People once stood in awe of electricity, until ... in today's information-driven organizations: knowledge, trust, and power. ..... people's culture and resistance to anti-corruption efforts constitute the firmly fixed load.

  2. download full text

    African Journals Online (AJOL)

    Epidemiological study has shown that 2.5 million deaths occurred every year as a result of vaccine-preventable diseases, mainly in Africa and Asia among children less than 5 years old (GIVS, 2005). Immunization is the process of conferring increased resistance to an infectious disease by a means other than experiencing ...

  3. download full text

    African Journals Online (AJOL)

    UNIVERSITY OF BENIN

    By paying strict attention to the manipulation of action and dialogue, the short story ... through the workings of the human mind as he reacts to various predicaments. .... In “A Caring Man,” in A Forest of Flowers, Ken Saro-Wiwa illustrates the theme of .... until his small dirty pillow is thrown out of the window of the moving train.

  4. download full text

    African Journals Online (AJOL)

    ... country and mass migration of the farming communities to IDP camps in major cities ..... "Global Warming Impact: Flood Events, Wet-Dry Conditions and Changing ... Global Environmental Change, Vol. 16, pp. 268-281.Web. Adger, W. N. (1999). "Social Vulnerability to Climate Change and Extremes in Coastal Vietnam.

  5. download full text

    African Journals Online (AJOL)

    Pablo Rubio Gijon

    Hishongwa belongs to a generation of writers who created a new style of expression in .... authority, can turn this authority into something even more autocratic. ... leadership of that (liberation) struggle” (Haarhoff 224), Hishongwa's Marrying ...

  6. download full text

    African Journals Online (AJOL)

    In English, this class includes the particles how, too, so, and as (Ibid). (3) Mary is ...... Doctoral thesis (unpublished), University of Dar es Salaam. Goodness, D. .... manga. fat. corpulent. 36. mbindipindi. green. 37. mwalo. naughty. absurd. 38.

  7. download full text

    African Journals Online (AJOL)

    Adopting a surveillance system for antibacterial use has therefore become a more realistic ..... Financial support was obtained from the African Poverty Related Infection ... classification and Defined Daily Dose system methodology in Canada.

  8. download full text

    African Journals Online (AJOL)

    Oita Etyang

    The concept democracy has been part of man's political life for ages. ... Taking the queue form Bratton and Mattes, we add that prospects of a stable democracy are ..... of the resulting instability that emanate from entrenched ethnic cleavages.

  9. download full text

    African Journals Online (AJOL)

    TAOFEEK YUSUF

    The data used were obtained through questionnaires administered to ... Keywords: academic performance, engineering education, undergraduate students, and .... and commitment to studies irrespective of any form of learning task Yusuf et al.

  10. download full text

    African Journals Online (AJOL)

    Njeri

    He took his children to St Marys, I could not afford to do so. ... place in universities should have been an important learning space for students. ... just us we are fascinated by Manchester football clubs and western movies as well as music.

  11. download full text

    African Journals Online (AJOL)

    Language and Meaning: A Syntactic Study of Wale Okediran's Strange Encounters ... own communication role, making assertions, asking questions, giving orders, ... I will go straight to the police with all the things you stole from the hospital.

  12. Full-text

    African Journals Online (AJOL)

    ADOWIE PERE

    consideration the needs of the current generation without risking ability of future generations to attain their needs. Evaluation of .... If an element or a number such as x and a collection such as A ... defined as definitive and accurate. This also ...

  13. download full text

    African Journals Online (AJOL)

    paula fiona mwikali

    The Portrayal of Masculinity in Dholuo Ohangla Music ... The Luo culture is built on patriarchy and the socialization of the children ..... A leader must be strong because those he/ she leads look up to him/her for direction, assistance and development. ... Being a loyal lieutenant of Orange Democratic Movement, Anyanga ...

  14. Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

    Science.gov (United States)

    Cohen, K Bretonnel; Lanfranchi, Arrick; Choi, Miji Joo-Young; Bada, Michael; Baumgartner, William A; Panteleyeva, Natalya; Verspoor, Karin; Palmer, Martha; Hunter, Lawrence E

    2017-08-17

    Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature. In order to better understand the nature of the phenomenon of coreference in biomedical publications and to increase performance on the task, we annotated the Colorado Richly Annotated Full Text (CRAFT) corpus with coreference relations. The corpus was manually annotated with coreference relations, including identity and appositives for all coreferring base noun phrases. The OntoNotes annotation guidelines, with minor adaptations, were used. Interannotator agreement ranges from 0.480 (entity-based CEAF) to 0.858 (Class-B3), depending on the metric that is used to assess it. The resulting corpus adds nearly 30,000 annotations to the previous release of the CRAFT corpus. Differences from related projects include a much broader definition of markables, connection to extensive annotation of several domain-relevant semantic classes, and connection to complete syntactic annotation. Tool performance was benchmarked on the data. A publicly available out-of-the-box, general-domain coreference resolution system achieved an F-measure of 0.14 (B3), while a simple domain-adapted rule-based system achieved an F-measure of 0.42. An ensemble of the two reached F of 0.46. Following the IDENTITY chains in the data would add 106,263 additional named entities in the full 97-paper corpus, for an increase of 76% percent in the semantic classes of the eight ontologies that have been annotated in earlier versions of the CRAFT corpus. The project produced a large data set for further investigation of coreference and coreference resolution in the scientific literature. The work raised issues in the phenomenon of reference in this domain and genre, and the paper proposes that many mentions that would be considered generic in the general domain are not

  15. Reported estimates of diagnostic accuracy in ophthalmology conference abstracts were not associated with full-text publication.

    Science.gov (United States)

    Korevaar, Daniël A; Cohen, Jérémie F; Spijker, René; Saldanha, Ian J; Dickersin, Kay; Virgili, Gianni; Hooft, Lotty; Bossuyt, Patrick M M

    2016-11-01

    To assess whether conference abstracts that report higher estimates of diagnostic accuracy are more likely to reach full-text publication in a peer-reviewed journal. We identified abstracts describing diagnostic accuracy studies, presented between 2007 and 2010 at the Association for Research in Vision and Ophthalmology (ARVO) Annual Meeting. We extracted reported estimates of sensitivity, specificity, area under the receiver operating characteristic curve (AUC), and diagnostic odds ratio (DOR). Between May and July 2015, we searched MEDLINE and EMBASE to identify corresponding full-text publications; if needed, we contacted abstract authors. Cox regression was performed to estimate associations with full-text publication, where sensitivity, specificity, and AUC were logit transformed, and DOR was log transformed. A full-text publication was found for 226/399 (57%) included abstracts. There was no association between reported estimates of sensitivity and full-text publication (hazard ratio [HR] 1.09 [95% confidence interval {CI} 0.98, 1.22]). The same applied to specificity (HR 1.00 [95% CI 0.88, 1.14]), AUC (HR 0.91 [95% CI 0.75, 1.09]), and DOR (HR 1.01 [95% CI 0.94, 1.09]). Almost half of the ARVO conference abstracts describing diagnostic accuracy studies did not reach full-text publication. Studies in abstracts that mentioned higher accuracy estimates were not more likely to be reported in a full-text publication. Copyright © 2016 Elsevier Inc. All rights reserved.

  16. Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.

    Science.gov (United States)

    Oellrich, Anika; Collier, Nigel; Smedley, Damian; Groza, Tudor

    2015-01-01

    Electronic health records and scientific articles possess differing linguistic characteristics that may impact the performance of natural language processing tools developed for one or the other. In this paper, we investigate the performance of four extant concept recognition tools: the clinical Text Analysis and Knowledge Extraction System (cTAKES), the National Center for Biomedical Ontology (NCBO) Annotator, the Biomedical Concept Annotation System (BeCAS) and MetaMap. Each of the four concept recognition systems is applied to four different corpora: the i2b2 corpus of clinical documents, a PubMed corpus of Medline abstracts, a clinical trails corpus and the ShARe/CLEF corpus. In addition, we assess the individual system performances with respect to one gold standard annotation set, available for the ShARe/CLEF corpus. Furthermore, we built a silver standard annotation set from the individual systems' output and assess the quality as well as the contribution of individual systems to the quality of the silver standard. Our results demonstrate that mainly the NCBO annotator and cTAKES contribute to the silver standard corpora (F1-measures in the range of 21% to 74%) and their quality (best F1-measure of 33%), independent from the type of text investigated. While BeCAS and MetaMap can contribute to the precision of silver standard annotations (precision of up to 42%), the F1-measure drops when combined with NCBO Annotator and cTAKES due to a low recall. In conclusion, the performances of individual systems need to be improved independently from the text types, and the leveraging strategies to best take advantage of individual systems' annotations need to be revised. The textual content of the PubMed corpus, accession numbers for the clinical trials corpus, and assigned annotations of the four concept recognition systems as well as the generated silver standard annotation sets are available from http://purl.org/phenotype/resources. The textual content of the Sh

  17. A full-text english database of testimonies of those exposed to radiation near the Semipalatinsk nuclear test site, Kazakhstan

    OpenAIRE

    Matsuo, Masatsugu; Kawano, Noriyuki; Hirabayashi, Kyoko; Tooka, Yasuyuki; Apsalikov, Kazbek Negamatovich; Hoshi, Masaharu

    2004-01-01

    The present paper is a sequel to the initial report (Kawano et al 2003a) of the project for a full-text Japanese database of the testimonies of those exposed to radiation near the nuclear test site of Semipalatinsk, Kazakhstan. 139 testimonies were gathered in four villages near Semipalatinsk in 2002. We translated them into English from Russian and Kazakh, and created a full-text database by using a Latin script text retrieval program, TERESA. The present paper attempts at essentially the sa...

  18. Empirical investigations into full-text protein interaction Article Categorization Task (ACT) in the BioCreative II.5 Challenge.

    Science.gov (United States)

    Lan, Man; Su, Jian

    2010-01-01

    The selection of protein interaction documents is one important application for biology research and has a direct impact on the quality of downstream BioNLP applications, i.e., information extraction and retrieval, summarization, QA, etc. The BioCreative II.5 Challenge Article Categorization task (ACT) involves doing a binary text classification to determine whether a given structured full-text article contains protein interaction information. This may be the first attempt at classification of full-text protein interaction documents in wide community. In this paper, we compare and evaluate the effectiveness of different section types in full-text articles for text classification. Moreover, in practice, the less number of true-positive samples results in unstable performance and unreliable classifier trained on it. Previous research on learning with skewed class distributions has altered the class distribution using oversampling and downsampling. We also investigate the skewed protein interaction classification and analyze the effect of various issues related to the choice of external sources, oversampling training sets, classifiers, etc. We report on the various factors above to show that 1) a full-text biomedical article contains a wealth of scientific information important to users that may not be completely represented by abstracts and/or keywords, which improves the accuracy performance of classification and 2) reinforcing true-positive samples significantly increases the accuracy and stability performance of classification.

  19. An Enhanced Text-Mining Framework for Extracting Disaster Relevant Data through Social Media and Remote Sensing Data Fusion

    Science.gov (United States)

    Scheele, C. J.; Huang, Q.

    2016-12-01

    In the past decade, the rise in social media has led to the development of a vast number of social media services and applications. Disaster management represents one of such applications leveraging massive data generated for event detection, response, and recovery. In order to find disaster relevant social media data, current approaches utilize natural language processing (NLP) methods based on keywords, or machine learning algorithms relying on text only. However, these approaches cannot be perfectly accurate due to the variability and uncertainty in language used on social media. To improve current methods, the enhanced text-mining framework is proposed to incorporate location information from social media and authoritative remote sensing datasets for detecting disaster relevant social media posts, which are determined by assessing the textual content using common text mining methods and how the post relates spatiotemporally to the disaster event. To assess the framework, geo-tagged Tweets were collected for three different spatial and temporal disaster events: hurricane, flood, and tornado. Remote sensing data and products for each event were then collected using RealEarthTM. Both Naive Bayes and Logistic Regression classifiers were used to compare the accuracy within the enhanced text-mining framework. Finally, the accuracies from the enhanced text-mining framework were compared to the current text-only methods for each of the case study disaster events. The results from this study address the need for more authoritative data when using social media in disaster management applications.

  20. On the Creation of Hypertext Links in Full-Text Documents: Measurement of Inter-Linker Consistency.

    Science.gov (United States)

    Ellis, David; And Others

    1994-01-01

    Describes a study in which several different sets of hypertext links are inserted by different people in full-text documents. The degree of similarity between the sets is measured using coefficients and topological indices. As in comparable studies of inter-indexer consistency, the sets of links used by different people showed little similarity.…

  1. Comparing data accuracy between structured abstracts and full-text journal articles: implications in their use for informing clinical decisions.

    Science.gov (United States)

    Fontelo, Paul; Gavino, Alex; Sarmiento, Raymond Francis

    2013-12-01

    The abstract is the most frequently read section of a research article. The use of 'Consensus Abstracts', a clinician-oriented web application formatted for mobile devices to search MEDLINE/PubMed, for informing clinical decisions was proposed recently; however, inaccuracies between abstracts and the full-text article have been shown. Efforts have been made to improve quality. We compared data in 60 recent-structured abstracts and full-text articles from six highly read medical journals. Data inaccuracies were identified and then classified as either clinically significant or not significant. Data inaccuracies were observed in 53.33% of articles ranging from 3.33% to 45% based on the IMRAD format sections. The Results section showed the highest discrepancies (45%) although these were deemed to be mostly not significant clinically except in one. The two most common discrepancies were mismatched numbers or percentages (11.67%) and numerical data or calculations found in structured abstracts but not mentioned in the full text (40%). There was no significant relationship between journals and the presence of discrepancies (Fisher's exact p value =0.3405). Although we found a high percentage of inaccuracy between structured abstracts and full-text articles, these were not significant clinically. The inaccuracies do not seem to affect the conclusion and interpretation overall. Structured abstracts appear to be informative and may be useful to practitioners as a resource for guiding clinical decisions.

  2. Desktop Access to Full-Text NACA and NASA Reports: Systems Developed by NASA Langley Technical Library

    Science.gov (United States)

    Ambur, Manjula Y.; Adams, David L.; Trinidad, P. Paul

    1997-01-01

    NASA Langley Technical Library has been involved in developing systems for full-text information delivery of NACA/NASA technical reports since 1991. This paper will describe the two prototypes it has developed and the present production system configuration. The prototype systems are a NACA CD-ROM of thirty-three classic paper NACA reports and a network-based Full-text Electronic Reports Documents System (FEDS) constructed from both paper and electronic formats of NACA and NASA reports. The production system is the DigiDoc System (DIGItal Documents) presently being developed based on the experiences gained from the two prototypes. DigiDoc configuration integrates the on-line catalog database World Wide Web interface and PDF technology to provide a powerful and flexible search and retrieval system. It describes in detail significant achievements and lessons learned in terms of data conversion, storage technologies, full-text searching and retrieval, and image databases. The conclusions from the experiences of digitization and full- text access and future plans for DigiDoc system implementation are discussed.

  3. The consistency between scientific papers presented at the Orthopaedic Trauma Association and their subsequent full-text publication.

    Science.gov (United States)

    Preston, Charles F; Bhandari, Mohit; Fulkerson, Eric; Ginat, Danial; Egol, Kenneth A; Koval, Kenneth J

    2006-02-01

    To determine the consistency of conclusions/statements made in podium presentations at the annual meeting of the Orthopaedic Trauma Association (OTA) with those in subsequent full-text publications. Also, to evaluate the nature and consistency of study design, methods, sample sizes, results and assign a corresponding level of evidence. Abstracts of the scientific programs of the OTA from 1994 to 1997 (N = 254) were queried by using the PubMed database to identify those studies resulting in a peer-reviewed, full-text publication. Of the 169 articles retrieved, 137 studies were the basis of our study after the exclusion criteria were applied: non-English language, basic science studies, anatomic dissection studies, and articles published in non-peer-reviewed journals. Information was abstracted onto a data form: first from the abstract published in the final meeting program, and then from the published journal article. Information was recorded regarding study issues, including the study design, primary objective, sample size, and statistical methods. We provided descriptive statistics about the frequency of consistent results between abstracts and full-text publications. The results were recorded as percentages and a 95% confidence interval was applied to each value. Study results were recorded for the abstract and full-text publication comparing results and the overall conclusion. A level of scientific-based evidence was assigned to each full-text publication. The final conclusion of the study remained the same 93.4% of the time. The method of study was an observational case series 52% of the time and a statement regarding the rate of patient follow-up was reported 42% of the time. Of the studies published, 18.2% consisted of a sample size smaller than the previously presented abstract. When the published papers had their level of evidence graded, 11% were level I, 16% level II, 17% level III, and 56% level IV. Authors conclusions were consistent with those in full-text

  4. Publication trends of shared decision making in 15 high impact medical journals: a full-text review with bibliometric analysis.

    Science.gov (United States)

    Blanc, Xavier; Collet, Tinh-Hai; Auer, Reto; Fischer, Roland; Locatelli, Isabella; Iriarte, Pablo; Krause, Jan; Légaré, France; Cornuz, Jacques

    2014-08-09

    Shared Decision Making (SDM) is increasingly advocated as a model for medical decision making. However, there is still low use of SDM in clinical practice. High impact factor journals might represent an efficient way for its dissemination. We aimed to identify and characterize publication trends of SDM in 15 high impact medical journals. We selected the 15 general and internal medicine journals with the highest impact factor publishing original articles, letters and editorials. We retrieved publications from 1996 to 2011 through the full-text search function on each journal website and abstracted bibliometric data. We included publications of any type containing the phrase "shared decision making" or five other variants in their abstract or full text. These were referred to as SDM publications. A polynomial Poisson regression model with logarithmic link function was used to assess the evolution across the period of the number of SDM publications according to publication characteristics. We identified 1285 SDM publications out of 229,179 publications in 15 journals from 1996 to 2011. The absolute number of SDM publications by journal ranged from 2 to 273 over 16 years. SDM publications increased both in absolute and relative numbers per year, from 46 (0.32% relative to all publications from the 15 journals) in 1996 to 165 (1.17%) in 2011. This growth was exponential (P Full-text search retrieved ten times more SDM publications than a similar PubMed search (1285 vs. 119 respectively). This review in full-text showed that SDM publications increased exponentially in major medical journals from 1996 to 2011. This growth might reflect an increased dissemination of the SDM concept to the medical community.

  5. How accessibility influences citation counts: The case of citations to the full text articles available from ResearchGate

    Directory of Open Access Journals (Sweden)

    Mohammad Sababi

    2017-08-01

    Full Text Available It is generally believed that the number of citations to an article can positively be correlated to its free online availability. In the present study, we investigated the possible impact of academic social networks on the number of citations. We chose the social web service “ResearchGate” as a case. This website acts both as a social network to connect researchers, and at the same time, as an open access repository to publish post-print version of the accepted manuscripts and final versions of open access articles. We collected the data of 1823 articles published by the authors from four different universities. By analyzing these data, we showed that although different levels of full text availability are observed for the four universities, there is always a significant positive correlation between full text availability and the citation count. Moreover, we showed that both post-print version and publisher’s version (i.e., final published version of the archived manuscripts receive more citations than non-OA articles, and the difference in the citation counts of post-print manuscripts and publisher’s version articles is nonsignificant.

  6. Conversion rates of abstracts presented at the Canadian Rheumatology Association Annual Meetings into full-text journal articles.

    Science.gov (United States)

    Yacyshyn, Elaine A; Soong, Laura C

    2017-06-01

    Dissemination of research studies is important for research ideas to be transformed from initial abstracts to full publications. Analyses of the scientific impact and publication record of the Canadian Rheumatology Association (CRA) Annual meeting have not been previously described. This study determines the publication rate of abstracts presented at the CRA Annual Meetings 2005-2013 to full-text journal articles and the factors associated with publication. Program records of previous CRA meetings from 2005 to 2013 were obtained. Abstracts were searched for corresponding full-text publication in Google Scholar and PubMed using a search algorithm. Abstracts and subsequent published articles were evaluated for type of abstract, time to publication, study type, publishing journal, and journal impact factor. A total of 1401 abstracts were included in the study, 567 of which were converted to full publications. The average time to publication was 19.7 months, with 89% of abstracts published within 3 years of being presented. Eighty-three percent of abstracts were clinical in nature, and 58% of published studies were observational in design. Articles were published in a wide range of journals, with the top publisher being the Journal of Rheumatology (31%). Average time to publication was 19.7 months. Eighty-six percent of articles had a Journal Impact Factor > 2. Overall, 40.5% of abstracts presented at the CRA Annual Meetings 2005-2013 were published. Further research is needed to determine barriers and reasons for abstracts not being published as full-text articles.

  7. [Full-text publication of abstracts presented at the 33th Argentinean pediatric meeting and non publication related factors].

    Science.gov (United States)

    Canosa, Daniela; Ferrero, Fernando; Melamud, Ariel; Otero, Paula D; Merech, Raúl S; Ceriani Cernadas, José M

    2011-02-01

    There is no information about non publication of research presented at scientific meetings in Argentina. We analyzed the full-text publication rate of abstracts presented at the 33° Argentinean Pediatric Congress (APC), time to achieve publication, and factors associated with publication or non-publication. Survey-based cross-sectional study, including authors of abstracts presented at the 33° APC. The survey included age, gender, specialty and sub-specialty, professional area and reason of publication or non-publication. We randomly selected 140/894 presented abstracts. Only 16 abstracts (11.4%) were subsequently published in full, requiring 27±15 months. There were no association between full-text publication and author's characteristics. "Oral presentations" were more likely to be subsequently published (p= 0.018). In non published abstracts, 95% were not submitted by the author, more frequently because of "lack of time" (35.9%). Only 11.4% of abstracts were subsequently published in full. Oral presentation was associated with a higher publication rate. Most frequent cause for non-publication was non submission due to lack of time.

  8. Identifying Scientific Project-generated Data Citation from Full-text Articles: An Investigation of TCGA Data Citation

    Directory of Open Access Journals (Sweden)

    Jiao Li

    2016-06-01

    Full Text Available Purpose: In the open science era, it is typical to share project-generated scientific data by depositing it in an open and accessible database. Moreover, scientific publications are preserved in a digital library archive. It is challenging to identify the data usage that is mentioned in literature and associate it with its source. Here, we investigated the data usage of a government-funded cancer genomics project, The Cancer Genome Atlas (TCGA, via a full-text literature analysis. Design/methodology/approach: We focused on identifying articles using the TCGA dataset and constructing linkages between the articles and the specific TCGA dataset. First, we collected 5,372 TCGA-related articles from PubMed Central (PMC. Second, we constructed a benchmark set with 25 full-text articles that truly used the TCGA data in their studies, and we summarized the key features of the benchmark set. Third, the key features were applied to the remaining PMC full-text articles that were collected from PMC. Findings: The amount of publications that use TCGA data has increased significantly since 2011, although the TCGA project was launched in 2005. Additionally, we found that the critical areas of focus in the studies that use the TCGA data were glioblastoma multiforme, lung cancer, and breast cancer; meanwhile, data from the RNA-sequencing (RNA-seq platform is the most preferable for use. Research limitations: The current workflow to identify articles that truly used TCGA data is labor-intensive. An automatic method is expected to improve the performance. Practical implications: This study will help cancer genomics researchers determine the latest advancements in cancer molecular therapy, and it will promote data sharing and data-intensive scientific discovery. Originality/value: Few studies have been conducted to investigate data usage by government-funded projects/programs since their launch. In this preliminary study, we extracted articles that use TCGA data

  9. Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles.

    Science.gov (United States)

    Xu, Rong; Wang, QuanQiu

    2015-06-01

    Targeted anticancer drugs such as imatinib, trastuzumab and erlotinib dramatically improved treatment outcomes in cancer patients, however, these innovative agents are often associated with unexpected side effects. The pathophysiological mechanisms underlying these side effects are not well understood. The availability of a comprehensive knowledge base of side effects associated with targeted anticancer drugs has the potential to illuminate complex pathways underlying toxicities induced by these innovative drugs. While side effect association knowledge for targeted drugs exists in multiple heterogeneous data sources, published full-text oncological articles represent an important source of pivotal, investigational, and even failed trials in a variety of patient populations. In this study, we present an automatic process to extract targeted anticancer drug-associated side effects (drug-SE pairs) from a large number of high profile full-text oncological articles. We downloaded 13,855 full-text articles from the Journal of Oncology (JCO) published between 1983 and 2013. We developed text classification, relationship extraction, signaling filtering, and signal prioritization algorithms to extract drug-SE pairs from downloaded articles. We extracted a total of 26,264 drug-SE pairs with an average precision of 0.405, a recall of 0.899, and an F1 score of 0.465. We show that side effect knowledge from JCO articles is largely complementary to that from the US Food and Drug Administration (FDA) drug labels. Through integrative correlation analysis, we show that targeted drug-associated side effects positively correlate with their gene targets and disease indications. In conclusion, this unique database that we built from a large number of high-profile oncological articles could facilitate the development of computational models to understand toxic effects associated with targeted anticancer drugs. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. The Effect of Different Modes of English Captioning on EFL learners’ General Listening Comprehension: Full text Vs. Keyword Captions

    Directory of Open Access Journals (Sweden)

    Sorayya Behroozizad

    2015-08-01

    Full Text Available This study investigated the effect of different modes of English captioning on EFL learners’ general listening comprehension. To this end, forty five intermediate-level learners were selected based on their scores on a standardized English proficiency test (PET to carry out the study. Then, the selected participants were randomly assigned into two experimental groups (full-captions and keyword-captions and one control group (no-captions. Research instrumentation included a pre-test and a post-test following an experimental design. Participants took a pre-test and a post-test containing 50 multiple-choice questions (25question for pre-test and 25 question for post-test selected from a standard listening test PET, and also 15 treatment sessions. The findings showed significant differences among full-captions, keyword-captions, and no-captions in terms of their effect on learners’ general listening comprehension. This study provided some pedagogical implications for teaching listening through using different modes of captions. Keywords: Caption, full caption, keyword caption, listening comprehension

  11. Factors Affecting Subsequent Full-text Publication of Papers Presented at the Annual Conference of the Indian Academy of Pediatrics.

    Science.gov (United States)

    Khalil, Sumaira; Mishra, Devendra; Mishra, Ruchi; Gupta, Shalu

    2017-02-15

    To study the factors associated with the subsequent (over next 9 years) full-text publication of papers presented at the 44th National Conference of Indian Academy of Pediatrics (PEDICON), 2007. All papers presented at PEDICON 2007 were searched for subsequent full-text publication over the next 9 years in English-language journals by an internet-based search. The published papers were compared with the conference-abstracts. 74 (16%) of the 450 abstracts presented were subsequently published; 61 (82.4%) in Medline-indexed journals. Majority (50, 67.6%) of the papers was published within the first 36 mo in journals with mean (SD) impact factor of 2.62 (1.63). The factors significantly associated with subsequent publication were papers presented as award papers (Pfull-papers, 55% had a change in title; authors were changed in 65%, and participants' numbers were dissimilar in 8.6%. There is a need to identify the factors responsible for this low rate of subsequent publication, and interventions to improve it both at institutional and researchers' level.

  12. BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph.

    Science.gov (United States)

    Peng, Yifan; Arighi, Cecilia; Wu, Cathy H; Vijay-Shanker, K

    2016-01-01

    There has been a large growth in the number of biomedical publications that report experimental results. Many of these results concern detection of protein-protein interactions (PPI). In BioCreative V, we participated in the BioC task and developed a PPI system to detect text passages with PPIs in the full-text articles. By adopting the BioC format, the output of the system can be seamlessly added to the biocuration pipeline with little effort required for the system integration. A distinctive feature of our PPI system is that it utilizes extended dependency graph, an intermediate level of representation that attempts to abstract away syntactic variations in text. As a result, we are able to use only a limited set of rules to extract PPI pairs in the sentences, and additional rules to detect additional passages for PPI pairs. For evaluation, we used the 95 articles that were provided for the BioC annotation task. We retrieved the unique PPIs from the BioGRID database for these articles and show that our system achieves a recall of 83.5%. In order to evaluate the detection of passages with PPIs, we further annotated Abstract and Results sections of 20 documents from the dataset and show that an f-value of 80.5% was obtained. To evaluate the generalizability of the system, we also conducted experiments on AIMed, a well-known PPI corpus. We achieved an f-value of 76.1% for sentence detection and an f-value of 64.7% for unique PPI detection.Database URL: http://proteininformationresource.org/iprolink/corpora. © The Author(s) 2016. Published by Oxford University Press.

  13. The film’s the thing: film translation and its effect on a silent, edited and full text Hamlet The film’s the thing: film translation and its effect on a silent, edited and full text Hamlet

    Directory of Open Access Journals (Sweden)

    Janete R. Costa

    2008-04-01

    Full Text Available Translation is, at its best, a difficult path to tred, especially in a global, multicultural society. A word that defines an object may be in need of careful consideration and modification, not only to convey its individual meaning, but also to place it in the concept or intent when linked with others words forming a thought. The process is particularly complex when pairing a word with an image as is done in film. In the 1960’s, the American television classic, Star Trek, added new words as well as additional meaning to old words in the English lexicon. The definition of these words was clearly given in visual images that can still be recalled today. A typical exchange of dialogue may read: Captain, according to my tricorder, there is no intelligent life on this planet. Beam him up, Scotty. Energise. Translation is, at its best, a difficult path to tred, especially in a global, multicultural society. A word that defines an object may be in need of careful consideration and modification, not only to convey its individual meaning, but also to place it in the concept or intent when linked with others words forming a thought. The process is particularly complex when pairing a word with an image as is done in film. In the 1960’s, the American television classic, Star Trek, added new words as well as additional meaning to old words in the English lexicon. The definition of these words was clearly given in visual images that can still be recalled today. A typical exchange of dialogue may read: Captain, according to my tricorder, there is no intelligent life on this planet. Beam him up, Scotty. Energise.

  14. Construction of phosphorylation interaction networks by text mining of full-length articles using the eFIP system.

    Science.gov (United States)

    Tudor, Catalina O; Ross, Karen E; Li, Gang; Vijay-Shanker, K; Wu, Cathy H; Arighi, Cecilia N

    2015-01-01

    Protein phosphorylation is a reversible post-translational modification where a protein kinase adds a phosphate group to a protein, potentially regulating its function, localization and/or activity. Phosphorylation can affect protein-protein interactions (PPIs), abolishing interaction with previous binding partners or enabling new interactions. Extracting phosphorylation information coupled with PPI information from the scientific literature will facilitate the creation of phosphorylation interaction networks of kinases, substrates and interacting partners, toward knowledge discovery of functional outcomes of protein phosphorylation. Increasingly, PPI databases are interested in capturing the phosphorylation state of interacting partners. We have previously developed the eFIP (Extracting Functional Impact of Phosphorylation) text mining system, which identifies phosphorylated proteins and phosphorylation-dependent PPIs. In this work, we present several enhancements for the eFIP system: (i) text mining for full-length articles from the PubMed Central open-access collection; (ii) the integration of the RLIMS-P 2.0 system for the extraction of phosphorylation events with kinase, substrate and site information; (iii) the extension of the PPI module with new trigger words/phrases describing interactions and (iv) the addition of the iSimp tool for sentence simplification to aid in the matching of syntactic patterns. We enhance the website functionality to: (i) support searches based on protein roles (kinases, substrates, interacting partners) or using keywords; (ii) link protein entities to their corresponding UniProt identifiers if mapped and (iii) support visual exploration of phosphorylation interaction networks using Cytoscape. The evaluation of eFIP on full-length articles achieved 92.4% precision, 76.5% recall and 83.7% F-measure on 100 article sections. To demonstrate eFIP for knowledge extraction and discovery, we constructed phosphorylation-dependent interaction

  15. Publication rates of full-text journal articles converted from abstracts presented during the 22(nd) Turkish National Urology Congress.

    Science.gov (United States)

    Kocaaslan, Ramazan; Kayalı, Yunus; Tok, Adem; Tepeler, Abdulkadir

    2016-03-01

    To analyze the publication rates of full-text journal articles converted from the abstracts presented in the 22(nd) Turkish National Urology Congress in 2012. A total of 576 abstracts accepted for presentation at the 22(nd) Turkish National Urology Association Meeting were identified from the published abstract book. The abstracts were categorized into subsections such as endourology and pediatric urology. The subsequent publication rate for the studies was evaluated by scanning PubMed Medline. Abstracts published before the proceedings were excluded from the study. The abstracts were categorized as being presented orally (n=155), by poster (n=421), or by video (n=78). Of the 28 (18.3%) of 155 oral and 34 (8.15%) of 421 poster presentations, were subsequently published in several journals until March 2015. The publication rates of the abstracts based on urology subsections were as follows: neurology (25%), andrology (18.6%), endourology (17.2%), urolithiasis (15.3%), general urology (12.5%), infectious diseases (7.14%), pediatric urology (6.25%), uro-gynecology (6.06%), reconstructive urology (5.8%), and urooncology (3.8%). The average time to publication was 11.77 (0-33) months. This is the first study assessing the publication rates of abstracts presented at a Turkish National Urology Congress. It reveals that more qualified randomized studies need to be done to improve the rate of publication.

  16. Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles.

    Science.gov (United States)

    Xu, Rong; Wang, QuanQiu

    2015-02-01

    Anticancer drug-associated side effect knowledge often exists in multiple heterogeneous and complementary data sources. A comprehensive anticancer drug-side effect (drug-SE) relationship knowledge base is important for computation-based drug target discovery, drug toxicity predication and drug repositioning. In this study, we present a two-step approach by combining table classification and relationship extraction to extract drug-SE pairs from a large number of high-profile oncological full-text articles. The data consists of 31,255 tables downloaded from the Journal of Oncology (JCO). We first trained a statistical classifier to classify tables into SE-related and -unrelated categories. We then extracted drug-SE pairs from SE-related tables. We compared drug side effect knowledge extracted from JCO tables to that derived from FDA drug labels. Finally, we systematically analyzed relationships between anti-cancer drug-associated side effects and drug-associated gene targets, metabolism genes, and disease indications. The statistical table classifier is effective in classifying tables into SE-related and -unrelated (precision: 0.711; recall: 0.941; F1: 0.810). We extracted a total of 26,918 drug-SE pairs from SE-related tables with a precision of 0.605, a recall of 0.460, and a F1 of 0.520. Drug-SE pairs extracted from JCO tables is largely complementary to those derived from FDA drug labels; as many as 84.7% of the pairs extracted from JCO tables have not been included a side effect database constructed from FDA drug labels. Side effects associated with anticancer drugs positively correlate with drug target genes, drug metabolism genes, and disease indications. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Saddle Slow Manifolds and Canard Orbits in [Formula: see text] and Application to the Full Hodgkin-Huxley Model.

    Science.gov (United States)

    Hasan, Cris R; Krauskopf, Bernd; Osinga, Hinke M

    2018-04-19

    Many physiological phenomena have the property that some variables evolve much faster than others. For example, neuron models typically involve observable differences in time scales. The Hodgkin-Huxley model is well known for explaining the ionic mechanism that generates the action potential in the squid giant axon. Rubin and Wechselberger (Biol. Cybern. 97:5-32, 2007) nondimensionalized this model and obtained a singularly perturbed system with two fast, two slow variables, and an explicit time-scale ratio ε. The dynamics of this system are complex and feature periodic orbits with a series of action potentials separated by small-amplitude oscillations (SAOs); also referred to as mixed-mode oscillations (MMOs). The slow dynamics of this system are organized by two-dimensional locally invariant manifolds called slow manifolds which can be either attracting or of saddle type.In this paper, we introduce a general approach for computing two-dimensional saddle slow manifolds and their stable and unstable fast manifolds. We also develop a technique for detecting and continuing associated canard orbits, which arise from the interaction between attracting and saddle slow manifolds, and provide a mechanism for the organization of SAOs in [Formula: see text]. We first test our approach with an extended four-dimensional normal form of a folded node. Our results demonstrate that our computations give reliable approximations of slow manifolds and canard orbits of this model. Our computational approach is then utilized to investigate the role of saddle slow manifolds and associated canard orbits of the full Hodgkin-Huxley model in organizing MMOs and determining the firing rates of action potentials. For ε sufficiently large, canard orbits are arranged in pairs of twin canard orbits with the same number of SAOs. We illustrate how twin canard orbits partition the attracting slow manifold into a number of ribbons that play the role of sectors of rotations. The upshot is that we

  18. Functionally relevant microorganisms to enhanced biological phosphorus removal performance at full-scale wastewater treatment plants in the United States.

    Science.gov (United States)

    Gu, April Z; Saunders, A; Neethling, J B; Stensel, H D; Blackall, L L

    2008-08-01

    The abundance and relevance ofAccumulibacter phosphatis (presumed to be polyphosphate-accumulating organisms [PAOs]), Competibacter phosphatis (presumed to be glycogen-accumulating organisms [GAOs]), and tetrad-forming organisms (TFOs) to phosphorus removal performance at six full-scale enhanced biological phosphorus removal (EBPR) wastewater treatment plants were investigated. Coexistence of various levels of candidate PAOs and GAOs were found at these facilities. Accumulibacter were found to be 5 to 20% of the total bacterial population, and Competibacter were 0 to 20% of the total bacteria population. The TFO abundance varied from nondetectable to dominant. Anaerobic phosphorus (P) release to acetate uptake ratios (P(rel)/HAc(up)) obtained from bench tests were correlated positively with the abundance ratio of Accumulibacter/(Competibacter +TFOs) and negatively with the abundance of (Competibacter +TFOs) for all plants except one, suggesting the relevance of these candidate organisms to EBPR processes. However, effluent phosphorus concentration, amount of phosphorus removed, and process stability in an EBPR system were not directly related to high PAO abundance or mutually exclusive with a high GAO fraction. The plant that had the lowest average effluent phosphorus and highest stability rating had the lowest P(rel)/HAc(up) and the most TFOs. Evaluation of full-scale EBPR performance data indicated that low effluent phosphorus concentration and high process stability are positively correlated with the influent readily biodegradable chemical oxygen demand-to-phosphorus ratio. A system-level carbon-distribution-based conceptual model is proposed for capturing the dynamic competition between PAOs and GAOs and their effect on an EBPR process, and the results from this study seem to support the model hypothesis.

  19. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

    Science.gov (United States)

    Krallinger, Martin; Vazquez, Miguel; Leitner, Florian; Salgado, David; Chatr-Aryamontri, Andrew; Winter, Andrew; Perfetto, Livia; Briganti, Leonardo; Licata, Luana; Iannuccelli, Marta; Castagnoli, Luisa; Cesareni, Gianni; Tyers, Mike; Schneider, Gerold; Rinaldi, Fabio; Leaman, Robert; Gonzalez, Graciela; Matos, Sergio; Kim, Sun; Wilbur, W John; Rocha, Luis; Shatkay, Hagit; Tendulkar, Ashish V; Agarwal, Shashank; Liu, Feifan; Wang, Xinglong; Rak, Rafal; Noto, Keith; Elkan, Charles; Lu, Zhiyong; Dogan, Rezarta Islamaj; Fontaine, Jean-Fred; Andrade-Navarro, Miguel A; Valencia, Alfonso

    2011-10-03

    Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were evaluated by comparing

  20. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text

    Science.gov (United States)

    2011-01-01

    Background Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. Results A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were

  1. Open-Source Tools for Enhancing Full-Text Searching of OPACs: Use of Koha, Greenstone and Fedora

    Science.gov (United States)

    Anuradha, K. T.; Sivakaminathan, R.; Kumar, P. Arun

    2011-01-01

    Purpose: There are many library automation packages available as open-source software, comprising two modules: staff-client module and online public access catalogue (OPAC). Although the OPAC of these library automation packages provides advanced features of searching and retrieval of bibliographic records, none of them facilitate full-text…

  2. Acoustic measurements of a full-scale rotor with four tip shapes. Volume 1: Text, appendices A and B

    Science.gov (United States)

    Mosher, M.

    1984-01-01

    A full-scale helicopter with four different blade-tip geometries was tested in the 40- by 80-foot wind tunnel at Ames Research Center. Performance, loads, and noise were measured. The four tip shapes tested were rectangular, tapered, swept, and swept-tapered. Noise measurements from that test are presented in the form of tables and plots. The noise data include measurements of the sound pressure level in dB, dBA, and tone-corrected PNdB, for all of the conditions tested. Detailed measurements, 1/3-octave spectra and time-histories for some selected data are included as well as plots of dBA as function of test condition. Some performance measurements are given to aid interpretation of the noise data.

  3. The Dayton Agenda: Full Text

    Science.gov (United States)

    Journal of Research on Christian Education, 2009

    2009-01-01

    In November 1997, 140 researchers, administrators, and others interested in the support of nonpublic schools gathered at the University of Dayton to develop a research agenda for American private education. What developed over the several hours of intense sessions was an agenda that has given direction to researchers well into the 21st century.…

  4. The Effect of Text Materials with Relevant Language, Illustrations and Content Upon the Reading Achievement and Reading Preference (Attitude) of Black Primary and Intermediate Inner-City Students.

    Science.gov (United States)

    Grant, Gloria W.

    The purpose of this study was to examine the effect of text materials with relevant language, illustrations, and content upon the reading achievement and reading preference (attitude) of black primary and intermediate grade inner-city students. The subjects for the study were 330 black students enrolled in three schools in a large urban area. A…

  5. A comparison of the accuracy of clinical decisions based on full-text articles and on journal abstracts alone: a study among residents in a tertiary care hospital.

    Science.gov (United States)

    Marcelo, Alvin; Gavino, Alex; Isip-Tan, Iris Thiele; Apostol-Nicodemus, Leilanie; Mesa-Gaerlan, Faith Joan; Firaza, Paul Nimrod; Faustorilla, John Francis; Callaghan, Fiona M; Fontelo, Paul

    2013-04-01

    Many clinicians depend solely on journal abstracts to guide clinical decisions. This study aims to determine if there are differences in the accuracy of responses to simulated cases between resident physicians provided with an abstract only and those with full-text articles. It also attempts to describe their information-seeking behaviour. Seventy-seven resident physicians from four specialty departments of a tertiary care hospital completed a paper-based questionnaire with clinical simulation cases, then randomly assigned to two intervention groups-access to abstracts-only and access to both abstracts and full-text. While having access to medical literature, they completed an online version of the same questionnaire. The average improvement across departments was not significantly different between the abstracts-only group and the full-text group (p=0.44), but when accounting for an interaction between intervention and department, the effect was significant (p=0.049) with improvement greater with full-text in the surgery department. Overall, the accuracy of responses was greater after the provision of either abstracts-only or full-text (pfull-text articles were more accurate than those guided by abstracts alone, but the results seem to be driven by a significant difference in one department.

  6. Progress in the Full-Text Publication Rate of Orthopaedic and Sport Physical Therapy Abstracts Presented at the American Physical Therapy Association's Combined Sections Meeting.

    Science.gov (United States)

    Warden, Stuart J; Fletcher, Jacquelyn M; Barker, Rick G; Guildenbecher, Elizabeth A; Gorkis, Colleen E; Thompson, William R

    2017-10-07

    Study Design Descriptive study. Background Professional meetings, such as the American Physical Therapy Association's (APTA's) Combined Sections Meeting (CSM), provide forums for sharing information. However, it was reported that only one-quarter of orthopaedic and sports physical therapy abstracts presented at the CSM between 2000 and 2004 went on to full-text publication. This low conversion rate raises a number of concerns regarding the full dissemination of work within the profession. Objectives The purpose of this study was to determine the full-text publication rate of work presented in abstract form at subsequent CSMs and investigate factors influencing the rate. Methods A systematic search was undertaken to locate full-text publications of orthopaedic and sports physical therapy abstracts presented at CSMs between 2005 and 2011. Eligible publications were published within 5 years following abstract presentation. The influences of year of abstract presentation, APTA section, presentation type, institution of origin, study design, and study significance were assessed. Results Over one-third (38.6%) of presented abstracts progressed to full-text publication. Odds of full-text publication increased if the abstract was presented as a platform presentation, originated from a doctorate-granting institution, reported findings of an experimental study, or reported a statistically significant finding. Conclusion The full-text publication rate for orthopaedic and sports physical therapy abstracts presented at recent CSMs has increased by over 50% compared to that reported for the preceding period. The rate is now in the range of that reported in comparable clinical disciplines, demonstrating important progress in the full dissemination of work within the profession. J Orthop Sports Phys Ther, Epub 7 Oct 2017. doi:10.2519/jospt.2018.7581.

  7. Salton and Buckley’s Landmark Research in Experimental Text Information Retrieval. A Review of: Salton, G., & Buckley, C. (1990. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41(4, 288–297.

    Directory of Open Access Journals (Sweden)

    Christine F. Marton

    2011-01-01

    Full Text Available Objectives – To compare the performance of the vector space model and the probabilistic weighting model of relevance feedback for the overall purpose of determining the most useful relevance feedback procedures. The amount of improvement that can be obtained from searching several test document collections with only one feedback iteration of each relevance feedback model was measured.Design – The experimental design consisted of 72 different tests: 2 different relevance feedback methods, each with 6 permutations, on 6 test document collections of various sizes. A residual collection method was utilized to ascertain the “true advantage provided by the relevance feedback process.” (Salton & Buckley, 1990, p. 293Setting – Department of Computer Science at Cornell University.Subjects – Six test document collections.Methods – Relevance feedback is an effective technique for query modification that provides significant improvement in search performance. Relevance feedback entails both “term reweighting,” the modification of term weights based on term use in retrieved relevant and non-relevant documents, and “query expansion,” which is the addition of new terms from relevant documents retrieved (Harman, 1992. Salton and Buckley (1990 evaluated two established relevance feedback models based on the vector space model (a spatial model and the probabilistic model, respectively. Harman (1992 describes the two key differences between these competing models of relevance feedback.[The vector space model merges] document vectors and original query vectors. This automatically reweights query terms by adding the weights from the actual occurrence of those query terms in the relevant documents, and subtracting the weights of those terms occurring in the non-relevant documents. Queries are automatically expanded by adding all the terms not in the original query that are in the relevant documents and non-relevant documents. They are expanded

  8. Full text publication rates of research abstracts presented at the European Society of Endodontology (ESE) Congresses in the last 20 years.

    Science.gov (United States)

    Tzanetakis, G N; Tzimpoulas, N; Floratos, S; Agrafioti, A; Kontakiotis, E G; Shemesh, H

    2017-06-26

    To evaluate the full-text publication rates of scientific research abstracts presented at the European Society of Endodontology (ESE) Congresses held between 1993 and 2013 (a total of 11 occasions) and to determine factors associated with the manuscripts. An electronic database search was conducted from January 2015 to December 2016 to identify full text English written publications of the research abstracts presented at the last 11 ESE Biennial Congresses from 1993 to 2013. For each occasion, research abstract information were retrieved from the International Endodontic Journal (IEJ) through the official website of the ESE and the following parameters for each abstract presentation were recorded: Year of presentation, first author's affiliation, geographic origin, and type of study. Following full-text article identification, additional information was recorded such as: Year and journal of publication, elapsed time until full publication and number of authors per presentation and publication. A total of 1165 research abstracts were presented, of which 401 (34.4%) were finally published as full-length articles. Overall 235 articles (58.6%) were published either in the International Endodontic Journal (IEJ, 35.7%) or Journal of Endodontics (JOE, 22.9%). The mean time between abstract presentation and full-text publication was 18.95 months. Munich (2001) had the highest publication rate (44%) whereas Lisbon (2013) had the highest number of published articles (77). Turkey was the country with the highest number of published abstracts (56). However, the Netherlands was the country with the highest number of publications related to the number of presentations (21/26) (80.7%). Differences in authorship between presentation and full publication were found in 179 (44.6%) articles. A substantial number of research abstracts presented at ESE congresses were not published in peer reviewed journals. Authors prefer to publish their research papers in international journals with

  9. A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools.

    Science.gov (United States)

    Verspoor, Karin; Cohen, Kevin Bretonnel; Lanfranchi, Arrick; Warner, Colin; Johnson, Helen L; Roeder, Christophe; Choi, Jinho D; Funk, Christopher; Malenkiy, Yuriy; Eckert, Miriam; Xue, Nianwen; Baumgartner, William A; Bada, Michael; Palmer, Martha; Hunter, Lawrence E

    2012-08-17

    We introduce the linguistic annotation of a corpus of 97 full-text biomedical publications, known as the Colorado Richly Annotated Full Text (CRAFT) corpus. We further assess the performance of existing tools for performing sentence splitting, tokenization, syntactic parsing, and named entity recognition on this corpus. Many biomedical natural language processing systems demonstrated large differences between their previously published results and their performance on the CRAFT corpus when tested with the publicly available models or rule sets. Trainable systems differed widely with respect to their ability to build high-performing models based on this data. The finding that some systems were able to train high-performing models based on this corpus is additional evidence, beyond high inter-annotator agreement, that the quality of the CRAFT corpus is high. The overall poor performance of various systems indicates that considerable work needs to be done to enable natural language processing systems to work well when the input is full-text journal articles. The CRAFT corpus provides a valuable resource to the biomedical natural language processing community for evaluation and training of new models for biomedical full text publications.

  10. Fate of abstracts presented at a National Turkish Orthopedics and Traumatology Congress: publication rates and consistency of abstracts compared with their subsequent full-text publications.

    Science.gov (United States)

    Yalçınkaya, Merter; Bagatur, Erdem

    2013-01-01

    The aim of this study was to evaluate the publication rates of full-text articles after presentation of abstracts at a Turkish National Orthopaedics and Traumatology Congress, determine the time lag from the congress date to publication of full-text articles and assess the consistency between abstracts and the subsequent publications. All abstracts from the scientific program of the 20th Turkish National Orthopaedics and Traumatology Congress (2007) were identified and computerized PubMed searches were conducted to determine whether an abstract had been followed by publication of a full-text article and key features were compared to evaluate their consistency. The time lag to publication and the impact factors of the journals where the articles were published were noted. Of the 770 abstracts (264 oral, 506 poster presentations), 227 (29.5%) were followed by a full-text and 116 (44%) of the 264 oral and 111 (22%) of the 506 poster presentations were published. The mean time to publication was 14.9±16.075 (range: 33 to 55) months. Thirty-three (14.5%) were published prior to the presentation at the congress. The likelihood of publication decreased after the third year (26 of 227, 11.5%). A total of 182 (80.2%) articles showed inconsistencies with the abstract; 74 (32.6%) minor, 14 (6.2%) major, and 94 (41.4%) minor and major inconsistencies. The mean impact factor of the journals was 1.152±0.858. The vast majority of abstracts presented at this congress were not followed by publication of a full-text article. Additionally, frequent inconsistencies between the final published article and the original abstract indicated the inadequacy of quality of reporting in abstracts.

  11. Sustainable development relevant comparison of the greenhouse gas emissions from the full energy chains of different energy sources

    International Nuclear Information System (INIS)

    Van De Vate, J.F.

    1997-01-01

    It is emphasized that sustainable energy planning should account for the emissions of all greenhouse gases (GHGs) from the whole energy chain, hence accounting not only carbon dioxide as the greenhouse gas and not only for the emissions from the combustion of fossil fuels. Lowering greenhouse gas emissions from the worldwide energy use can be done most effectively by accounting in energy planning for the full-energy-chain (FENCH) emissions of all GHGs. Only energy sources with similar output can be compared. This study investigates electricity generating technologies, which are compared in terms their GHG emission factors to be expressed in CO 2 -equivalents per kW.h(e). Earlier IAEA expert meetings are reviewed. A general meeting made general recommendations about methods and input data bases for FENCH-GHG analysis. Two more recent meetings dealt with the energy chains of nuclear and hydropower. The site-specific character of the emission factors of these energy sources is discussed. Both electricity generators have emission factors in the range of 5-30 g CO 2 -equiv./kW.h(e), which is very low compared to the FENCH-GHG emission factors of fossil-fueled power generation and of most of the renewable power generators. (author)

  12. ScienceCentral: open access full-text archive of scientific journals based on Journal Article Tag Suite regardless of their languages.

    Science.gov (United States)

    Huh, Sun

    2013-01-01

    ScienceCentral, a free or open access, full-text archive of scientific journal literature at the Korean Federation of Science and Technology Societies, was under test in September 2013. Since it is a Journal Article Tag Suite-based full text database, extensible markup language files of all languages can be presented, according to Unicode Transformation Format 8-bit encoding. It is comparable to PubMed Central: however, there are two distinct differences. First, its scope comprises all science fields; second, it accepts all language journals. Launching ScienceCentral is the first step for free access or open access academic scientific journals of all languages to leap to the world, including scientific journals from Croatia.

  13. Discrepancies between Abstracts Presented at International Association for Dental Research Annual Sessions from 2004 to 2005 and Full-Text Publication.

    Science.gov (United States)

    Prasad, Soni; Lee, Damian J; Yuan, Judy Chia-Chun; Barao, Valentim A R; Shyamsunder, Nodesh; Sukotjo, Cortino

    2012-01-01

    Purpose. The purpose of this study was to evaluate the discrepancies between abstracts presented at the IADR meeting (2004-2005) and their full-text publication. Material and Methods. Abstracts from the Prosthodontic Section of IADR meeting were obtained. The following information was collected: abstract title, number of authors, study design, statistical analysis, outcome, and funding source. PubMed was used to identify the full-text publication of the abstracts. The discrepancies between the abstract and the full-text publication were examined, categorized as major and minor discrepancies, and quantified. The data were collected and analyzed using descriptive analysis. Frequency and percentage of major and minor discrepancies were calculated. Results. A total of 109 (95.6%) articles showed changes from their abstracts. Seventy-four (65.0%) and 105 (92.0%) publications had at least one major and one minor discrepancies, respectively. Minor discrepancies were more prevalent (92.0%) than major discrepancies (65.0%). The most common minor discrepancy was observed in the title (80.7%), and most common major discrepancies were seen in results (48.2%). Conclusion. Minor discrepancies were more prevalent than major discrepancies. The data presented in this study may be useful to establish a more comprehensive structured abstract requirement for future meetings.

  14. Analysis of full-text publication and publishing predictors of abstracts presented at an Italian public health meeting (2005-2007).

    Science.gov (United States)

    Castaldi, S; Giacometti, M; Toigo, W; Bert, F; Siliquini, R

    2015-09-29

    In Public Health, a thorough review of abstract quality evaluations and the publication history of studies presented at scientific meetings has never been conducted. To analyse the long-term outcome of quality abstracts submitted to conferences of Italian Society of Hygiene and Public Health (SItI) from 2005 to 2007, we conducted a second analysis of previously published material aiming to estimate full-text publication rate of high quality abstract presented at Italian public health meetings, and to identify predictors of full-text publication. The search was undertaken through scientific databases and search engines and through the web sites of the major Italian journals of Public Health. For each publication confirmed as a full text paper, the journal name, impact factor, year of publication, gender of the first author, type of study design, characteristics of the results and sample size were collected. The overall publication rate of the abstracts presented is 23.5%; most of the papers were published in Public Health journals (average impact factor: 3.007). Non universitary affiliation had resulted in a lower probability of publication, while some of the Conference topics had predisposed the studies to an increased likelihood of publication as well as poster form presentation. The method presented in this study provides a good framework for the evaluation of the scientific evidence. The findings achieved should be taken into consideration by the Scientific Societies during the contributions selection phase, with the aim of achieving a continuous improvement of work quality. In the future, it would be interesting to survey the abstract authors to identify reasons for unpublished data.

  15. An Observational Study of Abstracts Presented at the American College of Veterinary Surgeon Annual Meetings (2001-2008) and Their Subsequent Full-Text Publication.

    Science.gov (United States)

    Meyers, Katherine E; Lindem, Margaret J; Giuffrida, Michelle A

    2016-07-01

    To determine the frequency of abstracts presented at American College of Veterinary Surgeons (ACVS) meetings from 2001 to 2008 that were published as complete articles, to identify abstract characteristics associated with final full-text publication, and to examine consistency of information between abstracts and final full-text publications. Observational bibliographic study. Abstracts were retrieved from published proceedings. Published articles were retrieved from bibliographic databases. Features of abstract and article authorship, design, and content were recorded. Regression analysis identified abstract features associated with article publication, and evaluated consistency between abstracts and final publications. Seven hundred eighty-two of 1078 (73%) abstracts were published as complete articles. Median time to publication was 1 year; 90% were published within 3 years. Abstracts originating from academic institutions were published more often than abstracts from practice or industry sites (odds ratio 2.61, 95% confidence interval 1.68-4.05). Compared to their conference abstracts, 49% of articles contained major inconsistences including changes in study design, interventions, outcomes, sample size, and results. For each year elapsed between presentation and publication, the odds of major inconsistency increased 2.4 times (odds ratio 2.36, 95% confidence interval 1.57-3.55) for retrospective studies and 1.4 times (odds ratio 1.35, 95% confidence interval 1.17-1.56) for other study designs. Changes in study title and authorship were frequent, particularly in publications that contained major inconsistencies. ACVS abstracts were promptly and reliably published, but final full-text publications often differed substantially from the original abstracts. © Copyright 2016 by The American College of Veterinary Surgeons.

  16. Print versus a culturally-relevant Facebook and text message delivered intervention to promote physical activity in African American women: a randomized pilot trial.

    Science.gov (United States)

    Joseph, Rodney P; Keller, Colleen; Adams, Marc A; Ainsworth, Barbara E

    2015-03-27

    African American women report insufficient physical activity and are disproportionally burdened by associated disease conditions; indicating the need for innovative approaches to promote physical activity in this underserved population. Social media platforms (i.e. Facebook) and text messaging represent potential mediums to promote physical activity. This paper reports the results of a randomized pilot trial evaluating a theory-based (Social Cognitive Theory) multi-component intervention using Facebook and text-messages to promote physical activity among African American women. Participants (N = 29) were randomly assigned to receive one of two multi-component physical activity interventions over 8 weeks: a culturally-relevant, Social Cognitive Theory-based, intervention delivered by Facebook and text message (FI) (n = 14), or a non-culturally tailored print-based intervention (PI) (n = 15) consisting of promotion brochures mailed to their home. The primary outcome of physical activity was assessed by ActiGraph GT3X+ accelerometers. Secondary outcomes included self-reported physical activity, physical activity-related psychosocial variables, and participant satisfaction. All randomized participants (N = 29) completed the study. Accelerometer measured physical activity showed that FI participants decreased sedentary time (FI = -74 minutes/week vs. PI = +118 minute/week) and increased light intensity (FI = +95 minutes/week vs. PI = +59 minutes/week) and moderate-lifestyle intensity physical activity (FI = + 27 minutes/week vs. PI = -34 minutes/week) in comparison to PI participants (all P's  .05). Results of secondary outcomes showed that in comparison to the PI, FI participants self-reported greater increases in moderate-to-vigorous physical activity (FI = +62 minutes/week vs. PI = +6 minutes/week; P = .015) and had greater enhancements in self-regulation for physical activity (P program to a friend

  17. http://www.isarder.org/isardercom/2013vol5issue1/vol5_issue1_article06full_text.PDF

    Directory of Open Access Journals (Sweden)

    Suat TEKER

    2013-03-01

    Full Text Available This study has pointed out that a new version of Build-Operate-Transfer (BOT financing model generated out from classical BOT model can be used for highway financing. The classical BOT, one of the most popular PPP models has been oftenly employed by various countries for financing of large scale public projects. Over the last 20 year period a number of infrastructure projects in Turkey such as natural gas plants, airports and hydro electric power plants were constructed by using BOT model. In this study, the new version of BOT model is implemented on the projected Ankara-İzmir Highway Project. This highway project can be constructed at a lower project cost by using the suggested BOT model compared to the classical BOT model. Therefore, the lower project cost results in a lower toll rate.

  18. Text-mining as a methodology to assess eating disorder-relevant factors: Comparing mentions of fitness tracking technology across online communities.

    Science.gov (United States)

    McCaig, Duncan; Bhatia, Sudeep; Elliott, Mark T; Walasek, Lukasz; Meyer, Caroline

    2018-05-07

    Text-mining offers a technique to identify and extract information from a large corpus of textual data. As an example, this study presents the application of text-mining to assess and compare interest in fitness tracking technology across eating disorder and health-related online communities. A list of fitness tracking technology terms was developed, and communities (i.e., 'subreddits') on a large online discussion platform (Reddit) were compared regarding the frequency with which these terms occurred. The corpus used in this study comprised all comments posted between May 2015 and January 2018 (inclusive) on six subreddits-three eating disorder-related, and three relating to either fitness, weight-management, or nutrition. All comments relating to the same 'thread' (i.e., conversation) were concatenated, and formed the cases used in this study (N = 377,276). Within the eating disorder-related subreddits, the findings indicated that a 'pro-eating disorder' subreddit, which is less recovery focused than the other eating disorder subreddits, had the highest frequency of fitness tracker terms. Across all subreddits, the weight-management subreddit had the highest frequency of the fitness tracker terms' occurrence, and MyFitnessPal was the most frequently mentioned fitness tracker. The technique exemplified here can potentially be used to assess group differences to identify at-risk populations, generate and explore clinically relevant research questions in populations who are difficult to recruit, and scope an area for which there is little extant literature. The technique also facilitates methodological triangulation of research findings obtained through more 'traditional' techniques, such as surveys or interviews. © 2018 Wiley Periodicals, Inc.

  19. Conversion rates of abstracts presented at the Urological Society of Australia and New Zealand (USANZ) Annual Scientific Meeting into full-text journal articles.

    Science.gov (United States)

    Yoon, Peter D; Chalasani, Venu; Woo, Henry H

    2012-08-01

    What's known on the subject? and What does the study add? It is well known that the transition of a presented abstract in a scientific meeting to a journal article improves the quality of the meeting and prevents an abstract being incorporated into meta-analyses or practice guidelines without proper appraisal. This is the first analysis of USANZ Annual Scientific Meeting abstracts' conversion to full publication. With relatively low publication rates compared to other international meetings, this review identifies the need for mechanisms to encourage USANZ researchers to convert their abstracts into published articles. The numbers and characteristics of the abstracts presented at the Annual Scientific Meetings (ASM) of the Urological Society of Australia and New Zealand (USANZ) that are converted to peer-reviewed publications have not previously been analysed and published. We undertook a review of all abstracts presented at the USANZ ASM from 2005 to 2009. A PubMed search was performed between 15 June and 15 July 2012, using a search algorithm to identify the full-text publications of the presented abstracts. Correlation between abstract characteristics and publication rate was then examined to distinguish the predictors for publications. Of 614 abstracts that were presented at USANZ ASM between 2005 and 2009, 183 papers were published, giving a publication rate of 29.80%. The papers were predominantly published in urological journals and were more likely to be published if they were presented by an international author or were retrospective studies or if basic science research. The mean (SD) time to publication was 14.46 (13.89) months and the mean Impact Factor of journals where papers were published was 2.90. The overall publication rate was relatively low compared with other urological meetings held in America and Europe. USANZ has a challenge of encouraging higher-quality research from the authors to further enhance its publication rate and consequently the

  20. Multi-stage gene normalization for full-text articles with context-based species filtering for dynamic dictionary entry selection.

    Science.gov (United States)

    Tsai, Richard Tzong-Han; Lai, Po-Ting

    2011-10-03

    Gene normalization (GN) is the task of identifying the unique database IDs of genes and proteins in literature. The best-known public competition of GN systems is the GN task of the BioCreative challenge, which has been held four times since 2003. The last two BioCreatives, II.5 & III, had two significant differences from earlier tasks: firstly, they provided full-length articles in addition to abstracts; and secondly, they included multiple species without providing species ID information. Full papers introduce more complex targets for GN processing, while the inclusion of multiple species vastly increases the potential size of dictionaries needed for GN. BioCreative III GN uses Threshold Average Precision at a median of k errors per query (TAP-k), a new measure closely related to the well-known average precision, but also reflecting the reliability of the score provided by each GN system. To use full-paper text, we employed a multi-stage GN algorithm and a ranking method which exploit information in different sections and parts of a paper. To handle the inclusion of multiple unknown species, we developed two context-based dynamic strategies to select dictionary entries related to the species that appear in the paper-section-wide and article-wide context. Our originally submitted BioCreative III system uses a static dictionary containing only the most common species entries. It already exceeds the BioCreative III average team performance by at least 24% in every evaluation. However, using our proposed dynamic dictionary strategies, we were able to further improve TAP-5, TAP-10, and TAP-20 by 16.47%, 13.57% and 6.01%, respectively in the Gold 50 test set. Our best dynamic strategy outperforms the best BioCreative III systems in TAP-10 on the Silver 50 test set and in TAP-5 on the Silver 507 set. Our experimental results demonstrate the superiority of our proposed dynamic dictionary selection strategies over our original static strategy and most BioCreative III

  1. Antecedent thermal injury worsens split-thickness skin graft quality: A clinically relevant porcine model of full-thickness burn, excision and grafting.

    Science.gov (United States)

    Carlsson, Anders H; Rose, Lloyd F; Fletcher, John L; Wu, Jesse C; Leung, Kai P; Chan, Rodney K

    2017-02-01

    Current standard of care for full-thickness burn is excision followed by autologous split-thickness skin graft placement. Skin grafts are also frequently used to cover surgical wounds not amenable to linear closure. While all grafts have potential to contract, clinical observation suggests that antecedent thermal injury worsens contraction and impairs functional and aesthetic outcomes. This study evaluates the impact of antecedent full-thickness burn on split-thickness skin graft scar outcomes and the potential mediating factors. Full-thickness contact burns (100°C, 30s) were created on the backs of anesthetized female Yorkshire Pigs. After seven days, burn eschar was tangentially excised and covered with 12/1000th inch (300μm) split-thickness skin graft. For comparison, unburned wounds were created by sharp excision to fat before graft application. From 7 to 120days post-grafting, planimetric measurements, digital imaging and biopsies for histology, immunohistochemistry and gene expression were obtained. At 120days post-grafting, the Observer Scar Assessment Scale, colorimetry, contour analysis and optical graft height assessments were performed. Twenty-nine porcine wounds were analyzed. All measured metrics of clinical skin quality were significantly worse (pskin graft quality, likely by multiple mechanisms including burn-related inflammation, microscopically inadequate excision, and dysregulation of tissue remodeling. A valid, reliable, clinically relevant model of full-thickness burn, excision and skin replacement therapy has been demonstrated. Future research to enhance quality of skin replacement therapies should be directed toward modulation of inflammation and assessments for complete excision. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.

  2. Systematic review finds that study data not published in full text articles have unclear impact on meta-analyses results in medical research.

    Science.gov (United States)

    Schmucker, Christine M; Blümle, Anette; Schell, Lisa K; Schwarzer, Guido; Oeller, Patrick; Cabrera, Laura; von Elm, Erik; Briel, Matthias; Meerpohl, Joerg J

    2017-01-01

    A meta-analysis as part of a systematic review aims to provide a thorough, comprehensive and unbiased statistical summary of data from the literature. However, relevant study results could be missing from a meta-analysis because of selective publication and inadequate dissemination. If missing outcome data differ systematically from published ones, a meta-analysis will be biased with an inaccurate assessment of the intervention effect. As part of the EU-funded OPEN project (www.open-project.eu) we conducted a systematic review that assessed whether the inclusion of data that were not published at all and/or published only in the grey literature influences pooled effect estimates in meta-analyses and leads to different interpretation. Systematic review of published literature (methodological research projects). Four bibliographic databases were searched up to February 2016 without restriction of publication year or language. Methodological research projects were considered eligible for inclusion if they reviewed a cohort of meta-analyses which (i) compared pooled effect estimates of meta-analyses of health care interventions according to publication status of data or (ii) examined whether the inclusion of unpublished or grey literature data impacts the result of a meta-analysis. Seven methodological research projects including 187 meta-analyses comparing pooled treatment effect estimates according to different publication status were identified. Two research projects showed that published data showed larger pooled treatment effects in favour of the intervention than unpublished or grey literature data (Ratio of ORs 1.15, 95% CI 1.04-1.28 and 1.34, 95% CI 1.09-1.66). In the remaining research projects pooled effect estimates and/or overall findings were not significantly changed by the inclusion of unpublished and/or grey literature data. The precision of the pooled estimate was increased with narrower 95% confidence interval. Although we may anticipate that

  3. Healthy full-term infants' brain responses to emotionally and linguistically relevant sounds using a multi-feature mismatch negativity (MMN) paradigm.

    Science.gov (United States)

    Kostilainen, Kaisamari; Wikström, Valtteri; Pakarinen, Satu; Videman, Mari; Karlsson, Linnea; Keskinen, Maria; Scheinin, Noora M; Karlsson, Hasse; Huotilainen, Minna

    2018-03-23

    We evaluated the feasibility of a multi-feature mismatch negativity (MMN) paradigm in studying auditory processing of healthy newborns. The aim was to examine the automatic change-detection and processing of semantic and emotional information in speech in newborns. Brain responses of 202 healthy newborns were recorded with a multi-feature paradigm including a Finnish bi-syllabic pseudo-word/ta-ta/as a standard stimulus, six linguistically relevant deviant stimuli and three emotionally relevant stimuli (happy, sad, angry). Clear responses to emotional sounds were found already at the early latency window 100-200 ms, whereas responses to linguistically relevant minor changes and emotional stimuli at the later latency window 300-500 ms did not reach significance. Moreover, significant interaction between gender and emotional stimuli was found in the early latency window. Further studies on using multi-feature paradigms with linguistic and emotional stimuli in newborns are needed, especially those containing of follow-ups, enabling the assessment of the predictive value of early variations between subjects. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

  4. User perspectives on relevance criteria

    DEFF Research Database (Denmark)

    Maglaughlin, Kelly L.; Sonnenwald, Diane H.

    2002-01-01

    , partially relevant, or not relevant to their information need; and explained their decisions in an interview. Analysis revealed 29 criteria, discussed positively and negatively, that were used by the participants when selecting passages that contributed or detracted from a document's relevance......This study investigates the use of criteria to assess relevant, partially relevant, and not-relevant documents. Study participants identified passages within 20 document representations that they used to make relevance judgments; judged each document representation as a whole to be relevant...... matter, thought catalyst), full text (e.g., audience, novelty, type, possible content, utility), journal/publisher (e.g., novelty, main focus, perceived quality), and personal (e.g., competition, time requirements). Results further indicate that multiple criteria are used when making relevant, partially...

  5. The Development of Relevance in Information Retrieval

    Directory of Open Access Journals (Sweden)

    Mu-hsuan Huang

    1997-12-01

    Full Text Available This article attempts to investigate the notion of relevance in information retrieval. It discusses various definitions for relevance from historical viewpoints and the characteristics of relevance judgments. Also, it introduces empirical results of important related researches.[Article content in Chinese

  6. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  7. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  8. Contrast-induced nephropathy in patients with diabetes mellitus between iso- and low-osmolar contrast media: A meta-analysis of full-text prospective, randomized controlled trials.

    Science.gov (United States)

    Han, Xiao-Fang; Zhang, Xin-Xiu; Liu, Ke-Mei; Tan, Hua; Zhang, Qiu

    2018-01-01

    This study was conducted to compare iso-osmolar contrast medium, iodixanol, with low-osmolar contrast media (LOCM) for assessing contrast-induced nephropathy (CIN) incidence, exclusively in the diabetic population. A systematic search was conducted for full-text, prospective, randomized controlled trials (RCTs). The primary outcome was incidence of CIN. Medline, Cochrane Central Register of Controlled Trials, and other sources were searched until May 31, 2017. Twelve RCTs finally met the search criteria. Iodixanol did not significantly reduce the risk of CIN (risk ratio [RR]: 0.72, 95% confidence interval (CI): [0.49, 1.04], p = 0.08). However, there was significantly reduced risk of CIN when iodixanol was compared to a LOCM agent iohexol (RR: 0.32, 95% CI [0.12, 0.89]). There were no differences between iodixanol and the other non-iohexol LOCM (RR: 0.92, 95% CI [0.68, 1.25]). In diabetic populations, iodixanol is not associated with a significant reduction of CIN risk. Iodixanol is associated with a reduced risk of CIN compared with iohexol, whereas no significant difference between iodixanol and other LOCM could be found.

  9. Profiles of Dialogue for Relevance

    Directory of Open Access Journals (Sweden)

    Douglas Walton

    2016-12-01

    Full Text Available This paper uses argument diagrams, argumentation schemes, and some tools from formal argumentation systems developed in artificial intelligence to build a graph-theoretic model of relevance shown to be applicable (with some extensions as a practical method for helping a third party judge issues of relevance or irrelevance of an argument in real examples. Examples used to illustrate how the method works are drawn from disputes about relevance in natural language discourse, including a criminal trial and a parliamentary debate.

  10. Clinical Relevance of Adipokines

    Directory of Open Access Journals (Sweden)

    Matthias Blüher

    2012-10-01

    Full Text Available The incidence of obesity has increased dramatically during recent decades. Obesity increases the risk for metabolic and cardiovascular diseases and may therefore contribute to premature death. With increasing fat mass, secretion of adipose tissue derived bioactive molecules (adipokines changes towards a pro-inflammatory, diabetogenic and atherogenic pattern. Adipokines are involved in the regulation of appetite and satiety, energy expenditure, activity, endothelial function, hemostasis, blood pressure, insulin sensitivity, energy metabolism in insulin sensitive tissues, adipogenesis, fat distribution and insulin secretion in pancreatic β-cells. Therefore, adipokines are clinically relevant as biomarkers for fat distribution, adipose tissue function, liver fat content, insulin sensitivity, chronic inflammation and have the potential for future pharmacological treatment strategies for obesity and its related diseases. This review focuses on the clinical relevance of selected adipokines as markers or predictors of obesity related diseases and as potential therapeutic tools or targets in metabolic and cardiovascular diseases.

  11. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  12. Conditions governing the acceptance of radioactive wastes by the Hauptabteilung Dekontaminationsbetriebe (HDB). Full text of legal provisions, issue no.6 of July 1, 1991, as amended until January 1, 1995

    International Nuclear Information System (INIS)

    1995-01-01

    The conditions apply to the acceptance of radwaste by the Main Decontamination Dept. (HDB) of Karlsruhe Research Center, including radioactive remnants, contaminated plant components, and primary waste from the following waste generators: Institutes of the Karlsruhe Research Center, facilities located within the Center but run by other organisations, other outside facilities not linked with the Center, as e.g. waste generators in Baden-Wuerttemberg obliged to deliver their radwaste to the Radwaste Collecting Site of the Land of Baden-Wuerttemberg. Amendments are marked at the right-hand margin of the text

  13. Relevance: An Interdisciplinary and Information Science Perspective

    Directory of Open Access Journals (Sweden)

    Howard Greisdorf

    2000-01-01

    Full Text Available Although relevance has represented a key concept in the field of information science for evaluating information retrieval effectiveness, the broader context established by interdisciplinary frameworks could provide greater depth and breadth to on-going research in the field. This work provides an overview of the nature of relevance in the field of information science with a cursory view of how cross-disciplinary approaches to relevance could represent avenues for further investigation into the evaluative characteristics of relevance as a means for enhanced understanding of human information behavior.

  14. Abstracts to be Delivered at the 2014 Annual Conference of the Association of Medical Microbiology and Infectious Disease Canada, April 3 to 5, Victoria, British Columbia, Alphabetized According to the Surname of the First Author. Full-text Abstracts Can be Accessed at www.pulsus.com

    Directory of Open Access Journals (Sweden)

    2014-01-01

    Full Text Available This document presents the titles of the abstracts to be presented at the 2014 Annual Conference of the Association of Medical Microbiology and Infectious Disease Canada (April 3 to 5, Victoria, British Columbia. The full-text abstracts are available online.

  15. Context and Structure in Automated Full-Text Information Access

    Science.gov (United States)

    1994-04-29

    Meisei, Makayo, Nitsuko and Tamura, all of Japan; Goldstar, Samsung and OPC of South Korea, and Sun Moon Star of Taiwan; AT&T says the practices have...IN MALAYSIA [ ... ] Another example topic description is shown below: Topic 034 <dom> Domain: Science and Technology <title>Topic: Entities Involved In

  16. HighWire Free Online Full-text Articles

    Science.gov (United States)

    Journal of Lipid Research all articles after 12 months Journal of Medical Ethics all articles 1 Jan 1975 Anticancer Research all articles after 2 years every Jan. Antimicrobial Agents and Chemotherapy all articles BMJ Open Diabetes Research & Care free site BMJ Open Gastroenterology free site BMJ Open

  17. Relevant closure: a new form of defeasible reasoning for description logics

    CSIR Research Space (South Africa)

    Casini, G

    2014-09-01

    Full Text Available Relevant Closure and Minimal Relevant Closure. As the names suggest, both rely on defining a version of relevance. Our formalisation of relevance in this context is based on the notion of a justification (a minimal subset of sentences implying a given...

  18. Plagiarism in Academic Texts

    Directory of Open Access Journals (Sweden)

    Marta Eugenia Rojas-Porras

    2012-08-01

    Full Text Available The ethical and social responsibility of citing the sources in a scientific or artistic work is undeniable. This paper explores, in a preliminary way, academic plagiarism in its various forms. It includes findings based on a forensic analysis. The purpose of this paper is to raise awareness on the importance of considering these details when writing and publishing a text. Hopefully, this analysis may put the issue under discussion.

  19. Developmental Readiness of Normal Full Term Infants To Progress from Exclusive Breastfeeding to the Introduction of Complementary Foods: Reviews of the Relevant Literature Concerning Infant Immunologic, Gastrointestinal, Oral Motor and Maternal Reproductive and Lactational Development.

    Science.gov (United States)

    Naylor, Audrey J., Ed.; Morrow, Ardythe L., Ed.

    This review of the developmental readiness of normal, full-term infants to progress from exclusive breastfeeding to the introduction of complementary foods is the result of the international debate regarding the best age to introduce complementary foods into the diet of the breastfed human infant. After a list of definitions, four papers focus on:…

  20. Relevance of nanotechnology to Africa: synthesis, applications and safety

    CSIR Research Space (South Africa)

    Musee, N

    2012-07-01

    Full Text Available In this chapter, two nanotechnology-based applications relevant to Africa in promoting sustainability and achievement of the Millennium Development Goals (MDGs) are presented. The applications comprise the provision of therapeutic treatment...

  1. Vocabulary Constraint on Texts

    Directory of Open Access Journals (Sweden)

    C. Sutarsyah

    2008-01-01

    Full Text Available This case study was carried out in the English Education Department of State University of Malang. The aim of the study was to identify and describe the vocabulary in the reading text and to seek if the text is useful for reading skill development. A descriptive qualitative design was applied to obtain the data. For this purpose, some available computer programs were used to find the description of vocabulary in the texts. It was found that the 20 texts containing 7,945 words are dominated by low frequency words which account for 16.97% of the words in the texts. The high frequency words occurring in the texts were dominated by function words. In the case of word levels, it was found that the texts have very limited number of words from GSL (General Service List of English Words (West, 1953. The proportion of the first 1,000 words of GSL only accounts for 44.6%. The data also show that the texts contain too large proportion of words which are not in the three levels (the first 2,000 and UWL. These words account for 26.44% of the running words in the texts.  It is believed that the constraints are due to the selection of the texts which are made of a series of short-unrelated texts. This kind of text is subject to the accumulation of low frequency words especially those of content words and limited of words from GSL. It could also defeat the development of students' reading skills and vocabulary enrichment.

  2. Evolutionary Relevance Facilitates Visual Information Processing

    Directory of Open Access Journals (Sweden)

    Russell E. Jackson

    2013-07-01

    Full Text Available Visual search of the environment is a fundamental human behavior that perceptual load affects powerfully. Previously investigated means for overcoming the inhibitions of high perceptual load, however, generalize poorly to real-world human behavior. We hypothesized that humans would process evolutionarily relevant stimuli more efficiently than evolutionarily novel stimuli, and evolutionary relevance would mitigate the repercussions of high perceptual load during visual search. Animacy is a significant component to evolutionary relevance of visual stimuli because perceiving animate entities is time-sensitive in ways that pose significant evolutionary consequences. Participants completing a visual search task located evolutionarily relevant and animate objects fastest and with the least impact of high perceptual load. Evolutionarily novel and inanimate objects were located slowest and with the highest impact of perceptual load. Evolutionary relevance may importantly affect everyday visual information processing.

  3. Chinese legal texts – Quantitative Description

    Directory of Open Access Journals (Sweden)

    Ľuboš GAJDOŠ

    2017-06-01

    Full Text Available The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation.

  4. Bible Translation And Relevance Theory | Deist | Stellenbosch ...

    African Journals Online (AJOL)

    Stellenbosch Papers in Linguistics Plus. Journal Home · ABOUT THIS JOURNAL · Advanced Search · Current Issue · Archives · Journal Home > Vol 22 (1992) >. Log in or Register to get access to full text downloads. Username, Password, Remember me, or Register. Bible Translation And Relevance Theory. F Deist ...

  5. Directed Activities Related to Text: Text Analysis and Text Reconstruction.

    Science.gov (United States)

    Davies, Florence; Greene, Terry

    This paper describes Directed Activities Related to Text (DART), procedures that were developed and are used in the Reading for Learning Project at the University of Nottingham (England) to enhance learning from texts and that fall into two broad categories: (1) text analysis procedures, which require students to engage in some form of analysis of…

  6. Making academic research more relevant: A few suggestions

    Directory of Open Access Journals (Sweden)

    Abinash Panda

    2014-09-01

    Full Text Available Academic research in the domain of management scholarship, though steeped in scientific and methodological rigour, is generally found to be of little relevance to practice. The authors of this paper have revisited the rigour-relevance debate in light of recent developments and with special reference to the management research scenario in India. The central thesis of the argument is that the gulf between rigour and relevance needs to be bridged to make academic research more relevant to business organizations and practitioners. They have offered some suggestions to enhance the relevance of academic research to practice.

  7. Relevance Feedback in Content Based Image Retrieval: A Review

    Directory of Open Access Journals (Sweden)

    Manesh B. Kokare

    2011-01-01

    Full Text Available This paper provides an overview of the technical achievements in the research area of relevance feedback (RF in content-based image retrieval (CBIR. Relevance feedback is a powerful technique in CBIR systems, in order to improve the performance of CBIR effectively. It is an open research area to the researcher to reduce the semantic gap between low-level features and high level concepts. The paper covers the current state of art of the research in relevance feedback in CBIR, various relevance feedback techniques and issues in relevance feedback are discussed in detail.

  8. Zum Bildungspotenzial biblischer Texte

    Directory of Open Access Journals (Sweden)

    Theis, Joachim

    2017-11-01

    Full Text Available Biblical education as a holistic process goes far beyond biblical learning. It must be understood as a lifelong process, in which both biblical texts and their understanders operate appropriating their counterpart in a dialogical way. – Neither does the recipient’s horizon of understanding appear as an empty room, which had to be filled with the text only, nor is the latter a dead material one could only examine cognitively. The recipient discovers the meaning of the biblical text recomposing it by existential appropriation. So the text is brought to live in each individual reality. Both scientific insights and subjective structures as well as the understanders’ community must be included to avoid potential one-sidednesses. Unfortunately, a special negative association obscures the approach of the bible very often: Still biblical work as part of religious education appears in a cognitively oriented habit, which is neither regarding the vitality and sovereignty of the biblical texts nor the students’ desire for meaning. Moreover, the bible is getting misused for teaching moral terms or pontifications. Such downfalls can be disrupted by biblical didactics which are empowerment didactics. Regarding the sovereignty of biblical texts, these didactics assist the understander with his/her individuation by opening the texts with focus on the understander’s otherness. Thus each the text and the recipient become subjects in a dialogue. The approach of the Biblical-Enabling-Didactics leads the Bible to become always new a book of life. Understanding them from within their hermeneutics, empowerment didactics could be raised to the principle of biblical didactics in general and grow into an essential element of holistic education.

  9. Microdosing: Concept, application and relevance

    Directory of Open Access Journals (Sweden)

    Tushar Tewari

    2010-01-01

    Full Text Available The use of microdose pharmacokinetic studies as an essential tool in drug development is still to catch on. While this approach promises potential cost savings and a quantum leap in efficiencies of the drug development process, major hurdles still need to be overcome before the technique becomes commonplace and part of routine practice. Clear regulations in Europe and the USA have had an enabling effect. The lack of enabling provisions for microdosing studies in Indian regulation, despite low risk and manifest relevance for the local drug development industry, is inconsistent with the country′s aspirations to be among the leaders in pharmaceutical research.

  10. U.S. Cavalry: Still Relevant in Full Spectrum Operations

    Science.gov (United States)

    2010-05-21

    major railway line 51 Jonathean Gawne, The Americans in Brittany, 1944: The Battle for Brest . ( Paris : Historie & Collections, 2002), 19. 52 Gawne. The...Jonathean. The Americans in Brittany, 1944: The Battle for Brest . Paris , FR: Historie & Collections, 2002. Gillie, Mildred H. Forging the Thunderbolt...traversed the northern coast.54 This rail line was crucial to improving Allied communication between Brest and Rennes and supporting the advance of

  11. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  12. Üstverinin Tam-Metin Bilgi Erişim Performansı Üzerindeki Etkisi: Küçük Ölçekli Türkçe Külliyat Üzerinde Deneysel Bir Araştırma / Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

    OpenAIRE

    Çapkın, Çağdaş

    2016-01-01

    Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate ...

  13. Text Maps: Helping Students Navigate Informational Texts.

    Science.gov (United States)

    Spencer, Brenda H.

    2003-01-01

    Notes that a text map is an instructional approach designed to help students gain fluency in reading content area materials. Discusses how the goal is to teach students about the important features of the material and how the maps can be used to build new understandings. Presents the procedures for preparing and using a text map. (SG)

  14. Text mining for the biocuration workflow.

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A P C; Krallinger, Martin; Arighi, Cecilia; Cohen, K Bretonnel; Valencia, Alfonso; Wu, Cathy H; Chatr-Aryamontri, Andrew; Dowell, Karen G; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on 'Text Mining for the BioCuration Workflow' at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community.

  15. Weitere Texte physiognomischen Inhalts

    Directory of Open Access Journals (Sweden)

    Böck, Barbara

    2004-12-01

    Full Text Available The present article offers the edition of three cuneiform texts belonging to the Akkadian handbook of omens drawn from the physical appearance as well as the morals and behaviour of man. The book comprising up to 27 chapters with more than 100 omens each was entitled in antiquity Alamdimmû. The edition of the three cuneiform tablets completes, thus, the author's monographic study on the ancient Mesopotamian divinatory discipline of physiognomy (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

    En este artículo se presenta la editio princeps de tres textos cuneiformes conservados en el British Museum (Londres y el Vorderasiatisches Museum (Berlín, que pertenecen al libro asirio-babilonio de presagios fisiognómicos. Este libro, titulado originalmente Alamdimmû ('forma, figura', consta de 27 capítulos, cada uno con más de cien presagios escritos en lengua acadia. Los tres textos completan así el estudio monográfico de la autora sobre la disciplina adivinatoria de la fisiognomía en el antiguo Oriente (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

  16. Profiling School Shooters: Automatic Text-Based Analysis

    Directory of Open Access Journals (Sweden)

    Yair eNeuman

    2015-06-01

    Full Text Available School shooters present a challenge to both forensic psychiatry and law enforcement agencies. The relatively small number of school shooters, their various charateristics, and the lack of in-depth analysis of all of the shooters prior to the shooting add complexity to our understanding of this problem. In this short paper, we introduce a new methodology for automatically profiling school shooters. The methodology involves automatic analysis of texts and the production of several measures relevant for the identification of the shooters. Comparing texts written by six school shooters to 6056 texts written by a comparison group of male subjects, we found that the shooters' texts scored significantly higher on the Narcissistic Personality dimension as well as on the Humilated and Revengeful dimensions. Using a ranking/priorization procedure, similar to the one used for the automatic identification of sexual predators, we provide support for the validity and relevance of the proposed methodology.

  17. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  18. Passage relevance models for genomics search

    Directory of Open Access Journals (Sweden)

    Frieder Ophir

    2009-03-01

    Full Text Available Abstract We present a passage relevance model for integrating syntactic and semantic evidence of biomedical concepts and topics using a probabilistic graphical model. Component models of topics, concepts, terms, and document are represented as potential functions within a Markov Random Field. The probability of a passage being relevant to a biologist's information need is represented as the joint distribution across all potential functions. Relevance model feedback of top ranked passages is used to improve distributional estimates of query concepts and topics in context, and a dimensional indexing strategy is used for efficient aggregation of concept and term statistics. By integrating multiple sources of evidence including dependencies between topics, concepts, and terms, we seek to improve genomics literature passage retrieval precision. Using this model, we are able to demonstrate statistically significant improvements in retrieval precision using a large genomics literature corpus.

  19. The Relevance of Hegel's Logic

    Directory of Open Access Journals (Sweden)

    John W Burbidge

    2007-12-01

    Full Text Available Hegel defines his Logic as the science that thinks about thinking.nbsp; But when we interpret that work as outlining what happens when we reason we are vulnerable to Fregersquo;s charge of psychologism.nbsp; I use Hegelrsquo;s tripartite distinction among understanding, dialectical and speculative reason as operations of pure thought to suggest how thinking can work with objective concepts.nbsp; In the last analysis, however, our ability to move from the subjective contingency of representations and ideas to the pure concepts we think develops from mechanical memory, which separates sign from sense so hat we can focus simply on the latter.nbsp; By becoming aware of the connections that underlie our thinking processes we may be able to both move beyond the abstractions of symbolic logic and clarify what informal logicians call relevance.

  20. Text-Fabric

    NARCIS (Netherlands)

    Roorda, Dirk

    2016-01-01

    Text-Fabric is a Python3 package for Text plus Annotations. It provides a data model, a text file format, and a binary format for (ancient) text plus (linguistic) annotations. The emphasis of this all is on: data processing; sharing data; and contributing modules. A defining characteristic is that

  1. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  2. XML and Free Text.

    Science.gov (United States)

    Riggs, Ken Roger

    2002-01-01

    Discusses problems with marking free text, text that is either natural language or semigrammatical but unstructured, that prevent well-formed XML from marking text for readily available meaning. Proposes a solution to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML. (Author/LRW)

  3. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2018-01-01

    text can be defined by taking as point of departure the digital format in which everything is represented in the binary alphabet. While the notion of text, in most cases, lends itself to be independent of medium and embodiment, it is also often tacitly assumed that it is, in fact, modeled around...... the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities...

  4. Texting on the Move

    Science.gov (United States)

    ... text. What's the Big Deal? The problem is multitasking. No matter how young and agile we are, ... on something other than the road. In fact, driving while texting (DWT) can be more dangerous than ...

  5. Text Coherence in Translation

    Science.gov (United States)

    Zheng, Yanping

    2009-01-01

    In the thesis a coherent text is defined as a continuity of senses of the outcome of combining concepts and relations into a network composed of knowledge space centered around main topics. And the author maintains that in order to obtain the coherence of a target language text from a source text during the process of translation, a translator can…

  6. Why relevance theory is relevant for lexicography

    DEFF Research Database (Denmark)

    Bothma, Theo; Tarp, Sven

    2014-01-01

    This article starts by providing a brief summary of relevance theory in information science in relation to the function theory of lexicography, explaining the different types of relevance, viz. objective system relevance and the subjective types of relevance, i.e. topical, cognitive, situational...... that is very important for lexicography as well as for information science, viz. functional relevance. Since all lexicographic work is ultimately aimed at satisfying users’ information needs, the article then discusses why the lexicographer should take note of all these types of relevance when planning a new...... dictionary project, identifying new tasks and responsibilities of the modern lexicographer. The article furthermore discusses how relevance theory impacts on teaching dictionary culture and reference skills. By integrating insights from lexicography and information science, the article contributes to new...

  7. Text mining for the biocuration workflow

    Science.gov (United States)

    Hirschman, Lynette; Burns, Gully A. P. C; Krallinger, Martin; Arighi, Cecilia; Cohen, K. Bretonnel; Valencia, Alfonso; Wu, Cathy H.; Chatr-Aryamontri, Andrew; Dowell, Karen G.; Huala, Eva; Lourenço, Anália; Nash, Robert; Veuthey, Anne-Lise; Wiegers, Thomas; Winter, Andrew G.

    2012-01-01

    Molecular biology has become heavily dependent on biological knowledge encoded in expert curated biological databases. As the volume of biological literature increases, biocurators need help in keeping up with the literature; (semi-) automated aids for biocuration would seem to be an ideal application for natural language processing and text mining. However, to date, there have been few documented successes for improving biocuration throughput using text mining. Our initial investigations took place for the workshop on ‘Text Mining for the BioCuration Workflow’ at the third International Biocuration Conference (Berlin, 2009). We interviewed biocurators to obtain workflows from eight biological databases. This initial study revealed high-level commonalities, including (i) selection of documents for curation; (ii) indexing of documents with biologically relevant entities (e.g. genes); and (iii) detailed curation of specific relations (e.g. interactions); however, the detailed workflows also showed many variabilities. Following the workshop, we conducted a survey of biocurators. The survey identified biocurator priorities, including the handling of full text indexed with biological entities and support for the identification and prioritization of documents for curation. It also indicated that two-thirds of the biocuration teams had experimented with text mining and almost half were using text mining at that time. Analysis of our interviews and survey provide a set of requirements for the integration of text mining into the biocuration workflow. These can guide the identification of common needs across curated databases and encourage joint experimentation involving biocurators, text mining developers and the larger biomedical research community. PMID:22513129

  8. TEXT DEIXIS IN NARRATIVE SEQUENCES

    Directory of Open Access Journals (Sweden)

    Josep Rivera

    2007-06-01

    Full Text Available This study looks at demonstrative descriptions, regarding them as text-deictic procedures which contribute to weave discourse reference. Text deixis is thought of as a metaphorical referential device which maps the ground of utterance onto the text itself. Demonstrative expressions with textual antecedent-triggers, considered as the most important text-deictic units, are identified in a narrative corpus consisting of J. M. Barrie’s Peter Pan and its translation into Catalan. Some linguistic and discourse variables related to DemNPs are analysed to characterise adequately text deixis. It is shown that this referential device is usually combined with abstract nouns, thus categorising and encapsulating (non-nominal complex discourse entities as nouns, while performing a referential cohesive function by means of the text deixis + general noun type of lexical cohesion.

  9. Library Users Expect Link Resolvers to Provide Full Text While Librarians Expect Accurate Results. A review of: Wakimoto, Jina Choi, David S. Walker, and Katherine S. Dabbour. “The Myths and Realities of SFX in Academic Libraries.” The Journal of Academic Librarianship 32.2 (Mar. 2006: 127‐ 36.

    Directory of Open Access Journals (Sweden)

    Wendy Furlan

    2006-12-01

    Full Text Available Objective – To determine how successfulthe link resolver, SFX, is in meeting the expectations of library users and librarians.Design – Analysis of an online user survey, library staff focus groups, retrospective analysis of system statistics, and test searches.Setting – Two California State University campus libraries in the United States: Northbridge, with over 31,000 students on campus, and San Marcos, with over 7,300 students on campus.Subjects – A total of 453 online survey responses were submitted from library users, 421 from Northbridge and 32 from SanMarcos. Twenty librarians took part in the focus groups conducted with library staff consisting of 14 of the 23 librarians from Northbridge (2 from technical services and 12 from public services, and 6 of the 10 San Marcos librarians (3 from technical services and 3 from public services. No further information was provided on the characteristics of the subjects.Methods – An online survey was offered to users of the two campus libraries for a two week period in May 2004. The survey consisted of 8 questions, 7 fixed response and 1 free text. Survey distribution was enabled via a different mechanism at each campus. The Northbridge library offered the survey to users via a pop‐up window each time the SFX service was clicked on, while the San Marcos library presented the survey as a link from the library’s home page. Survey responses from both campuses were combined and analysed together. Focus groups were conducted with librarians from each campus library on April 20th, 21st, and 29th, 2004. Librarians attended focus groups only with others from their own campus. Statistics were gathered from each campus’ local SFX system for the 3‐month period from September 14, 2004, to December 14,2004. Statistics from each campus were combined for analysis. The authors also conducted 224 test searches over the 3‐month period from July to September, 2004.Main results – Analysis of the

  10. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  11. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    Dictionaries for Text Production are information tools that are designed and constructed for helping users to produce (i.e. encode) texts, both oral and written texts. These can be broadly divided into two groups: (a) specialized text production dictionaries, i.e., dictionaries that only offer...... a small amount of lexicographic data, most or all of which are typically used in a production situation, e.g. synonym dictionaries, grammar and spelling dictionaries, collocation dictionaries, concept dictionaries such as the Longman Language Activator, which is advertised as the World’s First Production...... Dictionary; (b) general text production dictionaries, i.e., dictionaries that offer all or most of the lexicographic data that are typically used in a production situation. A review of existing production dictionaries reveals that there are many specialized text production dictionaries but only a few general...

  12. SparkText: Biomedical Text Mining on Big Data Framework.

    Directory of Open Access Journals (Sweden)

    Zhan Ye

    Full Text Available Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment.In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM, and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes.This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  13. Inferring relevance in a changing world

    Directory of Open Access Journals (Sweden)

    Robert C Wilson

    2012-01-01

    Full Text Available Reinforcement learning models of human and animal learning usually concentrate on how we learn the relationship between different stimuli or actions and rewards. However, in real world situations stimuli are ill-defined. On the one hand, our immediate environment is extremely multi-dimensional. On the other hand, in every decision-making scenario only a few aspects of the environment are relevant for obtaining reward, while most are irrelevant. Thus a key question is how do we learn these relevant dimensions, that is, how do we learn what to learn about? We investigated this process of representation learning experimentally, using a task in which one stimulus dimension was relevant for determining reward at each point in time. As in real life situations, in our task the relevant dimension can change without warning, adding ever-present uncertainty engendered by a constantly changing environment. We show that human performance on this task is better described by a suboptimal strategy based on selective attention and serial hypothesis testing rather than a normative strategy based on probabilistic inference. From this, we conjecture that the problem of inferring relevance in general scenarios is too computationally demanding for the brain to solve optimally. As a result the brain utilizes approximations, employing these even in simplified scenarios in which optimal representation learning is tractable, such as the one in our experiment.

  14. Incorporating other texts: Intertextuality in Malaysian CSR reports

    Directory of Open Access Journals (Sweden)

    Kumaran Rajandran

    2016-11-01

    Full Text Available In Malaysia, corporate social responsibility (CSR is relatively new but corporations have been required to engage in and disclose their CSR. A typical genre for disclosure is CSR reports and these reports often refer to other texts. The article investigates the act of referencing to other texts or intertextuality in Malaysian CSR reports. It creates an archive of CEO Statements and Environment Sections in CSR reports and studies the archive for keywords, which can identify the incorporated texts. The function of these texts is examined in relation to Malaysia’s corporate context. CSR reports contain explicit references to documents (policies, regulations, reports, research, standards and to individuals/groups (CEOs, stakeholders, expert organizations. The incorporated texts display variation in corporate control, which organizes these texts along an intertextual cline. The cline helps to identify corporate and non-corporate sources among the texts. The selection of incorporated texts may reflect government and stock exchange demands. The texts are not standardized and are relevant for the CSR domain and corporations, where these texts monitor and justify CSR performance. Yet, the incorporated texts may perpetuate inexact reporting because corporations select the texts and the parts of texts to refer to. Since these texts have been employed to scrutinize initiatives and results, CSR reports can claim to represent the “truth” about a corporation’s CSR. Hence, intertextuality serves corporate interests.

  15. Text mining by Tsallis entropy

    Science.gov (United States)

    Jamaati, Maryam; Mehri, Ali

    2018-01-01

    Long-range correlations between the elements of natural languages enable them to convey very complex information. Complex structure of human language, as a manifestation of natural languages, motivates us to apply nonextensive statistical mechanics in text mining. Tsallis entropy appropriately ranks the terms' relevance to document subject, taking advantage of their spatial correlation length. We apply this statistical concept as a new powerful word ranking metric in order to extract keywords of a single document. We carry out an experimental evaluation, which shows capability of the presented method in keyword extraction. We find that, Tsallis entropy has reliable word ranking performance, at the same level of the best previous ranking methods.

  16. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  17. LocText

    DEFF Research Database (Denmark)

    Cejuela, Juan Miguel; Vinchurkar, Shrikant; Goldberg, Tatyana

    2018-01-01

    trees and was trained and evaluated on a newly improved LocTextCorpus. Combined with an automatic named-entity recognizer, LocText achieved high precision (P = 86%±4). After completing development, we mined the latest research publications for three organisms: human (Homo sapiens), budding yeast...

  18. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  19. The Perfect Text.

    Science.gov (United States)

    Russo, Ruth

    1998-01-01

    A chemistry teacher describes the elements of the ideal chemistry textbook. The perfect text is focused and helps students draw a coherent whole out of the myriad fragments of information and interpretation. The text would show chemistry as the central science necessary for understanding other sciences and would also root chemistry firmly in the…

  20. Text 2 Mind Map

    OpenAIRE

    Iona, John

    2017-01-01

    This is a review of the web resource 'Text 2 Mind Map' www.Text2MindMap.com. It covers what the resource is, and how it might be used in Library and education context, in particular for School Librarians.

  1. Text File Comparator

    Science.gov (United States)

    Kotler, R. S.

    1983-01-01

    File Comparator program IFCOMP, is text file comparator for IBM OS/VScompatable systems. IFCOMP accepts as input two text files and produces listing of differences in pseudo-update form. IFCOMP is very useful in monitoring changes made to software at the source code level.

  2. Deep learning relevance

    DEFF Research Database (Denmark)

    Lioma, Christina; Larsen, Birger; Petersen, Casper

    2016-01-01

    train a Recurrent Neural Network (RNN) on existing relevant information to that query. We then use the RNN to "deep learn" a single, synthetic, and we assume, relevant document for that query. We design a crowdsourcing experiment to assess how relevant the "deep learned" document is, compared...... to existing relevant documents. Users are shown a query and four wordclouds (of three existing relevant documents and our deep learned synthetic document). The synthetic document is ranked on average most relevant of all....

  3. Monitoring interaction and collective text production through text mining

    Directory of Open Access Journals (Sweden)

    Macedo, Alexandra Lorandi

    2014-04-01

    Full Text Available This article presents the Concepts Network tool, developed using text mining technology. The main objective of this tool is to extract and relate terms of greatest incidence from a text and exhibit the results in the form of a graph. The Network was implemented in the Collective Text Editor (CTE which is an online tool that allows the production of texts in synchronized or non-synchronized forms. This article describes the application of the Network both in texts produced collectively and texts produced in a forum. The purpose of the tool is to offer support to the teacher in managing the high volume of data generated in the process of interaction amongst students and in the construction of the text. Specifically, the aim is to facilitate the teacher’s job by allowing him/her to process data in a shorter time than is currently demanded. The results suggest that the Concepts Network can aid the teacher, as it provides indicators of the quality of the text produced. Moreover, messages posted in forums can be analyzed without their content necessarily having to be pre-read.

  4. EST: Evading Scientific Text.

    Science.gov (United States)

    Ward, Jeremy

    2001-01-01

    Examines chemical engineering students' attitudes to text and other parts of English language textbooks. A questionnaire was administered to a group of undergraduates. Results reveal one way students get around the problem of textbook reading. (Author/VWL)

  5. nal Sesotho texts

    African Journals Online (AJOL)

    with literary texts written in indigenous South African languages. The project ... Homi Bhabha uses the words of Salman Rushdie to underline the fact that new .... I could not conceptualise an African-language-to-African-language dictionary. An.

  6. Machine Translation from Text

    Science.gov (United States)

    Habash, Nizar; Olive, Joseph; Christianson, Caitlin; McCary, John

    Machine translation (MT) from text, the topic of this chapter, is perhaps the heart of the GALE project. Beyond being a well defined application that stands on its own, MT from text is the link between the automatic speech recognition component and the distillation component. The focus of MT in GALE is on translating from Arabic or Chinese to English. The three languages represent a wide range of linguistic diversity and make the GALE MT task rather challenging and exciting.

  7. Bacterial colonization of psoriasis plaques. Is it relevant?

    Directory of Open Access Journals (Sweden)

    Eva Marcus

    2011-08-01

    Full Text Available Bacterial colonization was investigated retrospectively in patients with plaque psoriasis (n=98 inpatient treatments, n=73 patients. At least one pathogen was found in 46% of all cases. Staphylococcus aureus was the most frequent bacterium. Bacterial colonization of psoriasis plaques could be relevant in individual cases.

  8. The LAILAPS Search Engine: Relevance Ranking in Life Science Databases

    Directory of Open Access Journals (Sweden)

    Lange Matthias

    2010-06-01

    Full Text Available Search engines and retrieval systems are popular tools at a life science desktop. The manual inspection of hundreds of database entries, that reflect a life science concept or fact, is a time intensive daily work. Hereby, not the number of query results matters, but the relevance does. In this paper, we present the LAILAPS search engine for life science databases. The concept is to combine a novel feature model for relevance ranking, a machine learning approach to model user relevance profiles, ranking improvement by user feedback tracking and an intuitive and slim web user interface, that estimates relevance rank by tracking user interactions. Queries are formulated as simple keyword lists and will be expanded by synonyms. Supporting a flexible text index and a simple data import format, LAILAPS can easily be used both as search engine for comprehensive integrated life science databases and for small in-house project databases.

  9. Extracting of implicit information in English advertising texts with phonetic and lexical-morphological means

    Directory of Open Access Journals (Sweden)

    Traikovskaya Natalya Petrovna

    2015-12-01

    Full Text Available The article deals with phonetic and lexical-morphological language means participating in the process of extracting implicit information in English-speaking advertising texts for men and women. The functioning of phonetic means of the English language is not the basis for implication of information in advertising texts. Lexical and morphological means play the role of markers of relevant information, playing the role of the activator ofimplicit information in the texts of advertising.

  10. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  11. TEXT Energy Storage System

    International Nuclear Information System (INIS)

    Weldon, W.F.; Rylander, H.G.; Woodson, H.H.

    1977-01-01

    The Texas Experimental Tokamak (TEXT) Enery Storage System, designed by the Center for Electromechanics (CEM), consists of four 50 MJ, 125 V homopolar generators and their auxiliaries and is designed to power the toroidal and poloidal field coils of TEXT on a two-minute duty cycle. The four 50 MJ generators connected in series were chosen because they represent the minimum cost configuration and also represent a minimal scale up from the successful 5.0 MJ homopolar generator designed, built, and operated by the CEM

  12. Text and ideology: text-oriented discourse analysis

    Directory of Open Access Journals (Sweden)

    Maria Eduarda Gonçalves Peixoto

    2018-04-01

    Full Text Available The article aims to contribute to the understanding of the connection between text and ideology articulated by the text-oriented analysis of discourse (ADTO. Based on the reflections of Fairclough (1989, 2001, 2003 and Fairclough and Chouliaraki (1999, the debate presents the social ontology that ADTO uses to base its conception of social life as an open system and textually mediated; the article then explains the chronological-narrative development of the main critical theories of ideology, by virtue of which ADTO organizes the assumptions that underpin the particular use it makes of the term. Finally, the discussion presents the main aspects of the connection between text and ideology, offering a conceptual framework that can contribute to the domain of the theme according to a critical discourse analysis approach.

  13. Other relevant biological papers

    International Nuclear Information System (INIS)

    Shimizu, M.

    1989-01-01

    A considerable number of CRESP-relevant papers concerning deep-sea biology and radioecology have been published. It is the purpose of this study to call attention to them. They fall into three general categories. The first is papers of general interest. They are mentioned only briefly, and include text references to the global bibliography at the end of the volume. The second are papers that are not only mentioned and referenced, but for various reasons are described in abstract form. The last is a list of papers compiled by H.S.J. Roe specifically for this volume. They are listed in bibliographic form, and are also included in the global bibliography at the end of the volume

  14. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  15. The Emar Lexical Texts

    NARCIS (Netherlands)

    Gantzert, Merijn

    2011-01-01

    This four-part work provides a philological analysis and a theoretical interpretation of the cuneiform lexical texts found in the Late Bronze Age city of Emar, in present-day Syria. These word and sign lists, commonly dated to around 1100 BC, were almost all found in the archive of a single school.

  16. Text Induced Spelling Correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from a very large corpus of raw text, without supervision, and contains word

  17. Texts and Readers.

    Science.gov (United States)

    Iser, Wolfgang

    1980-01-01

    Notes that, since fictional discourse need not reflect prevailing systems of meaning and norms or values, readers gain detachment from their own presuppositions; by constituting and formulating text-sense, readers are constituting and formulating their own cognition and becoming aware of the operations for doing so. (FL)

  18. Documents and legal texts

    International Nuclear Information System (INIS)

    2017-01-01

    This section treats of the following documents and legal texts: 1 - Belgium 29 June 2014 - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy; 2 - Belgium, 7 December 2016. - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy

  19. SparkText: Biomedical Text Mining on Big Data Framework

    Science.gov (United States)

    He, Karen Y.; Wang, Kai

    2016-01-01

    Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652

  20. SparkText: Biomedical Text Mining on Big Data Framework.

    Science.gov (United States)

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  1. Strategies for Translating Vocative Texts

    Directory of Open Access Journals (Sweden)

    Olga COJOCARU

    2014-12-01

    Full Text Available The paper deals with the linguistic and cultural elements of vocative texts and the techniques used in translating them by giving some examples of texts that are typically vocative (i.e. advertisements and instructions for use. Semantic and communicative strategies are popular in translation studies and each of them has its own advantages and disadvantages in translating vocative texts. The advantage of semantic translation is that it takes more account of the aesthetic value of the SL text, while communicative translation attempts to render the exact contextual meaning of the original text in such a way that both content and language are readily acceptable and comprehensible to the readership. Focus is laid on the strategies used in translating vocative texts, strategies that highlight and introduce a cultural context to the target audience, in order to achieve their overall purpose, that is to sell or persuade the reader to behave in a certain way. Thus, in order to do that, a number of advertisements from the field of cosmetics industry and electronic gadgets were selected for analysis. The aim is to gather insights into vocative text translation and to create new perspectives on this field of research, now considered a process of innovation and diversion, especially in areas as important as economy and marketing.

  2. Strategy as Texts

    DEFF Research Database (Denmark)

    Obed Madsen, Søren

    of the strategy into four categories. Second, the managers produce new texts based on the original strategy document by using four different ways of translation models. The study’s findings contribute to three areas. Firstly, it shows that translation is more than a sociological process. It is also...... a craftsmanship that requires knowledge and skills, which unfortunately seems to be overlooked in both the literature and in practice. Secondly, it shows that even though a strategy text is in singular, the translation makes strategy plural. Thirdly, the article proposes a way to open up the black box of what......This article shows empirically how managers translate a strategy plan at an individual level. By analysing how managers in three organizations translate strategies, it identifies that the translation happens in two steps: First, the managers decipher the strategy by coding the different parts...

  3. THE STUDENTS’ PERCEPTIONS OF AUTHENTIC TEXTS-BASED TRANSLATION

    Directory of Open Access Journals (Sweden)

    Rusiana .

    2017-12-01

    Full Text Available Translation requires lots of practice. As it is generally known, authentic texts provide fruitful experience for students to translate either Indonesian-English or vice versa. Authentic texts give many real uses of language in varied meaningful contexts The texts used were advertisement, abstract, local stories, tourist attraction, community service and project for money. This research is aimed at investigating whether the use of authentic texts benefits the students and describing the students’ perceptions toward the use of authentic texts in Translation class. It is a qualitative research. Questionnaires were used to obtain the students’ perceptions on the use of authentic texts in translation. The findings show that authentic texts-based translation benefits students in experiencing better translation. Advertisement was considered to be the most relevant text. On the contrary, they find it difficult to cope with authentic texts particularly dealing with words/terms/vocabulary, meanings, culture, and grammar. The recommendations are that the students have to be exposed to many authentic texts of varied topics in both English and Indonesian in order that they understand both the SL and TL well. For further researchers, it would be possible to research on the influence of authentic texts based translation on the students’ translation skill.

  4. Determination of the relevant market as a criterion of assessment of concentration effects in the practice of antitrust authorities

    Directory of Open Access Journals (Sweden)

    Daria Kostecka-Jurczyk

    2012-12-01

    Full Text Available Determining the relevant market is the first and the most important step in antimonopoly proceedings due to the fact that the market position of an enterprise is always determined from the relevant market perspective. In respect of mergers, establishing the relevant market is essential in analysing whether the aim of the concentration is to distort and limit competition on the market. The author of the article focuses on describing methods for determining the relevant market applied by antimonopoly authorities.

  5. The Relevant Physical Trace in Criminal Investigation

    Directory of Open Access Journals (Sweden)

    Durdica Hazard

    2016-01-01

    Full Text Available A criminal investigation requires the forensic scientist to search and to interpret vestiges of a criminal act that happened in the past. The forensic scientist is one of the many stakeholders who take part in the information quest within the criminal justice system. She reads the investigation scene in search of physical traces that should enable her to tell the story of the offense/crime that allegedly occurred. The challenge for any investigator is to detect and recognize relevant physical traces in order to provide clues for investigation and intelligence purposes, and that will constitute sound and relevant evidence for the court. This article shows how important it is to consider the relevancy of physical traces from the beginning of the investigation and what might influence the evaluation process. The exchange and management of information between the investigation stakeholders are important. Relevancy is a dimension that needs to be understood from the standpoints of law enforcement personnel and forensic scientists with the aim of strengthening investigation and ultimately the overall judicial process.

  6. The Relevance of Causal Social Construction

    Directory of Open Access Journals (Sweden)

    Marques Teresa

    2017-02-01

    Full Text Available Social constructionist claims are surprising and interesting when they entail that presumably natural kinds are in fact socially constructed. The claims are interesting because of their theoretical and political importance. Authors like Díaz-León argue that constitutive social construction is more relevant for achieving social justice than causal social construction. This paper challenges this claim. Assuming there are socially salient groups that are discriminated against, the paper presents a dilemma: if there were no constitutively constructed social kinds, the causes of the discrimination of existing social groups would have to be addressed, and understanding causal social construction would be relevant to achieve social justice. On the other hand, not all possible constitutively socially constructed kinds are actual social kinds. If an existing social group is constitutively constructed as a social kind K, the fact that it actually exists as a K has social causes. Again, causal social construction is relevant. The paper argues that (i for any actual social kind X, if X is constitutively socially constructed as K, then it is also causally socially constructed; and (ii causal social construction is at least as relevant as constitutive social construction for concerns of social justice. For illustration, I draw upon two phenomena that are presumed to contribute towards the discrimination of women: (i the poor performance effects of stereotype threat, and (ii the silencing effects of gendered language use.

  7. Classifying Written Texts Through Rhythmic Features

    NARCIS (Netherlands)

    Balint, Mihaela; Dascalu, Mihai; Trausan-Matu, Stefan

    2016-01-01

    Rhythm analysis of written texts focuses on literary analysis and it mainly considers poetry. In this paper we investigate the relevance of rhythmic features for categorizing texts in prosaic form pertaining to different genres. Our contribution is threefold. First, we define a set of rhythmic

  8. Text Structure and Retention of Prose.

    Science.gov (United States)

    Zimmer, John W.

    1985-01-01

    The effects of text structure were studied using two kinds of reading materials: a standard text with headings and illustrations, as well as a nonstructured manuscript. The manuscript readers scored higher on delayed tests, generated more relevant ideas, and wrote better essays both immediately and after a delay. (Author/GDC)

  9. English Metafunction Analysis in Chemistry Text: Characterization of Scientific Text

    Directory of Open Access Journals (Sweden)

    Ahmad Amin Dalimunte, M.Hum

    2013-09-01

    Full Text Available The objectives of this research are to identify what Metafunctions are applied in chemistry text and how they characterize a scientific text. It was conducted by applying content analysis. The data for this research was a twelve-paragraph chemistry text. The data were collected by applying a documentary technique. The document was read and analyzed to find out the Metafunction. The data were analyzed by some procedures: identifying the types of process, counting up the number of the processes, categorizing and counting up the cohesion devices, classifying the types of modulation and determining modality value, finally counting up the number of sentences and clauses, then scoring the grammatical intricacy index. The findings of the research show that Material process (71of 100 is mostly used, circumstance of spatial location (26 of 56 is more dominant than the others. Modality (5 is less used in order to avoid from subjectivity. Impersonality is implied through less use of reference either pronouns (7 or demonstrative (7, conjunctions (60 are applied to develop ideas, and the total number of the clauses are found much more dominant (109 than the total number of the sentences (40 which results high grammatical intricacy index. The Metafunction found indicate that the chemistry text has fulfilled the characteristics of scientific or academic text which truly reflects it as a natural science.

  10. Reading Authentic Texts

    DEFF Research Database (Denmark)

    Balling, Laura Winther

    2013-01-01

    Most research on cognates has focused on words presented in isolation that are easily defined as cognate between L1 and L2. In contrast, this study investigates what counts as cognate in authentic texts and how such cognates are read. Participants with L1 Danish read news articles in their highly...... proficient L2, English, while their eye-movements were monitored. The experiment shows a cognate advantage for morphologically simple words, but only when cognateness is defined relative to translation equivalents that are appropriate in the context. For morphologically complex words, a cognate disadvantage...... word predictability indexed by the conditional probability of each word....

  11. Documents and legal texts

    International Nuclear Information System (INIS)

    2016-01-01

    This section treats of the following documents and legal texts: 1 - Brazil: Law No. 13,260 of 16 March 2016 (To regulate the provisions of item XLIII of Article 5 of the Federal Constitution on terrorism, dealing with investigative and procedural provisions and redefining the concept of a terrorist organisation; and amends Laws No. 7,960 of 21 December 1989 and No. 12,850 of 2 August 2013); 2 - India: The Atomic Energy (Amendment) Act, 2015; Department Of Atomic Energy Notification (Civil Liability for Nuclear Damage); 3 - Japan: Act on Subsidisation, etc. for Nuclear Damage Compensation Funds following the implementation of the Convention on Supplementary Compensation for Nuclear Damage

  12. Journalistic Text Production

    DEFF Research Database (Denmark)

    Haugaard, Rikke Hartmann

    , a multiple case study investigated three professional text producers’ practices as they unfolded in their natural setting at the Spanish newspaper, El Mundo. • Results indicate that journalists’ revisions are related to form markedly more often than to content. • Results suggest two writing phases serving...... at the Spanish newspaper, El Mundo, in Madrid. The study applied a combination of quantitative and qualitative methods, i.e. keystroke logging, participant observation and retrospective interview. Results indicate that journalists’ revisions are related to form markedly more often than to content (approx. three...

  13. Utah Text Retrieval Project

    Energy Technology Data Exchange (ETDEWEB)

    Hollaar, L A

    1983-10-01

    The Utah Text Retrieval project seeks well-engineered solutions to the implementation of large, inexpensive, rapid text information retrieval systems. The project has three major components. Perhaps the best known is the work on the specialized processors, particularly search engines, necessary to achieve the desired performance and cost. The other two concern the user interface to the system and the system's internal structure. The work on user interface development is not only concentrating on the syntax and semantics of the query language, but also on the overall environment the system presents to the user. Environmental enhancements include convenient ways to browse through retrieved documents, access to other information retrieval systems through gateways supporting a common command interface, and interfaces to word processing systems. The system's internal structure is based on a high-level data communications protocol linking the user interface, index processor, search processor, and other system modules. This allows them to be easily distributed in a multi- or specialized-processor configuration. It also allows new modules, such as a knowledge-based query reformulator, to be added. 15 references.

  14. Identifying issue frames in text.

    Directory of Open Access Journals (Sweden)

    Eyal Sagi

    Full Text Available Framing, the effect of context on cognitive processes, is a prominent topic of research in psychology and public opinion research. Research on framing has traditionally relied on controlled experiments and manually annotated document collections. In this paper we present a method that allows for quantifying the relative strengths of competing linguistic frames based on corpus analysis. This method requires little human intervention and can therefore be efficiently applied to large bodies of text. We demonstrate its effectiveness by tracking changes in the framing of terror over time and comparing the framing of abortion by Democrats and Republicans in the U.S.

  15. Documents and legal texts

    International Nuclear Information System (INIS)

    2013-01-01

    This section reprints a selection of recently published legislative texts and documents: - Russian Federation: Federal Law No.170 of 21 November 1995 on the use of atomic energy, Adopted by the State Duma on 20 October 1995; - Uruguay: Law No.19.056 On the Radiological Protection and Safety of Persons, Property and the Environment (4 January 2013); - Japan: Third Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (concerning Damages related to Rumour-Related Damage in the Agriculture, Forestry, Fishery and Food Industries), 30 January 2013; - France and the United States: Joint Statement on Liability for Nuclear Damage (Aug 2013); - Franco-Russian Nuclear Power Declaration (1 November 2013)

  16. Documents and legal texts

    International Nuclear Information System (INIS)

    2015-01-01

    This section treats of the following Documents and legal texts: 1 - Canada: Nuclear Liability and Compensation Act (An Act respecting civil liability and compensation for damage in case of a nuclear incident, repealing the Nuclear Liability Act and making consequential amendments to other acts); 2 - Japan: Act on Compensation for Nuclear Damage (The purpose of this act is to protect persons suffering from nuclear damage and to contribute to the sound development of the nuclear industry by establishing a basic system regarding compensation in case of nuclear damage caused by reactor operation etc.); Act on Indemnity Agreements for Compensation of Nuclear Damage; 3 - Slovak Republic: Act on Civil Liability for Nuclear Damage and on its Financial Coverage and on Changes and Amendments to Certain Laws (This Act regulates: a) The civil liability for nuclear damage incurred in the causation of a nuclear incident, b) The scope of powers of the Nuclear Regulatory Authority (hereinafter only as the 'Authority') in relation to the application of this Act, c) The competence of the National Bank of Slovakia in relation to the supervised financial market entities in the financial coverage of liability for nuclear damage; and d) The penalties for violation of this Act)

  17. Documents and legal texts

    International Nuclear Information System (INIS)

    2014-01-01

    This section of the Bulletin presents the recently published documents and legal texts sorted by country: - Brazil: Resolution No. 169 of 30 April 2014. - Japan: Act Concerning Exceptions to Interruption of Prescription Pertaining to Use of Settlement Mediation Procedures by the Dispute Reconciliation Committee for Nuclear Damage Compensation in relation to Nuclear Damage Compensation Disputes Pertaining to the Great East Japan Earthquake (Act No. 32 of 5 June 2013); Act Concerning Measures to Achieve Prompt and Assured Compensation for Nuclear Damage Arising from the Nuclear Plant Accident following the Great East Japan Earthquake and Exceptions to the Extinctive Prescription, etc. of the Right to Claim Compensation for Nuclear Damage (Act No. 97 of 11 December 2013); Fourth Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage Resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.); Outline of 'Fourth Supplement to Interim Guidelines (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.)'. - OECD Nuclear Energy Agency: Decision and Recommendation of the Steering Committee Concerning the Application of the Paris Convention to Nuclear Installations in the Process of Being Decommissioned; Joint Declaration on the Security of Supply of Medical Radioisotopes. - United Arab Emirates: Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage; Ratification of the Federal Supreme Council of Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage

  18. Enhancing biomedical text summarization using semantic relation extraction.

    Directory of Open Access Journals (Sweden)

    Yue Shang

    Full Text Available Automatic text summarization for a biomedical concept can help researchers to get the key points of a certain topic from large amount of biomedical literature efficiently. In this paper, we present a method for generating text summary for a given biomedical concept, e.g., H1N1 disease, from multiple documents based on semantic relation extraction. Our approach includes three stages: 1 We extract semantic relations in each sentence using the semantic knowledge representation tool SemRep. 2 We develop a relation-level retrieval method to select the relations most relevant to each query concept and visualize them in a graphic representation. 3 For relations in the relevant set, we extract informative sentences that can interpret them from the document collection to generate text summary using an information retrieval based method. Our major focus in this work is to investigate the contribution of semantic relation extraction to the task of biomedical text summarization. The experimental results on summarization for a set of diseases show that the introduction of semantic knowledge improves the performance and our results are better than the MEAD system, a well-known tool for text summarization.

  19. Arabic Text Categorization Using Improved k-Nearest neighbour Algorithm

    Directory of Open Access Journals (Sweden)

    Wail Hamood KHALED

    2014-10-01

    Full Text Available The quantity of text information published in Arabic language on the net requires the implementation of effective techniques for the extraction and classifying of relevant information contained in large corpus of texts. In this paper we presented an implementation of an enhanced k-NN Arabic text classifier. We apply the traditional k-NN and Naive Bayes from Weka Toolkit for comparison purpose. Our proposed modified k-NN algorithm features an improved decision rule to skip the classes that are less similar and identify the right class from k nearest neighbours which increases the accuracy. The study evaluates the improved decision rule technique using the standard of recall, precision and f-measure as the basis of comparison. We concluded that the effectiveness of the proposed classifier is promising and outperforms the classical k-NN classifier.

  20. Value Relevance of Accounting Information in the United Arab Emirates

    Directory of Open Access Journals (Sweden)

    Jamal Barzegari Khanagha

    2011-01-01

    Full Text Available This paper examines the value relevance of accounting information in per and post-periods of International Financial Reporting Standards implementation using the regression and portfolio approaches for sample of the UAE companies. The results obtained from a combination of regression and portfolio approaches, show accounting information is value relevant in UAE stock market. A comparison of the results for the periods before and after adoption, based on both regression and portfolio approaches, shows a decline in value relevance of accounting information after the reform in accounting standards. It could be interpreted to mean that following to IFRS in UAE didn’t improve value relevancy of accounting information. However, results based on and portfolio approach shows that cash flows’ incremental information content increased for the post-IFRS period.

  1. Assessing the scientific relevance of a single publication over time

    Directory of Open Access Journals (Sweden)

    Philipp A. Bloching

    2013-09-01

    Full Text Available Quantitatively assessing the scientific relevance of a research paper is challenging for two reasons. Firstly, scientific relevance may change over time, and secondly, it is unclear how to evaluate a recently published paper. The temporally averaged paper-specific impact factor is defined as the yearly average of citations to the paper until now including bonus citations equal to the journal impact factor in the publication year. This new measure subsequently allows relevance rankings and annual updates of all (i.e. both recent and older scientific papers of a department, or even a whole scientific field, on a more objective basis. It can also be used to assess both the average and overall time-dependent scientific relevance of researchers in a specific department or scientific field.

  2. Comprehending text in literature class

    Directory of Open Access Journals (Sweden)

    Purić Daliborka S.

    2016-01-01

    Full Text Available The paper discusses the problem of understanding a text and the contribution of methodological apparatus in the reader book to comprehension of a text being read in junior classes of elementary school. By using the technique of content analysis from methodological apparatuses in eight reader books for the fourth grade of elementary school, approved for usage in 2014/2015 academic year, and surveying 350 teachers in 33 elementary schools and 11 administrative districts in the Republic of Serbia we examined: (a to what extent the Serbian language text book contents enable junior students to understand a literary text; (b to what extent teachers accept the suggestions offered in the textbook for preparing literature teaching. The results show that a large number of suggestions relate to reading comprehension, but some of categories of understanding are unevenly distributed in the methodological apparatus. On the other hand, the majority of teachers use the methodological apparatus given in a textbook for preparing classes, not only the textbook he or she selected for teaching but also other textbooks for the same grade.

  3. Heterodox Autonomy Doctrine: realism and purposes, and its relevance

    Directory of Open Access Journals (Sweden)

    Raúl Bernal-Meza

    2013-12-01

    Full Text Available The Autonomy Doctrine, elaborated by Juan Carlos Puig, is a realist point of view of International Relations. It is an analysis, from the periphery, about the structure of world power, and a roadmap (from a theoretical point of view for the longing process of autonomization-regarding hegemonic power-for a country whose ruling class would decide to overcome dependency. The elements its author took into account when analyzing its own context are explained in this text and, afterwards, are reflected over its relevance nowadays. For that purpose, it is necessary to answer certain questions, such as which are the concepts and categories that may explain its relevance, its applicability to regional integration and cooperation models and projects, and what would be the analytical method to compare reality versus ideas, among others. The methodological proposal to analyze the relevance of Puig's doctrine is to compare it to different visions of regionalism that are currently in effect in Latin America.

  4. Are figure legends sufficient? Evaluating the contribution of associated text to biomedical figure comprehension.

    Science.gov (United States)

    Yu, Hong; Agarwal, Shashank; Johnston, Mark; Cohen, Aaron

    2009-01-06

    Biomedical scientists need to access figures to validate research facts and to formulate or to test novel research hypotheses. However, figures are difficult to comprehend without associated text (e.g., figure legend and other reference text). We are developing automated systems to extract the relevant explanatory information along with figures extracted from full text articles. Such systems could be very useful in improving figure retrieval and in reducing the workload of biomedical scientists, who otherwise have to retrieve and read the entire full-text journal article to determine which figures are relevant to their research. As a crucial step, we studied the importance of associated text in biomedical figure comprehension. Twenty subjects evaluated three figure-text combinations: figure+legend, figure+legend+title+abstract, and figure+full-text. Using a Likert scale, each subject scored each figure+text according to the extent to which the subject thought he/she understood the meaning of the figure and the confidence in providing the assigned score. Additionally, each subject entered a free text summary for each figure-text. We identified missing information using indicator words present within the text summaries. Both the Likert scores and the missing information were statistically analyzed for differences among the figure-text types. We also evaluated the quality of text summaries with the text-summarization evaluation method the ROUGE score. Our results showed statistically significant differences in figure comprehension when varying levels of text were provided. When the full-text article is not available, presenting just the figure+legend left biomedical researchers lacking 39-68% of the information about a figure as compared to having complete figure comprehension; adding the title and abstract improved the situation, but still left biomedical researchers missing 30% of the information. When the full-text article is available, figure comprehension

  5. Full closure strategic analysis.

    Science.gov (United States)

    2014-07-01

    The full closure strategic analysis was conducted to create a decision process whereby full roadway : closures for construction and maintenance activities can be evaluated and approved or denied by CDOT : Traffic personnel. The study reviewed current...

  6. The value relevance of environmental emissions

    Directory of Open Access Journals (Sweden)

    Melinda Lydia Nelwan

    2016-07-01

    Full Text Available This study examines whether environmental performance has value relevance by investigating the relations between environmental emissions and stock prices for the U.S. public companies. The previous studies argued that the conjectured relations between accounting performance measures and environmental performance do not have a strong theoretical basis, and the modeling of relations between market per-formance measures and environmental performance do not adequately consider the relevance of accounting performance to market value. Therefore, this study examines whether publicly reported environmental emissions provide incremental information to accounting earnings in pricing companies stocks. It is done among the complete set of industries covered by Toxics Release Inventory (TRI reporting for the period 2007 to 2010. Using Ohlson model but modified to include different types of emis-sions, it is found that ground emissions (underground injection and land emissions are value relevant but other emission types (air and water and transferred-out emis-sions appear to not provide incremental information in the valuation model. The result in this study raise concerns that different types of emissions are assessed differently by the market, confirming that studies should not aggregate such measures.

  7. EOG feature relevance determination for microsleep detection

    Directory of Open Access Journals (Sweden)

    Golz Martin

    2017-09-01

    Full Text Available Automatic relevance determination (ARD was applied to two-channel EOG recordings for microsleep event (MSE recognition. 10 s immediately before MSE and also before counterexamples of fatigued, but attentive driving were analysed. Two type of signal features were extracted: the maximum cross correlation (MaxCC and logarithmic power spectral densities (PSD averaged in spectral bands of 0.5 Hz width ranging between 0 and 8 Hz. Generalised learn-ing vector quantisation (GRLVQ was used as ARD method to show the potential of feature reduction. This is compared to support-vector machines (SVM, in which the feature reduction plays a much smaller role. Cross validation yielded mean normalised relevancies of PSD features in the range of 1.6 – 4.9 % and 1.9 – 10.4 % for horizontal and vertical EOG, respectively. MaxCC relevancies were 0.002 – 0.006 % and 0.002 – 0.06 %, respectively. This shows that PSD features of vertical EOG are indispensable, whereas MaxCC can be neglected. Mean classification accuracies were estimated at 86.6±b 1.3 % and 92.3±b 0.2 % for GRLVQ and SVM, respectively. GRLVQ permits objective feature reduction by inclusion of all processing stages, but is not as accurate as SVM.

  8. EOG feature relevance determination for microsleep detection

    Directory of Open Access Journals (Sweden)

    Golz Martin

    2017-09-01

    Full Text Available Automatic relevance determination (ARD was applied to two-channel EOG recordings for microsleep event (MSE recognition. 10 s immediately before MSE and also before counterexamples of fatigued, but attentive driving were analysed. Two type of signal features were extracted: the maximum cross correlation (MaxCC and logarithmic power spectral densities (PSD averaged in spectral bands of 0.5 Hz width ranging between 0 and 8 Hz. Generalised learn-ing vector quantisation (GRLVQ was used as ARD method to show the potential of feature reduction. This is compared to support-vector machines (SVM, in which the feature reduction plays a much smaller role. Cross validation yielded mean normalised relevancies of PSD features in the range of 1.6 - 4.9 % and 1.9 - 10.4 % for horizontal and vertical EOG, respectively. MaxCC relevancies were 0.002 - 0.006 % and 0.002 - 0.06 %, respectively. This shows that PSD features of vertical EOG are indispensable, whereas MaxCC can be neglected. Mean classification accuracies were estimated at 86.6±b 1.3 % and 92.3±b 0.2 % for GRLVQ and SVM, respec-tively. GRLVQ permits objective feature reduction by inclu-sion of all processing stages, but is not as accurate as SVM.

  9. Valerian: No Evidence for Clinically Relevant Interactions

    Directory of Open Access Journals (Sweden)

    Olaf Kelber

    2014-01-01

    Full Text Available In recent popular publications as well as in widely used information websites directed to cancer patients, valerian is claimed to have a potential of adverse interactions with anticancer drugs. This questions its use as a safe replacement for, for example, benzodiazepines. A review on the interaction potential of preparations from valerian root (Valeriana officinalis L. root was therefore conducted. A data base search and search in a clinical drug interaction data base were conducted. Thereafter, a systematic assessment of publications was performed. Seven in vitro studies on six CYP 450 isoenzymes, on p-glycoprotein, and on two UGT isoenzymes were identified. However, the methodological assessment of these studies did not support their suitability for the prediction of clinically relevant interactions. In addition, clinical studies on various valerian preparations did not reveal any relevant interaction potential concerning CYP 1A2, 2D6, 2E1, and 3A4. Available animal and human pharmacodynamic studies did not verify any interaction potential. The interaction potential of valerian preparations therefore seems to be low and thereby without clinical relevance. We conclude that there is no specific evidence questioning their safety, also in cancer patients.

  10. Making Deferred Taxes Relevant

    NARCIS (Netherlands)

    Brouwer, Arjan; Naarding, Ewout

    2018-01-01

    We analyse the conceptual problems in current accounting for deferred taxes and provide solutions derived from the literature in order to make International Financial Reporting Standards (IFRS) deferred tax numbers value-relevant. In our view, the empirical results concerning the value relevance of

  11. Parsimonious relevance models

    NARCIS (Netherlands)

    Meij, E.; Weerkamp, W.; Balog, K.; de Rijke, M.; Myang, S.-H.; Oard, D.W.; Sebastiani, F.; Chua, T.-S.; Leong, M.-K.

    2008-01-01

    We describe a method for applying parsimonious language models to re-estimate the term probabilities assigned by relevance models. We apply our method to six topic sets from test collections in five different genres. Our parsimonious relevance models (i) improve retrieval effectiveness in terms of

  12. A Guide Text or Many Texts? "That is the Question”

    Directory of Open Access Journals (Sweden)

    Delgado de Valencia Sonia

    2001-08-01

    Full Text Available The use of supplementary materials in the classroom has always been an essential part of the teaching and learning process. To restrict our teaching to the scope of one single textbook means to stand behind the advances of knowledge, in any area and context. Young learners appreciate any new and varied support that expands their knowledge of the world: diaries, letters, panels, free texts, magazines, short stories, poems or literary excerpts, and articles taken from Internet are materials that will allow learnersto share more and work more collaboratively. In this article we are going to deal with some of these materials, with the criteria to select, adapt, and create them that may be of interest to the learner and that may promote reading and writing processes. Since no text can entirely satisfy the needs of students and teachers, the creativity of both parties will be necessary to improve the quality of teaching through the adequate use and adaptation of supplementary materials.

  13. VALUE RELEVANCE DAN IFRS ADOPTION DI INDONESIA: INVESTIGASI PADA PERUSAHAAN LQ-45 BURSA EFEK INDONESIA

    Directory of Open Access Journals (Sweden)

    Triandi ,

    2017-02-01

    Full Text Available Value relevance is being defined as the ability of information disclosed by financial statements to capture and summarize firm value. earnings per share (EPS and book value of shares (BVS and stock market price (SMP, both before and after IFRS adoption. Based on theresult,test there the valuerelevancebefore and after IFRS adoptio. The value relevance after IFRS adoption decreased. These findings differ from the findings in several countries have adopted IFRS. In many countries have adopted IFRS tends to increase the value relevance. Keyword : value relevance, earning per share, book value of equity, stock price, IFRS adoption, LQ-45

  14. Text

    International Nuclear Information System (INIS)

    Anon.

    2009-01-01

    The purpose of this act is to safeguard against the dangers and harmful effects of radioactive waste and to contribute to public safety and environmental protection by laying down requirements for the safe and efficient management of radioactive waste. We will find definitions, interrelation with other legislation, responsibilities of the state and local governments, responsibilities of radioactive waste management companies and generators, formulation of the basic plan for the control of radioactive waste, radioactive waste management ( with public information, financing and part of spent fuel management), Korea radioactive waste management corporation ( business activities, budget), establishment of a radioactive waste fund in order to secure the financial resources required for radioactive waste management, and penalties in case of improper operation of radioactive waste management. (N.C.)

  15. Categorising Resources of Historical Memory in Researching Publicistic Text

    Directory of Open Access Journals (Sweden)

    Konyk Anastasiya

    2016-12-01

    Full Text Available The article focuses on the allocation and analysis of the main resources of historical memory which are considered as peculiar indicators for studying publicist content and conceptual reading of discourses of historical memory in contemporary publications. It is relevant insofar as researching the use and intensification of these or other resources of historical memory allows us to observe changes in world landmarks, socio-political moods, ideological references and temperament and the dynamics of re-interpretation of historical facts and events by authors.

  16. Full page insight

    DEFF Research Database (Denmark)

    Cortsen, Rikke Platz

    2014-01-01

    Alan Moore and his collaborating artists often manipulate time and space by drawing upon the formal elements of comics and making alternative constellations. This article looks at an element that is used frequently in comics of all kinds – the full page – and discusses how it helps shape spatio......, something that it shares with the full page in comics. Through an analysis of several full pages from Moore titles like Swamp Thing, From Hell, Watchmen and Promethea, it is made clear why the full page provides an apt vehicle for an apocalypse in comics....

  17. Teaching Text Structure: Examining the Affordances of Children's Informational Texts

    Science.gov (United States)

    Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

    2016-01-01

    This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…

  18. Culturally Relevant Cyberbullying Prevention

    OpenAIRE

    Phillips, Gregory John

    2017-01-01

    In this action research study, I, along with a student intervention committee of 14 members, developed a cyberbullying intervention for a large urban high school on the west coast. This high school contained a predominantly African American student population. I aimed to discover culturally relevant cyberbullying prevention strategies for African American students. The intervention committee selected video safety messages featuring African American actors as the most culturally relevant cyber...

  19. A quick survey of text categorization algorithms

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper contains an overview of basic formulations and approaches to text classification. This paper surveys the algorithms used in text categorization: handcrafted rules, decision trees, decision rules, on-line learning, linear classifier, Rocchio’s algorithm, k Nearest Neighbor (kNN, Support Vector Machines (SVM.

  20. Full Service Leasing

    OpenAIRE

    Richter, Ján

    2009-01-01

    Aim of this master thesis is to describe the service of Full Service Leasing, as a modern form of financing and management of assets, primarily automobile fleet. Description of full service leasing is designed as a comprehensive and complete guide to support reader's position when deciding to finance and manage a fleet by this service. Whether the reader is an entrepreneur, CFO, fleet manager, new employee of leasing company, or anyone who is interested in this service, this master thesis wil...

  1. Text mining improves prediction of protein functional sites.

    Directory of Open Access Journals (Sweden)

    Karin M Verspoor

    Full Text Available We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites. The structure analysis was carried out using Dynamics Perturbation Analysis (DPA, which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

  2. Important Text Characteristics for Early-Grades Text Complexity

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2015-01-01

    The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…

  3. Causal and Epistemic Relevance in Appeals to Authority

    Directory of Open Access Journals (Sweden)

    Sebastiano Lommi

    2015-05-01

    Full Text Available Appeals to authority have a long tradition in the history of argumentation theory. During the Middle Age they were considered legitimate and sound arguments, but after Locke’s treatment in the Essay Concerning Human Understanding their legitimacy has come under question. Traditionally, arguments from authority were considered informal arguments, but since the important work of Charles Hamblin (Hamblin, 1970 many attempts to provide a form for them have been done. The most convincing of them is the presumptive form developed by Douglas Walton and John Woods (Woods, Walton, 1974 that aims at taking into account the relevant contextual aspects in assessing the provisional validity of an appeal to authority. The soundness of an appeal depends on its meeting the adequacy conditions set to scrutinize all the relevant questions. I want to claim that this approach is compatible with the analysis of arguments in terms of relevance advanced by David Hitchcock (Hitchcock, 1992. He claims that relevance is a triadic relation between two items and a context. The first item is relevant to the second one in a given context. Different types of relevance relation exist, namely causal relevance and epistemic relevance. “Something is [causally] relevant to an outcome in a given situation if it helps to cause that outcome in the situation” (Hitchcock, 1992, p. 253, whereas it is epistemically relevant when it helps to achieve an epistemic goal in a given situation. I claim that we can adapt this conception to Walton and Krabbe’s theory of dialogue type (Walton, Krabbe, 1995, seeing the items of a relevance relation as the argument and its consequence and the context as the type of dialogue in which these arguments are advanced. According to this perspective, an argument from authority that meets the adequacy conditions has to be considered legitimate because it is an epistemically relevant relation. Therefore, my conclusion is that an analysis of appeals to

  4. The relevance of segments reports – measurement methodology

    Directory of Open Access Journals (Sweden)

    Tomasz Zimnicki

    2017-09-01

    Full Text Available The segment report is one of the areas of financial statements, and it obliges a company to provide infor-mation about the economic situation in each of its activity areas. The article evaluates the change of segment reporting standards from IAS14R to IFRS8 in the context of feature relevance. It presents the construction of a measure which allows the relevance of segment disclosures to be determined. The created measure was used to study periodical reports published by companies listed on the main market of the Warsaw Stock Exchange from three reporting periods – 2008, 2009 and 2013. Based on the re-search results, it was found that the change of segment reporting standards from IAS14R to IFRS8 in the context of relevance was legitimate.

  5. An Evaluation of Relevance of Computing Curricula to Industry Needs

    Directory of Open Access Journals (Sweden)

    Ioana Chan Mow

    2015-02-01

    Full Text Available The research documented in this paper attempted to answer the question of how relevant the content of the Computing courses offered within programs of the Computing Department at the National University of Samoa (NUS were to meet the needs of industry and the workforce. The RINCCII study which was conducted in 2013 to 2014, surveyed 13 institutions and 19 graduates from the Computing programs. Findings from the survey indicated that the current course offerings within the Computing department are relevant to the needs of industry and the workplace. However there are aspects or topics which need inclusion or better coverage. The study also recommended regular surveys to gauge relevance of curricula to needs of industry.

  6. Reach and Relevance of Prison Research

    Directory of Open Access Journals (Sweden)

    Hilde Tubex

    2015-04-01

    Full Text Available In this contribution I reflect on the changes in the penal landscape and how they impact on prison research. I do this from my experiences as a prison researcher in a variety of roles, in both Europe and Australia. The growing dominance of managerialism has impacted on both corrective services and universities, in ways that have changed the relationship between current prison practices and academically oriented research. Therefore, academics have to question how their contemporary prison research can bridge the emerging gap: how they can not only produce research that adheres to the roots of criminology and provides a base for a rational penal policy, but also how they can develop strategies to get recognition of and funding for this broader contextual work which, although it might not produce results that are immediately identifiable, can be of relevance in indirect ways and in the longer term.

  7. The Balinese Unicode Text Processing

    Directory of Open Access Journals (Sweden)

    Imam Habibi

    2009-06-01

    Full Text Available In principal, the computer only recognizes numbers as the representation of a character. Therefore, there are many encoding systems to allocate these numbers although not all characters are covered. In Europe, every single language even needs more than one encoding system. Hence, a new encoding system known as Unicode has been established to overcome this problem. Unicode provides unique id for each different characters which does not depend on platform, program, and language. Unicode standard has been applied in a number of industries, such as Apple, HP, IBM, JustSystem, Microsoft, Oracle, SAP, Sun, Sybase, and Unisys. In addition, language standards and modern information exchanges such as XML, Java, ECMA Script (JavaScript, LDAP, CORBA 3.0, and WML make use of Unicode as an official tool for implementing ISO/IEC 10646. There are four things to do according to Balinese script: the algorithm of transliteration, searching, sorting, and word boundary analysis (spell checking. To verify the truth of algorithm, some applications are made. These applications can run on Linux/Windows OS platform using J2SDK 1.5 and J2ME WTK2 library. The input and output of the algorithm/application are character sequence that is obtained from keyboard punch and external file. This research produces a module or a library which is able to process the Balinese text based on Unicode standard. The output of this research is the ability, skill, and mastering of 1. Unicode standard (21-bit as a substitution to ASCII (7-bit and ISO8859-1 (8-bit as the former default character set in many applications. 2. The Balinese Unicode text processing algorithm. 3. An experience of working with and learning from an international team that consists of the foremost experts in the area: Michael Everson (Ireland, Peter Constable (Microsoft US, I Made Suatjana, and Ida Bagus Adi Sudewa.

  8. Compressive full waveform lidar

    Science.gov (United States)

    Yang, Weiyi; Ke, Jun

    2017-05-01

    To avoid high bandwidth detector, fast speed A/D converter, and large size memory disk, a compressive full waveform LIDAR system, which uses a temporally modulated laser instead of a pulsed laser, is studied in this paper. Full waveform data from NEON (National Ecological Observatory Network) are used. Random binary patterns are used to modulate the source. To achieve 0.15 m ranging resolution, a 100 MSPS A/D converter is assumed to make measurements. SPIRAL algorithm with canonical basis is employed when Poisson noise is considered in the low illuminated condition.

  9. Multimodal Diversity of Postmodernist Fiction Text

    Directory of Open Access Journals (Sweden)

    U. I. Tykha

    2016-12-01

    Full Text Available The article is devoted to the analysis of structural and functional manifestations of multimodal diversity in postmodernist fiction texts. Multimodality is defined as the coexistence of more than one semiotic mode within a certain context. Multimodal texts feature a diversity of semiotic modes in the communication and development of their narrative. Such experimental texts subvert conventional patterns by introducing various semiotic resources – verbal or non-verbal.

  10. PathText: a text mining integrator for biological pathway visualizations

    Science.gov (United States)

    Kemper, Brian; Matsuzaki, Takuya; Matsuoka, Yukiko; Tsuruoka, Yoshimasa; Kitano, Hiroaki; Ananiadou, Sophia; Tsujii, Jun'ichi

    2010-01-01

    Motivation: Metabolic and signaling pathways are an increasingly important part of organizing knowledge in systems biology. They serve to integrate collective interpretations of facts scattered throughout literature. Biologists construct a pathway by reading a large number of articles and interpreting them as a consistent network, but most of the models constructed currently lack direct links to those articles. Biologists who want to check the original articles have to spend substantial amounts of time to collect relevant articles and identify the sections relevant to the pathway. Furthermore, with the scientific literature expanding by several thousand papers per week, keeping a model relevant requires a continuous curation effort. In this article, we present a system designed to integrate a pathway visualizer, text mining systems and annotation tools into a seamless environment. This will enable biologists to freely move between parts of a pathway and relevant sections of articles, as well as identify relevant papers from large text bases. The system, PathText, is developed by Systems Biology Institute, Okinawa Institute of Science and Technology, National Centre for Text Mining (University of Manchester) and the University of Tokyo, and is being used by groups of biologists from these locations. Contact: brian@monrovian.com. PMID:20529930

  11. Opposing Subjective Temporal Experiences in Response to Unpredictable and Predictable Fear-Relevant Stimuli

    Directory of Open Access Journals (Sweden)

    Qian Cui

    2018-03-01

    Full Text Available Previous studies have found that the durations of fear-relevant stimuli were overestimated compared to those of neutral stimuli, even when the fear-relevant stimuli were only anticipated. The current study aimed to investigate the effect of the predictability of fear-relevant stimuli on sub-second temporal estimations. In Experiments 1a and 1b, a randomized design was employed to render the emotional valence of each trial unpredictable. In Experiments 2a and 2b, we incorporated a block design and a cueing paradigm, respectively, to render the emotional stimuli predictable. Compared with the neutral condition, the estimated blank interval was judged as being shorter under the unpredictable fear-relevant condition, while it was judged as being longer under the predictable fear-relevant condition. In other words, the unpredictable and predictable fear-relevant stimuli led to opposing temporal distortions. These results demonstrated that emotions modulate interval perception during different time processing stages.

  12. PENGARUH UKURAN KAP DAN AUDITOR TENURE TERHADAP VALUE RELEVANCE DARI NILAI WAJAR

    Directory of Open Access Journals (Sweden)

    Taufik Hidayat

    2012-12-01

    Full Text Available This study examines the value relevance of fair value and whether the value relevance of fair value measured at quoted active market is higher than value measured at valuation techniques. We also examine whether the value relevance offair value with valuation techniques improves if auditor tenure is longer or financial statements are audited by a Big Four Firm. Hypothesis testing was conducted by a panel data based on model of Ohlson (1995. Using a sample of 147 companies listed on the Indonesia Stock Exchange from 2008-2011, resulting the general conclusion that fair value has value relevance, where the value relevance of fair value measured at quoted active market is higher than the valuation technique. Value relevance of fair value measured at valuation techniques will increase as auditor tenure increases or financial statements were audited by a Big Four Firm.

  13. Tagging narrator's names in Hadith text | Rahman | Journal of ...

    African Journals Online (AJOL)

    N.A. Rahman, N.K. Ismail, Z.M. Nor, M.N. Alias, M.S. Kamis, N Alias. Abstract. No Abstract. Keywords: tagging; hadith text; name. Full Text: EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT · http://dx.doi.org/10.4314/jfas.v9i5s.21 · AJOL African Journals Online. HOW TO ...

  14. The Limits to Relevance

    Science.gov (United States)

    Averill, M.; Briggle, A.

    2006-12-01

    Science policy and knowledge production lately have taken a pragmatic turn. Funding agencies increasingly are requiring scientists to explain the relevance of their work to society. This stems in part from mounting critiques of the "linear model" of knowledge production in which scientists operating according to their own interests or disciplinary standards are presumed to automatically produce knowledge that is of relevance outside of their narrow communities. Many contend that funded scientific research should be linked more directly to societal goals, which implies a shift in the kind of research that will be funded. While both authors support the concept of useful science, we question the exact meaning of "relevance" and the wisdom of allowing it to control research agendas. We hope to contribute to the conversation by thinking more critically about the meaning and limits of the term "relevance" and the trade-offs implicit in a narrow utilitarian approach. The paper will consider which interests tend to be privileged by an emphasis on relevance and address issues such as whose goals ought to be pursued and why, and who gets to decide. We will consider how relevance, narrowly construed, may actually limit the ultimate utility of scientific research. The paper also will reflect on the worthiness of research goals themselves and their relationship to a broader view of what it means to be human and to live in society. Just as there is more to being human than the pragmatic demands of daily life, there is more at issue with knowledge production than finding the most efficient ways to satisfy consumer preferences or fix near-term policy problems. We will conclude by calling for a balanced approach to funding research that addresses society's most pressing needs but also supports innovative research with less immediately apparent application.

  15. Full faith in myself

    Indian Academy of Sciences (India)

    Lawrence

    Full faith in myself. Meenakshi Banerjee. 12. Ihad my schooling at the Irish Convent, Loreto, in Asansol,. West Bengal. Perhaps the earliest memories I have are of myself as a very determined child with a deep appreciation of and inquisitiveness regarding nature although not understanding most of it at that tender age.

  16. Relevant Subspace Clustering

    DEFF Research Database (Denmark)

    Müller, Emmanuel; Assent, Ira; Günnemann, Stephan

    2009-01-01

    Subspace clustering aims at detecting clusters in any subspace projection of a high dimensional space. As the number of possible subspace projections is exponential in the number of dimensions, the result is often tremendously large. Recent approaches fail to reduce results to relevant subspace...... clusters. Their results are typically highly redundant, i.e. many clusters are detected multiple times in several projections. In this work, we propose a novel model for relevant subspace clustering (RESCU). We present a global optimization which detects the most interesting non-redundant subspace clusters...... achieves top clustering quality while competing approaches show greatly varying performance....

  17. Text mining resources for the life sciences.

    Science.gov (United States)

    Przybyła, Piotr; Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable-those that have the crucial ability to share information, enabling smooth integration and reusability. © The Author(s) 2016. Published by Oxford University Press.

  18. Text mining resources for the life sciences

    Science.gov (United States)

    Shardlow, Matthew; Aubin, Sophie; Bossy, Robert; Eckart de Castilho, Richard; Piperidis, Stelios; McNaught, John; Ananiadou, Sophia

    2016-01-01

    Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable—those that have the crucial ability to share information, enabling smooth integration and reusability. PMID:27888231

  19. New Historicism: Text and Context

    Directory of Open Access Journals (Sweden)

    Violeta M. Vesić

    2016-02-01

    Full Text Available During most of the twentieth century history was seen as a phenomenon outside of literature that guaranteed the veracity of literary interpretation. History was unique and it functioned as a basis for reading literary works. During the seventies of the twentieth century there occurred a change of attitude towards history in American literary theory, and there appeared a new theoretical approach which soon became known as New Historicism. Since its inception, New Historicism has been identified with the study of Renaissance and Romanticism, but nowadays it has been increasingly involved in other literary trends. Although there are great differences in the arguments and practices at various representatives of this school, New Historicism has clearly recognizable features and many new historicists will agree with the statement of Walter Cohen that New Historicism, when it appeared in the eighties, represented something quite new in reference to the studies of theory, criticism and history (Cohen 1987, 33. Theoretical connection with Bakhtin, Foucault and Marx is clear, as well as a kind of uneasy tie with deconstruction and the work of Paul de Man. At the center of this approach is a renewed interest in the study of literary works in the light of historical and political circumstances in which they were created. Foucault encouraged readers to begin to move literary texts and to link them with discourses and representations that are not literary, as well as to examine the sociological aspects of the texts in order to take part in the social struggles of today. The study of literary works using New Historicism is the study of politics, history, culture and circumstances in which these works were created. With regard to one of the main fact which is located in the center of the criticism, that history cannot be viewed objectively and that reality can only be understood through a cultural context that reveals the work, re-reading and interpretation of

  20. Text analysis methods, text analysis apparatuses, and articles of manufacture

    Science.gov (United States)

    Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

    2014-10-28

    Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.

  1. Classroom Texting in College Students

    Science.gov (United States)

    Pettijohn, Terry F.; Frazier, Erik; Rieser, Elizabeth; Vaughn, Nicholas; Hupp-Wilds, Bobbi

    2015-01-01

    A 21-item survey on texting in the classroom was given to 235 college students. Overall, 99.6% of students owned a cellphone and 98% texted daily. Of the 138 students who texted in the classroom, most texted friends or significant others, and indicate the reason for classroom texting is boredom or work. Students who texted sent a mean of 12.21…

  2. Plate Full of Color

    Centers for Disease Control (CDC) Podcasts

    The Eagle Books are a series of four books that are brought to life by wise animal characters - Mr. Eagle, Miss Rabbit, and Coyote - who engage Rain That Dances and his young friends in the joy of physical activity, eating healthy foods, and learning from their elders about health and diabetes prevention. Plate Full of Color teaches the value of eating a variety of colorful and healthy foods.

  3. Is Information Still Relevant?

    Science.gov (United States)

    Ma, Lia

    2013-01-01

    Introduction: The term "information" in information science does not share the characteristics of those of a nomenclature: it does not bear a generally accepted definition and it does not serve as the bases and assumptions for research studies. As the data deluge has arrived, is the concept of information still relevant for information…

  4. PaperBLAST: Text Mining Papers for Information about Homologs.

    Science.gov (United States)

    Price, Morgan N; Arkin, Adam P

    2017-01-01

    Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST's database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quickly finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. PaperBLAST is available at http://papers.genomics.lbl.gov/. IMPORTANCE With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins' functions.

  5. PaperBLAST: Text Mining Papers for Information about Homologs

    International Nuclear Information System (INIS)

    Price, Morgan N.; Arkin, Adam P.

    2017-01-01

    Large-scale genome sequencing has identified millions of protein-coding genes whose function is unknown. Many of these proteins are similar to characterized proteins from other organisms, but much of this information is missing from annotation databases and is hidden in the scientific literature. To make this information accessible, PaperBLAST uses EuropePMC to search the full text of scientific articles for references to genes. PaperBLAST also takes advantage of curated resources (Swiss-Prot, GeneRIF, and EcoCyc) that link protein sequences to scientific articles. PaperBLAST’s database includes over 700,000 scientific articles that mention over 400,000 different proteins. Given a protein of interest, PaperBLAST quickly finds similar proteins that are discussed in the literature and presents snippets of text from relevant articles or from the curators. With the recent explosion of genome sequencing data, there are now millions of uncharacterized proteins. If a scientist becomes interested in one of these proteins, it can be very difficult to find information as to its likely function. Often a protein whose sequence is similar, and which is likely to have a similar function, has been studied already, but this information is not available in any database. To help find articles about similar proteins, PaperBLAST searches the full text of scientific articles for protein identifiers or gene identifiers, and it links these articles to protein sequences. Then, given a protein of interest, it can quickly find similar proteins in its database by using standard software (BLAST), and it can show snippets of text from relevant papers. We hope that PaperBLAST will make it easier for biologists to predict proteins’ functions.

  6. Prospects and limitations of full-text index structures in genome analysis

    Science.gov (United States)

    Vyverman, Michaël; De Baets, Bernard; Fack, Veerle; Dawyndt, Peter

    2012-01-01

    The combination of incessant advances in sequencing technology producing large amounts of data and innovative bioinformatics approaches, designed to cope with this data flood, has led to new interesting results in the life sciences. Given the magnitude of sequence data to be processed, many bioinformatics tools rely on efficient solutions to a variety of complex string problems. These solutions include fast heuristic algorithms and advanced data structures, generally referred to as index structures. Although the importance of index structures is generally known to the bioinformatics community, the design and potency of these data structures, as well as their properties and limitations, are less understood. Moreover, the last decade has seen a boom in the number of variant index structures featuring complex and diverse memory-time trade-offs. This article brings a comprehensive state-of-the-art overview of the most popular index structures and their recently developed variants. Their features, interrelationships, the trade-offs they impose, but also their practical limitations, are explained and compared. PMID:22584621

  7. Partial index replicated and distributed scheme for full-text search on ...

    Indian Academy of Sciences (India)

    2Department of Computer Application, Krishna Institute of Engineering & .... In the wireless data broadcast system, many researchers have used the ..... Graph 3 depicts decrement of tuning time as the repeatability increases in the proposed.

  8. Sådan kombinerer du søgninger i SocINDEX with Full Text

    DEFF Research Database (Denmark)

    2017-01-01

    Valg af materiale/medie/form: YouTube Valg af arbejdsform: E-læring Begrundelse for valg af materiale/medie/form/arbejdsform: Flipped Classroom......Valg af materiale/medie/form: YouTube Valg af arbejdsform: E-læring Begrundelse for valg af materiale/medie/form/arbejdsform: Flipped Classroom...

  9. On-line access to the full-texts of non periodical documents

    International Nuclear Information System (INIS)

    Svrsek, L.

    2004-01-01

    This article describes several options how electronic books (technical handbooks, scientific books, reference works, etc.) are published and available on-line on the Internet. There is a short description of some of the major services provided by worldwide publishers. As a part of the presentation there will be a live demonstration of selected services and work slices of the most interested systems. (author)

  10. Europe PMC: a full-text literature database for the life sciences and platform for innovation

    Science.gov (United States)

    2015-01-01

    This article describes recent developments of Europe PMC (http://europepmc.org), the leading database for life science literature. Formerly known as UKPMC, the service was rebranded in November 2012 as Europe PMC to reflect the scope of the funding agencies that support it. Several new developments have enriched Europe PMC considerably since then. Europe PMC now offers RESTful web services to access both articles and grants, powerful search tools such as citation-count sort order and data citation features, a service to add publications to your ORCID, a variety of export formats, and an External Links service that enables any related resource to be linked from Europe PMC content. PMID:25378340

  11. Electronic books. On-line access to the full-texts of non periodical documents

    International Nuclear Information System (INIS)

    Svrsek, L.

    2004-01-01

    This presentation describes several options how electronic books (technical handbooks, scientific books, reference works, etc.) are published and available on-line on the Internet. There is a short description of some of the major services provided by worldwide publishers. As a part of the presentation there will be a live demonstration of selected services and work slices of the most interested systems. (author)

  12. Full Inclusion: Understanding the Role of Gay and Lesbian Texts and Films in Teacher Education Classrooms

    Science.gov (United States)

    Hermann-Wilmarth, Jill M.

    2007-01-01

    This paper identifies some of the resources the author has found and used to help future teachers become fully inclusive teachers, particularly of early elementary students. Through sharing these resources--children's literature, a children's literature textbook, edited books for teacher educators and pre- and inservice teachers, and a video--the…

  13. Early Career Researchers Demand Full-text and Rely on Google to Find Scholarly Sources

    OpenAIRE

    Richard Hayman

    2017-01-01

    A Review of: Nicholas, D., Boukacem-Zeghmouri, C., Rodríguez-Bravo, B., Xu, J., Watkinson, A., Abrizah, A., Herman, E., & Świgoń, M. (2017). Where and how early career researchers find scholarly information. Learned Publishing, 30(1), 19-29. http://dx.doi.org/10.1002/leap.1087 Abstract Objective – To examine the attitudes and information behaviours of early career researchers (ECRs) when locating scholarly information. Design – Qualitative longitudinal study. Setting – R...

  14. Observation of [Formula: see text] and [Formula: see text] decays.

    Science.gov (United States)

    Aaij, R; Adeva, B; Adinolfi, M; Ajaltouni, Z; Akar, S; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Andreassi, G; Andreotti, M; Andrews, J E; Appleby, R B; Archilli, F; d'Argent, P; Arnau Romeu, J; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Babuschkin, I; Bachmann, S; Back, J J; Badalov, A; Baesso, C; Baker, S; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Baszczyk, M; Batozskaya, V; Batsukh, B; Battista, V; Bay, A; Beaucourt, L; Beddow, J; Bedeschi, F; Bediaga, I; Bel, L J; Bellee, V; Belloli, N; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bertolin, A; Betancourt, C; Betti, F; Bettler, M-O; van Beuzekom, M; Bezshyiko, Ia; Bifani, S; Billoir, P; Bird, T; Birnkraut, A; Bitadze, A; Bizzeti, A; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Boettcher, T; Bondar, A; Bondar, N; Bonivento, W; Bordyuzhin, I; Borgheresi, A; Borghi, S; Borisyak, M; Borsato, M; Bossu, F; Boubdir, M; Bowcock, T J V; Bowen, E; Bozzi, C; Braun, S; Britsch, M; Britton, T; Brodzicka, J; Buchanan, E; Burr, C; Bursche, A; Buytaert, J; Cadeddu, S; Calabrese, R; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D H; Capriotti, L; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carniti, P; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cavallero, G; Cenci, R; Charles, M; Charpentier, Ph; Chatzikonstantinidis, G; Chefdeville, M; Chen, S; Cheung, S-F; Chobanova, V; Chrzaszcz, M; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coco, V; Cogan, J; Cogneras, E; Cogoni, V; Cojocariu, L; Collazuol, G; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombs, G; Coquereau, S; Corti, G; Corvo, M; Costa Sobral, C M; Couturier, B; Cowan, G A; Craik, D C; Crocombe, A; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Da Cunha Marinho, F; Dall'Occo, E; Dalseno, J; David, P N Y; Davis, A; De Aguiar Francisco, O; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Serio, M; De Simone, P; Dean, C-T; Decamp, D; Deckenhoff, M; Del Buono, L; Demmer, M; Dendek, A; Derkach, D; Deschamps, O; Dettori, F; Dey, B; Di Canto, A; Dijkstra, H; Dordei, F; Dorigo, M; Dosil Suárez, A; Dovbnya, A; Dreimanis, K; Dufour, L; Dujany, G; Dungs, K; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Déléage, N; Easo, S; Ebert, M; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; Ely, S; Esen, S; Evans, H M; Evans, T; Falabella, A; Farley, N; Farry, S; Fay, R; Fazzini, D; Ferguson, D; Fernandez Prieto, A; Ferrari, F; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fini, R A; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fleuret, F; Fohl, K; Fontana, M; Fontanelli, F; Forshaw, D C; Forty, R; Franco Lima, V; Frank, M; Frei, C; Fu, J; Furfaro, E; Färber, C; Gallas Torreira, A; Galli, D; Gallorini, S; Gambetta, S; Gandelman, M; Gandini, P; Gao, Y; Garcia Martin, L M; García Pardiñas, J; Garra Tico, J; Garrido, L; Garsed, P J; Gascon, D; Gaspar, C; Gavardi, L; Gazzoni, G; Gerick, D; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianì, S; Gibson, V; Girard, O G; Giubega, L; Gizdov, K; Gligorov, V V; Golubkov, D; Golutvin, A; Gomes, A; Gorelov, I V; Gotti, C; Govorkova, E; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graverini, E; Graziani, G; Grecu, A; Griffith, P; Grillo, L; Gruberg Cazon, B R; Grünberg, O; Gushchin, E; Guz, Yu; Gys, T; Göbel, C; Hadavizadeh, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hatch, M; He, J; Head, T; Heister, A; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hombach, C; Hopchev, H; Hulsbergen, W; Humair, T; Hushchyn, M; Hussain, N; Hutchcroft, D; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jawahery, A; Jiang, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kandybei, S; Kanso, W; Karacson, M; Kariuki, J M; Karodia, S; Kecke, M; Kelsey, M; Kenyon, I R; Kenzie, M; Ketel, T; Khairullin, E; Khanji, B; Khurewathanakul, C; Kirn, T; Klaver, S; Klimaszewski, K; Koliiev, S; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Kosmyntseva, A; Kozachuk, A; Kozeiha, M; Kravchuk, L; Kreplin, K; Kreps, M; Krokovny, P; Kruse, F; Krzemien, W; Kucewicz, W; Kucharczyk, M; Kudryavtsev, V; Kuonen, A K; Kurek, K; Kvaratskheliya, T; Lacarrere, D; Lafferty, G; Lai, A; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Leflat, A; Lefrançois, J; Lefèvre, R; Lemaitre, F; Lemos Cid, E; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Likhomanenko, T; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, X; Loh, D; Longstaff, I; Lopes, J H; Lucchesi, D; Lucio Martinez, M; Luo, H; Lupato, A; Luppi, E; Lupton, O; Lusiani, A; Lyu, X; Machefert, F; Maciuc, F; Maev, O; Maguire, K; Malde, S; Malinin, A; Maltsev, T; Manca, G; Mancinelli, G; Manning, P; Maratas, J; Marchand, J F; Marconi, U; Marin Benito, C; Marino, P; Marks, J; Martellotti, G; Martin, M; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massacrier, L M; Massafferri, A; Matev, R; Mathad, A; Mathe, Z; Matteuzzi, C; Mauri, A; Maurin, B; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; Meadows, B; Meier, F; Meissner, M; Melnychuk, D; Merk, M; Merli, A; Michielin, E; Milanes, D A; Minard, M-N; Mitzel, D S; Mogini, A; Molina Rodriguez, J; Monroy, I A; Monteil, S; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Morris, A B; Mountain, R; Muheim, F; Mulder, M; Mussini, M; Müller, D; Müller, J; Müller, K; Müller, V; Naik, P; Nakada, T; Nandakumar, R; Nandi, A; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nieswand, S; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; O'Hanlon, D P; Oblakowska-Mucha, A; Obraztsov, V; Ogilvy, S; Oldeman, R; Onderwater, C J G; Otalora Goicochea, J M; Otto, A; Owen, P; Oyanguren, A; Pais, P R; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Pappalardo, L L; Parker, W; Parkes, C; Passaleva, G; Pastore, A; Patel, G D; Patel, M; Patrignani, C; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perret, P; Pescatore, L; Petridis, K; Petrolini, A; Petrov, A; Petruzzo, M; Picatoste Olloqui, E; Pietrzyk, B; Pikies, M; Pinci, D; Pistone, A; Piucci, A; Playfer, S; Plo Casasus, M; Poikela, T; Polci, F; Poluektov, A; Polyakov, I; Polycarpo, E; Pomery, G J; Popov, A; Popov, D; Popovici, B; Poslavskii, S; Potterat, C; Price, E; Price, J D; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Quagliani, R; Rachwal, B; Rademacker, J H; Rama, M; Ramos Pernas, M; Rangel, M S; Raniuk, I; Ratnikov, F; Raven, G; Redi, F; Reichert, S; Dos Reis, A C; Remon Alepuz, C; Renaudin, V; Ricciardi, S; Richards, S; Rihl, M; Rinnert, K; Rives Molina, V; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Lopez, J A; Rodriguez Perez, P; Rogozhnikov, A; Roiser, S; Rollings, A; Romanovskiy, V; Romero Vidal, A; Ronayne, J W; Rotondo, M; Rudolph, M S; Ruf, T; Ruiz Valls, P; Saborido Silva, J J; Sadykhov, E; Sagidova, N; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santimaria, M; Santovetti, E; Sarti, A; Satriano, C; Satta, A; Saunders, D M; Savrina, D; Schael, S; Schellenberg, M; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmelzer, T; Schmidt, B; Schneider, O; Schopper, A; Schubert, K; Schubiger, M; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Semennikov, A; Sergi, A; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Siddi, B G; Silva Coutinho, R; Silva de Oliveira, L; Simi, G; Simone, S; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, E; Smith, I T; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Souza De Paula, B; Spaan, B; Spradlin, P; Sridharan, S; Stagni, F; Stahl, M; Stahl, S; Stefko, P; Stefkova, S; Steinkamp, O; Stemmle, S; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Sun, L; Sutcliffe, W; Swientek, K; Syropoulos, V; Szczekowski, M; Szumlak, T; T'Jampens, S; Tayduganov, A; Tekampe, T; Tellarini, G; Teubert, F; Thomas, E; van Tilburg, J; Tilley, M J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Toriello, F; Tournefier, E; Tourneur, S; Trabelsi, K; Traill, M; Tran, M T; Tresch, M; Trisovic, A; Tsaregorodtsev, A; Tsopelas, P; Tully, A; Tuning, N; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vacca, C; Vagnoni, V; Valassi, A; Valat, S; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vecchi, S; van Veghel, M; Velthuis, J J; Veltri, M; Veneziano, G; Venkateswaran, A; Vernet, M; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Viemann, H; Vilasis-Cardona, X; Vitti, M; Volkov, V; Vollhardt, A; Voneki, B; Vorobyev, A; Vorobyev, V; Voß, C; de Vries, J A; Vázquez Sierra, C; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wang, J; Ward, D R; Wark, H M; Watson, N K; Websdale, D; Weiden, A; Whitehead, M; Wicht, J; Wilkinson, G; Wilkinson, M; Williams, M; Williams, M P; Williams, M; Williams, T; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wraight, K; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yin, H; Yu, J; Yuan, X; Yushchenko, O; Zarebski, K A; Zavertyaev, M; Zhang, L; Zhang, Y; Zhang, Y; Zhelezov, A; Zheng, Y; Zhokhov, A; Zhu, X; Zhukov, V; Zucchelli, S

    2017-01-01

    The decays [Formula: see text] and [Formula: see text] are observed for the first time using a data sample corresponding to an integrated luminosity of 3.0 fb[Formula: see text], collected by the LHCb experiment in proton-proton collisions at the centre-of-mass energies of 7 and 8[Formula: see text]. The branching fractions relative to that of [Formula: see text] are measured to be [Formula: see text]where the first uncertainties are statistical and the second are systematic.

  15. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  16. Plate Full of Color

    Centers for Disease Control (CDC) Podcasts

    2008-08-04

    The Eagle Books are a series of four books that are brought to life by wise animal characters - Mr. Eagle, Miss Rabbit, and Coyote - who engage Rain That Dances and his young friends in the joy of physical activity, eating healthy foods, and learning from their elders about health and diabetes prevention. Plate Full of Color teaches the value of eating a variety of colorful and healthy foods.  Created: 8/4/2008 by National Center for Chronic Disease Prevention and Health Promotion (NCCDPHP).   Date Released: 8/5/2008.

  17. Figure text extraction in biomedical literature.

    Directory of Open Access Journals (Sweden)

    Daehyun Kim

    2011-01-01

    Full Text Available Figures are ubiquitous in biomedical full-text articles, and they represent important biomedical knowledge. However, the sheer volume of biomedical publications has made it necessary to develop computational approaches for accessing figures. Therefore, we are developing the Biomedical Figure Search engine (http://figuresearch.askHERMES.org to allow bioscientists to access figures efficiently. Since text frequently appears in figures, automatically extracting such text may assist the task of mining information from figures. Little research, however, has been conducted exploring text extraction from biomedical figures.We first evaluated an off-the-shelf Optical Character Recognition (OCR tool on its ability to extract text from figures appearing in biomedical full-text articles. We then developed a Figure Text Extraction Tool (FigTExT to improve the performance of the OCR tool for figure text extraction through the use of three innovative components: image preprocessing, character recognition, and text correction. We first developed image preprocessing to enhance image quality and to improve text localization. Then we adapted the off-the-shelf OCR tool on the improved text localization for character recognition. Finally, we developed and evaluated a novel text correction framework by taking advantage of figure-specific lexicons.The evaluation on 382 figures (9,643 figure texts in total randomly selected from PubMed Central full-text articles shows that FigTExT performed with 84% precision, 98% recall, and 90% F1-score for text localization and with 62.5% precision, 51.0% recall and 56.2% F1-score for figure text extraction. When limiting figure texts to those judged by domain experts to be important content, FigTExT performed with 87.3% precision, 68.8% recall, and 77% F1-score. FigTExT significantly improved the performance of the off-the-shelf OCR tool we used, which on its own performed with 36.6% precision, 19.3% recall, and 25.3% F1-score for

  18. From Text to Political Positions: Text analysis across disciplines

    NARCIS (Netherlands)

    Kaal, A.R.; Maks, I.; van Elfrinkhof, A.M.E.

    2014-01-01

    ABSTRACT From Text to Political Positions addresses cross-disciplinary innovation in political text analysis for party positioning. Drawing on political science, computational methods and discourse analysis, it presents a diverse collection of analytical models including pure quantitative and

  19. An Embedded Application for Degraded Text Recognition

    Directory of Open Access Journals (Sweden)

    Thillou Céline

    2005-01-01

    Full Text Available This paper describes a mobile device which tries to give the blind or visually impaired access to text information. Three key technologies are required for this system: text detection, optical character recognition, and speech synthesis. Blind users and the mobile environment imply two strong constraints. First, pictures will be taken without control on camera settings and a priori information on text (font or size and background. The second issue is to link several techniques together with an optimal compromise between computational constraints and recognition efficiency. We will present the overall description of the system from text detection to OCR error correction.

  20. The Instructional Text like a Textual Genre

    Directory of Open Access Journals (Sweden)

    Adiane Fogali Marinello

    2011-07-01

    Full Text Available This article analyses the instructional text as a textual genre and is part of the research called Reading and text production from the textual genre perspective, done at Universidade de Caxias do Sul, Campus Universitário da Região dos Vinhedos. Firstly, some theoretical assumptions about textual genre are presented, then, the instructional text is characterized. After that an instructional text is analyzed and, finally, some activities related to reading and writing of the mentioned genre directed to High School and University students are suggested.

  1. Emptiness and Fullness

    DEFF Research Database (Denmark)

    Bregnbæk, Susanne; Bunkenborg, Mikkel

    As critical voices question the quality, authenticity, and value of people, goods, and words in post-Mao China, accusations of emptiness render things open to new investments of meaning, substance, and value. Exploring the production of lack and desire through fine-grained ethnography, this volume...... examines how diagnoses of emptiness operate in a range of very different domains in contemporary China: In the ostensibly meritocratic exam system and the rhetoric of officials, in underground churches, housing bubbles, and nationalist fantasies, in bodies possessed by spirits and evaluations of jade......, there is a pervasive concern with states of lack and emptiness and the contributions suggest that this play of emptiness and fullness is crucial to ongoing constructions of quality, value, and subjectivity in China....

  2. Information Needs/Relevance

    OpenAIRE

    Wildemuth, Barbara M.

    2009-01-01

    A user's interaction with a DL is often initiated as the result of the user experiencing an information need of some kind. Aspects of that experience and how it might affect the user's interactions with the DL are discussed in this module. In addition, users continuously make decisions about and evaluations of the materials retrieved from a DL, relative to their information needs. Relevance judgments, and their relationship to the user's information needs, are discussed in this module. Draft

  3. A Proposed Arabic Handwritten Text Normalization Method

    Directory of Open Access Journals (Sweden)

    Tarik Abu-Ain

    2014-11-01

    Full Text Available Text normalization is an important technique in document image analysis and recognition. It consists of many preprocessing stages, which include slope correction, text padding, skew correction, and straight the writing line. In this side, text normalization has an important role in many procedures such as text segmentation, feature extraction and characters recognition. In the present article, a new method for text baseline detection, straightening, and slant correction for Arabic handwritten texts is proposed. The method comprises a set of sequential steps: first components segmentation is done followed by components text thinning; then, the direction features of the skeletons are extracted, and the candidate baseline regions are determined. After that, selection of the correct baseline region is done, and finally, the baselines of all components are aligned with the writing line.  The experiments are conducted on IFN/ENIT benchmark Arabic dataset. The results show that the proposed method has a promising and encouraging performance.

  4. Arabic text classification using Polynomial Networks

    Directory of Open Access Journals (Sweden)

    Mayy M. Al-Tahrawi

    2015-10-01

    Full Text Available In this paper, an Arabic statistical learning-based text classification system has been developed using Polynomial Neural Networks. Polynomial Networks have been recently applied to English text classification, but they were never used for Arabic text classification. In this research, we investigate the performance of Polynomial Networks in classifying Arabic texts. Experiments are conducted on a widely used Arabic dataset in text classification: Al-Jazeera News dataset. We chose this dataset to enable direct comparisons of the performance of Polynomial Networks classifier versus other well-known classifiers on this dataset in the literature of Arabic text classification. Results of experiments show that Polynomial Networks classifier is a competitive algorithm to the state-of-the-art ones in the field of Arabic text classification.

  5. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  6. SELECT COMPANY BUSINESS STRATEGIES IN FULL OF UNCERTAINTY

    Directory of Open Access Journals (Sweden)

    Mikhail N. Konotopov

    2013-01-01

    Full Text Available This article proposes an approach to the choice of the business strategies of the company in the conditions of complete market demand uncertainty. It is based on using the so-called strategic statistical games - market becomes the first player and the head of the organization, receiving the administrative decision, the second. Formed matrix losses of the company, the relevant combinations of possible states of a market demand and selected business strategies. To select the optimal business strategies, corresponding to the concrete possibilities of the company, Wald, Savage and Hurwitz rules are used.

  7. A Text-Mining Framework for Supporting Systematic Reviews.

    Science.gov (United States)

    Li, Dingcheng; Wang, Zhen; Wang, Liwei; Sohn, Sunghwan; Shen, Feichen; Murad, Mohammad Hassan; Liu, Hongfang

    2016-11-01

    Systematic reviews (SRs) involve the identification, appraisal, and synthesis of all relevant studies for focused questions in a structured reproducible manner. High-quality SRs follow strict procedures and require significant resources and time. We investigated advanced text-mining approaches to reduce the burden associated with abstract screening in SRs and provide high-level information summary. A text-mining SR supporting framework consisting of three self-defined semantics-based ranking metrics was proposed, including keyword relevance, indexed-term relevance and topic relevance. Keyword relevance is based on the user-defined keyword list used in the search strategy. Indexed-term relevance is derived from indexed vocabulary developed by domain experts used for indexing journal articles and books. Topic relevance is defined as the semantic similarity among retrieved abstracts in terms of topics generated by latent Dirichlet allocation, a Bayesian-based model for discovering topics. We tested the proposed framework using three published SRs addressing a variety of topics (Mass Media Interventions, Rectal Cancer and Influenza Vaccine). The results showed that when 91.8%, 85.7%, and 49.3% of the abstract screening labor was saved, the recalls were as high as 100% for the three cases; respectively. Relevant studies identified manually showed strong topic similarity through topic analysis, which supported the inclusion of topic analysis as relevance metric. It was demonstrated that advanced text mining approaches can significantly reduce the abstract screening labor of SRs and provide an informative summary of relevant studies.

  8. Flexible frontiers for text division into rows

    Directory of Open Access Journals (Sweden)

    Dan L. Lacrămă

    2009-01-01

    Full Text Available This paper presents an original solution for flexible hand-written text division into rows. Unlike the standard procedure, the proposed method avoids the isolated characters extensions amputation and reduces the recognition error rate in the final stage.

  9. Ontology Assisted Formal Specification Extraction from Text

    Directory of Open Access Journals (Sweden)

    Andreea Mihis

    2010-12-01

    Full Text Available In the field of knowledge processing, the ontologies are the most important mean. They make possible for the computer to understand better the natural language and to make judgments. In this paper, a method which use ontologies in the semi-automatic extraction of formal specifications from a natural language text is proposed.

  10. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  11. STYLISTIC FEATURES OF ADVERTISING TEXTS OF INFORMATIVE AND COMPARATIVE TYPES

    Directory of Open Access Journals (Sweden)

    Poddubskaya, O.N.

    2016-06-01

    Full Text Available The relevance of this article is related to the fact that nowadays advertising has a very strong impact both on the consumer market, political and cultural life of society, and on the language and its development as a system. Advertising has given rise to the development of a special set of stylistic features of a text, formed under the influence of reviving advertising traditions in the Russian language and under the active impact of energetic and pushy European advertising. The purpose of this study is to explore stylistic features of informative and comparative advertising texts. The object of research is Russian-language advertising in printed media and on television. In the end of the article we made conclusions about groups of language means used for different stylistic devices in informative and comparative advertising texts. Analysis of stylistic features of modern informative and comparative advertising texts can be of great interest to specialists in the field of theoretical studies of modern advertising.

  12. Identity text: an educational intervention to foster cultural interaction

    Directory of Open Access Journals (Sweden)

    Zareen Zaidi

    2016-11-01

    Full Text Available Background: Sociocultural theories state that learning results from people participating in contexts where social interaction is facilitated. There is a need to create such facilitated pedagogical spaces where participants can share their ways of knowing and doing. The aim of this exploratory study was to introduce pedagogical space for sociocultural interaction using ‘Identity Text’. Methods: Identity Texts are sociocultural artifacts produced by participants, which can be written, spoken, visual, musical, or multimodal. In 2013, participants of an international medical education fellowship program were asked to create their own Identity Texts to promote discussion about participants’ cultural backgrounds. Thematic analysis was used to make the analysis relevant to studying the pedagogical utility of the intervention. Result: The Identity Text intervention created two spaces: a ‘reflective space’, which helped participants reflect on sensitive topics such as institutional environments, roles in interdisciplinary teams, and gender discrimination, and a ‘narrative space’, which allowed participants to tell powerful stories that provided cultural insights and challenged cultural hegemony; they described the conscious and subconscious transformation in identity that evolved secondary to struggles with local power dynamics and social demands involving the impact of family, peers, and country of origin. Conclusion: While the impact of providing pedagogical space using Identity Text on cognitive engagement and enhanced learning requires further research, the findings of this study suggest that it is a useful pedagogical strategy to support cross-cultural education.

  13. Linguistics and the Literary Text.

    Science.gov (United States)

    Ferrar, Madeleine

    1984-01-01

    Discusses the opposing viewpoints of the two most influential linguists of this century--Saussure and Chomsky--suggesting that while both are interested in form as opposed to substance, Saussure sees linguistics as a branch of semiotics and Chomsky sees it as part of cognitive psychology. Evaluates the relevance of these two viewpoints to the…

  14. The Value Relevance of IFRS: The Case of Turkey

    Directory of Open Access Journals (Sweden)

    Ahmet Turel

    2009-10-01

    Full Text Available Accounting standards that are mostly compatible with International Financial Reporting Standards (IFRS are required for consolidatedfinancial statements of all Turkish listed firms starting from 2005 fiscal year end. Before that, financial reporting in Turkey was closely related to taxreporting. Until 2003, all Turkish listed firms were preparing their financial statements in accordance with the local standards issued by the CapitalMarket Board of Turkey. In this study, I examine the relative and incremental value relevance of earnings and the book value of equity under CapitalMarket Board (CMB Accounting Standards (2001-2002 and under IFRS (2005-2006 for Turkish listed firms. I compare these two periods toinvestigate whether the mandatory adoption of IFRS increase relevance of earnings and book value of equity which are accepted as proxies ofaccounting quality. I find evidence that the value relevance of earnings and book value of equity has increased significantly after adopting IFRS. Inaddition, I find that the incremental value relevance of earnings increased between the CMB accounting standards period and the IFRS period.However, the incremental value relevance of book value of equity degreased in the same period.

  15. Full metal jacket!

    CERN Document Server

    Laëtitia Pedroso

    2011-01-01

    Ten years ago, standard issue clothing only gave CERN firemen partial protection but today our fire-fighters are equipped with state-of-the-art, full personal protective equipment.   CERN's Fire Brigade team. For many years, the members of CERN's Fire Brigade went on call-outs clad in their work trousers and fire-rescue coats, which only afforded them partial protection. Today, textile manufacturing techniques have moved on a long way and CERN's firemen are now kitted out with state-of-the-art personal protective equipment. The coat and trousers are three-layered, comprising fire-resistant aramide, a protective membrane and a thermal lining. The CERN Fire Brigade' new state-of-the-art personal protection equipment. "This equipment is fully compliant with the standards in force and is therefore resistant to cuts, abrasion, electrical arcs with thermal effects and, of course, fire," explains Patrick Berlinghi, the CERN Fire Brigade's Logistics Officer. You might think that su...

  16. Policies for full employment

    DEFF Research Database (Denmark)

    de Koning, Jaap; Layard, Richard; Nickel, Stephen

    European unemployment is too high, and employment is too low. Over 7½ per cent of Europe's workforce is unemployed, and only two thirds of people aged 15-64 are in work. At the Lisbon summit two years ago the heads of government set the target that by 2010 the employment rate should rise from 64...... per cent to at least 70 per cent. And for older workers between 55 and 64 the employment rate should rise from 38 per cent to at least one half. These are ambitious targets. They will require two big changes: more people must seek work, and among those seeking work a higher proportion must get a job....... So we need higher participation, and (for full employment) we need a much lower unemployment rate. Can it be done? A mere glance at the experience of different European countries shows that it can. As Table 1 shows, four E.U. countries already exceed the overall target for 2010 (Britain, Denmark...

  17. Relevance as process: judgements in the context of scholarly research

    Directory of Open Access Journals (Sweden)

    Theresa D. Anderson

    2005-01-01

    Full Text Available Introduction. This paper discusses how exploring the research process in-depth and over time contributes to a fuller understanding of interactions with various representations of information. Method. A longitudinal ethnographic study explored decisions made by two informants involved in scholarly research. Relevance assessment and information seeking were observed as part of informants' own ongoing research projects. Fieldwork used methods of discovery that allowed informants to shape the exploration of the practices surrounding the evolving understandings of their topics. Analysis. Inductive analysis was carried out on the qualitative data collected over a two-year period of judgements observed on a document-by-document basis. The paper introduces broad categories that point to the variability and richness of the ways that informants used representations of information resources to make relevance judgements. Results. Relevance judgements appear to be drivers of the search and research processes informants moved through during the observations. Focusing on research goals rather than on retrieval tasks brings us to a fuller understanding of the relationship between ultimate research goals and the articulation of those goals in interactions with information systems. Conclusion. Relevance assessment is a process that unfolds in the doing of a search, the making of judgements and the using of texts and representations of information.

  18. NAMED ENTITY RECOGNITION FROM BIOMEDICAL TEXT -AN INFORMATION EXTRACTION TASK

    Directory of Open Access Journals (Sweden)

    N. Kanya

    2016-07-01

    Full Text Available Biomedical Text Mining targets the Extraction of significant information from biomedical archives. Bio TM encompasses Information Retrieval (IR and Information Extraction (IE. The Information Retrieval will retrieve the relevant Biomedical Literature documents from the various Repositories like PubMed, MedLine etc., based on a search query. The IR Process ends up with the generation of corpus with the relevant document retrieved from the Publication databases based on the query. The IE task includes the process of Preprocessing of the document, Named Entity Recognition (NER from the documents and Relationship Extraction. This process includes Natural Language Processing, Data Mining techniques and machine Language algorithm. The preprocessing task includes tokenization, stop word Removal, shallow parsing, and Parts-Of-Speech tagging. NER phase involves recognition of well-defined objects such as genes, proteins or cell-lines etc. This process leads to the next phase that is extraction of relationships (IE. The work was based on machine learning algorithm Conditional Random Field (CRF.

  19. Informational Text and the CCSS

    Science.gov (United States)

    Aspen Institute, 2012

    2012-01-01

    What constitutes an informational text covers a broad swath of different types of texts. Biographies & memoirs, speeches, opinion pieces & argumentative essays, and historical, scientific or technical accounts of a non-narrative nature are all included in what the Common Core State Standards (CCSS) envisions as informational text. Also included…

  20. The Only Safe SMS Texting Is No SMS Texting.

    Science.gov (United States)

    Toth, Cheryl; Sacopulos, Michael J

    2015-01-01

    Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.

  1. Using the Characteristics of Documents, Users and Tasks to Predict the Situational Relevance of Health Web Documents

    Directory of Open Access Journals (Sweden)

    Melinda Oroszlányová

    2017-09-01

    Full Text Available Relevance is usually estimated by search engines using document content, disregarding the user behind the search and the characteristics of the task. In this work, we look at relevance as framed in a situational context, calling it situational relevance, and analyze whether it is possible to predict it using documents, users and tasks characteristics. Using an existing dataset composed of health web documents, relevance judgments for information needs, user and task characteristics, we build a multivariate prediction model for situational relevance. Our model has an accuracy of 77.17%. Our findings provide insights into features that could improve the estimation of relevance by search engines, helping to conciliate the systemic and situational views of relevance. In a near future we will work on the automatic assessment of document, user and task characteristics.

  2. Predicting Prosody from Text for Text-to-Speech Synthesis

    CERN Document Server

    Rao, K Sreenivasa

    2012-01-01

    Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.

  3. Text recycling: acceptable or misconduct?

    Science.gov (United States)

    Harriman, Stephanie; Patel, Jigisha

    2014-08-16

    Text recycling, also referred to as self-plagiarism, is the reproduction of an author's own text from a previous publication in a new publication. Opinions on the acceptability of this practice vary, with some viewing it as acceptable and efficient, and others as misleading and unacceptable. In light of the lack of consensus, journal editors often have difficulty deciding how to act upon the discovery of text recycling. In response to these difficulties, we have created a set of guidelines for journal editors on how to deal with text recycling. In this editorial, we discuss some of the challenges of developing these guidelines, and how authors can avoid undisclosed text recycling.

  4. Text against Text: Counterbalancing the Hegemony of Assessment.

    Science.gov (United States)

    Cosgrove, Cornelius

    A study examined whether composition specialists can counterbalance the potential privileging of the assessment perspective, or of self-appointed interpreters of that perspective, through the study of assessment discourse as text. Fourteen assessment texts were examined, most of them journal articles and most of them featuring the common…

  5. [Relevant public health enteropathogens].

    Science.gov (United States)

    Riveros, Maribel; Ochoa, Theresa J

    2015-01-01

    Diarrhea remains the third leading cause of death in children under five years, despite recent advances in the management and prevention of this disease. It is caused by multiple pathogens, however, the prevalence of each varies by age group, geographical area and the scenario where cases (community vs hospital) are recorded. The most relevant pathogens in public health are those associated with the highest burden of disease, severity, complications and mortality. In our country, norovirus, Campylobacter and diarrheagenic E. coli are the most prevalent pathogens at the community level in children. In this paper we review the local epidemiology and potential areas of development in five selected pathogens: rotavirus, norovirus, Shiga toxin-producing E. coli (STEC), Shigella and Salmonella. Of these, rotavirus is the most important in the pediatric population and the main agent responsible for child mortality from diarrhea. The introduction of rotavirus vaccination in Peru will have a significant impact on disease burden and mortality from diarrhea. However, surveillance studies are needed to determine the impact of vaccination and changes in the epidemiology of diarrhea in Peru following the introduction of new vaccines, as well as antibiotic resistance surveillance of clinical relevant bacteria.

  6. Effects of Text Messaging on Academic Performance

    Directory of Open Access Journals (Sweden)

    Barks Amanda

    2011-12-01

    Full Text Available University students frequently send and receive cellular phone text messages during classroominstruction. Cognitive psychology research indicates that multi-tasking is frequently associatedwith performance cost. However, university students often have considerable experience withelectronic multi-tasking and may believe that they can devote necessary attention to a classroomlecture while sending and receiving text messages. In the current study, university students whoused text messaging were randomly assigned to one of two conditions: 1. a group that sent andreceived text messages during a lecture or, 2. a group that did not engage in text messagingduring the lecture. Participants who engaged in text messaging demonstrated significantlypoorer performance on a test covering lecture content compared with the group that did notsend and receive text messages. Participants exhibiting higher levels of text messaging skill hadsignificantly lower test scores than participants who were less proficient at text messaging. It ishypothesized that in terms of retention of lecture material, more frequent task shifting by thosewith greater text messaging proficiency contributed to poorer performance. Overall, the findingsdo not support the view, held by many university students, that this form of multitasking has littleeffect on the acquisition of lecture content. Results provide empirical support for teachers andprofessors who ban text messaging in the classroom.

  7. [Formula: see text]The statistical crisis in science: how is it relevant to clinical neuropsychology?

    Science.gov (United States)

    Gelman, Andrew; Geurts, Hilde M

    There is currently increased attention to the statistical (and replication) crisis in science. Biomedicine and social psychology have been at the heart of this crisis, but similar problems are evident in a wide range of fields. We discuss three examples of replication challenges from the field of social psychology and some proposed solutions, and then consider the applicability of these ideas to clinical neuropsychology. In addition to procedural developments such as preregistration and open data and criticism, we recommend that data be collected and analyzed with more recognition that each new study is a part of a learning process. The goal of improving neuropsychological assessment, care, and cure is too important to not take good scientific practice seriously.

  8. Control-relevant modeling and simulation of a SOFC-GT hybrid system

    Directory of Open Access Journals (Sweden)

    Rambabu Kandepu

    2006-07-01

    Full Text Available In this paper, control-relevant models of the most important components in a SOFC-GT hybrid system are described. Dynamic simulations are performed on the overall hybrid system. The model is used to develop a simple control structure, but the simulations show that more elaborate control is needed.

  9. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    Today, information travels fast. Texts travel, too. In a corporate context, the question is how to manage which knowledge elements should travel to a new language area or market and in which form? The decision to let knowledge elements travel or not travel highly depends on the limitation...... and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  10. Texting while driving: is speech-based text entry less risky than handheld text entry?

    Science.gov (United States)

    He, J; Chaparro, A; Nguyen, B; Burge, R J; Crandall, J; Chaparro, B; Ni, R; Cao, S

    2014-11-01

    Research indicates that using a cell phone to talk or text while maneuvering a vehicle impairs driving performance. However, few published studies directly compare the distracting effects of texting using a hands-free (i.e., speech-based interface) versus handheld cell phone, which is an important issue for legislation, automotive interface design and driving safety training. This study compared the effect of speech-based versus handheld text entries on simulated driving performance by asking participants to perform a car following task while controlling the duration of a secondary text-entry task. Results showed that both speech-based and handheld text entries impaired driving performance relative to the drive-only condition by causing more variation in speed and lane position. Handheld text entry also increased the brake response time and increased variation in headway distance. Text entry using a speech-based cell phone was less detrimental to driving performance than handheld text entry. Nevertheless, the speech-based text entry task still significantly impaired driving compared to the drive-only condition. These results suggest that speech-based text entry disrupts driving, but reduces the level of performance interference compared to text entry with a handheld device. In addition, the difference in the distraction effect caused by speech-based and handheld text entry is not simply due to the difference in task duration. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Texte, Mathématiques, Philosophie et Sujet

    Directory of Open Access Journals (Sweden)

    Jean-Michel Salanskis

    2004-04-01

    Full Text Available Dans cet article sont menées deux réflexions. La première tente de juger du rapport de la philosophie à sa textualisation d’après le rapport des mathématiques à leur textualisation, et ce à trois niveaux : 1 en essayant de tirer des manières dont le texte mathématique excède sa forme logique des enseignements quant à la pertinence et la viabilité d’une réduction du texte philosophique à sa forme logique ; 2 en posant le problème d’une étude externaliste du texte philosophique à la lumière des difficultés particulières que suscite l’approche externaliste du texte mathématique ; 3 en examinant ce qu’il en est de l’hybridation du philosophique et du mathématique dans certains textes. La seconde porte sur un aspect particulier de la textualisation : sur l’intervention du marqueur du sujet de l’énonciation (« Je » dans les textes philosophiques.Two reflexions are carried out in this article. The first one tries to judge the relationship between philosophy and its textualization after the relationship between mathematics and their textualization at three different levels : 1 trying to draw ways in which the mathematical text exceeds the logical form of teaching with regards to the relevance and the viability of a reduction of the philosophical text up to its logical form ; 2 setting the problem of an externalist study of the philosophical text in the light of the peculiar difficulties aroused by the externalist approach of the mathematical text ; 3 examining what concerns the hybridization of the philosophical and the mathematical in certain texts. The second one deals with a particular aspect of textualization : with the intervention of the marker of enunciation subject (« I » in philosophical texts.

  12. Active Learning for Text Classification

    OpenAIRE

    Hu, Rong

    2011-01-01

    Text classification approaches are used extensively to solve real-world challenges. The success or failure of text classification systems hangs on the datasets used to train them, without a good dataset it is impossible to build a quality system. This thesis examines the applicability of active learning in text classification for the rapid and economical creation of labelled training data. Four main contributions are made in this thesis. First, we present two novel selection strategies to cho...

  13. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  14. Communicative Pragmatic Parameters of Manipulative Formulas in the Advertising Text

    Directory of Open Access Journals (Sweden)

    Areshenkova Oleksandra

    2016-12-01

    Full Text Available Background: Advertising communication tends to shorten the use of the language means. This characteristic explains the deliberate usage of such linguistic constructions that primarily influence the potential consumer. The role of advertising has increased in the modern world. Its strengthening intensifies the interest in the study of this social phenomenon among the scientists in various fields. The relevance of the study is obvious due to the fact that the issues of verbal influence on the recipient remains unexplored in modern Ukrainian linguistics. Purpose: clarifying the role of manipulative formulas in pragmatic implementation of advertisement guidelines. Results: This article makes an attempt to define and describe a set of basic communicative and pragmatic properties in modern advertising texts. The research represents main communicative and speech characteristics of advertising texts. The author analyzes the role of evaluation in producing the impact of communicative effect on the recipient. Also it reveals the efficiency of using impact-oriented language means in advertising texts. Discussion: Hidden evaluation of the advertised goods / services is an effective communicative pragmatic tool influencing the buyer. The latent assessment is provided by manipulative formulas (all, million; №1, leader; 100%; professional, expert.

  15. Graphics in Text: A Bibliography. Monograph No. 6.

    Science.gov (United States)

    Macdonald-Ross, Michael; Smith, Eleanor

    This bibliography lists books and articles discussing graphic aspects of human communication. References have been selected for their relevance to the design of self-instructional texts for the adult learner; for the most part, research on younger children, on non-text media, and on non-educational texts is not included. Items are organized into…

  16. Chapter 16: text mining for translational bioinformatics.

    Science.gov (United States)

    Cohen, K Bretonnel; Hunter, Lawrence E

    2013-04-01

    Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.

  17. Figure-associated text summarization and evaluation.

    Directory of Open Access Journals (Sweden)

    Balaji Polepalli Ramesh

    Full Text Available Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903.

  18. Text segmentation in degraded historical document images

    Directory of Open Access Journals (Sweden)

    A.S. Kavitha

    2016-07-01

    Full Text Available Text segmentation from degraded Historical Indus script images helps Optical Character Recognizer (OCR to achieve good recognition rates for Hindus scripts; however, it is challenging due to complex background in such images. In this paper, we present a new method for segmenting text and non-text in Indus documents based on the fact that text components are less cursive compared to non-text ones. To achieve this, we propose a new combination of Sobel and Laplacian for enhancing degraded low contrast pixels. Then the proposed method generates skeletons for text components in enhanced images to reduce computational burdens, which in turn helps in studying component structures efficiently. We propose to study the cursiveness of components based on branch information to remove false text components. The proposed method introduces the nearest neighbor criterion for grouping components in the same line, which results in clusters. Furthermore, the proposed method classifies these clusters into text and non-text cluster based on characteristics of text components. We evaluate the proposed method on a large dataset containing varieties of images. The results are compared with the existing methods to show that the proposed method is effective in terms of recall and precision.

  19. Text Genres in Information Organization

    Science.gov (United States)

    Nahotko, Marek

    2016-01-01

    Introduction: Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method: The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information…

  20. Dynamic effects of self-relevance and task on the neural processing of emotional words in context

    Directory of Open Access Journals (Sweden)

    Eric C. Fields

    2016-01-01

    Full Text Available We used event-related potentials (ERPs to examine the interactions between task, emotion, and contextual self-relevance on processing words in social vignettes. Participants read scenarios that were in either third person (other-relevant or second person (self-relevant and we recorded ERPs to a neutral, pleasant, or unpleasant critical word. In a previously reported study (Fields & Kuperberg, 2012 with these stimuli, participants were tasked with producing a third sentence continuing the scenario. We observed a larger LPC to emotional words than neutral words in both the self-relevant and other-relevant scenarios, but this effect was smaller in the self-relevant scenarios because the LPC was larger on the neutral words (i.e., a larger LPC to self-relevant than other-relevant neutral words. In the present work, participants simply answered comprehension questions that did not refer to the emotional aspects of the scenario. Here we observed quite a different pattern of interaction between self-relevance and emotion: the LPC was larger to emotional versus neutral words in the self-relevant scenarios only, and there was no effect of self-relevance on neutral words. Taken together, these findings suggest that the LPC reflects a dynamic interaction between specific task demands, the emotional properties of a stimulus, and contextual self-relevance. We conclude by discussing implications and future directions for a functional theory of the emotional LPC.

  1. Text mining in livestock animal science: introducing the potential of text mining to animal sciences.

    Science.gov (United States)

    Sahadevan, S; Hofmann-Apitius, M; Schellander, K; Tesfaye, D; Fluck, J; Friedrich, C M

    2012-10-01

    In biological research, establishing the prior art by searching and collecting information already present in the domain has equal importance as the experiments done. To obtain a complete overview about the relevant knowledge, researchers mainly rely on 2 major information sources: i) various biological databases and ii) scientific publications in the field. The major difference between the 2 information sources is that information from databases is available, typically well structured and condensed. The information content in scientific literature is vastly unstructured; that is, dispersed among the many different sections of scientific text. The traditional method of information extraction from scientific literature occurs by generating a list of relevant publications in the field of interest and manually scanning these texts for relevant information, which is very time consuming. It is more than likely that in using this "classical" approach the researcher misses some relevant information mentioned in the literature or has to go through biological databases to extract further information. Text mining and named entity recognition methods have already been used in human genomics and related fields as a solution to this problem. These methods can process and extract information from large volumes of scientific text. Text mining is defined as the automatic extraction of previously unknown and potentially useful information from text. Named entity recognition (NER) is defined as the method of identifying named entities (names of real world objects; for example, gene/protein names, drugs, enzymes) in text. In animal sciences, text mining and related methods have been briefly used in murine genomics and associated fields, leaving behind other fields of animal sciences, such as livestock genomics. The aim of this work was to develop an information retrieval platform in the livestock domain focusing on livestock publications and the recognition of relevant data from

  2. Manipulative Use of Short Messaging Service (SMS Text Messages by Nigerian Telecommunications Companies

    Directory of Open Access Journals (Sweden)

    Ayoola, Kehinde A.

    2014-02-01

    Full Text Available This paper is an application of Relevance Theory for the interpretation of short messaging service (SMS text messages emanating from Nigerian telecommunications companies to their subscribers. The aim of the research was to identify and describe the manipulative strategies employed by Nigerian telecommunications companies to induce subscribers to part with their money through sales promotion lotteries. 100 SMS texts were purposively extracted from the cell phones of randomly selected residents of Lagos Nigeria who had received promotional SMS text messages from three major Nigerian telecommunications companies. Using Sperber and Wilson's Relevance Theory (1995 as its theoretical framework, the paper described the manipulative use of SMS by Nigerian telecommunications companies. The analysis revealed that SMS text messages were encoded to achieve maximization of relevance through explicature and implicature; contextual implication and strengthening; and the reduction of processing effort through violating the maxim of truthfulness and the creative use of graphology. The paper concludes that SMS text-messages were used manipulatively by Nigerian telecommunications companies to earn indirect income from sales promotion lottery.

  3. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed the chronol......For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  4. INVESTIGATING TEACHERS’ PROFESSIONAL COMPETENCE: A SYSTEMIC FUNCTIONAL LINGUISTIC ANALYSIS OF TEACHERS’ REPORT TEXTS

    Directory of Open Access Journals (Sweden)

    Sudarsono M. I. Sudarsono

    2017-05-01

    Full Text Available This research aims at observing the teachers’ professional competence by investigating the report texts written by three English teachers in a junior high school in terms of their schematic structures and linguistic features. To achieve this aim, a qualitative case study design involving analysis of English teachers’ report texts and interviews with these English teachers was employed in this research. The results of this research showed that generally the three English teachers have demonstrated sufficient ability in applying appropriate schematic structures and linguistic features relevant to the criteria of a report text. However, the results of this research also indicate that some improvements in understanding and writing a report text, especially in terms of schematic structure, linguistic features, and theme progressions, are needed to enhance the teachers’ subject matter content knowledge about report text.

  5. What Do You Do With Hands Like These? Close Reading Facilitates Exploration and Text Creation

    Directory of Open Access Journals (Sweden)

    Lindsey Moses

    2013-05-01

    Full Text Available This article shares instructional ideas to enhance language and literacy experiences involving the reading and writing processes of young bilinguals (Spanish and English in Colorado, USA when engaging with informational texts. Informational texts provide language scaffolds for young bilinguals because they build on their background knowledge about the world around them. Drawing on their recognition of real-world concepts found in informational texts, teaching ideas that enrich both academic and social vocabulary are shared. These teaching ideas suggest moving beyond the read aloud and individual reading of informational texts; they suggest instead teaching young learners to ‘read like writers’ and utilize Jenkins and Page’s What Do You Do With a Tail Like This? (2003 as a mentor text. This article includes relevant research, teaching ideas and classroom examples for scaffolding a close reading, ultimately resulting in intercultural explorations as the children share their writing about their home contexts.

  6. Agreement on economic and technological cooperation between the Federal Republic of Germany and the GDR. Project part 3.2, ''NDT and QA''. Project task 2.11. Experiments with the full-size vessel in Stuttgart for selection of practice-relevant non-destructive testing methods for evaluation of the value and performance of recurrent inspections of reactor components. Final report

    International Nuclear Information System (INIS)

    Betzold, K.; Brinette, R.; Bonitz, F.

    1992-01-01

    The efficiency of NDT methods such as ALOK, SAFT, EMUS, LLT, phased array, and multi-frequency eddy current testing which are generally used for reactor components recurrent inspection has been verified with experiments using two test specimens. These are a section of a main coolant pipe and the full-size vessel installed at MPA-Stuttgart, furnished with PWR test bodies with artificial defects and artificially applied natural defects. The defects have been detected with commercial probes as well as with probes optimized for the NDT methods EMUS, LLT, phased array, and multi-frequency eddy current testing. Type, location, orientation and geometry of the defects have been measured, also recording the influence of type of defect on the efficiency of the NDT methods, in order to reveal problems linked with the various methods as well as their advantages. Further tests have been made for evaluation of a combination of ALOK and SAFT using novel, specifically developed test probes, and a combination of ALOK and phased array testing. (orig.) [de

  7. Stemming Malay Text and Its Application in Automatic Text Categorization

    Science.gov (United States)

    Yasukawa, Michiko; Lim, Hui Tian; Yokoo, Hidetoshi

    In Malay language, there are no conjugations and declensions and affixes have important grammatical functions. In Malay, the same word may function as a noun, an adjective, an adverb, or, a verb, depending on its position in the sentence. Although extensively simple root words are used in informal conversations, it is essential to use the precise words in formal speech or written texts. In Malay, to make sentences clear, derivative words are used. Derivation is achieved mainly by the use of affixes. There are approximately a hundred possible derivative forms of a root word in written language of the educated Malay. Therefore, the composition of Malay words may be complicated. Although there are several types of stemming algorithms available for text processing in English and some other languages, they cannot be used to overcome the difficulties in Malay word stemming. Stemming is the process of reducing various words to their root forms in order to improve the effectiveness of text processing in information systems. It is essential to avoid both over-stemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes. Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary. The use of set of rules is aimed at reducing the occurrence of under-stemming errors, while that of the dictionaries is believed to reduce the occurrence of over-stemming errors. We performed an experiment to evaluate the application of our stemmer in text mining software. For the experiment, text data used were actual web pages collected from the World Wide Web to demonstrate the effectiveness of our Malay stemming algorithm. The experimental results showed that our stemmer can effectively increase the precision of the extracted Boolean expressions for text categorization.

  8. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  9. Social Studies: Texts and Supplements.

    Science.gov (United States)

    Curriculum Review, 1979

    1979-01-01

    This review of selected social studies texts, series, and supplements, mainly for the secondary level, includes a special section examining eight titles on warfare and terrorism for grades 4-12. (SJL)

  10. Blocking Avoidance and Escape Responses: Relations With Clinically Relevant Behaviors

    Directory of Open Access Journals (Sweden)

    Juliana Maria Bubna Popovitz

    Full Text Available Abstract: The current study aims to evaluate the possible effects of interrupting problematic clinically relevant behaviors on the percentage of these responses and of clinical improvement-related responses. Two clients were treated with Functional Analytic Psychotherapy (FAP, alternating two conditions (ABAB. On condition A, procedures to the therapist consisted of responding to the clinical improvement responses, and to description of outside of therapeutic setting behaviors, but therapists were advised to ignore problem behaviors emitted in session. During condition B, therapists followed the same procedures, but they were oriented to block (interrupt problematic responses emitted in session. Results suggest increase in the percentage of problem behaviors during condition B. Results are discussed, highlighting the viability of planning the contingent response the therapist emits to clinically relevant behaviors.

  11. Creativity-Relevant Personal Characteristics among Indonesia Creative Workers

    Directory of Open Access Journals (Sweden)

    Nugroho J. Setiadi

    2014-09-01

    Full Text Available The study aims to identify Creativity-relevant Personal Characteristics among creative workers in Indonesia’s creative industry. Identification of the constituent elements of the nature of the changes needs to be measured. Researchers have advocated replacing creativity-relevant personal characteristics based on the five-factor model to investigate how individual differences stimulate creativity. This study presents data supporting reliability (internal consistency and validity (criterion and construct of the instrument. Validity of the instrument is based on the content validity involving art and design experts. The 220 creative workers from several creative industry firms in Indonesia participated as samples in this research. Results of a factor analysis indicated a five factor solution of creative characteristics and behavior. Discussion of findings and the most important ways in which individuals differ in their enduring emotional, interpersonal, experiential, attitudinal, and motivational styles for stimulating creativity are presented.

  12. Underdevelopment in contemporary world:is structuralism still relevant?

    Directory of Open Access Journals (Sweden)

    ADEMIR PEDRO VILAÇA JUNIOR

    Full Text Available ABSTRACT This paper intends to evaluate if the Latin American structuralist approach is still relevant to understand capital accumulation dynamics of peripheral countries and their insertion in the global value chains. It’s a theoretical paper that strives to improve the building blocks of structuralism with the incorporation of elements from different approaches to establish a nexus to understand capital accumulation dynamics in the periphery. Considering the relevance of technological accumulation, its impacts over the productive structure and over the international insertion, we strive to analyze factors that perpetuate income diversion in relation to the center. Under this perspective, we conclude that the particularities of peripheral economies changed their form of manifestation without effectively overcome the dependence relation.

  13. Strategic approach to branding of nations: Relevancy for Serbia

    Directory of Open Access Journals (Sweden)

    Rakita Branko

    2009-01-01

    Full Text Available Building and managing brands becomes very important marketing tool in nowadays business. Branding is being pulled out from a strictly marketing area and becomes business component of a strategic importance. It is applying to products, services, companies, but also to events, people, ideas, institutions, destinations. Basically, almost everything can be branded. The subject of this paper is strategic approach to branding of nations. The paper contains review of relevant literature for the topic. Specifics of this type of branding have been analyzed. Detailed concept of strategic approach to branding of nations is a vital part of the paper. Relevancy of strategic approach to branding for Serbia is discussed at the end.

  14. THE LEVEL OF KNOWLEDGE IN THE VALUE RELEVANCE LITERATURE

    Directory of Open Access Journals (Sweden)

    Mihaela Alina ROBU

    2014-12-01

    Full Text Available In the last decades, numerous studies have covered the relationship between stock price or stock return and financial information. These studies represent the "value-relevance" literature. Knowledge of this area of interest, through literature and the main ideas, yields scientific progress. The aim of the study is to achieve a qualitative and a quantitative analysis regarding the level of knowledge in the value relevance literature, in an international context. To achieve this aim, a number of 53 scientific articles published between 2001 and 2013 were selected, from the first two journals related to the number of citations in the rankings compiled by Google Scholar, Accounting and Taxation category. Qualitative analysis and quantitative analysis (factorial analysis of multiple correspondences as statistical method were used. The results reflect the importance of existing problems in the financial markets. The studies are focused on solving these problems, to support the investors.

  15. Pierre Robin sequence: case report, the relevance of autopsy

    Directory of Open Access Journals (Sweden)

    Cristiano C. Oliveira

    2015-10-01

    Full Text Available ABSTRACTPierre Robin sequence is a neonatal disorder characterized by micrognathism, glossoptosis and cleft palate. We reported an autopsy case of a child whose malformations of the oropharynx were identified only at birth. The child was extremely preterm with severe neonatal depression and poor recovery, and the orofacial alterations prevented the correct treatment. There was facial disorder characterized by micrognathia associated with cleft palate and posterior displacement of the tongue, compressing the vallecula, structurally compatible with glossoptosis. This autopsy surpassed the scientific and epidemiological relevance, allowing the family genetic counseling and close monitoring of a subsequent pregnancy.

  16. THE RELEVANCE OF ECONOMIC INFORMATION IN ANALYZING THE ECONOMIC PERFORMANCE

    Directory of Open Access Journals (Sweden)

    PATRUTA MIRCEA IOAN

    2017-12-01

    Full Text Available The performance analysis is based on an informational system, which provides financial information in various formatsand with various applicabilities.We intend to formulate a set of important caracteristics of financial information along with identifying a set of relevant financial rates and indicatorsused to appreciate the performance level of a company. Economic performance can be interpreted in different ways at each level of analysis. Generally, it refers to economic growth, increased productivity and profitability. The growth of labor productivity or increased production per worker is a measure of efficient use of resources in value creation.

  17. Models of the Economic Growth and their Relevance

    Directory of Open Access Journals (Sweden)

    Nicolae MOROIANU

    2012-06-01

    Full Text Available Until few years ago, the economic growth was something perfect normal, part of an era marked by the transformation speed. Normality itself has been transformed and we currently are influenced by other rules, unknown yet, which should answer the question: “How do we return to the economic growth?” The economic growth and the models aiming to solve this problem concern the economic history even since its beginnings. In this paper we would like to find out what is the relevance that the well-known macroeconomic models still have and which might be their applicability level in a framework created by a black swan event type.

  18. Text Mining in Organizational Research.

    Science.gov (United States)

    Kobayashi, Vladimer B; Mol, Stefan T; Berkers, Hannah A; Kismihók, Gábor; Den Hartog, Deanne N

    2018-07-01

    Despite the ubiquity of textual data, so far few researchers have applied text mining to answer organizational research questions. Text mining, which essentially entails a quantitative approach to the analysis of (usually) voluminous textual data, helps accelerate knowledge discovery by radically increasing the amount data that can be analyzed. This article aims to acquaint organizational researchers with the fundamental logic underpinning text mining, the analytical stages involved, and contemporary techniques that may be used to achieve different types of objectives. The specific analytical techniques reviewed are (a) dimensionality reduction, (b) distance and similarity computing, (c) clustering, (d) topic modeling, and (e) classification. We describe how text mining may extend contemporary organizational research by allowing the testing of existing or new research questions with data that are likely to be rich, contextualized, and ecologically valid. After an exploration of how evidence for the validity of text mining output may be generated, we conclude the article by illustrating the text mining process in a job analysis setting using a dataset composed of job vacancies.

  19. [Text mining, a method for computer-assisted analysis of scientific texts, demonstrated by an analysis of author networks].

    Science.gov (United States)

    Hahn, P; Dullweber, F; Unglaub, F; Spies, C K

    2014-06-01

    Searching for relevant publications is becoming more difficult with the increasing number of scientific articles. Text mining as a specific form of computer-based data analysis may be helpful in this context. Highlighting relations between authors and finding relevant publications concerning a specific subject using text analysis programs are illustrated graphically by 2 performed examples. © Georg Thieme Verlag KG Stuttgart · New York.

  20. Current Writing: Text and Reception in Southern Africa - Vol 18, No 1 ...

    African Journals Online (AJOL)

    Lions, leopards and liminal spaces:Representations of Biosociality in the Writings of Katy Payne, Linda Tucker and Gillian van Houten · EMAIL FULL TEXT EMAIL FULL TEXT DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT. W Woodward ...

  1. NOTICING AND TEXT-BASED CHAT

    Directory of Open Access Journals (Sweden)

    Chun Lai

    2006-09-01

    Full Text Available This study examined the capacity of text-based online chat to promote learners’ noticing of their problematic language productions and of the interactional feedback from their interlocutors. In this study, twelve ESL learners formed six mixed-proficiency dyads. The same dyads worked on two spot-the-difference tasks, one via online chat and the other through face-to-face conversation. Stimulated recall sessions were held subsequently to identify instances of noticing. It was found that text-based online chat promotes noticing more than face-to-face conversations, especially in terms of learners’ noticing of their own linguistic mistakes.

  2. NOTICING HYBRID RECASTS IN TEXT CHAT

    Directory of Open Access Journals (Sweden)

    Mark J. Oliver

    2016-12-01

    Full Text Available This study examined ten EFL learners’ noticing of the corrective nature of a form of text-based SCMC (text chat feedback that combined a recast of a grammatical error with metalinguistic information. The feedback, termed a hybrid recast, was provided by a native-speaker interlocutor during two text chat activities: a spot-the-difference and picture-ordering task. Data was collected in two ways: analysis of task-based dyadic text chat interaction in which uptake was used as an indicator of learner noticing, and a post-task questionnaire containing questions that identified evidence of learner noticing. Interaction analysis showed that learners responded to almost two thirds of the hybrid recasts with uptake. In addition, every learner provided evidence that they had correctly perceived at least some of the hybrid recasts as corrective in their post-task questionnaire responses.

  3. EXPLORING STUDENTS‟ DIFFICULTIES IN READING ACADEMIC TEXTS

    Directory of Open Access Journals (Sweden)

    Ira Ernawati

    2017-04-01

    Full Text Available Academic texts play an important role for university students. However, those texts are considered difficult. This study is intended to investigate students‘ difficulties in reading academic texts. The qualitative approach was employed in this study. The design was a case study. The participants were ten students from fifth semester of CLS: EE (Classroom Language and Strategy: Explaining and Exemplifying class who were selected by using purposive sampling. The data were gathered from students‘ journal reflections, observation, and interview. The finding shows that the students encountered reading difficulties in area of textual factors, namely vocabulary, comprehending specific information, text organization, and grammar and human factors including background knowledge, mood, laziness, and time constraint.

  4. Text Character Extraction Implementation from Captured Handwritten Image to Text Conversionusing Template Matching Technique

    Directory of Open Access Journals (Sweden)

    Barate Seema

    2016-01-01

    Full Text Available Images contain various types of useful information that should be extracted whenever required. A various algorithms and methods are proposed to extract text from the given image, and by using that user will be able to access the text from any image. Variations in text may occur because of differences in size, style,orientation, alignment of text, and low image contrast, composite backgrounds make the problem during extraction of text. If we develop an application that extracts and recognizes those texts accurately in real time, then it can be applied to many important applications like document analysis, vehicle license plate extraction, text- based image indexing, etc and many applications have become realities in recent years. To overcome the above problems we develop such application that will convert the image into text by using algorithms, such as bounding box, HSV model, blob analysis,template matching, template generation.

  5. GPU-Accelerated Text Mining

    International Nuclear Information System (INIS)

    Cui, X.; Mueller, F.; Zhang, Y.; Potok, Thomas E.

    2009-01-01

    Accelerating hardware devices represent a novel promise for improving the performance for many problem domains but it is not clear for which domains what accelerators are suitable. While there is no room in general-purpose processor design to significantly increase the processor frequency, developers are instead resorting to multi-core chips duplicating conventional computing capabilities on a single die. Yet, accelerators offer more radical designs with a much higher level of parallelism and novel programming environments. This present work assesses the viability of text mining on CUDA. Text mining is one of the key concepts that has become prominent as an effective means to index the Internet, but its applications range beyond this scope and extend to providing document similarity metrics, the subject of this work. We have developed and optimized text search algorithms for GPUs to exploit their potential for massive data processing. We discuss the algorithmic challenges of parallelization for text search problems on GPUs and demonstrate the potential of these devices in experiments by reporting significant speedups. Our study may be one of the first to assess more complex text search problems for suitability for GPU devices, and it may also be one of the first to exploit and report on atomic instruction usage that have recently become available in NVIDIA devices

  6. Sacred texts and mystic meaning: An inquiry into Christian ...

    African Journals Online (AJOL)

    EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT · http://dx.doi.org/10.4314/actat.v31i2.3 · AJOL African Journals Online. HOW TO USE AJOL... for Researchers · for Librarians · for Authors · FAQ's · More about AJOL · AJOL's Partners · Terms and Conditions of Use ...

  7. DECISION USEFULNESS: TRADE-OFF ANTARA RELIABILITY DAN RELEVANCE

    Directory of Open Access Journals (Sweden)

    AGUS INDRA TENAYA

    2007-07-01

    Full Text Available The purpose of this article is to search for trade-off solution betweenreliability and relevance. Approach that can be used to have more reliable andrelevant financial statement is decision usefulness. This approach suggests thatfinancial statement must be useful to become a base of investors’ decision making.The change function of financial statement from just a tool of responsibility tobecome a tool of decision making has caused historical cost-based financialstatement could not be used to predict future value of a firm. This problem couldbe solved by presenting full disclosure of financial statement. Discussion sessionshows that full disclosure results in more useful and reliable accountinginformation to be used in decision making process of various users.

  8. Autonomia e relevância dos regimes The autonomy and relevance of regimes

    Directory of Open Access Journals (Sweden)

    Gustavo Seignemartin de Carvalho

    2005-12-01

    Full Text Available Teorias institucionalistas na disciplina de relações internacionais usualmente definem regimes como um conjunto de normas e regras formais ou informais que permitem a convergência de expectativas ou a padronização do comportamento de seus participantes em uma determinada área de interesses com o objetivo de resolver problemas de coordenação que tenderiam a resultados não pareto-eficientes. Como estas definições baseadas meramente na "eficiência" dos regimes não parecem suficientes para explicar sua efetividade, o presente artigo propõe uma definição diferente para regimes: a de arranjos políticos que permitem a redistribuição dos ganhos da cooperação pelos participantes em uma determinada área de interesses em um contexto de interdependência. Regimes possuiriam efetividade pela sua autonomia e relevância, ou seja, por possuírem existência objetiva autônoma da de seus participantes e por influenciarem seu comportamento e expectativas de maneiras que não podem ser reduzidas à ação individual de nenhum deles. O artigo inicia-se com uma breve discussão sobre as dificuldades terminológicas associadas ao estudo de regimes e a definição dos conceitos de autonomia e relevância. Em seguida, classifica os diversos autores participantes do debate em duas perspectivas distintas, uma que nega (não-autonomistas e outra que atribui (autonomistas aos regimes autonomia e relevância, e faz uma breve análise dos autores e tradições mais significativos para o debate, aprofundando-se nos autonomistas e nos argumentos que reforçam a hipótese aqui apresentada. Ao final, o artigo propõe uma decomposição analítica dos regimes nos quatro elementos principais que lhes propiciam autonomia e relevância: normatividade, atores, especificidade da área de interesses e interdependência complexa com o contexto.Regimes are defined by institutionalist theories in the discipline of International Relations as formal or informal sets

  9. Individual Profiling Using Text Analysis

    Science.gov (United States)

    2016-04-15

    AFRL-AFOSR-UK-TR-2016-0011 Individual Profiling using Text Analysis 140333 Mark Stevenson UNIVERSITY OF SHEFFIELD, DEPARTMENT OF PSYCHOLOGY Final...REPORT TYPE      Final 3.  DATES COVERED (From - To)      15 Sep 2014 to 14 Sep 2015 4.  TITLE AND SUBTITLE Individual Profiling using Text Analysis ...consisted of collections of tweets for a number of Twitter users whose gender, age and personality scores are known. The task was to construct some system

  10. Finding text in color images

    Science.gov (United States)

    Zhou, Jiangying; Lopresti, Daniel P.; Tasdizen, Tolga

    1998-04-01

    In this paper, we consider the problem of locating and extracting text from WWW images. A previous algorithm based on color clustering and connected components analysis works well as long as the color of each character is relatively uniform and the typography is fairly simple. It breaks down quickly, however, when these assumptions are violated. In this paper, we describe more robust techniques for dealing with this challenging problem. We present an improved color clustering algorithm that measures similarity based on both RGB and spatial proximity. Layout analysis is also incorporated to handle more complex typography. THese changes significantly enhance the performance of our text detection procedure.

  11. Current Writing: Text and Reception in Southern Africa: Advanced ...

    African Journals Online (AJOL)

    Current Writing: Text and Reception in Southern Africa: Advanced Search. Journal Home > Current Writing: Text and Reception in Southern Africa: Advanced Search. Log in or Register to get access to full text downloads.

  12. Intertextuality within the linguistic analysis of a literary text

    Directory of Open Access Journals (Sweden)

    Л Н Лунькова

    2008-12-01

    Full Text Available The article is devoted to the phenomenon of precedent texts in fiction, the ways they are introduced into it and the possibilities of their linguistic interpretation within secondary texts.

  13. Global Prospects for Full Employment

    Directory of Open Access Journals (Sweden)

    Ivo Šlaus

    2011-04-01

    Full Text Available The recent international financial crisis highlights the crucial role of employment in human welfare and social stability. Access to remunerative employment opportunities is essential for economic security in a market-based economic system. As the rise of democracy compelled nations to extend the voting right to all citizens, employment must be recognized as a fundamental human right. In total defiance of conventional wisdom, since 1950 job growth has outpaced the explosive growth of population, the rapid adoption of labor-saving technologies, the manifold expansion of world trade, and the dramatic shift from manual labor to white collar work. In an increasingly globalized labor market, current nation-centric theories and models of employment need to be replaced with a human-centered global perspective complemented by new indicators that recognize the central and essential contribution of employment to human economic welfare. Employment and economy are subsets of society and their growth is driven by the more fundamental process of social development. A vast array of unmet social needs combined with an enormous reservoir of underutilized social resources – technological, scientific, educational, organizational, cultural and psychological – can be harnessed to dramatically expand employment opportunities and achieve full employment on a global basis. This paper examines the theoretical basis, policy issues and strategies required to eradicate unemployment nationally and globally.

  14. Multilingual text induced spelling correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams

  15. Automated analysis of instructional text

    Energy Technology Data Exchange (ETDEWEB)

    Norton, L.M.

    1983-05-01

    The development of a capability for automated processing of natural language text is a long-range goal of artificial intelligence. This paper discusses an investigation into the issues involved in the comprehension of descriptive, as opposed to illustrative, textual material. The comprehension process is viewed as the conversion of knowledge from one representation into another. The proposed target representation consists of statements of the prolog language, which can be interpreted both declaratively and procedurally, much like production rules. A computer program has been written to model in detail some ideas about this process. The program successfully analyzes several heavily edited paragraphs adapted from an elementary textbook on programming, automatically synthesizing as a result of the analysis a working Prolog program which, when executed, can parse and interpret let commands in the basic language. The paper discusses the motivations and philosophy of the project, the many kinds of prerequisite knowledge which are necessary, and the structure of the text analysis program. A sentence-by-sentence account of the analysis of the sample text is presented, describing the syntactic and semantic processing which is involved. The paper closes with a discussion of lessons learned from the project, possible alternative approaches, and possible extensions for future work. The entire project is presented as illustrative of the nature and complexity of the text analysis process, rather than as providing definitive or optimal solutions to any aspects of the task. 12 references.

  16. Solar Concepts: A Background Text.

    Science.gov (United States)

    Gorham, Jonathan W.

    This text is designed to provide teachers, students, and the general public with an overview of key solar energy concepts. Various energy terms are defined and explained. Basic thermodynamic laws are discussed. Alternative energy production is described in the context of the present energy situation. Described are the principal contemporary solar…

  17. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    -folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...

  18. Figure-associated text summarization and evaluation.

    Science.gov (United States)

    Polepalli Ramesh, Balaji; Sethi, Ricky J; Yu, Hong

    2015-01-01

    Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts are therefore essential for full comprehension. The associated text of a figure, however, is scattered throughout its full-text article and contains redundant information content. In this paper, we report the continued development and evaluation of several figure summarization systems, the FigSum+ systems, that automatically identify associated texts, remove redundant information, and generate a text summary for every figure in an article. Using a set of 94 annotated figures selected from 19 different journals, we conducted an intrinsic evaluation of FigSum+. We evaluate the performance by precision, recall, F1, and ROUGE scores. The best FigSum+ system is based on an unsupervised method, achieving F1 score of 0.66 and ROUGE-1 score of 0.97. The annotated data is available at figshare.com (http://figshare.com/articles/Figure_Associated_Text_Summarization_and_Evaluation/858903).

  19. Clinically Relevant Anticancer Polymer Paclitaxel Therapeutics

    Directory of Open Access Journals (Sweden)

    Danbo Yang

    2010-12-01

    Full Text Available The concept of utilizing polymers in drug delivery has been extensively explored for improving the therapeutic index of small molecule drugs. In general, polymers can be used as polymer-drug conjugates or polymeric micelles. Each unique application mandates its own chemistry and controlled release of active drugs. Each polymer exhibits its own intrinsic issues providing the advantage of flexibility. However, none have as yet been approved by the U.S. Food and Drug Administration. General aspects of polymer and nano-particle therapeutics have been reviewed. Here we focus this review on specific clinically relevant anticancer polymer paclitaxel therapeutics. We emphasize their chemistry and formulation, in vitro activity on some human cancer cell lines, plasma pharmacokinetics and tumor accumulation, in vivo efficacy, and clinical outcomes. Furthermore, we include a short review of our recent developments of a novel poly(L-g-glutamylglutamine-paclitaxel nano-conjugate (PGG-PTX. PGG-PTX has its own unique property of forming nano-particles. It has also been shown to possess a favorable profile of pharmacokinetics and to exhibit efficacious potency. This review might shed light on designing new and better polymer paclitaxel therapeutics for potential anticancer applications in the clinic.

  20. Extracellular vesicles: fundamentals and clinical relevance

    Directory of Open Access Journals (Sweden)

    Wael Nassar

    2015-01-01

    Full Text Available All types of cells of eukaryotic organisms produce and release small nanovesicles into their extracellular environment. Early studies have described these vesicles as ′garbage bags′ only to remove obsolete cellular molecules. Valadi and colleagues, in 2007, were the first to discover the capability of circulating extracellular vesicles (EVs to horizontally transfer functioning gene information between cells. These extracellular vesicles express components responsible for angiogenesis promotion, stromal remodeling, chemoresistance, genetic exchange, and signaling pathway activation through growth factor/receptor transfer. EVs represent an important mode of intercellular communication by serving as vehicles for transfer between cells of membrane and cytosolic proteins, lipids, signaling proteins, and RNAs. They contribute to physiology and pathology, and they have a myriad of potential clinical applications in health and disease. Moreover, vesicles can pass the blood-brain barrier and may perhaps even be considered as naturally occurring liposomes. These cell-derived EVs not only represent a central mediator of the disease microenvironment, but their presence in the peripheral circulation may serve as a surrogate for disease biopsies, enabling real-time diagnosis and disease monitoring. In this review, we′ll be addressing the characteristics of different types of extracellular EVs, as well as their clinical relevance and potential as diagnostic markers, and also define therapeutic options.

  1. Media and mental illness: Relevance to India

    Directory of Open Access Journals (Sweden)

    S K Padhy

    2014-01-01

    Full Text Available Media has a complex interrelationship with mental illnesses. This narrative review takes a look at the various ways in which media and mental illnesses interact. Relevant scientific literature and electronic databases were searched, including Pubmed and GoogleScholar, to identify studies, viewpoints and recommendations using keywords related to media and mental illnesses. This review discusses both the positive and the negative portrayals of mental illnesses through the media. The portrayal of mental health professionals and psychiatric treatment is also discussed. The theories explaining the relationship of how media influences the attitudes and behavior are discussed. Media has also been suggested to be a risk factor for the genesis or exacerbation of mental illnesses like eating disorders and substance use disorders. The potential use of media to understand the psychopathology and plight of those with psychiatric disorders is referred to. The manner in which media can be used as a tool for change to reduce the stigma surrounding mental illnesses is explored.

  2. EXTRACELLULAR VESICLES: CLASSIFICATION, FUNCTIONS AND CLINICAL RELEVANCE

    Directory of Open Access Journals (Sweden)

    A. V. Oberemko

    2014-12-01

    Full Text Available This review presents a generalized definition of vesicles as bilayer extracellular organelles of all celular forms of life: not only eu-, but also prokaryotic. The structure and composition of extracellular vesicles, history of research, nomenclature, their impact on life processes in health and disease are discussed. Moreover, vesicles may be useful as clinical instruments for biomarkers, and they are promising as biotechnological drug. However, many questions in this area are still unresolved and need to be addressed in the future. The most interesting from the point of view of practical health care represents a direction to study the effect of exosomes and microvesicles in the development and progression of a particular disease, the possibility of adjusting the pathological process by means of extracellular vesicles of a particular type, acting as an active ingredient. Relevant is the further elucidation of the role and importance of exosomes to the surrounding cells, tissues and organs at the molecular level, the prospects for the use of non-cellular vesicles as biomarkers of disease.

  3. The Effects of Goal Relevance and Perceptual Features on Emotional Items and Associative Memory

    Directory of Open Access Journals (Sweden)

    Wei B. Mao

    2017-07-01

    Full Text Available Showing an emotional item in a neutral background scene often leads to enhanced memory for the emotional item and impaired associative memory for background details. Meanwhile, both top–down goal relevance and bottom–up perceptual features played important roles in memory binding. We conducted two experiments and aimed to further examine the effects of goal relevance and perceptual features on emotional items and associative memory. By manipulating goal relevance (asking participants to categorize only each item image as living or non-living or to categorize each whole composite picture consisted of item image and background scene as natural scene or manufactured scene and perceptual features (controlling visual contrast and visual familiarity in two experiments, we found that both high goal relevance and salient perceptual features (high salience of items vs. high familiarity of items could promote emotional item memory, but they had different effects on associative memory for emotional items and neutral backgrounds. Specifically, high goal relevance and high perceptual-salience of items could jointly impair the associative memory for emotional items and neutral backgrounds, while the effect of item familiarity on associative memory for emotional items would be modulated by goal relevance. High familiarity of items could increase associative memory for negative items and neutral backgrounds only in the low goal relevance condition. These findings suggest the effect of emotion on associative memory is not only related to attentional capture elicited by emotion, but also can be affected by goal relevance and perceptual features of stimulus.

  4. Social and emotional relevance in face processing: Happy faces of future interaction partners enhance the LPP

    Directory of Open Access Journals (Sweden)

    Florian eBublatzky

    2014-07-01

    Full Text Available Human face perception is modulated by both emotional valence and social relevance, but their interaction has rarely been examined. Event-related brain potentials (ERP to happy, neutral, and angry facial expressions with different degrees of social relevance were recorded. Social relevance was manipulated by presenting pictures of two specific face actors as future interaction partners (meet condition, whereas two other face actors remained non-relevant. As a further control condition all stimuli were presented without specific task instructions (passive viewing condition. A within-subject design (Facial Expression x Relevance x Task was implemented, where randomly ordered face stimuli of four actors (2 women, from the KDEF were presented for 1s to 26 participants (16 female. Results showed an augmented N170, early posterior negativity (EPN, and late positive potential (LPP for emotional in contrast to neutral facial expressions. Of particular interest, face processing varied as a function of instructed social relevance. Whereas the meet condition was accompanied with unspecific effects regardless of relevance (P1, EPN, viewing potential interaction partners was associated with increased LPP amplitudes. The LPP was specifically enhanced for happy facial expressions of the future interaction partners. This underscores that social relevance can impact face processing already at an early stage of visual processing. These findings are discussed within the framework of motivated attention and face processing theories.

  5. The text-critical and exegetical value of the Dead Sea Scrolls

    Directory of Open Access Journals (Sweden)

    Johann Cook

    2016-07-01

    Full Text Available This article will analyse a number of Dead Sea manuscripts and/or fragments in order to determine their linguistic and exegetical value. The article will, firstly, address textual material that is largely in agreement with the Massoretic Text – 1QIsaa is a case in point. Secondly, fragmentsthat are seemingly less relevant will be discussed. The less helpful fragments from the Biblical books Proverbs and Job are taken as examples. Finally, highly significant textual differences, such as a fragment from Genesis 1 and one from the complicated books of Jeremiah, will be evaluated.

  6. The linguistic construction of the giftedness discourse in the media texts of historical and digital times

    Directory of Open Access Journals (Sweden)

    Halliki Põlda

    2015-04-01

    Full Text Available The aim of the study is to describe and explicate, using critical text analysis, how the socially weighty discourse of giftedness has been constructed historically and how it manifests in the media texts of the digital era. The diachronic analysis is based on the media texts of the 1890s–1990s stored in the Corpus of Standard Estonian, while the synchronic analysis applies to texts found in Delfi.ee. The results highlight the main media discourses dealing with giftedness, the relevant terms and expressions, and the social relations and meanings brought up in the media in connection with the topic. The study reveals that through history, the giftedness discourse has been subject to changes and, constructed with specific linguistic means, it plays an important role in modern social arrangements.

  7. Cloning in the classroom: an example of the didactical use of popularization of science text

    Directory of Open Access Journals (Sweden)

    Isabel Martins

    2004-03-01

    Full Text Available This paper describes a science lesson in which different texts,such as newspapers, popular science magazines and textbooks were used as didactic resources. Our theoretical framework explores the relevance of communicative approaches to teaching and discusses the relationships between text and discourse. Data were collected through videotapes of a Biology lesson about cloning in an adult education class in Brazil. The analyses focussed on the teacher’s discursive re-elaborations and revealed a variety of roles played by a popular science text in a science lesson, such as motivation and lesson structuring, as well as, helpingorganise explanations, fostering debate, broadening reading practices and establishing relationships between scientific and everyday contexts. Amongst the discursive re-elaborations observed are strategies for adaptation of originals, the emphasis on reading activities and joint use of popular science texts and textbooks.

  8. Roots and Relevance

    DEFF Research Database (Denmark)

    Gumucio-Dagron, Alfonso; Tufte, Thomas

    This anthology, the result of 3 years of review of 1000+ articles, now assembles 150 authors with 200 contributions - full articles, excerpts and quotes - ranging from 1927 to 2005. The articles all have been selected upon the criteria of contributing conceptually to the field of communication fo...... Participation, 3) Power, Media and the Public Sphere, 4) Paradigms in Communication for Development, 5) Information Society & Communication Rights....... for social change. The book is organised in two parts: the first part being cronological, from 1927-1995, and the second part containing 'the contemporary debate' in communication for social change, organised in 5 sub-themes: 1) Popular Culture, Narrative and Identity, 2) Social Movements & Community...

  9. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases...... the accuracy at the same time. The test example is classified using simpler and smaller model. The training examples in a particular cluster share the common vocabulary. At the time of clustering, we do not take into account the labels of the training examples. After the clusters have been created......, the classifier is trained on each cluster having reduced dimensionality and less number of examples. The experimental results show that the proposed model outperforms the existing classification models for the task of suspicious email detection and topic categorization on the Reuters-21578 and 20 Newsgroups...

  10. Linguistic dating of biblical texts

    DEFF Research Database (Denmark)

    Young, Ian; Rezetko, Robert; Ehrensvärd, Martin Gustaf

    Since the beginning of critical scholarship biblical texts have been dated using linguistic evidence. In recent years this has become a controversial topic, especially with the publication of Ian Young (ed.), Biblical Hebrew: Studies in Chronology and Typology (2003). However, until now there has...... been no introduction and comprehensive study of the field. Volume 1 introduces the field of linguistic dating of biblical texts, particularly to intermediate and advanced students of biblical Hebrew who have a reasonable background in the language, having completed at least an introductory course...... in this volume are: What is it that makes Archaic Biblical Hebrew archaic , Early Biblical Hebrew early , and Late Biblical Hebrew late ? Does linguistic typology, i.e. different linguistic characteristics, convert easily and neatly into linguistic chronology, i.e. different historical origins? A large amount...

  11. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level. Th....... By comparing the biological with the textual account of autopoietic agency, the end conclusion is that a newly derived concept of sociopoiesis might be better suited for discussing the architecture of textual systems....

  12. The TEXT upgrade vertical interferometer

    International Nuclear Information System (INIS)

    Hallock, G.A.; Gartman, M.L.; Li, W.; Chiang, K.; Shin, S.; Castles, R.L.; Chatterjee, R.; Rahman, A.S.

    1992-01-01

    A far-infrared interferometer has been installed on TEXT upgrade to obtain electron density profiles. The primary system views the plasma vertically through a set of large (60-cm radialx7.62-cm toroidal) diagnostic ports. A 1-cm channel spacing (59 channels total) and fast electronic time response is used, to provide high resolution for radial profiles and perturbation experiments. Initial operation of the vertical system was obtained late in 1991, with six operating channels

  13. Reasoning with Annotations of Texts

    OpenAIRE

    Ma , Yue; Lévy , François; Ghimire , Sudeep

    2011-01-01

    International audience; Linguistic and semantic annotations are important features for text-based applications. However, achieving and maintaining a good quality of a set of annotations is known to be a complex task. Many ad hoc approaches have been developed to produce various types of annotations, while comparing those annotations to improve their quality is still rare. In this paper, we propose a framework in which both linguistic and domain information can cooperate to reason with annotat...

  14. LOTUS: Adaptive text search for big linked data

    NARCIS (Netherlands)

    Ilievski, F.; Beek, Wouter; van Erp, Marieke; Rietveld, Laurens; Schlobach, Stefan

    2016-01-01

    Finding relevant resources on the Semantic Web today is a dirty job: no centralized query service exists and the support for natural language access is limited. We present LOTUS: Linked Open Text Un- leaShed, a text-based entry point to a massive subset of today’s Linked Open Data Cloud. Recognizing

  15. The Relationship between Paraphrasing and Text Analysis

    Directory of Open Access Journals (Sweden)

    María Luisa Cepeda Islas

    2013-04-01

    Full Text Available Given the importance of paraphrasing in the process of comprehension for college students, this study assessed the level of implementation of text analysis and paraphrases the response of a sample of senior students of the career psychology. We selected a group of freshmen to the Psychology course, which was asked to answer a questionnaire and carry out the summary of an empirical article. The results showed that participants have a low level of text analysis, at the same time had low levels of paraphrasing. It was seen that the predominant textual copy. They envision some possibilities for the structure of a training workshop not only paraphrasing but on the analysis of text.

  16. A full computation-relevant topological dynamics classification of elementary cellular automata.

    Science.gov (United States)

    Schüle, Martin; Stoop, Ruedi

    2012-12-01

    Cellular automata are both computational and dynamical systems. We give a complete classification of the dynamic behaviour of elementary cellular automata (ECA) in terms of fundamental dynamic system notions such as sensitivity and chaoticity. The "complex" ECA emerge to be sensitive, but not chaotic and not eventually weakly periodic. Based on this classification, we conjecture that elementary cellular automata capable of carrying out complex computations, such as needed for Turing-universality, are at the "edge of chaos."

  17. LITURGICAL TEXT IN RUSSIAN LITERATURE. PROBLEM STATEMENT

    Directory of Open Access Journals (Sweden)

    Avetis Serezhaevich Seropyan

    2012-11-01

    Full Text Available The article analyses artistic expressions of liturgical language in the literary text and its interaction of the Holy Tradition. Many Russian authors knew the liturgical text well. Studying it reveals the crucial meaning of the Gospel and liturgical texts (as part of the Holy Tradition for Russian literature. Authors saw the essence of every phenomenon in the word for it, and the nature of God in His name. Some ideas and sayings of the authors and their characters find their sources in liturgical texts. The article focuses on liturgical sources of some characters' commemorations and invocations, as well as poetical topics of the symbolists, Dostoevsky's famous dictum on beauty which will save the world (The Idiot, etc. De-cyphering this liturgical code will help us learn and comprehend the hidden endless meaning of a literary text. The specific feature of Russian literature is its pursuit of the spiritual liturgical exploration of the world, an exploration when truth takes shape and thus becomes real in both literary text and history.

  18. Application of LSP texts in translator training

    Directory of Open Access Journals (Sweden)

    Larisa Ilynska

    2017-06-01

    Full Text Available The paper presents discussion of the results of extensive empirical research into efficient methods of educating and training translators of LSP (language for special purposes texts. The methodology is based on using popular LSP texts in the respective fields as one of the main media for translator training. The aim of the paper is to investigate the efficiency of this methodology in developing thematic, linguistic and cultural competences of the students, following Bloom’s revised taxonomy and European Master in Translation Network (EMT translator training competences. The methodology has been tested on the students of a professional Master study programme called Technical Translation implemented by the Institute of Applied Linguistics, Riga Technical University, Latvia. The group of students included representatives of different nationalities, translating from English into Latvian, Russian and French. Analysis of popular LSP texts provides an opportunity to structure student background knowledge and expand it to account for linguistic innovation. Application of popular LSP texts instead of purely technical or scientific texts characterised by neutral style and rigid genre conventions provides an opportunity for student translators to develop advanced text processing and decoding skills, to develop awareness of expressive resources of the source and target languages and to develop understanding of socio-pragmatic language use.

  19. Biased limiter experiments on text

    International Nuclear Information System (INIS)

    Phillips, P.E.; Wootton, A.J.; Rowan, W.L.; Ritz, C.P.; Rhodes, T.L.; Bengtson, R.D.; Hodge, W.L.; Durst, R.D.; McCool, S.C.; Richards, B.; Gentle, K.W.; Schoch, P.; Forster, J.C.; Hickok, R.L.; Evans, T.E.

    1987-01-01

    Experiments using an electrically biased limiter have been performed on the Texas Experimental Tokamak (TEXT). A small movable limiter is inserted past the main poloidal ring limiter (which is electrically connected to the vacuum vessel) and biased at V Lim with respect to it. The floating potential, plasma potential and shear layer position can be controlled. With vertical strokeV Lim vertical stroke ≥ 50 V the plasma density increases. For V Lim Lim > 0 the results obtained are inconclusive. Variation of V Lim changes the electrostatic turbulence which may explain the observed total flux changes. (orig.)

  20. Aspects of Text Mining From Computational Semiotics to Systemic Functional Hypertexts

    Directory of Open Access Journals (Sweden)

    Alexander Mehler

    2001-05-01

    Full Text Available The significance of natural language texts as the prime information structure for the management and dissemination of knowledge in organisations is still increasing. Making relevant documents available depending on varying tasks in different contexts is of primary importance for any efficient task completion. Implementing this demand requires the content based processing of texts, which enables to reconstruct or, if necessary, to explore the relationship of task, context and document. Text mining is a technology that is suitable for solving problems of this kind. In the following, semiotic aspects of text mining are investigated. Based on the primary object of text mining - natural language lexis - the specific complexity of this class of signs is outlined and requirements for the implementation of text mining procedures are derived. This is done with reference to text linkage introduced as a special task in text mining. Text linkage refers to the exploration of implicit, content based relations of texts (and their annotation as typed links in corpora possibly organised as hypertexts. In this context, the term systemic functional hypertext is introduced, which distinguishes genre and register layers for the management of links in a poly-level hypertext system.

  1. Vygotsky's Crisis: Argument, context, relevance.

    Science.gov (United States)

    Hyman, Ludmila

    2012-06-01

    Vygotsky's The Historical Significance of the Crisis in Psychology (1926-1927) is an important text in the history and philosophy of psychology that has only become available to scholars in 1982 in Russian, and in 1997 in English. The goal of this paper is to introduce Vygotsky's conception of psychology to a wider audience. I argue that Vygotsky's argument about the "crisis" in psychology and its resolution can be fully understood only in the context of his social and political thinking. Vygotsky shared the enthusiasm, widespread among Russian leftist intelligentsia in the 1920s, that Soviet society had launched an unprecedented social experiment: The socialist revolution opened the way for establishing social conditions that would let the individual flourish. For Vygotsky, this meant that "a new man" of the future would become "the first and only species in biology that would create itself." He envisioned psychology as a science that would serve this humanist teleology. I propose that The Crisis is relevant today insofar as it helps us define a fundamental problem: How can we systematically account for the development of knowledge in psychology? I evaluate how Vygotsky addresses this problem as a historian of the crisis. Copyright © 2011 Elsevier Ltd. All rights reserved.

  2. Transfer Learning beyond Text Classification

    Science.gov (United States)

    Yang, Qiang

    Transfer learning is a new machine learning and data mining framework that allows the training and test data to come from different distributions or feature spaces. We can find many novel applications of machine learning and data mining where transfer learning is necessary. While much has been done in transfer learning in text classification and reinforcement learning, there has been a lack of documented success stories of novel applications of transfer learning in other areas. In this invited article, I will argue that transfer learning is in fact quite ubiquitous in many real world applications. In this article, I will illustrate this point through an overview of a broad spectrum of applications of transfer learning that range from collaborative filtering to sensor based location estimation and logical action model learning for AI planning. I will also discuss some potential future directions of transfer learning.

  3. Towards Technological Approaches for Concept Maps Mining from Text

    Directory of Open Access Journals (Sweden)

    Camila Zacche Aguiar

    2018-04-01

    Full Text Available Concept maps are resources for the representation and construction of knowledge. They allow showing, through concepts and relationships, how knowledge about a subject is organized. Technological advances have boosted the development of approaches for the automatic construction of a concept map, to facilitate and provide the benefits of that resource more broadly. Due to the need to better identify and analyze the functionalities and characteristics of those approaches, we conducted a detailed study on technological approaches for automatic construction of concept maps published between 1994 and 2016 in the IEEE Xplore, ACM and Elsevier Science Direct data bases. From this study, we elaborate a categorization defined on two perspectives, Data Source and Graphic Representation, and fourteen categories. That study collected 30 relevant articles, which were applied to the proposed categorization to identify the main features and limitations of each approach. A detailed view on these approaches, their characteristics and techniques are presented enabling a quantitative analysis. In addition, the categorization has given us objective conditions to establish new specification requirements for a new technological approach aiming at concept maps mining from texts.

  4. Text Entry by Gazing and Smiling

    Directory of Open Access Journals (Sweden)

    Outi Tuisku

    2013-01-01

    Full Text Available Face Interface is a wearable prototype that combines the use of voluntary gaze direction and facial activations, for pointing and selecting objects on a computer screen, respectively. The aim was to investigate the functionality of the prototype for entering text. First, three on-screen keyboard layout designs were developed and tested (n=10 to find a layout that would be more suitable for text entry with the prototype than traditional QWERTY layout. The task was to enter one word ten times with each of the layouts by pointing letters with gaze and select them by smiling. Subjective ratings showed that a layout with large keys on the edge and small keys near the center of the keyboard was rated as the most enjoyable, clearest, and most functional. Second, using this layout, the aim of the second experiment (n=12 was to compare entering text with Face Interface to entering text with mouse. The results showed that text entry rate for Face Interface was 20 characters per minute (cpm and 27 cpm for the mouse. For Face Interface, keystrokes per character (KSPC value was 1.1 and minimum string distance (MSD error rate was 0.12. These values compare especially well with other similar techniques.

  5. Inspiration and the Texts of the Bible

    Directory of Open Access Journals (Sweden)

    Dirk Buchner

    1997-12-01

    Full Text Available This article seeks to explore what the inspired text of the Old Testament was as it existed for the New Testament authors, particularly for the author of the book of Hebrews. A quick look at the facts makes. it clear that there was, at the time, more than one 'inspired' text, among these were the Septuagint and the Masoretic Text 'to name but two'. The latter eventually gained ascendancy which is why it forms the basis of our translated Old Testament today. Yet we have to ask: what do we make of that other text that was the inspired Bible to the early Church, especially to the writer of the book of Hebrews, who ignored the Masoretic text? This article will take a brief look at some suggestions for a doctrine of inspiration that keeps up with the facts of Scripture. Allied to this, the article is something of a bibliographical study of recent developments in textual research following the discovery of the Dead Sea scrolls.

  6. Ancient medical texts, modern reading problems

    Directory of Open Access Journals (Sweden)

    Maria Carlota Rosa

    2006-12-01

    Full Text Available The word tradition has a very specific meaning in linguistics: the passing down of a text, which may have been completed or corrected by different copyists at different times, when the concept of authorship was not the same as it is today. When reading an ancient text the word tradition must be in the reader's mind. To discuss one of the problems an ancient text poses to its modern readers, this work deals with one of the first printed medical texts in Portuguese, the Regimento proueytoso contra ha pestenença, and draws a parallel between it and two related texts, A moche profitable treatise against the pestilence, and the Recopilaçam das cousas que conuem guardar se no modo de preseruar à Cidade de Lixboa E os sãos, & curar os que esteuerem enfermos de Peste. The problems which arise out of the textual structure of those books show how difficult is to establish a tradition of another type, the medical tradition. The linguistic study of the innumerable medieval plague treatises may throw light on the continuities and on the disruptions of the so-called hippocratic-galenical medical tradition.

  7. Reading an ESL Writer’s Text

    Directory of Open Access Journals (Sweden)

    Paul Kei Matsuda

    2011-03-01

    Full Text Available This paper focuses on reading as a central act of communication in the tutorial session. Writing center tutors without extensive experience reading writing by second language writers may have difficulty getting past the many differences in surface-level features, organization, and rhetorical moves. After exploring some of the sources of these differences in writing, the authors present strategies that writing tutors can use to work effectively with second language writers.

  8. Full-scope training simulators

    International Nuclear Information System (INIS)

    Ugedo, E.

    1986-01-01

    The following topics to be covered in this report are: Reasons justifying the use of full-scope simulators for operator qualification. Full-scope simulator description: the control room, the physical models, the computer complex, the instructor's console. Main features of full-scope simulators. Merits of simulator training. The role of full-scope simulators in the training programs. The process of ordering and acquiring a full-scope simulator. Maintaining and updating simulator capabilities. (orig./GL)

  9. No More Provincialism: Art and Text

    Directory of Open Access Journals (Sweden)

    Heather Barker

    2010-11-01

    Full Text Available This essay discusses the writing and personalities surrounding the 1981 establishment of the Australian art magazine, Art & Text, and traces its progression under Paul Taylor’s editorship up to his relocation to New York. During this period, Art & Text published Taylor’s own essays and, more importantly, those of other writers and artists — Meaghan Morris, Paul Foss, Philip Brophy, Imants Tillers, Rex Butler, Edward Colless — all articulating a consistent and complex postmodern position. The magazine’s founder and editor, Paul Taylor, personified the shattering impact of postmodernism upon the Australian art world as well as postmodernism’s limitations. Taylor facilitated a new theoretical framework for the discussion of Australian art, one that continues to dominate the internationalist aspirations of Australian art writers. He produced temporarily convincing solutions to problems that earlier critics had wrestled with unsuccessfully, in particular the twin problems of provincialism, and the relationship of Australian to international art.

  10. A programmed text in statistics

    CERN Document Server

    Hine, J

    1975-01-01

    Exercises for Section 2 42 Physical sciences and engineering 42 43 Biological sciences 45 Social sciences Solutions to Exercises, Section 1 47 Physical sciences and engineering 47 49 Biological sciences 49 Social sciences Solutions to Exercises, Section 2 51 51 PhYSical sciences and engineering 55 Biological sciences 58 Social sciences 62 Tables 2 62 x - tests involving variances 2 63,64 x - one tailed tests 2 65 x - two tailed tests F-distribution 66-69 Preface This project started some years ago when the Nuffield Foundation kindly gave a grant for writing a pro­ grammed text to use with service courses in statistics. The work carried out by Mrs. Joan Hine and Professor G. B. Wetherill at Bath University, together with some other help from time to time by colleagues at Bath University and elsewhere. Testing was done at various colleges and universities, and some helpful comments were received, but we particularly mention King Edwards School, Bath, who provided some sixth formers as 'guinea pigs' for the fir...

  11. The Impact of Texting on Comprehension

    Directory of Open Access Journals (Sweden)

    Jamal K. M. Ali

    2015-07-01

    Full Text Available This paper presents a study of the effects of texting on English language comprehension. The authors believe that English used in texting causes a lack of comprehension for English speakers, learners, and texters. Wei, Xian-hai and Jiang (2008:3 declare “In Netspeak, there are some newly-created vocabularies, which people cannot comprehend them either from their partial pronunciation or from their figures.” Crystal (2007:23 claims; “variation causes problems of comprehension and acceptability. If you speak or write differently from the way I do, we may fail to understand each other.”  In this paper, the authors conducted a questionnaire at Aligarh Muslim University to ninety respondents from five different Faculties and four different levels. To measure respondents’ comprehension of English texting, the authors gave the respondents abbreviations used by texters and asked them to write the full forms of the abbreviations. The authors found that many abbreviations were not understood, which suggested that most of the respondents did not understand and did not use these abbreviations.

  12. Analyzing the positioning of political competitors on relevant policy conflict dimensions within the 2014 EU Europarliamentary elections

    Directory of Open Access Journals (Sweden)

    Todor Arpad

    2014-11-01

    Full Text Available In this paper I analyze the positioning of Romanian political competitor for the European elections 2014 on most relevant dimensions of political conflict relevant to all European Union member countries. I analyze the political programs of the Romanian political parties realized within the euandi project. Even though not all dimensions are considered relevant in the context of political debate in Romania, the mapping provides a detailed picture of the current positioning of the main political competitors in the context of breaking USL and the creation of a new coalition government.

  13. PUBLIC SERVICE ADVERTISING: AN ANALYSIS ON TEXT AND SEMIOTICS

    Directory of Open Access Journals (Sweden)

    Ni Wayan Sukarini

    2012-07-01

    Full Text Available This study concerns with text and semiotics analysis on the use of language in public service advertising (PSA. PSA in this study is the text which is especially on health. There are three problems that are analysed in this research, namely: (1 grammatical structure and the lexical of the text; (2 the relationship of trichotomies (representamen, object, and interpretant with the three components of sign in nonverbal aspect; and (3 ideologies and messages conveyed in the verbal and nonverbal signs. Three methods applied in this research respectively including descriptive, qualitative, and interpretative. The type of data was the written one which was taken from printed media in the forms of poster and brochure. The data was collected through five procedures, they are clipping, numbering, coding, picturing, and documenting. As a scientific writing, a number of theories must be applied for the analysis. The relevant theories are semantics, semiotics, speech act, hermeneutics, language function, and text structure. These six theories were applied eclecticly in analysing the grammatical structure, lexicals, signs, and the structure of texts in order to elaborate the meaning, ideology, and message which were being conveyed through the texts of PSA. The result of the analysis showed that the grammatical structure applied in the PSA of health could be classified into the simple structure in the forms of phrase, clause, and sentence. The use of verbs dominated initially in order to express the imperative meaning but still had the purpose of being persuasive. Kinds of lexicals found were very close to disease, reproduction, and health either the general terms, for example victims, medicine or the specific ones like HIV/AIDS, Odha, perinatal, nifas, jampersal, sadari. From the nonverbal aspect, the relationship of trichotomy with the three of sign components are more realistics in the Object with its three sub components. Triadic relationship of three sub

  14. PUBLIC SERVICE ADVERTISING: AN ANALYSIS ON TEXT AND SEMIOTICS

    Directory of Open Access Journals (Sweden)

    Ni Wayan Sukarini

    2015-07-01

    Full Text Available This study concerns with text and semiotics analysis on the use of language in public service advertising (PSA. PSA in this study is the text which is especially on health. There are three problems that are analysed in this research, namely: (1 grammatical structure and the lexical of the text; (2 the relationship of trichotomies (representamen, object, and interpretant with the three components of sign in nonverbal aspect; and (3 ideologies and messages conveyed in the verbal and nonverbal signs. Three methods applied in this research respectively including descriptive, qualitative, and interpretative. The type of data was the written one which was taken from printed media in the forms of poster and brochure. The data was collected through five procedures, they are clipping, numbering, coding, picturing, and documenting. As a scientific writing, a number of theories must be applied for the analysis. The relevant theories are semantics, semiotics, speech act, hermeneutics, language function, and text structure. These six theories were applied eclecticly in analysing the grammatical structure, lexicals, signs, and the structure of texts in order to elaborate the meaning, ideology, and message which were being conveyed through the texts of PSA. The result of the analysis showed that the grammatical structure applied in the PSA of health could be classified into the simple structure in the forms of phrase, clause, and sentence. The use of verbs dominated initially in order to express the imperative meaning but still had the purpose of being persuasive. Kinds of lexicals found were very close to disease, reproduction, and health either the general terms, for example victims, medicine or the specific ones like HIV/AIDS, Odha, perinatal, nifas, jampersal, sadari. From the nonverbal aspect, the relationship of trichotomy with the three of sign components are more realistics in the Object with its three sub components. Triadic relationship of three sub

  15. THE CURRENT STATE OF KNOWLEDGE IN THE VALUE RELEVANCE RESEARCH FIELD

    Directory of Open Access Journals (Sweden)

    Carmen- Alexandra BALTARIU

    2015-04-01

    Full Text Available The purpose of this paper is to assess the scientific literature referring to the value relevance of reported accounting information over a twelve year period starting from 2002. The approach of the paper is a theoretical (conceptual one. In order to complete the purpose of the paper we selected as research method the longitudinal qualitative analysis. The qualitative analysis carried out presents a deductive character. Our conclusions regarding the general characteristics of the research field pertaining to the value relevance of reported accounting information are drawn based on the main results and scientific contributions identified in the research field of interest.

  16. A identidade nacional portuguesa: conteúdo e relevância

    Directory of Open Access Journals (Sweden)

    Cabral Manuel Villaverde

    2003-01-01

    Full Text Available This article discusses the relative merits of the instrumentalist and primordialist theses concerning the roles of the State and the nation in the production of contemporary national identities, as well as presenting a brief genealogy of Portuguese national identity. However, the attempt to anchor this identity in a purported "national character" reveals the reductionism and normative dimension of identitary ideology. The research is thus reoriented towards the relevance of national sentiment in current Portuguese society, observing that this sentiment constitutes a relevant symbolic resource, particularly for purposes of political mobilization.

  17. Relevant Market in Commercial Aviation of the European Union

    Directory of Open Access Journals (Sweden)

    Jakub Kociubiński

    2011-06-01

    Full Text Available The purpose of this paper is to provide a brief overview of the issue of definition of relevant market in civil aviation within the European Union. The liberalization of the market since the early 1990s has led to a rapid increase in the number of airlines operating in the EU. The increase in the competitiveness of the market has brought many positive changes for passengers, such as lower fares and a better network of connections. At the same time it has created a risk that the airlines, in order to gain a competitive edge, would infringe the rules of competition. This is especially important in the context of the phenomenon that is the development of the airline alliances, which could lead to an abuse of a dominant position. A clear definition of the relevant market is a first step in an assessment of whether such an abuse occurred. This paper focus on the elements that Internal Market regulator, the European Commission, takes into consideration when defining relevant market in the airline industry.

  18. Net Income, Book Value and Cash Flows: The Value Relevance in Jordanian Economic Sectors

    Directory of Open Access Journals (Sweden)

    DHIAA SHAMKI

    2013-07-01

    Full Text Available This paper examines the value relevance of financial statements variables namely net income, book value and cash flows simultaneously relative to Jordanian services and industrial firms for the period from 2000 to 2009. The main findings of this paper are three- dimensional. First, net income is value relevant, while book value and cash flows are irrelevant. Second, net income is more value relevant than book value and cash flows in both sectors. Third, this value relevance is greater in services sector than in industrial sector. The study shows that net income assist more in explaining market values in Jordanian services and industrial firms. Since research on the value relevance of these variables has neglected Jordan (and the Middle Eastern region, the study tries to fill this practical gap. The study is the first in Jordan that examines the value relevance of net income, book value and cash flows simultaneously and compares this value relevance according to Amman Stock Exchange sectors in one study in Jordan.

  19. Perceived Relevance of an Introductory Information Systems Course to Prospective Business Students

    Directory of Open Access Journals (Sweden)

    Irene Govender

    2013-12-01

    Full Text Available The study is designed to examine students’ perceptions of the introductory Information Systems (IS course. It was an exploratory study in which 67 students participated. A quantitative approach was followed making use of questionnaires for the collection of data. Using the theory of reasoned action as a framework, the study explores the factors that influence non-IS major students’ perceived relevance of the IS introductory course. The analysis of collected data included descriptive and inferential statistics. Using multiple regression analysis, the results suggest that overall, the independent variables, relevance of the content, previous IT knowledge, relevance for professional practice, IT preference in courses and peers’ influence may account for 72% of the explanatory power for the dependent variable, perceived relevance of the IS course. In addition, the results have shown some strong predictors (IT preference and peers’ influence that influence students’ perceived relevance of the IS course. Practical work was found to be a strong mediating variable toward positive perceptions of IS. The results of this study suggest that students do indeed perceive the introductory IS course to be relevant and match their professional needs, but more practical work would enhance their learning. Implications for theory and practice are discussed as a result of the behavioural intention to perceive the IS course to be relevant and eventually to recruit more IS students.

  20. Functionally relevant microsatellites in sugarcane unigenes

    Directory of Open Access Journals (Sweden)

    Singh Nagendra K

    2010-11-01

    Full Text Available Abstract Background Unigene sequences constitute a rich source of functionally relevant microsatellites. The present study was undertaken to mine the microsatellites in the available unigene sequences of sugarcane for understanding their constitution in the expressed genic component of its complex polyploid/aneuploid genome, assessing their functional significance in silico, determining the extent of allelic diversity at the microsatellite loci and for evaluating their utility in large-scale genotyping applications in sugarcane. Results The average frequency of perfect microsatellite was 1/10.9 kb, while it was 1/44.3 kb for the long and hypervariable class I repeats. GC-rich trinucleotides coding for alanine and the GA-rich dinucleotides were the most abundant microsatellite classes. Out of 15,594 unigenes mined in the study, 767 contained microsatellite repeats and for 672 of these putative functions were determined in silico. The microsatellite repeats were found in the functional domains of proteins encoded by 364 unigenes. Its significance was assessed by establishing the structure-function relationship for the beta-amylase and protein kinase encoding unigenes having repeats in the catalytic domains. A total of 726 allelic variants (7.42 alleles per locus with different repeat lengths were captured precisely for a set of 47 fluorescent dye labeled primers in 36 sugarcane genotypes and five cereal species using the automated fragment analysis system, which suggested the utility of designed primers for rapid, large-scale and high-throughput genotyping applications in sugarcane. Pair-wise similarity ranging from 0.33 to 0.84 with an average of 0.40 revealed a broad genetic base of the Indian varieties in respect of functionally relevant regions of the large and complex sugarcane genome. Conclusion Microsatellite repeats were present in 4.92% of sugarcane unigenes, for most (87.6% of which functions were determined in silico. High level of

  1. ERRORS AND DIFFICULTIES IN TRANSLATING LEGAL TEXTS

    Directory of Open Access Journals (Sweden)

    Camelia, CHIRILA

    2014-11-01

    Full Text Available Nowadays the accurate translation of legal texts has become highly important as the mistranslation of a passage in a contract, for example, could lead to lawsuits and loss of money. Consequently, the translation of legal texts to other languages faces many difficulties and only professional translators specialised in legal translation should deal with the translation of legal documents and scholarly writings. The purpose of this paper is to analyze translation from three perspectives: translation quality, errors and difficulties encountered in translating legal texts and consequences of such errors in professional translation. First of all, the paper points out the importance of performing a good and correct translation, which is one of the most important elements to be considered when discussing translation. Furthermore, the paper presents an overview of the errors and difficulties in translating texts and of the consequences of errors in professional translation, with applications to the field of law. The paper is also an approach to the differences between languages (English and Romanian that can hinder comprehension for those who have embarked upon the difficult task of translation. The research method that I have used to achieve the objectives of the paper was the content analysis of various Romanian and foreign authors' works.

  2. n-Gram-Based Text Compression

    Directory of Open Access Journals (Sweden)

    Vu H. Nguyen

    2016-01-01

    Full Text Available We propose an efficient method for compressing Vietnamese text using n-gram dictionaries. It has a significant compression ratio in comparison with those of state-of-the-art methods on the same dataset. Given a text, first, the proposed method splits it into n-grams and then encodes them based on n-gram dictionaries. In the encoding phase, we use a sliding window with a size that ranges from bigram to five grams to obtain the best encoding stream. Each n-gram is encoded by two to four bytes accordingly based on its corresponding n-gram dictionary. We collected 2.5 GB text corpus from some Vietnamese news agencies to build n-gram dictionaries from unigram to five grams and achieve dictionaries with a size of 12 GB in total. In order to evaluate our method, we collected a testing set of 10 different text files with different sizes. The experimental results indicate that our method achieves compression ratio around 90% and outperforms state-of-the-art methods.

  3. Full Employment in Industrialized Countries.

    Science.gov (United States)

    Britton, Andrew

    1997-01-01

    Argues that full employment must be acceptable on both social and economic grounds. Examines profound changes in industrialized economies since the 1970s and the diversity of employment contracts. Suggests that difficult policy decisions surround full employment. (SK)

  4. Enriching text with images and colored light

    Science.gov (United States)

    Sekulovski, Dragan; Geleijnse, Gijs; Kater, Bram; Korst, Jan; Pauws, Steffen; Clout, Ramon

    2008-01-01

    We present an unsupervised method to enrich textual applications with relevant images and colors. The images are collected by querying large image repositories and subsequently the colors are computed using image processing. A prototype system based on this method is presented where the method is applied to song lyrics. In combination with a lyrics synchronization algorithm the system produces a rich multimedia experience. In order to identify terms within the text that may be associated with images and colors, we select noun phrases using a part of speech tagger. Large image repositories are queried with these terms. Per term representative colors are extracted using the collected images. Hereto, we either use a histogram-based or a mean shift-based algorithm. The representative color extraction uses the non-uniform distribution of the colors found in the large repositories. The images that are ranked best by the search engine are displayed on a screen, while the extracted representative colors are rendered on controllable lighting devices in the living room. We evaluate our method by comparing the computed colors to standard color representations of a set of English color terms. A second evaluation focuses on the distance in color between a queried term in English and its translation in a foreign language. Based on results from three sets of terms, a measure of suitability of a term for color extraction based on KL Divergence is proposed. Finally, we compare the performance of the algorithm using either the automatically indexed repository of Google Images and the manually annotated Flickr.com. Based on the results of these experiments, we conclude that using the presented method we can compute the relevant color for a term using a large image repository and image processing.

  5. Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction

    Directory of Open Access Journals (Sweden)

    Darko Brodić

    2010-05-01

    Full Text Available Text line segmentation is an essential stage in off-line optical character recognition (OCR systems. It is a key because inaccurately segmented text lines will lead to OCR failure. Text line segmentation of handwritten documents is a complex and diverse problem, complicated by the nature of handwriting. Hence, text line segmentation is a leading challenge in handwritten document image processing. Due to inconsistencies in measurement and evaluation of text segmentation algorithm quality, some basic set of measurement methods is required. Currently, there is no commonly accepted one and all algorithm evaluation is custom oriented. In this paper, a basic test framework for the evaluation of text feature extraction algorithms is proposed. This test framework consists of a few experiments primarily linked to text line segmentation, skew rate and reference text line evaluation. Although they are mutually independent, the results obtained are strongly cross linked. In the end, its suitability for different types of letters and languages as well as its adaptability are its main advantages. Thus, the paper presents an efficient evaluation method for text analysis algorithms.

  6. Difficulties in translation of socio-political texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2013-12-01

    Full Text Available Belonging of Russian socio-political texts to publicistic style assumes being guided by functional approach in order to find most adequate linguistic means by transfer of pragmatic meaning of the source text. Intralinguistic meaning can slightly remain by the interpretation of German texts. Lexical and grammatical transformations help preserving semantic-syntactic structure of the target text which means achievement of the same communicative effect by the translate which is being achieved by the source text.

  7. The home concept in poetic texts: new ways of understanding

    Directory of Open Access Journals (Sweden)

    С А Радзиевская

    2010-03-01

    Full Text Available The article focuses on the analysis of the HOME concept in American poetic texts and on the description of the model of its content. Linguocognitive mechanisms of the formation of the images of home are revealed.

  8. About development of transcultural relevance

    Directory of Open Access Journals (Sweden)

    Victoria BARAGA

    2012-01-01

    Full Text Available Nowadays, it discusses more frequently and more acute the problem of spiritual crisis ofmodern people. A solution for removing our world from deadlock is promoted by BasarabNicolescu, french-roumanian physicist and philosopher. Basaran Nicolescu proposes the concept ofLevels of Reality, starting from researches and discoveries in quantum physics and from the logic ofquantum pshysics. The concept of Levels of Reality is implemented in the direction of the way ofsetting a trandisciplinary culture, with theory of third included and the idea of complexity. Thequantum revolution requests the intelligence revolution. According to the new evaluation ofmultiple reality, the sacred - as a primary source of our values - is rehabilitated, but released fromthe captivity of religiosity. „Trans” signifies what it is between, in and what it transcends them. Inthis case we can speak about transculturation, transreligiosity or transliterarity.

  9. Narrow-Bicliques: Cryptanalysis of Full IDEA

    DEFF Research Database (Denmark)

    Khovratovich, D.; Leurent, G.; Rechberger, C.

    2012-01-01

    We apply and extend the recently introduced biclique framework to IDEA and for the first time describe an approach to noticeably speed-up key-recovery for the full 8.5 round IDEA.We also show that the biclique approach to block cipher cryptanalysis not only obtains results on more rounds, but also...... extended with ways to allow for a significantly reduced data complexity with everything else being equal. For this we use available degrees of freedom as known from hash cryptanalysis to narrow the relevant differential trails. Our cryptanalysis is of high computational complexity, and does not threaten...

  10. Discussion on the Relevant Factors of General Surgery Incision Infection and Prevention Methods

    Directory of Open Access Journals (Sweden)

    Jin Baotao

    2017-01-01

    Full Text Available There are many reasons that can lead to incision infection of general surgical patients. The main reasons include weight, age, body albumin level, surgical time, observation ward, etc. This paper analyzes the clinic data of patients with incision infection after general surgery based on clinic practice and study on the reasons that have impact on general surgical incision infection and gives relevant prevention countermeasures.

  11. THE RELEVANCE OF THE PERFORMANCE INDICATORS IN ECONOMIC AND FINANCIAL DIAGNOSIS

    Directory of Open Access Journals (Sweden)

    MIRELA MONEA

    2011-01-01

    Full Text Available Each company must achieve the objectives to reach performance in order to survive on the market. The paper aims to present the concept of performance as is seen in economic literature, to discuss the relevance of the main performances indicators on economic and financial diagnosis, to answer the question what are the main indicators which reflect economic or financial performances: profit, profitability ratios, economic added value, investments return, liquidity, cash-flows, resources efficiency, productivity, others.

  12. Mining protein function from text using term-based support vector machines

    Science.gov (United States)

    Rice, Simon B; Nenadic, Goran; Stapley, Benjamin J

    2005-01-01

    Background Text mining has spurred huge interest in the domain of biology. The goal of the BioCreAtIvE exercise was to evaluate the performance of current text mining systems. We participated in Task 2, which addressed assigning Gene Ontology terms to human proteins and selecting relevant evidence from full-text documents. We approached it as a modified form of the document classification task. We used a supervised machine-learning approach (based on support vector machines) to assign protein function and select passages that support the assignments. As classification features, we used a protein's co-occurring terms that were automatically extracted from documents. Results The results evaluated by curators were modest, and quite variable for different problems: in many cases we have relatively good assignment of GO terms to proteins, but the selected supporting text was typically non-relevant (precision spanning from 3% to 50%). The method appears to work best when a substantial set of relevant documents is obtained, while it works poorly on single documents and/or short passages. The initial results suggest that our approach can also mine annotations from text even when an explicit statement relating a protein to a GO term is absent. Conclusion A machine learning approach to mining protein function predictions from text can yield good performance only if sufficient training data is available, and significant amount of supporting data is used for prediction. The most promising results are for combined document retrieval and GO term assignment, which calls for the integration of methods developed in BioCreAtIvE Task 1 and Task 2. PMID:15960835

  13. Possessives and relevance | Taylor | Stellenbosch Papers in ...

    African Journals Online (AJOL)

    Stellenbosch Papers in Linguistics. Journal Home · ABOUT THIS JOURNAL · Advanced Search · Current Issue · Archives · Journal Home > Vol 26 (1993) >. Log in or Register to get access to full text downloads.

  14. How Full is Full Employment? : How Tools and Not Theory Explained Full Employment

    NARCIS (Netherlands)

    Rodenburg, P.

    2016-01-01

    The post-war debate on full employment policy was blurred and unclear since the concept of full employment itself was theoretically unclear and un-operational. Unable to theoretically determine the unemployment level of full employment, economists tried to find more empirically based ways to

  15. Using Text Models In Diagnostic Tasks.

    Directory of Open Access Journals (Sweden)

    Korostil Yuriy

    2015-09-01

    Full Text Available This paper contains developing of a method of solving diagnostic tasks for complex technical objects (STO based on using text models (TMi to describe the functioning of STO. A TMi model is a text description, in normalized form, of all fragments of STO functioning process. The description of TMi is for med using semantic vocabularies of different types, which are generated on the basis of usage of information about all the aspects of STO construction and functioning. Such interpretation description is a subject area for tasks of STO diagnostics. Detection of malfunction and deviations of a functioning process of STO from an established functioning mode is implemented on the basis of analysis of semantic parameters of text description of the STO functioning process in order to determine semantic anomalies which occur in the descriptions of the STO functioning process, as well as in the descriptions of fragments of its functioning. Semantic anomalies occur in case when values of semantic parameters go beyond their established limits.

  16. Modeling statistical properties of written text.

    Directory of Open Access Journals (Sweden)

    M Angeles Serrano

    Full Text Available Written text is one of the fundamental manifestations of human language, and the study of its universal regularities can give clues about how our brains process information and how we, as a society, organize and share it. Among these regularities, only Zipf's law has been explored in depth. Other basic properties, such as the existence of bursts of rare words in specific documents, have only been studied independently of each other and mainly by descriptive models. As a consequence, there is a lack of understanding of linguistic processes as complex emergent phenomena. Beyond Zipf's law for word frequencies, here we focus on burstiness, Heaps' law describing the sublinear growth of vocabulary size with the length of a document, and the topicality of document collections, which encode correlations within and across documents absent in random null models. We introduce and validate a generative model that explains the simultaneous emergence of all these patterns from simple rules. As a result, we find a connection between the bursty nature of rare words and the topical organization of texts and identify dynamic word ranking and memory across documents as key mechanisms explaining the non trivial organization of written text. Our research can have broad implications and practical applications in computer science, cognitive science and linguistics.

  17. INNER DIALOGICITY OF MEDICAL SCIENTIFIC TEXTS

    Directory of Open Access Journals (Sweden)

    Efremova Nataliya Vladimirovna

    2015-06-01

    Full Text Available The author studies inner dialogicity as an integral property of a scientist's thinking activity, a way of a scientific idea development, one of the cognitive and discursive mechanisms of new knowledge formation, its crystallization and dementalisation in a text, as a way of search for truth. Such approach to dialogicity in the study of a scientific text makes it possible to analyze the cogitative processes proceeding in human consciousness and cognitive activity, allows to fully understand the stated scientific concept, to define pragmatic strategies of the author, to plunge into his reflexive world. On the material of medical scientific texts of N.M. Amosov and F. G. Uglov, famous scientists in the field of cardio surgery, it is established that traces of internal dialogicity manifestation in the textual space of scientists actualize the origin of new knowledge, the change of author's semantic positions, his ability to reflect, compare, analyze his own thoughts and actions, to estimate oneself and the features of thinking process which are realized in logic of a statement of the scientific concept, an explanation of concepts, terms at judgment of the points of view of contemporaries and predecessors, adherents and scientist's opponents, and also orientation to the addressee's presupposition, activization of his cogitative activity. Linguistic, discursive, verbal analysis singles out the impact on the addressee, his mental activity.

  18. AUTHENTIC TEXTS FOR CRITICAL READING ACTIVITIES

    Directory of Open Access Journals (Sweden)

    Ila Amalia

    2016-03-01

    Full Text Available This research takes an action research aimed at promoting critical reading (“thinking” while reading skills using authentic materials among the students. This research also aims to reveal the students perception on using critical reading skills in reading activities. Nineteen English Education Department students who took Reading IV class, participated in this project. There were three cycles with three different critical reading strategies were applied. Meanwhile, the authentic materials were taken from newspaper and internet articles. The result revealed that the use of critical reading strategies along with the use of authentic materials has improved students’ critical reading skills as seen from the improvement of each cycle - the students critical reading skill was 54% (fair in the cycle 1 improved to 68% (average in cycle 2, and 82% (good in cycle 3.. In addition, based on the critical reading skill criteria, the students’ critical reading skill has improved from 40% (nearly meet to 80% (exceed. Meanwhile, from the students’ perception questionnaire, it was shown that 63% students agreed the critical reading activity using authentic text could improve critical thinking and 58% students agreed that doing critical reading activity could improve reading comprehension. The result had the implication that the use of authentic texts could improve students’ critical reading skills if it was taught by performing not lecturing them. Selectively choosing various strategies and materials can trigger students’ activeness in responding to a text, that eventually shape their critical reading skills.

  19. Prayer in Qumran texts. A brief introduction

    Directory of Open Access Journals (Sweden)

    Zdzisław J. Kapera

    2011-03-01

    Full Text Available Of some three hundred literary texts found in the caves of the Judaean Desert and those close to Khirbet Qumran, 56 are various pieces of poetry and liturgy. Seven specific groups have been distinguished among them: 1. Liturgy on sunshine and sunset and on specific days; 2. Liturgy on specific ceremonies of the community; 3. Eschatological prayers; 4. Magic texts; 5. Collections of psalms (including pseudepigrapha; 6. Thanksgiving hymns; 7. Prose prayers. The issue of how the Qumranians were praying is here briefly touched upon. Then there is a description of morning and evening prayers, Sabbath prayers, specific liturgy of the annual ceremony of entering the New Covenant, the Hodayot (Thanksgiving Hymns, pseudepigraphic Psalms (like Ps 151, and the eschatological prayers. The introduction ends with a summary evaluation of the role of the texts in reconstructing the historical development of the Jewish prayer of the late Second Temple period. The need to study the relationship of the Qumran prayers with the early Christian prayers is also briefly discussed.

  20. Cell Phoning and Texting While Driving

    Directory of Open Access Journals (Sweden)

    Judy Honoria Rosaire Telemaque

    2015-07-01

    Full Text Available A qualitative phenomenological study was conducted on the consequences of cell phone use while operating a vehicle. We discussed why talking and texting on cell phones are so popular through the analysis of our interviews with police officers, driving instructors, and parents of teens and young adults. The participants came from central, northeastern, northwestern, and southeastern Connecticut. All had exposure with respect to the effects of cell phone usage problem. The study reached a point of theoretical saturation or redundancy by which the analysis no longer resulted in new themes. We concluded that the discoveries revealed the necessity for education, expansion of technology, and additional driver education preparation, which may provide a path for leadership to help solve the problem.

  1. Using ontology network structure in text mining.

    Science.gov (United States)

    Berndt, Donald J; McCart, James A; Luther, Stephen L

    2010-11-13

    Statistical text mining treats documents as bags of words, with a focus on term frequencies within documents and across document collections. Unlike natural language processing (NLP) techniques that rely on an engineered vocabulary or a full-featured ontology, statistical approaches do not make use of domain-specific knowledge. The freedom from biases can be an advantage, but at the cost of ignoring potentially valuable knowledge. The approach proposed here investigates a hybrid strategy based on computing graph measures of term importance over an entire ontology and injecting the measures into the statistical text mining process. As a starting point, we adapt existing search engine algorithms such as PageRank and HITS to determine term importance within an ontology graph. The graph-theoretic approach is evaluated using a smoking data set from the i2b2 National Center for Biomedical Computing, cast as a simple binary classification task for categorizing smoking-related documents, demonstrating consistent improvements in accuracy.

  2. Relevance theory: pragmatics and cognition.

    Science.gov (United States)

    Wearing, Catherine J

    2015-01-01

    Relevance Theory is a cognitively oriented theory of pragmatics, i.e., a theory of language use. It builds on the seminal work of H.P. Grice(1) to develop a pragmatic theory which is at once philosophically sensitive and empirically plausible (in both psychological and evolutionary terms). This entry reviews the central commitments and chief contributions of Relevance Theory, including its Gricean commitment to the centrality of intention-reading and inference in communication; the cognitively grounded notion of relevance which provides the mechanism for explaining pragmatic interpretation as an intention-driven, inferential process; and several key applications of the theory (lexical pragmatics, metaphor and irony, procedural meaning). Relevance Theory is an important contribution to our understanding of the pragmatics of communication. © 2014 John Wiley & Sons, Ltd.

  3. Clinical relevance in anesthesia journals

    DEFF Research Database (Denmark)

    Lauritsen, Jakob; Møller, Ann M

    2006-01-01

    The purpose of this review is to present the latest knowledge and research on the definition and distribution of clinically relevant articles in anesthesia journals. It will also discuss the importance of the chosen methodology and outcome of articles.......The purpose of this review is to present the latest knowledge and research on the definition and distribution of clinically relevant articles in anesthesia journals. It will also discuss the importance of the chosen methodology and outcome of articles....

  4. Terminology extraction from medical texts in Polish.

    Science.gov (United States)

    Marciniak, Małgorzata; Mykowiecka, Agnieszka

    2014-01-01

    Hospital documents contain free text describing the most important facts relating to patients and their illnesses. These documents are written in specific language containing medical terminology related to hospital treatment. Their automatic processing can help in verifying the consistency of hospital documentation and obtaining statistical data. To perform this task we need information on the phrases we are looking for. At the moment, clinical Polish resources are sparse. The existing terminologies, such as Polish Medical Subject Headings (MeSH), do not provide sufficient coverage for clinical tasks. It would be helpful therefore if it were possible to automatically prepare, on the basis of a data sample, an initial set of terms which, after manual verification, could be used for the purpose of information extraction. Using a combination of linguistic and statistical methods for processing over 1200 children hospital discharge records, we obtained a list of single and multiword terms used in hospital discharge documents written in Polish. The phrases are ordered according to their presumed importance in domain texts measured by the frequency of use of a phrase and the variety of its contexts. The evaluation showed that the automatically identified phrases cover about 84% of terms in domain texts. At the top of the ranked list, only 4% out of 400 terms were incorrect while out of the final 200, 20% of expressions were either not domain related or syntactically incorrect. We also observed that 70% of the obtained terms are not included in the Polish MeSH. Automatic terminology extraction can give results which are of a quality high enough to be taken as a starting point for building domain related terminological dictionaries or ontologies. This approach can be useful for preparing terminological resources for very specific subdomains for which no relevant terminologies already exist. The evaluation performed showed that none of the tested ranking procedures were

  5. Semi-Spontaneous Oral Text Production: Measurements in Clinical Practice

    Science.gov (United States)

    Lind, Marianne; Kristoffersen, Kristian Emil; Moen, Inger; Simonsen, Hanne Gram

    2009-01-01

    Functionally relevant assessment of the language production of speakers with aphasia should include assessment of connected speech production. Despite the ecological validity of everyday conversations, more controlled and monological types of texts may be easier to obtain and analyse in clinical practice. This article discusses some simple…

  6. THE COMPLEX OF EMOTIONAL EXPERIENCES, RELEVANT MANIFESTATIONS OF INSPIRATION

    Directory of Open Access Journals (Sweden)

    Pavel A. Starikov

    2015-01-01

    Full Text Available The aim of the study is to investigate structure of emotional experiences, relevant manifestations of inspiration creative activities of students.Methods. The proposed methods of mathematical statistics (correlation analysis, factor analysis, multidimensional scaling are applied.Results and scientific novelty. The use of factor analysis, multidimensional scaling allowed to reveal a consistent set of positive experiences of the students, the relevant experience of inspiration in creative activities. «Operational» rueful feelings dedicated by M. Chiksentmihaji («feeling of full involvement, and dilution in what you do», «feeling of concentration, perfect clarity of purpose, complete control and a feeling of total immersion in a job that does not require special efforts» and experiences of the «spiritual» nature, more appropriate to peaks experiences of A. Maslow («feeling of love for all existing, all life»; «a deep sense of self importance, the inner feeling of approval of self»; «feeling of unity with the whole world»; «acute perception of the beauty of the world of nature, “beautiful instant”»; «feeling of lightness, flowing» are included in this complex in accordance with the study results. The interrelation of degree of expressiveness of the given complex of experiences with inspiration experience is considered.Practical significance. The results of the study show structure of emotional experiences, relevant manifestations of inspiration. Research materials can be useful both to psychologists, and experts in the field of pedagogy of creative activity.

  7. Support Vector Machines: Relevance Feedback and Information Retrieval.

    Science.gov (United States)

    Drucker, Harris; Shahrary, Behzad; Gibbon, David C.

    2002-01-01

    Compares support vector machines (SVMs) to Rocchio, Ide regular and Ide dec-hi algorithms in information retrieval (IR) of text documents using relevancy feedback. If the preliminary search is so poor that one has to search through many documents to find at least one relevant document, then SVM is preferred. Includes nine tables. (Contains 24…

  8. Making School Development Credible. Text, Context, Irony

    Directory of Open Access Journals (Sweden)

    Mats Börjesson

    2012-01-01

    Full Text Available

    The article argues for the importance of an open, reflexive-methodological approach when switching between studying text, context and researcher activity. Close linguistic analysis can benefit from being linked with the researcher’s contextualisation of his empirical material as well as with more distanced readings. The more specific starting point for this article is that school development, like other similar terms such as school improvement and the like, makes use of linguistic building blocks with which whole narratives about today’s and tomorrow’s schools can be constructed. The subject of the study is a short text issued by the Swedish Schools Inspectorate (Skolinspektionen. Government language changes according to the authorities’ role in society and their own definitions of their functions, and an important aspect here is the legitimacy of the authorities’ texts. By means of various kinds of close linguistic analysis, the above-mentioned text is studied with regard to choice of categories, hierarchies of modalisation and the rhetorical effects of different types of formulations in a broader political-social landscape. The article concludes with a reflective discussion on the relationship between government language and irony as a stylistic device – a device that is based on the results of the close empirical analysis.[i]



    [i] The article is part of the project ”School  Development as Narrative”, funded by the Swedish Research Council. The author would like to thank the two reviewers for very valuable comments.

  9. Methodological Details and Full Bibliography

    Data.gov (United States)

    U.S. Environmental Protection Agency — This dataset has several components, The first part describes fully our literature review, providing details not included in the text. The second part provides all...

  10. Reconfigurable Full-Page Braille Displays

    Science.gov (United States)

    Garner, H. Douglas

    1994-01-01

    Electrically actuated braille display cells of proposed type arrayed together to form full-page braille displays. Like other braille display cells, these provide changeable patterns of bumps driven by digitally recorded text stored on magnetic tapes or in solid-state electronic memories. Proposed cells contain electrorheological fluid. Viscosity of such fluid increases in strong electrostatic field.

  11. Impact of Non Accounting Information on The Value Relevance of Accounting Information: The Case of Jordan

    Directory of Open Access Journals (Sweden)

    DHIAA SHAMKI

    2013-07-01

    Full Text Available The paper presents empirical evidence about the impact of firm’s shareholders number as non accounting information on the value relevance of its earnings and book value of equity as accounting information for Jordanian industrial firms for the period from 1993 to 2002. Employing the return regression analysis and using shareholders number in two proxies namely local and foreign shareholders number, the findings of the study are fourfold. First, Individual earnings are value relevant while book value is irrelevant. Second, combining earnings with book value leads both of them to be irrelevant. Third, extending local shareholders number has significant impact on the value relevance of individual and combined earnings. Forth, extending foreign shareholders number has significant impact on the value relevance of individual book value and combined earnings. Since studies on the value relevance of these variables have neglected Jordan (and the Middle Eastern region, the study is the first especially in Jordan that tries to fill this gap by examiningthe impact of shareholders numbers on the value relevance of earnings and book valueto indicate firm value.

  12. Relevant Skills for Criminal Accounting Expertise: The Perception of Federal Police Experts and Delegates

    Directory of Open Access Journals (Sweden)

    Carlos Roberto dos Santos Filho

    2017-03-01

    Full Text Available This research aimed to identify which skills are considered most relevant to the practice of criminal accounting expertise in Brazil. As in international research, the skills perceived as most relevant were written communication, deductive analysis and critical thinking. Among the less relevant skills were the interview and the solution and negotiation of conflicts. In the second part, while experts and delegates jointly consider written communication to be the most present skills, delegates diverge from experts in terms of critical thinking and serenity. In addition, the respondents indicated skills that had not been investigated, and the most cited skills were proactivity, objectivity and updating. In the light of forensic accounting, the research method used was the survey, using a predefined questionnaire with open and closed questions, which 144 respondents answered. The study was divided into three parts: the first about the perceived relevance of the skills, the second about the perceived practical application of skills and the third part allowed the respondents to contribute with suggestions of skills that were considered relevant but did not figure among the skills investigated. The study contributes to the establishment of curricular guidelines for undergraduate and postgraduate courses related to the training of skills considered relevant for the training of future professionals and for the improvement of criminal accounting experts. Finally, we observe that the skills investigated and suggested can contribute to all areas of accounting expertise.

  13. Attentional Capture and Inhibition of Saccades after Irrelevant and Relevant Cues

    Directory of Open Access Journals (Sweden)

    Heinz-Werner Priess

    2014-01-01

    Full Text Available Attentional capture is usually stronger for task-relevant than irrelevant stimuli, whereas irrelevant stimuli can trigger equal or even stronger amounts of inhibition than relevant stimuli. Capture and inhibition, however, are typically assessed in separate trials, leaving it open whether or not inhibition of irrelevant stimuli is a consequence of preceding attentional capture by the same stimuli or whether inhibition is the only response to these stimuli. Here, we tested the relationship between capture and inhibition in a setup allowing for estimates of the capture and inhibition based on the very same trials. We recorded saccadic inhibition after relevant and irrelevant stimuli. At the same time, we recorded the N2pc, an event-related potential, reflecting initial capture of attention. We found attentional capture not only for, relevant but importantly also for irrelevant stimuli, although the N2pc was stronger for relevant than irrelevant stimuli. In addition, inhibition of saccades was the same for relevant and irrelevant stimuli. We conclude with a discussion of the mechanisms that are responsible for these effects.

  14. Value Relevance of Investment Properties: Evidence from the Brazilian Capital Market

    Directory of Open Access Journals (Sweden)

    Ketlyn Alves Gonçalves

    2017-04-01

    Full Text Available This study investigates the relevance to the capital market of the assets recognized as investment properties of companies listed on the BM&F BOVESPA, in the period from 2011 to 2014. The research conducted was based on the Ohlson model (1995 and panel analysis was carried out using linear regression with POLS and Fixed and Random Effects estimators. Two hypothesis were made: (i that Earning and Equity generate accounting information relevant to investors; and (2 that Earning, Equity and Investment Property generate accounting information relevant to investors, assuming that investment properties have incremental effect on the relevance of this information relative only to earning and to equity. Both hypotheses were rejected, so it is concluded that Investment Property assets are not of value relevance in the determination of share price and do not influence the decision making of users of accounting information. The study adds to the limited literature on the value relevance of Investment Property, permitting a better understanding of the impact of accounting disclosures used by companies on their market value.

  15. An evidence perspective on topical relevance types and its implications for exploratory and task-based retrieval

    Directory of Open Access Journals (Sweden)

    Xiaoli Huang

    2006-01-01

    Full Text Available Introduction. The concept of relevance lies at the heart of intellectual access and information retrieval, indeed of reasoning and communication in general; in turn, topical relevance lies at the heart of relevance. The common view of topical relevance is limited to topic matching, resulting in information retrieval systems' failure to detect more complex topical connections which are needed to respond to diversified user situations and tasks. Method. Based on the role a piece of information plays in the overall structure of an argument, we have identified four topical relevance types: Direct, Indirect (circumstantial, Context, and Comparison. In the process of creating a speech retrieval test collection, graduate history students made 27,000 topical relevance assessments between Holocaust survivor interview segments and real user topics, using the four relevance types, each on a scale of 0 to 4. They recorded justifications for their assessments and kept detailed Topic Notes. Analysis. We analysed these relevance assessments using a grounded theory approach to arrive at a finer classification of topical relevance types. Results. For example, indirect relevance(a piece of information is connected to the topic indirectly through inference, circumstantial evidence was refined to Generic Indirect Relevance, Backward Inference (abduction, Forward Inference (deduction, and Inference from Cases (induction, with each subtype being further illustrated and explicated by examples. Conclusion. Each of these refined types of topical relevance plays a special role in reasoning, making a conclusive argument, or performing a task. Incorporating them into information retrieval systems allows users more flexibility and a better focus on their tasks. They can also be used in teaching reasoning skills.

  16. A relevance theoretic approach to intertextuality in print advertising

    African Journals Online (AJOL)

    Anonymous vs. acknowledged intertexts: A relevance theoretic approach to intertextuality in print advertising. ... make intertextual references to texts from mass media genres other than advertising as part of an ... AJOL African Journals Online.

  17. The interjection in old Romanian texts

    Directory of Open Access Journals (Sweden)

    Margareta Manu Magda

    2017-09-01

    Full Text Available The paper tries to identify the special problems posed by the study of interjection based on the examination of a corpus of texts from the old Romanian (1600–1780, referring to texts from modern Romanian. We have watched how certain interjectional formations have acquired, through diachronic expansion, new grammatical, semantic and pragmatic values.The structure of the paper is the following: the introduction (§1 summarizes the author’s position on the status of the interjection category at a morphosyntactic, semantic and pragmatic level (§1.1 and on the relation between different linguistic structures and their grammaticalization / pragmaticalization process (§1.2. The second section (§2 refers to the specific routes followed by the evolution of the various categories of the analysed interjections, from the old Romanian to the modern Romanian: the presentatives adecă, iată, ni (§2.1, the hortatives haide, ni (§2.2, the addressing particles bre, măi (§2.3, the connectors with demarcation signal function adevăr, amin (§2.4. The third section (§3 has as objective the description of a species of delocutive derivation, illustrated in Romanian by the lexicalized semantic variants of the secondary interjection Doamne!. The study concludes with several final considerations regarding the results of the research (§4.

  18. Speech Act Classification of German Advertising Texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2015-12-01

    Full Text Available This paper uses the theory of speech acts and the underlying concept of pragmalinguistics to determine the types of speech acts and their classification in the German advertising printed texts. We ascertain that the advertising of cars and accessories, household appliances and computer equipment, watches, fancy goods, food, pharmaceuticals, and financial, insurance, legal services and also airline advertising is dominated by a pragmatic principle, which is based on demonstrating information about the benefits of a product / service. This influences the frequent usage of certain speech acts. The dominant form of exposure is to inform the recipient-user about the characteristics of the advertised product. This information is fore-grounded by means of stylistic and syntactic constructions specific to the advertisement (participial constructions, appositional constructions which contribute to emphasize certain notional components within the framework of the advertising text. Stylistic and syntactic devices of reduction (parceling constructions convey the author's idea. Other means like repetitions, enumerations etc are used by the advertiser to strengthen his selling power. The advertiser focuses the attention of the consumer on the characteristics of the product seeking to convince him of the utility of the product and to influence his/ her buying behavior.

  19. Helios: Understanding Solar Evolution Through Text Analytics

    Energy Technology Data Exchange (ETDEWEB)

    Randazzese, Lucien [SRI International, Menlo Park, CA (United States)

    2016-12-02

    This proof-of-concept project focused on developing, testing, and validating a range of bibliometric, text analytic, and machine-learning based methods to explore the evolution of three photovoltaic (PV) technologies: Cadmium Telluride (CdTe), Dye-Sensitized solar cells (DSSC), and Multi-junction solar cells. The analytical approach to the work was inspired by previous work by the same team to measure and predict the scientific prominence of terms and entities within specific research domains. The goal was to create tools that could assist domain-knowledgeable analysts in investigating the history and path of technological developments in general, with a focus on analyzing step-function changes in performance, or “breakthroughs,” in particular. The text-analytics platform developed during this project was dubbed Helios. The project relied on computational methods for analyzing large corpora of technical documents. For this project we ingested technical documents from the following sources into Helios: Thomson Scientific Web of Science (papers), the U.S. Patent & Trademark Office (patents), the U.S. Department of Energy (technical documents), the U.S. National Science Foundation (project funding summaries), and a hand curated set of full-text documents from Thomson Scientific and other sources.

  20. PEDANT: Parallel Texts in Göteborg

    Directory of Open Access Journals (Sweden)

    Daniel Ridings

    2012-09-01

    Full Text Available

    The article presents the status of the PEDANT project with parallel corpora at the Language Bank at Göteborg University. The solutions for access to the corpus data are presented. Access is provided by way of the internet and standard applications and SGML-aware programming tools. The SGML format for encoding translation pairs is outlined together. The methods allow working with everything from plain text to texts densely encoded with linguistic information.

     

    In hierdie artikel word 'n beskrywing gegee van die stand van die PEDANT-projek met parallelle korpora by die Taalbank by die Universiteit van Göteborg. Oplossings vir die verkryging van toegang tot die korpusdata word aangedui. Toegang word verskaf deur middel van die Internet en standaardtoepassings en SGML-sensitiewe programmeringshulpmiddels. Die SGML-formaat vir die enkodering van vertaalpare word gesamentlik geskets. Hierdie metodes laat toe dat gewerk kan word met enigiets vanaf suiwer teks tot tekste wat taalkundig dig geëtiketteer is.