WorldWideScience

Sample records for Is the computer capable of recognizing different sentence types in a linguistic corpus such as COSMAS II Mannheim, which has not been previously treated by a tagger or a parser

  1. Satzarten unterscheiden - Kann das der Computer? Syntaktische Explorationen anhand von COSMAS II

    Directory of Open Access Journals (Sweden)

    Näf, Anton

    2006-01-01

    Full Text Available Is the computer capable of recognizing different sentence types in a linguistic corpus such as COSMAS II (Mannheim, which has not been previously treated by a tagger or a parser? The answer is in fact no. However, in the present article it is shown that under certain circumstances an automatic distinction is nevertheless possible. Making use of a procedure that we have called Anfragezuspitzung (literally: making a query pointed; encirclement of a grammatical phenomenon by a combination of several specific queries, and taking as a starting point philological prior knowledge that has been gathered "by hand", it proves to be perfectly possible to arrive at a satisfactory result. With the example of sentence types in German, in particular the distinction between interrogative and exclamatory sentences, we demonstrate in this article that such a distinction can be carried out automatically with a high degree of accuracy, e.g. the distinction between War das eine gute Idee? (Was this a good idea? and War das eine gute Idee! (What a good idea this was!.

  2. Evaluating stance-annotated sentences from the Brexit Blog Corpus: A quantitative linguistic analysis

    Directory of Open Access Journals (Sweden)

    Simaki Vasiliki

    2018-03-01

    Full Text Available This paper offers a formally driven quantitative analysis of stance-annotated sentences in the Brexit Blog Corpus (BBC. Our goal is to identify features that determine the formal profiles of six stance categories (contrariety, hypotheticality, necessity, prediction, source of knowledge and uncertainty in a subset of the BBC. The study has two parts: firstly, it examines a large number of formal linguistic features, such as punctuation, words and grammatical categories that occur in the sentences in order to describe the specific characteristics of each category, and secondly, it compares characteristics in the entire data set in order to determine stance similarities in the data set. We show that among the six stance categories in the corpus, contrariety and necessity are the most discriminative ones, with the former using longer sentences, more conjunctions, more repetitions and shorter forms than the sentences expressing other stances. necessity has longer lexical forms but shorter sentences, which are syntactically more complex. We show that stance in our data set is expressed in sentences with around 21 words per sentence. The sentences consist mainly of alphabetical characters forming a varied vocabulary without special forms, such as digits or special characters.

  3. ChemicalTagger: A tool for semantic text-mining in chemistry.

    Science.gov (United States)

    Hawizy, Lezan; Jessop, David M; Adams, Nico; Murray-Rust, Peter

    2011-05-16

    The primary method for scientific communication is in the form of published scientific articles and theses which use natural language combined with domain-specific terminology. As such, they contain free owing unstructured text. Given the usefulness of data extraction from unstructured literature, we aim to show how this can be achieved for the discipline of chemistry. The highly formulaic style of writing most chemists adopt make their contributions well suited to high-throughput Natural Language Processing (NLP) approaches. We have developed the ChemicalTagger parser as a medium-depth, phrase-based semantic NLP tool for the language of chemical experiments. Tagging is based on a modular architecture and uses a combination of OSCAR, domain-specific regex and English taggers to identify parts-of-speech. The ANTLR grammar is used to structure this into tree-based phrases. Using a metric that allows for overlapping annotations, we achieved machine-annotator agreements of 88.9% for phrase recognition and 91.9% for phrase-type identification (Action names). It is possible parse to chemical experimental text using rule-based techniques in conjunction with a formal grammar parser. ChemicalTagger has been deployed for over 10,000 patents and has identified solvents from their linguistic context with >99.5% precision.

  4. ChemicalTagger: A tool for semantic text-mining in chemistry

    Directory of Open Access Journals (Sweden)

    Hawizy Lezan

    2011-05-01

    Full Text Available Abstract Background The primary method for scientific communication is in the form of published scientific articles and theses which use natural language combined with domain-specific terminology. As such, they contain free owing unstructured text. Given the usefulness of data extraction from unstructured literature, we aim to show how this can be achieved for the discipline of chemistry. The highly formulaic style of writing most chemists adopt make their contributions well suited to high-throughput Natural Language Processing (NLP approaches. Results We have developed the ChemicalTagger parser as a medium-depth, phrase-based semantic NLP tool for the language of chemical experiments. Tagging is based on a modular architecture and uses a combination of OSCAR, domain-specific regex and English taggers to identify parts-of-speech. The ANTLR grammar is used to structure this into tree-based phrases. Using a metric that allows for overlapping annotations, we achieved machine-annotator agreements of 88.9% for phrase recognition and 91.9% for phrase-type identification (Action names. Conclusions It is possible parse to chemical experimental text using rule-based techniques in conjunction with a formal grammar parser. ChemicalTagger has been deployed for over 10,000 patents and has identified solvents from their linguistic context with >99.5% precision.

  5. A universal multilingual weightless neural network tagger via quantitative linguistics.

    Science.gov (United States)

    Carneiro, Hugo C C; Pedreira, Carlos E; França, Felipe M G; Lima, Priscila M V

    2017-07-01

    In the last decade, given the availability of corpora in several distinct languages, research on multilingual part-of-speech tagging started to grow. Amongst the novelties there is mWANN-Tagger (multilingual weightless artificial neural network tagger), a weightless neural part-of-speech tagger capable of being used for mostly-suffix-oriented languages. The tagger was subjected to corpora in eight languages of quite distinct natures and had a remarkable accuracy with very low sample deviation in every one of them, indicating the robustness of weightless neural systems for part-of-speech tagging tasks. However, mWANN-Tagger needed to be tuned for every new corpus, since each one required a different parameter configuration. For mWANN-Tagger to be truly multilingual, it should be usable for any new language with no need of parameter tuning. This article proposes a study that aims to find a relation between the lexical diversity of a language and the parameter configuration that would produce the best performing mWANN-Tagger instance. Preliminary analyses suggested that a single parameter configuration may be applied to the eight aforementioned languages. The mWANN-Tagger instance produced by this configuration was as accurate as the language-dependent ones obtained through tuning. Afterwards, the weightless neural tagger was further subjected to new corpora in languages that range from very isolating to polysynthetic ones. The best performing instances of mWANN-Tagger are again the ones produced by the universal parameter configuration. Hence, mWANN-Tagger can be applied to new corpora with no need of parameter tuning, making it a universal multilingual part-of-speech tagger. Further experiments with Universal Dependencies treebanks reveal that mWANN-Tagger may be extended and that it has potential to outperform most state-of-the-art part-of-speech taggers if better word representations are provided. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Is human sentence parsing serial or parallel? Evidence from event-related brain potentials.

    Science.gov (United States)

    Hopf, Jens-Max; Bader, Markus; Meng, Michael; Bayer, Josef

    2003-01-01

    In this ERP study we investigate the processes that occur in syntactically ambiguous German sentences at the point of disambiguation. Whereas most psycholinguistic theories agree on the view that processing difficulties arise when parsing preferences are disconfirmed (so-called garden-path effects), important differences exist with respect to theoretical assumptions about the parser's recovery from a misparse. A key distinction can be made between parsers that compute all alternative syntactic structures in parallel (parallel parsers) and parsers that compute only a single preferred analysis (serial parsers). To distinguish empirically between parallel and serial parsing models, we compare ERP responses to garden-path sentences with ERP responses to truly ungrammatical sentences. Garden-path sentences contain a temporary and ultimately curable ungrammaticality, whereas truly ungrammatical sentences remain so permanently--a difference which gives rise to different predictions in the two classes of parsing architectures. At the disambiguating word, ERPs in both sentence types show negative shifts of similar onset latency, amplitude, and scalp distribution in an initial time window between 300 and 500 ms. In a following time window (500-700 ms), the negative shift to garden-path sentences disappears at right central parietal sites, while it continues in permanently ungrammatical sentences. These data are taken as evidence for a strictly serial parser. The absence of a difference in the early time window indicates that temporary and permanent ungrammaticalities trigger the same kind of parsing responses. Later differences can be related to successful reanalysis in garden-path but not in ungrammatical sentences. Copyright 2003 Elsevier Science B.V.

  7. Sentence processing and grammaticality in functional linguistics

    DEFF Research Database (Denmark)

    Poulsen, Mads

    finding from research on sentence processing that sentences are processed incrementally. Empirical methods for establishing grammaticality status are discussed and applied in relation to non-WH extraction phenomena in Danish. In Chapter 2, I discuss the use of the notions of grammaticality......The dissertation presents a functional linguistic model of grammaticality and investigates methods for applying this notion in empirical work. The use of the notion of grammaticality in generative grammar has been criticized by functionalists (Harder, 1996; Lakoff & Johnson, 1999), but attempts...... grammaticality. It is concluded that the intuitions of linguists should in principle be considered hypotheses of grammaticality, and that such hypotheses need to be tested with independent data. Such data can for example take the form of corpus data or acceptability judgment experiments. It is furthermore argued...

  8. Combining shallow and deep processing for a robust, fast, deep-linguistic dependency parser

    OpenAIRE

    Schneider, G

    2004-01-01

    This paper describes Pro3Gres, a fast, robust, broad-coverage parser that delivers deep-linguistic grammatical relation structures as output, which are closer to predicate-argument structures and more informative than pure constituency structures. The parser stays as shallow as is possible for each task, combining shallow and deep-linguistic methods by integrating chunking and by expressing the majority of long-distance dependencies in a context-free way. It combines statistical and rule-base...

  9. A generic method for the evaluation of interval type-2 fuzzy linguistic summaries.

    Science.gov (United States)

    Boran, Fatih Emre; Akay, Diyar

    2014-09-01

    Linguistic summarization has turned out to be an important knowledge discovery technique by providing the most relevant natural language-based sentences in a human consistent manner. While many studies on linguistic summarization have handled ordinary fuzzy sets [type-1 fuzzy set (T1FS)] for modeling words, only few of them have dealt with interval type-2 fuzzy sets (IT2FS) even though IT2FS is better capable of handling uncertainties associated with words. Furthermore, the existent studies work with the scalar cardinality based degree of truth which might lead to inconsistency in the evaluation of interval type-2 fuzzy (IT2F) linguistic summaries. In this paper, to overcome this shortcoming, we propose a novel probabilistic degree of truth for evaluating IT2F linguistic summaries in the forms of type-I and type-II quantified sentences. We also extend the properties that should be fulfilled by any degree of truth on linguistic summarization with T1FS to IT2F environment. We not only prove that our probabilistic degree of truth satisfies the given properties, but also illustrate by examples that it provides more consistent results when compared to the existing degree of truth in the literature. Furthermore, we carry out an application on linguistic summarization of time series data of Europe Brent Spot Price, along with a comparison of the results achieved with our approach and that of the existing degree of truth in the literature.

  10. UNITS OF MEASUREMENT: ORAL TRADITION, TRANSLATION STUDIES AND CORPUS LINGUISTICS

    Directory of Open Access Journals (Sweden)

    John ZEMKE

    2017-06-01

    Full Text Available The study of the world’s verbal arts offers an opportunity to consider ways that computational analysis and modeling of narratives may lead to new understandings of how they are constructed, their dynamics and relationships. Similarly, as corpus linguistics operations must define metrics, it offers an occasion to review basic interpretive concepts such as “units of analysis, context, and genre." My essay begins with an admittedly cursory overview from a novice perspective of what capabilities corpus linguistics currently possesses for the analysis and modeling of narratives. Consideration is given to the epistemological issue in the social sciences with the positivistic prescription or empiricist description of units of analysis and the potential pitfalls or advantages corpus linguistics encounters in searching for adequate equivalent terms. This review leads naturally to reflection on the crucial determinative action of context on meaning and the extent to which current computational interfaces are able to account for and integrate into global analysis of linguistic and performance dimensions such as performer, intonation, gesture, diction, idioms and figurative language, setting, audience, time, and occasion. As a tentative conclusion from this review, it can be stated that artificial intelligence for modeling narratives or devising narrative algorithms must develop capacities to account for performance dimensions in order to fulfill their analytical potential.

  11. A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools.

    Science.gov (United States)

    Verspoor, Karin; Cohen, Kevin Bretonnel; Lanfranchi, Arrick; Warner, Colin; Johnson, Helen L; Roeder, Christophe; Choi, Jinho D; Funk, Christopher; Malenkiy, Yuriy; Eckert, Miriam; Xue, Nianwen; Baumgartner, William A; Bada, Michael; Palmer, Martha; Hunter, Lawrence E

    2012-08-17

    We introduce the linguistic annotation of a corpus of 97 full-text biomedical publications, known as the Colorado Richly Annotated Full Text (CRAFT) corpus. We further assess the performance of existing tools for performing sentence splitting, tokenization, syntactic parsing, and named entity recognition on this corpus. Many biomedical natural language processing systems demonstrated large differences between their previously published results and their performance on the CRAFT corpus when tested with the publicly available models or rule sets. Trainable systems differed widely with respect to their ability to build high-performing models based on this data. The finding that some systems were able to train high-performing models based on this corpus is additional evidence, beyond high inter-annotator agreement, that the quality of the CRAFT corpus is high. The overall poor performance of various systems indicates that considerable work needs to be done to enable natural language processing systems to work well when the input is full-text journal articles. The CRAFT corpus provides a valuable resource to the biomedical natural language processing community for evaluation and training of new models for biomedical full text publications.

  12. How (not) to train a dependency parser: The curious case of jackknifing part-of-speech taggers

    DEFF Research Database (Denmark)

    Agic, Zeljko; Schluter, Natalie

    2017-01-01

    In dependency parsing, jackknifing taggers is indiscriminately used as a simple adaptation strategy. Here, we empirically evaluate when and how (not) to use jackknifing in parsing. On 26 languages, we reveal a preference that conflicts with, and surpasses the ubiquitous ten-folding. We show no cl...

  13. Asyndetic sentences with a concretiser

    Directory of Open Access Journals (Sweden)

    Tanasić Sreto Z.

    2015-01-01

    Full Text Available The paper discusses asyndetic sentences, compound sentences without a conjunction between the clauses. Slavic scholars pay considerable attention to these sentences. They predominantly consider asyndetic sentences to be a model of compound sentences, apart from the model of compound conjunctional sentences, and plead that they should be described separately. Asyndetic sentences in contemporary Serbian have not been studied sufficiently. There are few specific papers dedicated to asyndetic sentences, and one can say that there are virtually no papers giving them an in-depth treatment. Therefore, we are so far left without a full insight into how widespread that compound sentence model is in contemporary Serbian and in what variants it occurs, not to mention our even lesser knowledge of its distribution in certain functional styles. This paper describes one type of asyndetic sentences in the contemporary Standard Serbian language. It includes such sentences that have a word or a phrase functioning as the verifier of the semantic relation between the clauses of asyndetic sentences. The paper demonstrates that such sentences take up a sizeable portion of the asyndetic sentence corpus, and that a large number of concretisers occur functioning as the verifiers of different meanings which are established between the clauses. The concretisers, similarly to conjunctions in syndetic sentences, serve the purpose of reducing the typical polysemy of asyndetic sentences to monosemy by assigning a monosemic relation between the clauses while foregrounding one of the possible meanings, and suppressing the others. The paper indicates that coordinate asyndetic sentences express a number of different semantic relations between the clauses. Some of them are expressed in complex sentences, some in compound sentences, and there are also those that can be expressed in both types of conjunctional sentences. The paper presents examples of sentences which have in their

  14. Lazy functional parser combinators in Java

    NARCIS (Netherlands)

    Swierstra, D.S.; Dijkstra, A.

    2001-01-01

    A parser is a program that checks if a text is a sentence of the language as described by a grammar. Traditionally, the program text of a parser is generated from a grammar description, after which it is compiled and subsequently run. The language accepted by such a parser is, by the nature of

  15. A grammar-based semantic similarity algorithm for natural language sentences.

    Science.gov (United States)

    Lee, Ming Che; Chang, Jia Wei; Hsieh, Tung Cheng

    2014-01-01

    This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to "artificial language", such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.

  16. Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

    Directory of Open Access Journals (Sweden)

    Jiali Du

    2014-12-01

    Full Text Available This paper discusses the application of computational linguistics in the machine learning (ML system for the processing of garden path sentences. ML is closely related to artificial intelligence and linguistic cognition. The rapid and efficient processing of the complex structures is an effective method to test the system. By means of parsing the garden path sentence, we draw a conclusion that the integration of theoretical and statistical methods is helpful for the development of ML system.

  17. A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences

    Science.gov (United States)

    Chang, Jia Wei; Hsieh, Tung Cheng

    2014-01-01

    This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure. PMID:24982952

  18. A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences

    Directory of Open Access Journals (Sweden)

    Ming Che Lee

    2014-01-01

    Full Text Available This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.

  19. A memory-based shallow parser for spoken Dutch

    NARCIS (Netherlands)

    Canisius, S.V.M.; van den Bosch, A.; Decadt, B.; Hoste, V.; De Pauw, G.

    2004-01-01

    We describe the development of a Dutch memory-based shallow parser. The availability of large treebanks for Dutch, such as the one provided by the Spoken Dutch Corpus, allows memory-based learners to be trained on examples of shallow parsing taken from the treebank, and act as a shallow parser after

  20. Learning a POS tagger for AAVE-like language

    DEFF Research Database (Denmark)

    Jørgensen, Anna; Hovy, Dirk; Søgaard, Anders

    2016-01-01

    tagger for subtitles, lyrics, and tweets associated with African-American Vernacular English from a previously released and manually annotated Twitter corpus. Our approach is to learn from a mixture of this data and unlabeled data, which was automatically and partially labeled using mined tag...

  1. BC4GO: a full-text corpus for the BioCreative IV GO task.

    Science.gov (United States)

    Van Auken, Kimberly; Schaeffer, Mary L; McQuilton, Peter; Laulederkind, Stanley J F; Li, Donghui; Wang, Shur-Jen; Hayman, G Thomas; Tweedie, Susan; Arighi, Cecilia N; Done, James; Müller, Hans-Michael; Sternberg, Paul W; Mao, Yuqing; Wei, Chih-Hsuan; Lu, Zhiyong

    2014-01-01

    Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain ∼ 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community. Database URL: http://www.biocreative.org/resources/corpora/bc-iv-go-task-corpus/. Published by Oxford University Press 2014. This work is written by US

  2. Saying What You're Looking For: Linguistics Meets Video Search.

    Science.gov (United States)

    Barrett, Daniel Paul; Barbu, Andrei; Siddharth, N; Siskind, Jeffrey Mark

    2016-10-01

    We present an approach to searching large video corpora for clips which depict a natural-language query in the form of a sentence. Compositional semantics is used to encode subtle meaning differences lost in other approaches, such as the difference between two sentences which have identical words but entirely different meaning: The person rode the horse versus The horse rode the person. Given a sentential query and a natural-language parser, we produce a score indicating how well a video clip depicts that sentence for each clip in a corpus and return a ranked list of clips. Two fundamental problems are addressed simultaneously: detecting and tracking objects, and recognizing whether those tracks depict the query. Because both tracking and object detection are unreliable, our approach uses the sentential query to focus the tracker on the relevant participants and ensures that the resulting tracks are described by the sentential query. While most earlier work was limited to single-word queries which correspond to either verbs or nouns, we search for complex queries which contain multiple phrases, such as prepositional phrases, and modifiers, such as adverbs. We demonstrate this approach by searching for 2,627 naturally elicited sentential queries in 10 Hollywood movies.

  3. Storing files in a parallel computing system based on user-specified parser function

    Science.gov (United States)

    Faibish, Sorin; Bent, John M; Tzelnic, Percy; Grider, Gary; Manzanares, Adam; Torres, Aaron

    2014-10-21

    Techniques are provided for storing files in a parallel computing system based on a user-specified parser function. A plurality of files generated by a distributed application in a parallel computing system are stored by obtaining a parser from the distributed application for processing the plurality of files prior to storage; and storing one or more of the plurality of files in one or more storage nodes of the parallel computing system based on the processing by the parser. The plurality of files comprise one or more of a plurality of complete files and a plurality of sub-files. The parser can optionally store only those files that satisfy one or more semantic requirements of the parser. The parser can also extract metadata from one or more of the files and the extracted metadata can be stored with one or more of the plurality of files and used for searching for files.

  4. SEMSIN SEMANTIC AND SYNTACTIC PARSER

    Directory of Open Access Journals (Sweden)

    K. K. Boyarsky

    2015-09-01

    Full Text Available The paper deals with the principle of operation for SemSin semantic and syntactic parser creating a dependency tree for the Russian language sentences. The parser consists of 4 blocks: a dictionary, morphological analyzer, production rules and lexical analyzer. An important logical part of the parser is pre-syntactical module, which harmonizes and complements morphological analysis results, separates the text paragraphs into individual sentences, and also carries out predisambiguation. Characteristic feature of the presented parser is an open type of control – it is done by means of a set of production rules. A varied set of commands provides the ability to both morphological and semantic-syntactic analysis of the sentence. The paper presents the sequence of rules usage and examples of their work. Specific feature of the rules is the decision making on establishment of syntactic links with simultaneous removal of the morphological and semantic ambiguity. The lexical analyzer provides the execution of commands and rules, and manages the parser in manual or automatic modes of the text analysis. In the first case, the analysis is performed interactively with the possibility of step-by-step execution of the rules and scanning the resulting parse tree. In the second case, analysis results are filed in an xml-file. Active usage of syntactic and semantic dictionary information gives the possibility to reduce significantly the ambiguity of parsing. In addition to marking the text, the parser is also usable as a tool for information extraction from natural language texts.

  5. Computational principles of syntax in the regions specialized for language: integrating theoretical linguistics and functional neuroimaging.

    Science.gov (United States)

    Ohta, Shinri; Fukui, Naoki; Sakai, Kuniyoshi L

    2013-01-01

    The nature of computational principles of syntax remains to be elucidated. One promising approach to this problem would be to construct formal and abstract linguistic models that parametrically predict the activation modulations in the regions specialized for linguistic processes. In this article, we review recent advances in theoretical linguistics and functional neuroimaging in the following respects. First, we introduce the two fundamental linguistic operations: Merge (which combines two words or phrases to form a larger structure) and Search (which searches and establishes a syntactic relation of two words or phrases). We also illustrate certain universal properties of human language, and present hypotheses regarding how sentence structures are processed in the brain. Hypothesis I is that the Degree of Merger (DoM), i.e., the maximum depth of merged subtrees within a given domain, is a key computational concept to properly measure the complexity of tree structures. Hypothesis II is that the basic frame of the syntactic structure of a given linguistic expression is determined essentially by functional elements, which trigger Merge and Search. We then present our recent functional magnetic resonance imaging experiment, demonstrating that the DoM is indeed a key syntactic factor that accounts for syntax-selective activations in the left inferior frontal gyrus and supramarginal gyrus. Hypothesis III is that the DoM domain changes dynamically in accordance with iterative Merge applications, the Search distances, and/or task requirements. We confirm that the DoM accounts for activations in various sentence types. Hypothesis III successfully explains activation differences between object- and subject-relative clauses, as well as activations during explicit syntactic judgment tasks. A future research on the computational principles of syntax will further deepen our understanding of uniquely human mental faculties.

  6. Polish Semantic Parser

    Directory of Open Access Journals (Sweden)

    Agnieszka Grudzinska

    2000-01-01

    Full Text Available Amount of information transferred by computers grows very rapidly thus outgrowing the average man's capability of reception. It implies computer programs increase in the demand for which would be able to perform an introductory classitication or even selection of information directed to a particular receiver. Due to the complexity of the problem, we restricted it to understanding short newspaper notes. Among many conceptions formulated so far, the conceptual dependency worked out by Roger Schank has been chosen. It is a formal language of description of the semantics of pronouncement integrated with a text understanding algorithm. Substantial part of each text transformation system is a semantic parser of the Polish language. It is a module, which as the first and the only one has an access to the text in the Polish language. lt plays the role of an element, which finds relations between words of the Polish language and the formal registration. It translates sentences written in the language used by people into the language theory. The presented structure of knowledge units and the shape of understanding process algorithms are universal by virtue of the theory. On the other hand the defined knowledge units and the rules used in the algorithms ure only examples because they are constructed in order to understand short newspaper notes.

  7. Lancaster Summer School in Corpus Linguistics

    Directory of Open Access Journals (Sweden)

    Jaka Čibej

    2016-11-01

    Full Text Available Med 12. in 15. julijem je na Univerzi v Lancastru potekala poletna šola korpusnega jezikoslovja Lancaster Summer Schools in Corpus Linguistics and Other Digital Methods. Poletno šolo so organizirali UCREL (University Centre for Computer Corpus Research on Language, ERC (Evropski svet za raziskave – European Research Council, CASS (ESRC Centre for Corpus Approaches to Social Science in ESRC (Economic and Social Research Council, razdeljena pa je bila na šest programov, prilagojenih različnim področjem: Korpusno jezikoslovje za proučevanje jezikov (Corpus Linguistics for Language Studies, Korpusno jezikoslovje za družbene vede (Corpus Linguistics for Social Science, Korpusno jezikoslovje za humanistiko (Corpus Linguistics for Humanities, Statistika za korpusno jezikoslovje (Statistics for Corpus Linguistics, Geografski informacijski sistemi za digitalno humanistiko (Geographical Information Systems for the Digital Humanities in Korpusno podprta obdelava naravnih jezikov (Corpus-based Natural Language Processing.

  8. Corpus linguistics, systemic functional grammar and literary meaning: a critical analysis of harry potter and the philosopher’s stone Corpus linguistics, systemic functional grammar and literary meaning: a critical analysis of harry potter and the philosopher’s stone

    Directory of Open Access Journals (Sweden)

    Andrew Goatly

    2008-04-01

    Full Text Available The research reported in this paper has two aims. First, to show how corpus linguistics, using word frequency and concordance data, which is then analysed according to transitivity systems of systemic functional grammar (SFG, can be useful to the enterprise of critical linguistics. Second, to investigate to what extent this critical corpus linguistics (CCL gives a valid representation of the meanings and ideologies of a literary text. The hypothesis tested is that semiotic models of communication, in this case of popular children’s literature, with their emphasis on the encoding and decoding of meanings, lend themselves to a corpus linguistics approach. But that, in fact, these mutually reinforcing approaches (SFG and CCL with their reliance on what is encoded as text cannot entirely succeed in accounting for how literature, in particular, is understood and interpreted, and how ideology works within it and behind it. For a richer critical discourse analysis we need a pragmatic account, for example an analysis of presupposition, inference and propositional attitude. The issues here will be discussed in the light of recent debate between Michael Stubbs and Henry Widdowson on the strengths and limitations of corpus linguistics in critical discourse analysis. The research reported in this paper has two aims. First, to show how corpus linguistics, using word frequency and concordance data, which is then analysed according to transitivity systems of systemic functional grammar (SFG, can be useful to the enterprise of critical linguistics. Second, to investigate to what extent this critical corpus linguistics (CCL gives a valid representation of the meanings and ideologies of a literary text. The hypothesis tested is that semiotic models of communication, in this case of popular children’s literature, with their emphasis on the encoding and decoding of meanings, lend themselves to a corpus linguistics approach. But that, in fact, these

  9. Who is who? Interpretation of multiple occurrences of the Chinese reflexive: evidence from real-time sentence processing.

    Directory of Open Access Journals (Sweden)

    Lan Shuai

    Full Text Available Theoretical linguists claim that the notorious reflexive ziji 'self' in Mandarin Chinese, if occurring more than once in a single sentence, can take distinct antecedents. This study tackles possibly the most interesting puzzle in the linguistic literature, investigating how two occurrences of ziji in a single sentence are interpreted and whether or not there are mixed readings, i.e., these zijis are interpretively bound by distinct antecedents. Using 15 Chinese sentences each having two zijis, we conducted two sentence reading experiments based on a modified self-paced reading paradigm. The general interpretation patterns observed showed that the majority of participants associated both zijis with the same local antecedent, which was consistent with Principle A of the Standard Binding Theory and previous experimental findings involving a single ziji. In addition, mixed readings also occurred, but did not pattern as claimed in the theoretical linguistic literature (i.e., one ziji is bound by a long-distance antecedent and the other by a local antecedent. Based on these results, we argue that: (i mixed readings were due to manifold, interlocking and conflicting perspectives taken by the participants; and (ii cases of multiple occurrences of ziji taking distinct antecedents are illicit in Chinese syntax, since the speaker, when expressing a sentence, can select only one P(erspective-Center that referentially denotes the psychological perspective in which the sentence is situated.

  10. Extracting Various Classes of Data From Biological Text Using the Concept of Existence Dependency.

    Science.gov (United States)

    Taha, Kamal

    2015-11-01

    One of the key goals of biological natural language processing (NLP) is the automatic information extraction from biomedical publications. Most current constituency and dependency parsers overlook the semantic relationships between the constituents comprising a sentence and may not be well suited for capturing complex long-distance dependences. We propose in this paper a hybrid constituency-dependency parser for biological NLP information extraction called EDCC. EDCC aims at enhancing the state of the art of biological text mining by applying novel linguistic computational techniques that overcome the limitations of current constituency and dependency parsers outlined earlier, as follows: 1) it determines the semantic relationship between each pair of constituents in a sentence using novel semantic rules; and 2) it applies a semantic relationship extraction model that extracts information from different structural forms of constituents in sentences. EDCC can be used to extract different types of data from biological texts for purposes such as protein function prediction, genetic network construction, and protein-protein interaction detection. We evaluated the quality of EDCC by comparing it experimentally with six systems. Results showed marked improvement.

  11. Interaction as 'involvement' in writing for students: a corpus linguistic ...

    African Journals Online (AJOL)

    Interaction as 'involvement' in writing for students: a corpus linguistic analysis of a key readability feature. E Hilton Hubbard. Abstract. The rapid change in the demographics of South Africa's tertiary level student population over the last decade — and most specifically the huge increase in those who have to study at a ...

  12. Corpus linguistics and statistics with R introduction to quantitative methods in linguistics

    CERN Document Server

    Desagulier, Guillaume

    2017-01-01

    This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and t...

  13. Predicting complex syntactic structure in real time: Processing of negative sentences in Russian.

    Science.gov (United States)

    Kazanina, Nina

    2017-11-01

    In Russian negative sentences the verb's direct object may appear either in the accusative case, which is licensed by the verb (as is common cross-linguistically), or in the genitive case, which is licensed by the negation (Russian-specific "genitive-of-negation" phenomenon). Such sentences were used to investigate whether case marking is employed for anticipating syntactic structure, and whether lexical heads other than the verb can be predicted on the basis of a case-marked noun phrase. Experiment 1, a completion task, confirmed that genitive-of-negation is part of Russian speakers' active grammatical repertoire. In Experiments 2 and 3, the genitive/accusative case manipulation on the preverbal object led to shorter reading times at the negation and verb in the genitive versus accusative condition. Furthermore, Experiment 3 manipulated linear order of the direct object and the negated verb in order to distinguish whether the abovementioned facilitatory effect was predictive or integrative in nature, and concluded that the parser actively predicts a verb and (otherwise optional) negation on the basis of a preceding genitive-marked object. Similarly to a head-final language, case-marking information on preverbal noun phrases (NPs) is used by the parser to enable incremental structure building in a free-word-order language such as Russian.

  14. The Dangling model in the construction of compound sentences with regard to verb tenses

    Directory of Open Access Journals (Sweden)

    Mahmoud Mehravaran

    2016-02-01

    Full Text Available Abstract A sentence is the most complete syntactic unit of a language. The construction of a sentence is the most comprehensive, controversial and fascinating syntactic issue in the language grammar. The message or intention is usually conveyed through a sentence. In fact, the communicative function of a language is carried out via a sentence. A sentence can be classified in to different categories from different perspectives: semantically, constructively or performatively either with a verb or without a verb and also with regard to the construction. With regard to the construction, a sentence is either simple or compound. A simple sentence is the one with a complete meaning which only has one verb. It must be born in mind that a sentence is a complete unit whit meaning and there can be a hesitation after that. There for units of speech that have a verb without a complete meaning and there can be no silence or hesitation after them cannot be regarded as a sentence. Since they are dependent upon another sentence to be completed. They are called phrases. Such phrases can be incorporated in compound sentences make main and subordinate clauses. Compound sentences are widely discussed whit in grammatical constructions, but their types and that how have been built their various constructions are less adequately discussed. With regard to the manner of construction of compound sentences, the widest linguistic amenities can be observed in the sentences. There is not such complexity or disagreement over simple sentences but compound sentences have been less adequately investigated and there is room for more discussion and debate. Because, in some grammars, without considering the construction criteria, all sentences that are connected to one another, whit connectives are called compound sentences. This paper has precisely investigated compound sentences and has elaborated on the construction criteria of compound sentences. The study has also pointed to

  15. Long-Range Correlations in Sentence Series from A Story of the Stone.

    Science.gov (United States)

    Yang, Tianguang; Gu, Changgui; Yang, Huijie

    2016-01-01

    A sentence is the natural unit of language. Patterns embedded in series of sentences can be used to model the formation and evolution of languages, and to solve practical problems such as evaluating linguistic ability. In this paper, we apply de-trended fluctuation analysis to detect long-range correlations embedded in sentence series from A Story of the Stone, one of the greatest masterpieces of Chinese literature. We identified a weak long-range correlation, with a Hurst exponent of 0.575±0.002 up to a scale of 104. We used the structural stability to confirm the behavior of the long-range correlation, and found that different parts of the series had almost identical Hurst exponents. We found that noisy records can lead to false results and conclusions, even if the noise covers a limited proportion of the total records (e.g., less than 1%). Thus, the structural stability test is an essential procedure for confirming the existence of long-range correlations, which has been widely neglected in previous studies. Furthermore, a combination of de-trended fluctuation analysis and diffusion entropy analysis demonstrated that the sentence series was generated by a fractional Brownian motion.

  16. Long-Range Correlations in Sentence Series from A Story of the Stone.

    Directory of Open Access Journals (Sweden)

    Tianguang Yang

    Full Text Available A sentence is the natural unit of language. Patterns embedded in series of sentences can be used to model the formation and evolution of languages, and to solve practical problems such as evaluating linguistic ability. In this paper, we apply de-trended fluctuation analysis to detect long-range correlations embedded in sentence series from A Story of the Stone, one of the greatest masterpieces of Chinese literature. We identified a weak long-range correlation, with a Hurst exponent of 0.575±0.002 up to a scale of 104. We used the structural stability to confirm the behavior of the long-range correlation, and found that different parts of the series had almost identical Hurst exponents. We found that noisy records can lead to false results and conclusions, even if the noise covers a limited proportion of the total records (e.g., less than 1%. Thus, the structural stability test is an essential procedure for confirming the existence of long-range correlations, which has been widely neglected in previous studies. Furthermore, a combination of de-trended fluctuation analysis and diffusion entropy analysis demonstrated that the sentence series was generated by a fractional Brownian motion.

  17. Designing a Constraint Based Parser for Sanskrit

    Science.gov (United States)

    Kulkarni, Amba; Pokar, Sheetal; Shukl, Devanand

    Verbal understanding (śā bdabodha) of any utterance requires the knowledge of how words in that utterance are related to each other. Such knowledge is usually available in the form of cognition of grammatical relations. Generative grammars describe how a language codes these relations. Thus the knowledge of what information various grammatical relations convey is available from the generation point of view and not the analysis point of view. In order to develop a parser based on any grammar one should then know precisely the semantic content of the grammatical relations expressed in a language string, the clues for extracting these relations and finally whether these relations are expressed explicitly or implicitly. Based on the design principles that emerge from this knowledge, we model the parser as finding a directed Tree, given a graph with nodes representing the words and edges representing the possible relations between them. Further, we also use the Mīmā ṃsā constraint of ākā ṅkṣā (expectancy) to rule out non-solutions and sannidhi (proximity) to prioritize the solutions. We have implemented a parser based on these principles and its performance was found to be satisfactory giving us a confidence to extend its functionality to handle the complex sentences.

  18. A neural jet charge tagger for the measurement of the B/s0 anti-B/s0 oscillation frequency at CDF

    International Nuclear Information System (INIS)

    Lecci, Claudia; Karlsruhe U., EKP

    2005-01-01

    A Jet Charge Tagger algorithm for b-flavour tagging for the measurement of Δm s at CDF has been presented. The tagger is based on a b-track probability variable and a b-jet probability variable, both obtained by combining the information available in b(bar b) events with a Neural Network. The tagging power measured on data is 0.917 ± 0.031% e+SVT sample; 0.938 ± 0.029% μ+SVT sample which is ∼30% larger than the cut based Jet Charge Tagger employed for the B s 0 mixing analysis presented by CDF at the Winter Conferences 2005. The improved power of the tagger is due to the selection of the b-jet with a Neural Network variable, which uses correlated jet variables in an optimal way. The development of the track and jet probability has profited from studies performed on simulated events, which allowed to understand better the features of b(bar b) events. For the first time in the CDF B group a Monte Carlo sample comprising flavour creation and additional b(bar b) production processes has been examined and compared to Run II data. It has been demonstrated that a Monte Carlo sample with only flavour creation b(bar b) production processes is not sufficient to describe b(bar b) data collected at CDF. The sample with additional processes introduced in this thesis is thus essential for tagging studies. Although the event description is satisfactory, the flavour information in the Monte Carlo sample differs with respect to data. This difference needs to be clarified by further studies. In addition, the track and the jet probabilities are the first official tools based on Neural Networks for B-Physics at CDF. They have proven that the simulation is understood to such an advanced level that Neural Networks can be employed. Further work is going on in this direction: a Soft Electron and a Soft Muon Tagger based on Neural Networks are under development as of now. Several possible tagger setups have been studied and the Jet Charge Tagger reached a high level of optimization

  19. Towards automated processing of clinical Finnish: sublanguage analysis and a rule-based parser.

    Science.gov (United States)

    Laippala, Veronika; Ginter, Filip; Pyysalo, Sampo; Salakoski, Tapio

    2009-12-01

    In this paper, we present steps taken towards more efficient automated processing of clinical Finnish, focusing on daily nursing notes in a Finnish Intensive Care Unit (ICU). First, we analyze ICU Finnish as a sublanguage, identifying its specific features facilitating, for example, the development of a specialized syntactic analyser. The identified features include frequent omission of finite verbs, limitations in allowed syntactic structures, and domain-specific vocabulary. Second, we develop a formal grammar and a parser for ICU Finnish, thus providing better tools for the development of further applications in the clinical domain. The grammar is implemented in the LKB system in a typed feature structure formalism. The lexicon is automatically generated based on the output of the FinTWOL morphological analyzer adapted to the clinical domain. As an additional experiment, we study the effect of using Finnish constraint grammar to reduce the size of the lexicon. The parser construction thus makes efficient use of existing resources for Finnish. The grammar currently covers 76.6% of ICU Finnish sentences, producing highly accurate best-parse analyzes with F-score of 91.1%. We find that building a parser for the highly specialized domain sublanguage is not only feasible, but also surprisingly efficient, given an existing morphological analyzer with broad vocabulary coverage. The resulting parser enables a deeper analysis of the text than was previously possible.

  20. Microsyntactic Annotation of Corpora and its Use in Computational Linguistics Tasks

    Directory of Open Access Journals (Sweden)

    Iomdin Leonid

    2017-12-01

    Full Text Available Microsyntax is a linguistic discipline dealing with idiomatic elements whose important properties are strongly related to syntax. In a way, these elements may be viewed as transitional entities between the lexicon and the grammar, which explains why they are often underrepresented in both of these resource types: the lexicographer fails to see such elements as full-fledged lexical units, while the grammarian finds them too specific to justify the creation of individual well-developed rules. As a result, such elements are poorly covered by linguistic models used in advanced modern computational linguistic tasks like high-quality machine translation or deep semantic analysis. A possible way to mend the situation and improve the coverage and adequate treatment of microsyntactic units in linguistic resources is to develop corpora with microsyntactic annotation, closely linked to specially designed lexicons. The paper shows how this task is solved in the deeply annotated corpus of Russian, SynTagRus.

  1. Assessment of disease named entity recognition on a corpus of annotated sentences.

    Science.gov (United States)

    Jimeno, Antonio; Jimenez-Ruiz, Ernesto; Lee, Vivian; Gaudan, Sylvain; Berlanga, Rafael; Rebholz-Schuhmann, Dietrich

    2008-04-11

    In recent years, the recognition of semantic types from the biomedical scientific literature has been focused on named entities like protein and gene names (PGNs) and gene ontology terms (GO terms). Other semantic types like diseases have not received the same level of attention. Different solutions have been proposed to identify disease named entities in the scientific literature. While matching the terminology with language patterns suffers from low recall (e.g., Whatizit) other solutions make use of morpho-syntactic features to better cover the full scope of terminological variability (e.g., MetaMap). Currently, MetaMap that is provided from the National Library of Medicine (NLM) is the state of the art solution for the annotation of concepts from UMLS (Unified Medical Language System) in the literature. Nonetheless, its performance has not yet been assessed on an annotated corpus. In addition, little effort has been invested so far to generate an annotated dataset that links disease entities in text to disease entries in a database, thesaurus or ontology and that could serve as a gold standard to benchmark text mining solutions. As part of our research work, we have taken a corpus that has been delivered in the past for the identification of associations of genes to diseases based on the UMLS Metathesaurus and we have reprocessed and re-annotated the corpus. We have gathered annotations for disease entities from two curators, analyzed their disagreement (0.51 in the kappa-statistic) and composed a single annotated corpus for public use. Thereafter, three solutions for disease named entity recognition including MetaMap have been applied to the corpus to automatically annotate it with UMLS Metathesaurus concepts. The resulting annotations have been benchmarked to compare their performance. The annotated corpus is publicly available at ftp://ftp.ebi.ac.uk/pub/software/textmining/corpora/diseases and can serve as a benchmark to other systems. In addition, we found

  2. Characterizing the Google Books Corpus: Strong Limits to Inferences of Socio-Cultural and Linguistic Evolution.

    Science.gov (United States)

    Pechenick, Eitan Adam; Danforth, Christopher M; Dodds, Peter Sheridan

    2015-01-01

    It is tempting to treat frequency trends from the Google Books data sets as indicators of the "true" popularity of various words and phrases. Doing so allows us to draw quantitatively strong conclusions about the evolution of cultural perception of a given topic, such as time or gender. However, the Google Books corpus suffers from a number of limitations which make it an obscure mask of cultural popularity. A primary issue is that the corpus is in effect a library, containing one of each book. A single, prolific author is thereby able to noticeably insert new phrases into the Google Books lexicon, whether the author is widely read or not. With this understood, the Google Books corpus remains an important data set to be considered more lexicon-like than text-like. Here, we show that a distinct problematic feature arises from the inclusion of scientific texts, which have become an increasingly substantive portion of the corpus throughout the 1900 s. The result is a surge of phrases typical to academic articles but less common in general, such as references to time in the form of citations. We use information theoretic methods to highlight these dynamics by examining and comparing major contributions via a divergence measure of English data sets between decades in the period 1800-2000. We find that only the English Fiction data set from the second version of the corpus is not heavily affected by professional texts. Overall, our findings call into question the vast majority of existing claims drawn from the Google Books corpus, and point to the need to fully characterize the dynamics of the corpus before using these data sets to draw broad conclusions about cultural and linguistic evolution.

  3. Language configurations of degree-related denotations in the spoken production of a group of Colombian EFL university students: A corpus-based study

    Directory of Open Access Journals (Sweden)

    Wilder Yesid Escobar

    2015-05-01

    Full Text Available Recognizing that developing the competences needed to appropriately use linguistic resources according to contextual characteristics (pragmatics is as important as the cultural-imbedded linguistic knowledge itself (semantics and that both are equally essential to form competent speakers of English in foreign language contexts, we feel this research relies on corpus linguistics to analyze both the scope and the limitations of the sociolinguistic knowledge and the communicative skills of English students at the university level. To such end, a linguistic corpus was assembled, compared to an existing corpus of native speakers, and analyzed in terms of the frequency, overuse, underuse, misuse, ambiguity, success, and failure of the linguistic parameters used in speech acts. The findings herein describe the linguistic configurations employed to modify levels and degrees of descriptions (salient sematic theme exhibited in the EFL learners´ corpus appealing to the sociolinguistic principles governing meaning making and language use which are constructed under the social conditions of the environments where the language is naturally spoken for sociocultural exchange.

  4. Representing sentence information

    Science.gov (United States)

    Perkins, Walton A., III

    1991-03-01

    This paper describes a computer-oriented representation for sentence information. Whereas many Artificial Intelligence (AI) natural language systems start with a syntactic parse of a sentence into the linguist's components: noun, verb, adjective, preposition, etc., we argue that it is better to parse the input sentence into 'meaning' components: attribute, attribute value, object class, object instance, and relation. AI systems need a representation that will allow rapid storage and retrieval of information and convenient reasoning with that information. The attribute-of-object representation has proven useful for handling information in relational databases (which are well known for their efficiency in storage and retrieval) and for reasoning in knowledge- based systems. On the other hand, the linguist's syntactic representation of the works in sentences has not been shown to be useful for information handling and reasoning. We think it is an unnecessary and misleading intermediate form. Our sentence representation is semantic based in terms of attribute, attribute value, object class, object instance, and relation. Every sentence is segmented into one or more components with the form: 'attribute' of 'object' 'relation' 'attribute value'. Using only one format for all information gives the system simplicity and good performance as a RISC architecture does for hardware. The attribute-of-object representation is not new; it is used extensively in relational databases and knowledge-based systems. However, we will show that it can be used as a meaning representation for natural language sentences with minor extensions. In this paper we describe how a computer system can parse English sentences into this representation and generate English sentences from this representation. Much of this has been tested with computer implementation.

  5. 'Tagger' - a Mac OS X Interactive Graphical Application for Data Inference and Analysis of N-Dimensional Datasets in the Natural Physical Sciences.

    Science.gov (United States)

    Morse, P. E.; Reading, A. M.; Lueg, C.

    2014-12-01

    Pattern-recognition in scientific data is not only a computational problem but a human-observer problem as well. Human observation of - and interaction with - data visualization software can augment, select, interrupt and modify computational routines and facilitate processes of pattern and significant feature recognition for subsequent human analysis, machine learning, expert and artificial intelligence systems.'Tagger' is a Mac OS X interactive data visualisation tool that facilitates Human-Computer interaction for the recognition of patterns and significant structures. It is a graphical application developed using the Quartz Composer framework. 'Tagger' follows a Model-View-Controller (MVC) software architecture: the application problem domain (the model) is to facilitate novel ways of abstractly representing data to a human interlocutor, presenting these via different viewer modalities (e.g. chart representations, particle systems, parametric geometry) to the user (View) and enabling interaction with the data (Controller) via a variety of Human Interface Devices (HID). The software enables the user to create an arbitrary array of tags that may be appended to the visualised data, which are then saved into output files as forms of semantic metadata. Three fundamental problems that are not strongly supported by conventional scientific visualisation software are addressed:1] How to visually animate data over time, 2] How to rapidly deploy unconventional parametrically driven data visualisations, 3] How to construct and explore novel interaction models that capture the activity of the end-user as semantic metadata that can be used to computationally enhance subsequent interrogation. Saved tagged data files may be loaded into Tagger, so that tags may be tagged, if desired. Recursion opens up the possibility of refining or overlapping different types of tags, tagging a variety of different POIs or types of events, and of capturing different types of specialist

  6. Tracking Anglicisms in Domains by the Corpus-Linguistic Method

    DEFF Research Database (Denmark)

    Mousten, Birthe; Laursen, Anne Lise

    2015-01-01

    Lay investors and semi-professionals lean on professional stock bloggers and stock analysts for advice on stock investments; semi-professionals and professionals write about investments globally, and stock information has to be available in many local markets. Using the correct terminology......’s critical sense is not enough to make the right choices. Our corpus-linguistic tool can be a help in this specialized field....

  7. The Dangling model in the construction of compound sentences with regard to verb tenses

    Directory of Open Access Journals (Sweden)

    Mahmoud Mehravaran

    2016-01-01

    Full Text Available Abstract A sentence is the most complete syntactic unit of a language. The construction of a sentence is the most comprehensive, controversial and fascinating syntactic issue in the language grammar. The message or intention is usually conveyed through a sentence. In fact, the communicative function of a language is carried out via a sentence. A sentence can be classified in to different categories from different perspectives: semantically, constructively or performatively either with a verb or without a verb and also with regard to the construction. With regard to the construction, a sentence is either simple or compound. A simple sentence is the one with a complete meaning which only has one verb. It must be born in mind that a sentence is a complete unit whit meaning and there can be a hesitation after that. There for units of speech that have a verb without a complete meaning and there can be no silence or hesitation after them cannot be regarded as a sentence. Since they are dependent upon another sentence to be completed. They are called phrases. Such phrases can be incorporated in compound sentences make main and subordinate clauses. Compound sentences are widely discussed whit in grammatical constructions, but their types and that how have been built their various constructions are less adequately discussed. With regard to the manner of construction of compound sentences, the widest linguistic amenities can be observed in the sentences. There is not such complexity or disagreement over simple sentences but compound sentences have been less adequately investigated and there is room for more discussion and debate. Because, in some grammars, without considering the construction criteria, all sentences that are connected to one another, whit connectives are called compound sentences. This paper has precisely investigated compound sentences and has elaborated on the construction criteria of compound sentences. The study has also pointed to

  8. Structure before meaning: sentence processing, plausibility, and subcategorization.

    Science.gov (United States)

    Kizach, Johannes; Nyvad, Anne Mette; Christensen, Ken Ramshøj

    2013-01-01

    Natural language processing is a fast and automatized process. A crucial part of this process is parsing, the online incremental construction of a syntactic structure. The aim of this study was to test whether a wh-filler extracted from an embedded clause is initially attached as the object of the matrix verb with subsequent reanalysis, and if so, whether the plausibility of such an attachment has an effect on reaction time. Finally, we wanted to examine whether subcategorization plays a role. We used a method called G-Maze to measure response time in a self-paced reading design. The experiments confirmed that there is early attachment of fillers to the matrix verb. When this attachment is implausible, the off-line acceptability of the whole sentence is significantly reduced. The on-line results showed that G-Maze was highly suited for this type of experiment. In accordance with our predictions, the results suggest that the parser ignores (or has no access to information about) implausibility and attaches fillers as soon as possible to the matrix verb. However, the results also show that the parser uses the subcategorization frame of the matrix verb. In short, the parser ignores semantic information and allows implausible attachments but adheres to information about which type of object a verb can take, ensuring that the parser does not make impossible attachments. We argue that the evidence supports a syntactic parser informed by syntactic cues, rather than one guided by semantic cues or one that is blind, or completely autonomous.

  9. Structure before meaning: sentence processing, plausibility, and subcategorization.

    Directory of Open Access Journals (Sweden)

    Johannes Kizach

    Full Text Available Natural language processing is a fast and automatized process. A crucial part of this process is parsing, the online incremental construction of a syntactic structure. The aim of this study was to test whether a wh-filler extracted from an embedded clause is initially attached as the object of the matrix verb with subsequent reanalysis, and if so, whether the plausibility of such an attachment has an effect on reaction time. Finally, we wanted to examine whether subcategorization plays a role. We used a method called G-Maze to measure response time in a self-paced reading design. The experiments confirmed that there is early attachment of fillers to the matrix verb. When this attachment is implausible, the off-line acceptability of the whole sentence is significantly reduced. The on-line results showed that G-Maze was highly suited for this type of experiment. In accordance with our predictions, the results suggest that the parser ignores (or has no access to information about implausibility and attaches fillers as soon as possible to the matrix verb. However, the results also show that the parser uses the subcategorization frame of the matrix verb. In short, the parser ignores semantic information and allows implausible attachments but adheres to information about which type of object a verb can take, ensuring that the parser does not make impossible attachments. We argue that the evidence supports a syntactic parser informed by syntactic cues, rather than one guided by semantic cues or one that is blind, or completely autonomous.

  10. CytometryML: a data standard which has been designed to interface with other standards

    Science.gov (United States)

    Leif, Robert C.

    2007-02-01

    Because of the differences in the requirements, needs, and past histories including existing standards of the creating organizations, a single encompassing cytology-pathology standard will not, in the near future, replace the multiple existing or under development standards. Except for DICOM and FCS, these standardization efforts are all based on XML. CytometryML is a collection of XML schemas, which are based on the Digital Imaging and Communications in Medicine (DICOM) and Flow Cytometry Standard (FCS) datatypes. The CytometryML schemas contain attributes that link them to the DICOM standard and FCS. Interoperability with DICOM has been facilitated by, wherever reasonable, limiting the difference between CytometryML and the previous standards to syntax. In order to permit the Resource Description Framework, RDF, to reference the CytometryML datatypes, id attributes have been added to many CytometryML elements. The Laboratory Digital Imaging Project (LDIP) Data Exchange Specification and the Flowcyt standards development effort employ RDF syntax. Documentation from DICOM has been reused in CytometryML. The unity of analytical cytology was demonstrated by deriving a microscope type and a flow cytometer type from a generic cytometry instrument type. The feasibility of incorporating the Flowcyt gating schemas into CytometryML has been demonstrated. CytometryML is being extended to include many of the new DICOM Working Group 26 datatypes, which describe patients, specimens, and analytes. In situations where multiple standards are being created, interoperability can be facilitated by employing datatypes based on a common set of semantics and building in links to standards that employ different syntax.

  11. Linguistics and the digital humanities

    DEFF Research Database (Denmark)

    Jensen, Kim Ebensgaard

    2014-01-01

    Corpus linguistics has been closely intertwined with digital technology since the introduction of university computer mainframes in the 1960s. Making use of both digitized data in the form of the language corpus and computational methods of analysis involving concordancers and statistics software......, corpus linguistics arguably has a place in the digital humanities. Still, it remains obscure and figures only sporadically in the literature on the digital humanities. This article provides an overview of the main principles of corpus linguistics and the role of computer technology in relation to data...... and method and also offers a bird's-eye view of the history of corpus linguistics with a focus on its intimate relationship with digital technology and how digital technology has impacted the very core of corpus linguistics and shaped the identity of the corpus linguist. Ultimately, the article is oriented...

  12. Sentence processing in anterior superior temporal cortex shows a social-emotional bias.

    Science.gov (United States)

    Mellem, Monika S; Jasmin, Kyle M; Peng, Cynthia; Martin, Alex

    2016-08-01

    The anterior region of the left superior temporal gyrus/superior temporal sulcus (aSTG/STS) has been implicated in two very different cognitive functions: sentence processing and social-emotional processing. However, the vast majority of the sentence stimuli in previous reports have been of a social or social-emotional nature suggesting that sentence processing may be confounded with semantic content. To evaluate this possibility we had subjects read word lists that differed in phrase/constituent size (single words, 3-word phrases, 6-word sentences) and semantic content (social-emotional, social, and inanimate objects) while scanned in a 7T environment. This allowed us to investigate if the aSTG/STS responded to increasing constituent structure (with increased activity as a function of constituent size) with or without regard to a specific domain of concepts, i.e., social and/or social-emotional content. Activity in the left aSTG/STS was found to increase with constituent size. This region was also modulated by content, however, such that social-emotional concepts were preferred over social and object stimuli. Reading also induced content type effects in domain-specific semantic regions. Those preferring social-emotional content included aSTG/STS, inferior frontal gyrus, posterior STS, lateral fusiform, ventromedial prefrontal cortex, and amygdala, regions included in the "social brain", while those preferring object content included parahippocampal gyrus, retrosplenial cortex, and caudate, regions involved in object processing. These results suggest that semantic content affects higher-level linguistic processing and should be taken into account in future studies. Copyright © 2016. Published by Elsevier Ltd.

  13. Boosted H­->bb Tagger In Run II

    CERN Document Server

    Sahinsoy, Merve; The ATLAS collaboration

    2016-01-01

    Several searches for Higgs bosons decaying to b­quark pairs benefit from the increased Run II centre­of­mass energy by exploiting the large transvers­momentum (boosted) Higgs boson regime, where the two b­jets are merged into one large­radius jet. ATLAS uses a boosted H­>bb tagger algorithm to separate the Higgs signal from the background processes (QCD, W and Z bosons, top quarks). The tagger takes as input a large­R=1.0 jet calibrating the pseudorapidity, energy and mass scale. The tagger employs b­tagging, Higgs candidate mass, and substructure information. The performance of several operating points in Higgs boson signal and QCD and ttbar all­hadronic backgrounds are presented. Systematic uncertainties are evaluated so that this tagger can be used in analyses.

  14. MORPOHOLOGICAL POS TAGGING IN ORAL LANGUAGE CORPUS: CHALLENGES FOR AELIUS

    Directory of Open Access Journals (Sweden)

    Gabriel de Ávila Othero

    2014-12-01

    Full Text Available In this paper, we present the results of our work with automatic morphological annotation of excerpts from a corpus of spoken language – belonging to the VARSUL project – using the free morphosyntatic tagger Aelius. We present 20 texts containing 154,530 words, annotated automatically and corrected manually. This paper presents the tagger Aelius and our work of manual review of the texts, as well as our suggestions for improvements of the tool, concerning aspects of oral texts. We verify the performance of morphosyntactic tagging a spoken language corpus, an unprecedented challenge for the tagger. Based on the errors of the tagger, we try to infer certain patterns of annotation to overcome limitations presented by the program, and we propose suggestions for implementations in order to allow Aelius to tag spoken language corpora in a more effective way, specially treating cases such as interjections, apheresis, onomatopeia and conversational markers.

  15. ULTRA: Universal Grammar as a Universal Parser.

    Science.gov (United States)

    Medeiros, David P

    2018-01-01

    A central concern of generative grammar is the relationship between hierarchy and word order, traditionally understood as two dimensions of a single syntactic representation. A related concern is directionality in the grammar. Traditional approaches posit process-neutral grammars, embodying knowledge of language, put to use with infinite facility both for production and comprehension. This has crystallized in the view of Merge as the central property of syntax, perhaps its only novel feature. A growing number of approaches explore grammars with different directionalities, often with more direct connections to performance mechanisms. This paper describes a novel model of universal grammar as a one-directional, universal parser. Mismatch between word order and interpretation order is pervasive in comprehension; in the present model, word order is language-particular and interpretation order (i.e., hierarchy) is universal. These orders are not two dimensions of a unified abstract object (e.g., precedence and dominance in a single tree); rather, both are temporal sequences, and UG is an invariant real-time procedure (based on Knuth's stack-sorting algorithm) transforming word order into hierarchical order. This shift in perspective has several desirable consequences. It collapses linearization, displacement, and composition into a single performance process. The architecture provides a novel source of brackets (labeled unambiguously and without search), which are understood not as part-whole constituency relations, but as storage and retrieval routines in parsing. It also explains why neutral word order within single syntactic cycles avoids 213-like permutations. The model identifies cycles as extended projections of lexical heads, grounding the notion of phase. This is achieved with a universal processor, dispensing with parameters. The empirical focus is word order in noun phrases. This domain provides some of the clearest evidence for 213-avoidance as a cross-linguistic

  16. Improved part-of-speech prediction in suffix analysis.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available MOTIVATION: Predicting the part of speech (POS tag of an unknown word in a sentence is a significant challenge. This is particularly difficult in biomedicine, where POS tags serve as an input to training sophisticated literature summarization techniques, such as those based on Hidden Markov Models (HMM. Different approaches have been taken to deal with the POS tagger challenge, but with one exception--the TnT POS tagger--previous publications on POS tagging have omitted details of the suffix analysis used for handling unknown words. The suffix of an English word is a strong predictor of a POS tag for that word. As a pre-requisite for an accurate HMM POS tagger for biomedical publications, we present an efficient suffix prediction method for integration into a POS tagger. RESULTS: We have implemented a fully functional HMM POS tagger using experimentally optimised suffix based prediction. Our simple suffix analysis method, significantly outperformed the probability interpolation based TnT method. We have also shown how important suffix analysis can be for probability estimation of a known word (in the training corpus with an unseen POS tag; a common scenario with a small training corpus. We then integrated this simple method in our POS tagger and determined an optimised parameter set for both methods, which can help developers to optimise their current algorithm, based on our results. We also introduce the concept of counting methods in maximum likelihood estimation for the first time and show how counting methods can affect the prediction result. Finally, we describe how machine-learning techniques were applied to identify words, for which prediction of POS tags were always incorrect and propose a method to handle words of this type. AVAILABILITY AND IMPLEMENTATION: Java source code, binaries and setup instructions are freely available at http://genomes.sapac.edu.au/text_mining/pos_tagger.zip.

  17. On Minimizing Training Corpus for Parser Acquisition

    National Research Council Canada - National Science Library

    Hwa, Rebecca

    2001-01-01

    .... In this work, we consider selecting training examples with the it tree-entropy metric. Our goal is to assess how well this selection technique can be applied for training different types of parsers...

  18. A new Soft Muon Tagger for ATLAS Run 2

    CERN Document Server

    Sciandra, Andrea; The ATLAS collaboration

    2017-01-01

    b-tagging plays a fundamental role at LHC, as it helps in the identification of heavy particles that decay to bottom quarks, such as the top quark, Higgs boson or heavy exotic particles. The Soft Muon Tagger (SMT) allows jets from b-quarks to be identified, taking advantage of the presence of a muon coming from semileptonic decays of b-hadrons. The development of this new b-tagger in ATLAS will be described, showing that, despite the low efficiency of the jet-muon association (based on the angular distance), the discriminating power of the associated muon variables is sufficient to reject light jets. An enhanced performance has been reached for all light jet rejection working points by adding the SMT output to the best performing multivariate b-tagger in ATLAS (MV2). A good modeling of input and output variables is shown, comparing simulation with Run 2 data.

  19. A Protocol for Annotating Parser Differences. Research Report. ETS RR-16-02

    Science.gov (United States)

    Bruno, James V.; Cahill, Aoife; Gyawali, Binod

    2016-01-01

    We present an annotation scheme for classifying differences in the outputs of syntactic constituency parsers when a gold standard is unavailable or undesired, as in the case of texts written by nonnative speakers of English. We discuss its automated implementation and the results of a case study that uses the scheme to choose a parser best suited…

  20. A corpus for plant-chemical relationships in the biomedical domain.

    Science.gov (United States)

    Choi, Wonjun; Kim, Baeksoo; Cho, Hyejin; Lee, Doheon; Lee, Hyunju

    2016-09-20

    Plants are natural products that humans consume in various ways including food and medicine. They have a long empirical history of treating diseases with relatively few side effects. Based on these strengths, many studies have been performed to verify the effectiveness of plants in treating diseases. It is crucial to understand the chemicals contained in plants because these chemicals can regulate activities of proteins that are key factors in causing diseases. With the accumulation of a large volume of biomedical literature in various databases such as PubMed, it is possible to automatically extract relationships between plants and chemicals in a large-scale way if we apply a text mining approach. A cornerstone of achieving this task is a corpus of relationships between plants and chemicals. In this study, we first constructed a corpus for plant and chemical entities and for the relationships between them. The corpus contains 267 plant entities, 475 chemical entities, and 1,007 plant-chemical relationships (550 and 457 positive and negative relationships, respectively), which are drawn from 377 sentences in 245 PubMed abstracts. Inter-annotator agreement scores for the corpus among three annotators were measured. The simple percent agreement scores for entities and trigger words for the relationships were 99.6 and 94.8 %, respectively, and the overall kappa score for the classification of positive and negative relationships was 79.8 %. We also developed a rule-based model to automatically extract such plant-chemical relationships. When we evaluated the rule-based model using the corpus and randomly selected biomedical articles, overall F-scores of 68.0 and 61.8 % were achieved, respectively. We expect that the corpus for plant-chemical relationships will be a useful resource for enhancing plant research. The corpus is available at http://combio.gist.ac.kr/plantchemicalcorpus .

  1. The first leg transplant for the treatment of a cancer by Saints Cosmas and Damian.

    Science.gov (United States)

    Androutsos, G; Diamantis, A; Vladimiros, L

    2008-01-01

    The most renowned of all medically inclined saints must have been St. Cosmas and St. Damian. They became the special patrons of surgery, due to their miraculous operations. On one occasion, they amputated a cancerous leg and replaced it with another, taken from a Moor who had just died. This dramatic cure was meat and drink for artists, and has been portrayed many times.

  2. Mining of relations between proteins over biomedical scientific literature using a deep-linguistic approach.

    Science.gov (United States)

    Rinaldi, Fabio; Schneider, Gerold; Kaljurand, Kaarel; Hess, Michael; Andronis, Christos; Konstandi, Ourania; Persidis, Andreas

    2007-02-01

    The amount of new discoveries (as published in the scientific literature) in the biomedical area is growing at an exponential rate. This growth makes it very difficult to filter the most relevant results, and thus the extraction of the core information becomes very expensive. Therefore, there is a growing interest in text processing approaches that can deliver selected information from scientific publications, which can limit the amount of human intervention normally needed to gather those results. This paper presents and evaluates an approach aimed at automating the process of extracting functional relations (e.g. interactions between genes and proteins) from scientific literature in the biomedical domain. The approach, using a novel dependency-based parser, is based on a complete syntactic analysis of the corpus. We have implemented a state-of-the-art text mining system for biomedical literature, based on a deep-linguistic, full-parsing approach. The results are validated on two different corpora: the manually annotated genomics information access (GENIA) corpus and the automatically annotated arabidopsis thaliana circadian rhythms (ATCR) corpus. We show how a deep-linguistic approach (contrary to common belief) can be used in a real world text mining application, offering high-precision relation extraction, while at the same time retaining a sufficient recall.

  3. The sentence wrap-up dogma.

    Science.gov (United States)

    Stowe, Laurie A; Kaan, Edith; Sabourin, Laura; Taylor, Ryan C

    2018-03-30

    Current sentence processing research has focused on early effects of the on-line incremental processes that are performed at each word or constituent during processing. However, less attention has been devoted to what happens at the end of the clause or sentence. More specifically, over the last decade and a half, a lot of effort has been put into avoiding measuring event-related brain potentials (ERPs) at the final word of a sentence, because of the possible effects of sentence wrap-up. This article reviews the evidence on how and when sentence wrap-up impacts behavioral and ERP results. Even though the end of the sentence is associated with a positive-going ERP wave, thus far this effect has not been associated with any factors hypothesized to affect wrap-up. In addition, ERP responses to violations have not been affected by this positivity. "Sentence-final" negativities reported in the literature are not unique to sentence final positions, nor do they obscure or distort ERP effects associated with linguistic manipulations. Finally, the empirical evidence used to argue that sentence-final ERPs are different from those recorded at sentence-medial positions is weak at most. Measuring ERPs at sentence-final positions is therefore certainly not to be avoided at all costs, especially not in cases where the structure of the language under investigation requires it. More importantly, researchers should follow rigorous method in their experimental design, avoid decision tasks which may induce ERP confounds, and ensure all other possible explanations for results are considered. Although this article is directed at a particular dogma from a particular literature, this review shows that it is important to reassess what is regarded as "general knowledge" from time to time. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. A atualidade do conceito de gerações de Karl Mannheim

    Directory of Open Access Journals (Sweden)

    Wivian Weller

    2010-08-01

    Full Text Available O conceito de gerações vem sendo retomado nas análises sociológicas que apontam não somente para as diferenças de classe, mas também para as desigualdades de gênero, étnico-raciais, culturais e geracionais. No entanto, "gerações" aparece como uma espécie de conceito guarda-chuva ou como categoria pouco teorizada. Embora o conceito de gerações de Mannheim represente, para muitos autores, a mais completa tentativa de explicação do tema, o mesmo tem sido, muitas vezes, citado por se tratar de um "clássico": as potencialidades de análise do conceito proposto pelo autor ainda são muitas. O presente artigo apresenta, por um lado, uma reconstrução deste "famoso" ensaio que até hoje não chegou a ser integralmente traduzido e publicado no Brasil e, por outro, discute a atualidade e pertinência do artigo escrito por Mannheim em 1928 para os estudos contemporâneos sobre gerações e suas interfaces com outros campos.The concept of generations has been incorporated into sociological analysis, which points class differences but also gender, ethnic, racial, cultural and generational inequalities. However "generations" seems to be a kind of umbrella concept or category that have been little theorized. Though the concept of generations of Mannheim represent, for many authors, the most comprehensive attempt to explain the issue, it has been frequently quoted since it is a "classic". The possibilities of analysis of the concept proposed by the author still are many. This article presents, first, a reconstruction of this 'famous' essay which has not yet come to be fully translated and published in Brazil, and secondly, discusses the relevance and appropriateness of the article written by Mannheim in 1928 for the contemporary studies of generation and their interfaces with other fields.

  5. The Syntactic Analysis of Pronoun Homofunction Considering Verb Structure and the Function of Connected Pronouns in Passive-Emotional Sentences

    Directory of Open Access Journals (Sweden)

    Mohammad Irani

    2017-04-01

       All these sentences express a kind of passive and emotional reaction; hence, it must certainly be considered to get the meaning of such sentences. Khābash gereft, for example, means he fell asleep.In the other hand, paying attention to the meaning will notably be effective in determining the verb type in structure and the grammatical role of the words in such sentences. According to the authors of this research, unlike some ideas, the verb structure is not compounded in these sentences, but is a nominal/adjectival component before the homo-function, recognized to be a part of the compound verb by some grammarians and linguists, has a subject role; and the homo-function is also a simple verb completely agreeing in suffix with the subject. The role of the pronoun suffixes, also, must be determined paying special attention to their role in traditional grammar because we think sometimes proposing ideas on some grammatical points might not bring about acceptable results regardless of their background and historic relations. Since pronoun suffixes occur only as objects, complements and genitives, the joint pronouns in these sentences are not an exception and have the same roles. The accompanying noun or adjective is the subject. The disjoint initial noun or pronoun, the “pronoun homo-role”, can be replaced by the pronoun suffix in some sentences, but makes the sentence ungrammatical in most cases.

  6. Techniques for Automated Testing of Lola Industrial Robot Language Parser

    Directory of Open Access Journals (Sweden)

    M. M. Lutovac

    2014-06-01

    Full Text Available The accuracy of parsing execution directly affects the accuracy of semantic analysis, optimization and object code generation. Therefore, parser testing represents the basis of compiler testing. It should include tests for correct and expected, but also for unexpected and invalid cases. Techniques for testing the parser, as well as algorithms and tools for test sentences generation, are discussed in this paper. The methodology for initial testing of a newly developed compiler is proposed. Generation of negative test sentences by modifying the original language grammar is described. Positive and negative test cases generated by Grow, Purdom’s algorithm with and without length control, CDRC-P algorithm and CDRC-P algorithm with length control are applied to the testing of L-IRL robot programming language. For this purpose two different tools for generation of test sentences are used. Based on the presented analysis of possible solutions, the appropriate method can be chosen for testing the parser for smaller grammars with many recursive rules.

  7. Aristotle on Sentence Types and Forms of Speech

    Directory of Open Access Journals (Sweden)

    Gábor Bolonyai

    2005-12-01

    Full Text Available According to the Hermeneutics, Ch. 4, the analysis of non-assertoric sentences such as wishes, commands, etc. belongs to rhetoric or poetics. They are, however, examined neither in the Rhetoric nor in the Poetics, where (Ch. 20 their treatment. is explicitly excluded from the art of poetry and referred to that of delivery or performance. The paper gives an explanation for this discrepancy, based on an interpretation of Aristotle's rejection of Protagoras' criticism of Homer. The sophist found fault with the first line of the Iliad where Homer invokes the Muse by the imperative Menin aeide, thea thus uttering a command while believing that he is expressing a prayer. Aristotle's grounds for rejecting this criticism remain implicit, but it appears very likely that he thought that, if uttered or performed in the right manner, the sentence could he taken as a prayer. From this observation, which is certainly valid in this particular case, he drew the conclusion that performative or vocal features in themselves, i.e. rhythm, intonation and volume of sound, are always sufficient to identify particular „figures of speech“, as he calls non-assertoric sentence types in the Poetics. This conclusion is, however, not entirely justified. Performative features are not always enough to differentiate between two `figures of speech'; the possible range of verbal moods and sentence types is likewise determined by morphological marks (e.g. mood signs, syntactical features (word-order, and lexical items (certain adverbs or particles. Aristotle’s decision to dismiss figures of speech altogether from the field of lexis may also have contributed to the later development of keeping linguistics and theory of style apart as two separate branches of inquiry.

  8. Subluxation and semantics: a corpus linguistics study.

    Science.gov (United States)

    Budgell, Brian

    2016-06-01

    The purpose of this study was to analyze the curriculum of one chiropractic college in order to discover if there were any implicit consensus definitions of the term subluxation. Using the software WordSmith Tools, the corpus of an undergraduate chiropractic curriculum was analyzed by reviewing collocated terms and through discourse analysis of text blocks containing words based on the root 'sublux.' It was possible to identify 3 distinct concepts which were each referred to as 'subluxation:' i) an acute or instantaneous injurious event; ii) a clinical syndrome which manifested post-injury; iii) a physical lesion, i.e. an anatomical or physiological derangement which in most instances acted as a pain generator. In fact, coherent implicit definitions of subluxation exist and may enjoy broad but subconscious acceptance. However, confusion likely arises from failure to distinguish which concept an author or speaker is referring to when they employ the term subluxation.

  9. Unsupervised grammar induction of clinical report sublanguage

    Directory of Open Access Journals (Sweden)

    Kate Rohit J

    2012-10-01

    Full Text Available Abstract Background Clinical reports are written using a subset of natural language while employing many domain-specific terms; such a language is also known as a sublanguage for a scientific or a technical domain. Different genres of clinical reports use different sublaguages, and in addition, different medical facilities use different medical language conventions. This makes supervised training of a parser for clinical sentences very difficult as it would require expensive annotation effort to adapt to every type of clinical text. Methods In this paper, we present an unsupervised method which automatically induces a grammar and a parser for the sublanguage of a given genre of clinical reports from a corpus with no annotations. In order to capture sentence structures specific to clinical domains, the grammar is induced in terms of semantic classes of clinical terms in addition to part-of-speech tags. Our method induces grammar by minimizing the combined encoding cost of the grammar and the corresponding sentence derivations. The probabilities for the productions of the induced grammar are then learned from the unannotated corpus using an instance of the expectation-maximization algorithm. Results Our experiments show that the induced grammar is able to parse novel sentences. Using a dataset of discharge summary sentences with no annotations, our method obtains 60.5% F-measure for parse-bracketing on sentences of maximum length 10. By varying a parameter, the method can induce a range of grammars, from very specific to very general, and obtains the best performance in between the two extremes.

  10. Unsupervised grammar induction of clinical report sublanguage.

    Science.gov (United States)

    Kate, Rohit J

    2012-10-05

    Clinical reports are written using a subset of natural language while employing many domain-specific terms; such a language is also known as a sublanguage for a scientific or a technical domain. Different genres of clinical reports use different sublaguages, and in addition, different medical facilities use different medical language conventions. This makes supervised training of a parser for clinical sentences very difficult as it would require expensive annotation effort to adapt to every type of clinical text. In this paper, we present an unsupervised method which automatically induces a grammar and a parser for the sublanguage of a given genre of clinical reports from a corpus with no annotations. In order to capture sentence structures specific to clinical domains, the grammar is induced in terms of semantic classes of clinical terms in addition to part-of-speech tags. Our method induces grammar by minimizing the combined encoding cost of the grammar and the corresponding sentence derivations. The probabilities for the productions of the induced grammar are then learned from the unannotated corpus using an instance of the expectation-maximization algorithm. Our experiments show that the induced grammar is able to parse novel sentences. Using a dataset of discharge summary sentences with no annotations, our method obtains 60.5% F-measure for parse-bracketing on sentences of maximum length 10. By varying a parameter, the method can induce a range of grammars, from very specific to very general, and obtains the best performance in between the two extremes.

  11. The power and limits of a rule-based morpho-semantic parser.

    Science.gov (United States)

    Baud, R H; Rassinoux, A M; Ruch, P; Lovis, C; Scherrer, J R

    1999-01-01

    The venue of Electronic Patient Record (EPR) implies an increasing amount of medical texts readily available for processing, as soon as convenient tools are made available. The chief application is text analysis, from which one can drive other disciplines like indexing for retrieval, knowledge representation, translation and inferencing for medical intelligent systems. Prerequisites for a convenient analyzer of medical texts are: building the lexicon, developing semantic representation of the domain, having a large corpus of texts available for statistical analysis, and finally mastering robust and powerful parsing techniques in order to satisfy the constraints of the medical domain. This article aims at presenting an easy-to-use parser ready to be adapted in different settings. It describes its power together with its practical limitations as experienced by the authors.

  12. Mammographic breast cancer screening for women previously treated with high breast doses for diseases such as Hodgkin's

    International Nuclear Information System (INIS)

    Faulkner, K.; Law, J.

    2005-01-01

    In screening of a general population for breast cancer, benefit/risk ratios are of the order of 100/1. For the very small subgroup of women treated by radiotherapy for Hodgkin's disease below age 35, calculations of this type require different considerations, an overview of which is given in this text. It is concluded that although such previous exposures will increase their radiation risk, their increased risk of carrying an undetected breast cancer means that the potential benefit for them of screening is increased even more. In the United Kingdom, the Dept. of Health has recommended annual screening for these women. (authors)

  13. What is modern standard Arabic NLP? Definition and tools (or how to understand Arabic even if you do not know a word)

    OpenAIRE

    Serrano Morales, Mónica

    2017-01-01

    In the last few years, there has been an increase of the interest on Modern Standard Arabic. There is where computational linguistics fits in. This paper analyses the intersection between Arabic and computational linguistics, focusing on text processing and the tools developed for this purpose. The fundamental functions of Arabic computational processing are: sentence segmentation, tokenization, morphosyntactic tagging, lemmatization, diacritization and base phrase chunking. After analysing e...

  14. Metaphor and Corpus Linguistics Metáfora e linguística de corpus

    Directory of Open Access Journals (Sweden)

    Tony Berber Sardinha

    2011-01-01

    Full Text Available In this paper, I look at four different aspects of metaphor research from a corpus linguistic perspective, namely: (1 the lexicogrammar of metaphors, which refers to the patterning of linguistic metaphor revealed by corpus analysis; (2 metaphor probabilities, which is a facet of metaphor that emerges from frequency-based studies of metaphor; (3 dimensions of metaphor variation, or the search for systematic parameters of variation in metaphor use across different registers; and (4 automated metaphor retrieval, which relates to the development of software to help identify metaphors in corpora. I argue that these four aspects are interrelated, and that advances in one of them can drive changes in the others.Neste artigo discuto quarto aspectos da pesquisa sobre metáfora do ponto de vista da linguística de corpus: (1 a lexicogramática das metáforas, que se refere aos padrões da metáfora linguística revelados pela análise de corpus; (2 probabilidades metafóricas, que é uma faceta da metáfora que emerge a partir dos estudos relacionados à freqüência de metáforas; (3 dimensões da variação de metáforas, ou a busca por parâmetros sistemáticos de variação de uso de metáfora em diferentes gêneros; e (4 captura automática de metáfora, que está relacionada ao desenvolvimento de softwares que auxiliam na identificação de metáforas em corpora. I defendo que esses quatro aspectos são interrelacionados, e que progressos em um deles podem acarretar mudanças nos outros.

  15. How the conceptions of Chinese rhetorical expressions are derived from the corresponding generic sentences

    Science.gov (United States)

    Zhao, Wenhui

    2018-04-01

    Generic sentences are simple and intuitive recognition and objective description to the external world in terms of "class". In the long evolutionary process of human being's language, the concepts represented by generic sentences has been internalized to be the defaulted knowledge in people's minds. In Chinese, some rhetorical expressions supported by corresponding generic sentences can be accepted by people. The derivation of these rhetorical expressions from the corresponding generic sentences is an important way for language to evolution, which reflects human's creative cognitive competence. From the perspective of conceptual blend theory and the theory of categorization of the cognitive linguistics, the goal of this paper is to analysis the process of the derivation of the rhetorical expressions from the corresponding generic sentences, which can facilitate the Chinese metaphorical information processing and the corpus construction of Chinese emotion metaphors.

  16. Linguistic complex networks as a young field of quantitative linguistics. Comment on "Approaching human language with complex networks" by J. Cong and H. Liu

    Science.gov (United States)

    Köhler, Reinhard

    2014-12-01

    We have long been used to the domination of qualitative methods in modern linguistics. Indeed, qualitative methods have advantages such as ease of use and wide applicability to many types of linguistic phenomena. However, this shall not overshadow the fact that a great part of human language is amenable to quantification. Moreover, qualitative methods may lead to over-simplification by employing the rigid yes/no scale. When variability and vagueness of human language must be taken into account, qualitative methods will prove inadequate and give way to quantitative methods [1, p. 11]. In addition to such advantages as exactness and precision, quantitative concepts and methods make it possible to find laws of human language which are just like those in natural sciences. These laws are fundamental elements of linguistic theories in the spirit of the philosophy of science [2,3]. Theorization effort of this type is what quantitative linguistics [1,4,5] is devoted to. The review of Cong and Liu [6] has provided an informative and insightful survey of linguistic complex networks as a young field of quantitative linguistics, including the basic concepts and measures, the major lines of research with linguistic motivation, and suggestions for future research.

  17. The modality-switch effect: Visually and aurally presented prime sentences activate our senses

    Directory of Open Access Journals (Sweden)

    Elisa eScerrati

    2015-10-01

    Full Text Available Verifying different sensory modality properties for concepts results in a processing cost known as the Modality-Switch Effect. It has been argued that this cognitive cost is the result of a perceptual simulation. This paper extends this argument and reports an experiment investigating whether the effect is the result of an activation of sensory information which can also be triggered by perceptual linguistically described stimuli. Participants were first exposed to a prime sentence describing a light or a sound’s perceptual property (e.g. The light is flickering, The sound is echoing, then required to perform a property-verification task on a target sentence (e.g. Butter is yellowish, Leaves rustle. The content modalities of the prime and target sentences could be compatible (i.e. in the same modality: e.g. visual-visual or not (i.e. in different modalities. Crucially, we manipulated the stimuli’s presentation modality such that half of the participants was faced with written sentences while the other half was faced with aurally presented sentences. Results show a cost when two different modalities alternate, compared to when the same modality is repeated with both visual and aural stimuli presentations. This result supports the embodied and grounded cognition view which claims that conceptual knowledge is grounded into the perceptual system. Specifically, this evidence suggests that sensory modalities can be pre-activated through the simulation of either read or listened linguistic stimuli describing visual or acoustic perceptual properties.

  18. Field – Football Expressions Dictionary: a lexicographic resource based on the theoretical-methodological approach of frame semantics and corpus linguistics

    Directory of Open Access Journals (Sweden)

    Rove Luiza de Oliveira Chishman

    2015-01-01

    Full Text Available The present article aims at problematizing the relevance of Frame Semantics (Fillmore, 1982 in the development of Field – Dictionary of Football Expressions – which the configuration allows the access to football language through expressions or through scenarios – or semantic frames. Frame Semantics, a theory developed in the realm of Cognitive Linguistics, is based on empirical data collected from the analysis of electronic corpora. The extraction of the data presented in this study was done with the Sketch Engine concordance, while their analysis was relegated to Frame Semantics. Among the results, it is possible to point out at the manner in which Fillmore´s theory contributes to the analysis of polysemy, presenting the different senses of a lexical unit considering different situations – or different frames – in which they appear. This article also emphasizes the pertinence of corpus linguistics and the processing of corpora as resources that allow the analysis of linguistic constructs present in the texts. It is also important to emphasize the applicability of Frame Semantics to a resource devoted to a non-specialized public, once the theory makes the contextualization of language possible through the everyday routine of the speakers.

  19. LINGUISTIC FEATURES OF THE RUSSIAN TEXTS OF PERSONS WHO COMMITTED SUICIDE AND PERSONS WITH A HIGH RISK OF AUTOAGGRESSIVE BEHAVIOR

    Directory of Open Access Journals (Sweden)

    Zagorovskaya, O.V.

    2017-12-01

    Full Text Available One of the most promising areas of modern research is speech analysis for the purpose of identifying the mental state and assessing the mental health of the speaker / writer. In recent years, there has been an increased interest in solving problems of this kind with the use of methods and tools for computer linguistics and data mining. A separate scientific problem far from its solution and, undoubtedly, requiring consolidation of the efforts of psychologists, linguists and experts in the intellectual analysis of data, is the problem of diagnosing a propensity for autoaggressive behavior (and suicide as an extreme form of it based on linguistic analysis of writing. This problem has not only theoretical, but also obvious practical significance. Using the methods of natural language processing, scientists analyze the texts (mostly English of suiciders and build models that classify the text as belonging or not belonging to the suicider, and reveal the characteristics of such texts. At the same time, if earlier mainly the fiction texts of suiciders were analyzed, then in the newest works scientists study Internet texts (blogs, tweets, Facebook posts etc. of persons who committed suicide or express their intention to commit it. The Russian language has long remained on the periphery of such studies. The article presents the results of studies aimed at identifying the linguistic features of Russian-language texts of persons who committed suicide, as well as persons prone to autoaggressive behavior. The studies used methods and techniques of corpus linguistics, computer linguistics, statistical analysis. Prospects for further research are indicated.

  20. Not an ogre’:adult music learners and their teachers, a corpus-based discourse analysis

    OpenAIRE

    Shirley, Rachel

    2015-01-01

    Adult learners are an under-researched group in music education. Although music education research often uses texts (interviews, autobiographical accounts, survey responses), linguistic analysis has not yet been used in this area. Meanwhile, the internet has become a source of support and expression for adult music learners, through blogs and forums. This presentation describes part of the research undertaken for my MA in English Language, which uses a corpus of online texts to investigate di...

  1. Easy-to-read texts for students with intellectual disability: linguistic factors affecting comprehension.

    Science.gov (United States)

    Fajardo, Inmaculada; Ávila, Vicenta; Ferrer, Antonio; Tavares, Gema; Gómez, Marcos; Hernández, Ana

    2014-05-01

    The use of 'easy-to-read' materials for people with intellectual disabilities has become very widespread but their effectiveness has scarcely been evaluated. In this study, the framework provided by Kintsch's Construction-Integration Model (1988) is used to examine (i) the reading comprehension levels of different passages of the Spanish text that have been designed following easy-to-read guidelines and (ii) the relationships between reading comprehension (literal and inferential) and various linguistic features of these texts. Sixteen students with mild intellectual disability and low levels of reading skills were asked to read easy-to-read texts and then complete a reading comprehension test. The corpus of texts was composed of a set of forty-eight pieces of news selected from www.noticiasfacil.es, a Spanish digital newspaper that publishes daily journalistic texts following international guidelines for the design of easy-to-read documents (IFLA, Tronbacke B. (1997) Guidelines for Easy-to-read Materials. IFLA, The Hague). Participants correctly answered 80% of the comprehension questions, showing significantly higher scores for literal questions than for inferential questions. The analyses of the texts' linguistic features revealed that the number of coreferences was the variable that best predicted literal comprehension, but contrary to what the previous literature seemed to indicate, the relationship between the two variables was inverse. In the case of inferential comprehension, the number of sentences was a significant negative predictor; that is, the higher the sentence density, the lower the ability of these students to find relationships between them. The effects of the rest of linguistic variables, such as word frequency and word length, on comprehension were null. These results provide preliminary empirical support for the use of easy-to-read texts but bring into question the validity of some popular design guidelines (e.g. augmenting word frequency) to

  2. Development and Use of a Corpus Tailored for Legal English Learning

    Science.gov (United States)

    Skier, Jason; Vibulphol, Jutarat

    2016-01-01

    While corpus linguistics has been applied towards many specific academic purposes, reports are few regarding its use to facilitate learning of legal English by non-native English speakers. Specialized corpora are required because legal English often differs significantly from ordinary usage, with words such as bar, motion, and hearing having…

  3. Telugu dependency parsing using different statistical parsers

    Directory of Open Access Journals (Sweden)

    B. Venkata Seshu Kumari

    2017-01-01

    Full Text Available In this paper we explore different statistical dependency parsers for parsing Telugu. We consider five popular dependency parsers namely, MaltParser, MSTParser, TurboParser, ZPar and Easy-First Parser. We experiment with different parser and feature settings and show the impact of different settings. We also provide a detailed analysis of the performance of all the parsers on major dependency labels. We report our results on test data of Telugu dependency treebank provided in the ICON 2010 tools contest on Indian languages dependency parsing. We obtain state-of-the art performance of 91.8% in unlabeled attachment score and 70.0% in labeled attachment score. To the best of our knowledge ours is the only work which explored all the five popular dependency parsers and compared the performance under different feature settings for Telugu.

  4. Towards a Computable Data Corpus of Temporal Correlations between Drug Administration and Lab Value Changes.

    Directory of Open Access Journals (Sweden)

    Axel Newe

    Full Text Available The analysis of electronic health records for an automated detection of adverse drug reactions is an approach to solve the problems that arise from traditional methods like spontaneous reporting or manual chart review. Algorithms addressing this task should be modeled on the criteria for a standardized case causality assessment defined by the World Health Organization. One of these criteria is the temporal relationship between drug intake and the occurrence of a reaction or a laboratory test abnormality. Appropriate data that would allow for developing or validating related algorithms is not publicly available, though.In order to provide such data, retrospective routine data of drug administrations and temporally corresponding laboratory observations from a university clinic were extracted, transformed and evaluated by experts in terms of a reasonable time relationship between drug administration and lab value alteration.The result is a data corpus of 400 episodes of normalized laboratory parameter values in temporal context with drug administrations. Each episode has been manually classified whether it contains data that might indicate a temporal correlation between the drug administration and the change of the lab value course, whether such a change is not observable or whether a decision between those two options is not possible due to the data. In addition, each episode has been assigned a concordance value which indicates how difficult it is to assess. This is the first open data corpus of a computable ground truth of temporal correlations between drug administration and lab value alterations.The main purpose of this data corpus is the provision of data for further research and the provision of a ground truth which allows for comparing the outcome of other assessments of this data with the outcome of assessments made by human experts. It can serve as a contribution towards systematic, computerized ADR detection in retrospective data. With

  5. What Does Corpus Linguistics Have to Offer to Language Assessment?

    Science.gov (United States)

    Xi, Xiaoming

    2017-01-01

    In recent years, continuing advances in technology have increased the capacity to automate the extraction of a range of linguistic features of texts and thus have provided the impetus for the substantial growth of corpus linguistics. While corpus linguistic tools and methods have been used extensively in second language learning research, they…

  6. Development of a new Soft Muon Tagger for the identification of $b$-jets in ATLAS

    CERN Document Server

    Sciandra, Andrea; The ATLAS collaboration

    2017-01-01

    $b$-tagging plays a fundamental role at LHC, as it helps in the identification of heavy particles that decay to bottom quarks, as the top quark and the Higgs boson or heavy exotic particles. The Soft Muon Tagger (SMT) allows to identify jets from $b$-quarks taking advantage of the presence of a muon coming from semileptonic decays of $b$-hadrons. The development of this new $b$-tagger is described, showing that, despite the low efficiency of the jet-muon association based on the angular distance, the discriminating power of the associated muon variables is remarkable to reject light jets. An enhanced performance has been reached for all light jet rejection working points by adding the SMT output to the best performing multivariate $b$-tagger in ATLAS (MV2). A good modeling of input and output variables is shown, comparing simulation with Run 2 data.

  7. Data modelling in corpus linguistics: how low may we go?

    Science.gov (United States)

    van Velzen, Marjolein H; Nanetti, Luca; de Deyn, Peter P

    2014-06-01

    Corpus linguistics allows researchers to process millions of words. However, the more words we analyse, i.e., the more data we acquire, the more urgent the call for correct data interpretation becomes. In recent years, a number of studies saw the light attempting to profile some prolific authors' linguistic decline, linking this decline to pathological conditions such as Alzheimer's Disease (AD). However, in line with the nature of the (literary) work that was analysed, numbers alone do not suffice to 'tell the story'. The one and only objective of using statistical methods for the analysis of research data is to tell a story--what happened, when, and how. In the present study we describe a computerised but individualised approach to linguistic analysis--we propose a unifying approach, with firm grounds in Information Theory, that, independently from the specific parameter being investigated, guarantees to produce a robust model of the temporal dynamics of an author's linguistic richness over his or her lifetime. We applied this methodology to six renowned authors with an active writing life of four decades or more: Iris Murdoch, Gerard Reve, Hugo Claus, Agatha Christie, P.D. James, and Harry Mulisch. The first three were diagnosed with probable Alzheimer Disease, confirmed post-mortem for Iris Murdoch; this same condition was hypothesized for Agatha Christie. Our analysis reveals different evolutive patterns of lexical richness, in turn plausibly correlated with the authors' different conditions. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. English Writing Teaching Model Dependent on Computer Network Corpus Drive Model

    Directory of Open Access Journals (Sweden)

    Shi Lei

    2018-03-01

    Full Text Available At present, the mainstream lexicalized English writing methods take only the corpus dependence between words into consideration, without introducing the corpus collocation and other issues. “Drive” is a relatively essential feature of words. And once the drive structure of a word is determined, it will be relatively clear what kinds of words to collocate with, hence the structure of the sentence can be derived relatively directly. In this paper, the English writing model that relies on the computer network corpus drive model is put forward. In this model, rich English corpus is introduced in the decomposition of the rules and the calculation of the probability, which includes not only the corpus dependence information, but also the drive structure and other corpus collocation information. Improved computer network corpus drive model is used to carry out the English writing teaching experiment. The experimental results show that the precision and the recall rate are 88.76% and 87.43%, respectively. The F value of the comprehensive index is improved by 6.65% compared with the Collins headword driven English modes of writing.

  9. Organic chemistry as a language and the implications of chemical linguistics for structural and retrosynthetic analyses.

    Science.gov (United States)

    Cadeddu, Andrea; Wylie, Elizabeth K; Jurczak, Janusz; Wampler-Doty, Matthew; Grzybowski, Bartosz A

    2014-07-28

    Methods of computational linguistics are used to demonstrate that a natural language such as English and organic chemistry have the same structure in terms of the frequency of, respectively, text fragments and molecular fragments. This quantitative correspondence suggests that it is possible to extend the methods of computational corpus linguistics to the analysis of organic molecules. It is shown that within organic molecules bonds that have highest information content are the ones that 1) define repeat/symmetry subunits and 2) in asymmetric molecules, define the loci of potential retrosynthetic disconnections. Linguistics-based analysis appears well-suited to the analysis of complex structural and reactivity patterns within organic molecules. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Processing the ITU vocabulary: revisions and adaptations to the Pisa syntactic-semantic parser

    OpenAIRE

    Peters, Carol; Federici, Stefano; Montemagni, Simonetta; Calzolari, Nicoletta

    1993-01-01

    The first version of the Pisa syntactic-semantic parser was described in detail in Deliverable 4, Section 2 and Appendices 2,3, and 4. The scope of this report is to discuss the testing of the parser on the sample set of vocabulary which has been selected from the ITU Corpus (see Deliverable 6.1) and to illustrate the revisions and extensions that are now being implemented. The report therefore concentrates on presenting analysis and extraction activities. We need to specify clearly all the k...

  11. LEXICAL ANALYSIS OF THE VERB COOK AND LEARNING VOCABULARY: A CORPUS STUDY

    Directory of Open Access Journals (Sweden)

    Priyono Priyono

    2011-07-01

    Full Text Available English verbs have built-in properties that determine how they behave syntactically and generate appropriate meaning associated. With these inherent properties some verbs can fill in only in certain syntactic structures and some in others. The observation of the verb COOK using English corpus has revealed its lexical properties covering the area of syntax, semantics, and collocation suggesting uniqueness of its behaviours that are distinguishable from other verbs. Having found the lexical properties of COOK, this article concludes that the acquisition of lexicon should include lexical properties that reflect their level of competence. It also argues that the acquisition of lexical properties should be implicit, not through meta-linguistic knowledge. This would render early grammar teaching unnecessary. The acquisition of lexical properties should take place through subconscious process, not explicit grammar instruction. Many of these are grammatical aspects such as word order, sentence construction, grammatical and lexical collocations.

  12. Boosted $H\\rightarrow b \\bar{b}$ Tagger in Run II

    CERN Document Server

    AUTHOR|(INSPIRE)INSPIRE-00394595; The ATLAS collaboration

    2016-01-01

    Many searches for Higgs bosons decaying to b quark pairs benefit from the increased Run II centre of mass energy by exploiting the large transverse momentum (boosted) Higgs boson regime, where the two b-jets are merged into one large radius jet. ATLAS uses a boosted $H \\rightarrow b\\bar{b}$ tagger algorithm to separate Higgs signal from background processes (QCD, W and Z bosons, top quarks). The tagger takes as input a large R=1.0 jet with calibrated pseudorapidity, energy and mass scale. It employs b-tagging, Higgs candidate mass, and substructure information. The performance of several operating points in Higgs boson signal, QCD and $t\\bar{t}$ all-hadronic backgrounds are presented. Systematic uncertainties are evaluated so that this tagger can be used in analyses.

  13. A Frequency-List of Sentence Structures: Distribution of Kernel Sentences

    Science.gov (United States)

    Geens, Dirk

    1974-01-01

    A corpus of 10,000 sentences extracted from British theatrical texts was used to construct a frequency list of kernel sentence structures. Thirty-one charts illustrate the analyzed results. The procedures used and an interpretation of the frequencies are given. Such lists might aid foreign language teachers in course organization. Available from…

  14. Advantages and Disadvantages in the Use of Internet as a Corpus

    DEFF Research Database (Denmark)

    Tarp, Sven; Fuertes-Olivera, Pedro A.

    2016-01-01

    This paper initially discusses some of the consequences which the technological development has for lexicography, especially in terms of the different types of empirical basis which can be used in dictionary projects. The most important advantages and disadvantages of using the Internet as a corpus...

  15. A primer in macromolecular linguistics.

    Science.gov (United States)

    Searls, David B

    2013-03-01

    Polymeric macromolecules, when viewed abstractly as strings of symbols, can be treated in terms of formal language theory, providing a mathematical foundation for characterizing such strings both as collections and in terms of their individual structures. In addition this approach offers a framework for analysis of macromolecules by tools and conventions widely used in computational linguistics. This article introduces the ways that linguistics can be and has been applied to molecular biology, covering the relevant formal language theory at a relatively nontechnical level. Analogies between macromolecules and human natural language are used to provide intuitive insights into the relevance of grammars, parsing, and analysis of language complexity to biology. Copyright © 2012 Wiley Periodicals, Inc.

  16. IIIB or not IIIB: a previously unanswered question.

    Science.gov (United States)

    Thibodeau, Jennifer T; Mishkin, Joseph D; Patel, Parag C; Mammen, Pradeep P A; Markham, David W; Drazner, Mark H

    2012-05-01

    The term New York Heart Association (NYHA) class IIIB has been used increasingly in clinical medicine, including as an inclusion criteria for many clinical trials assessing left ventricular assist devices (LVADs). Indeed, NYHA class IIIB is incorporated in the Food and Drug Administration's approved indication for the Heartmate II. However, on review of the medical literature, we found that there is no consensus definition of NYHA class IIIB. Until the ambiguity is resolved, we suggest that this designation not be used in clinical practice or by investigators leading clinical trials assessing therapies which convey substantial risk to patients and therefore require clarity in describing the enrolled patient population. With ongoing improvements in LVADs, this therapy will increasingly be considered in patients less sick than those who require inotropic support, providing urgency to establish a consensus system of classifying such patients who nevertheless fall within the spectrum of advanced heart failure. Herein we propose a modification of the standard NYHA classification system which can be used to fill this void. Copyright © 2012 Elsevier Inc. All rights reserved.

  17. Policy-Based Management Natural Language Parser

    Science.gov (United States)

    James, Mark

    2009-01-01

    The Policy-Based Management Natural Language Parser (PBEM) is a rules-based approach to enterprise management that can be used to automate certain management tasks. This parser simplifies the management of a given endeavor by establishing policies to deal with situations that are likely to occur. Policies are operating rules that can be referred to as a means of maintaining order, security, consistency, or other ways of successfully furthering a goal or mission. PBEM provides a way of managing configuration of network elements, applications, and processes via a set of high-level rules or business policies rather than managing individual elements, thus switching the control to a higher level. This software allows unique management rules (or commands) to be specified and applied to a cross-section of the Global Information Grid (GIG). This software embodies a parser that is capable of recognizing and understanding conversational English. Because all possible dialect variants cannot be anticipated, a unique capability was developed that parses passed on conversation intent rather than the exact way the words are used. This software can increase productivity by enabling a user to converse with the system in conversational English to define network policies. PBEM can be used in both manned and unmanned science-gathering programs. Because policy statements can be domain-independent, this software can be applied equally to a wide variety of applications.

  18. Introduction to HOBIT, a b-jet identification tagger at the CDF experiment optimized for light Higgs boson searches

    Energy Technology Data Exchange (ETDEWEB)

    Freeman, J.; Junk, T.; Kirby, M.; Oksuzian, Y.; Phillips, T. J.; Snider, F. D.; Trovato, M.; Vizan, J.; Yao, W. M.

    2013-01-01

    We present the development and validation of the Higgs Optimized b Identification Tagger (HOBIT), a multivariate b-jet identification algorithm optimized for Higgs boson searches at the CDF experiment at the Fermilab Tevatron. At collider experiments, b taggers allow one to distinguish particle jets containing B hadrons from other jets; these algorithms have been used for many years with great success at CDF. HOBIT has been designed specifically for use in searches for light Higgs bosons decaying via H ! b\\bar{b}. This fact combined with the extent to which HOBIT synthesizes and extends the best ideas of previous taggers makes HOBIT unique among CDF b-tagging algorithms. Employing feed-forward neural network architectures, HOBIT provides an output value ranging from approximately -1 ("light-jet like") to 1 ("b-jet like"); this continuous output value has been tuned to provide maximum sensitivity in light Higgs boson search analyses. When tuned to the equivalent light jet rejection rate, HOBIT tags 54% of b jets in simulated 120 GeV/c2 Higgs boson events compared to 39% for SecVtx, the most commonly used b tagger at CDF. We present features of the tagger as well as its characterization in the form of b-jet finding efficiencies and false (light-jet) tag rates.

  19. Quantitative Research in Systemic Functional Linguistics

    Science.gov (United States)

    He, Qingshun

    2018-01-01

    The research of Systemic Functional Linguistics has been quite in-depth in both theory and practice. However, many linguists hold that Systemic Functional Linguistics has no hypothesis testing or experiments and its research is only qualitative. Analyses of the corpus, intelligent computing and language evolution on the ideological background of…

  20. The acute mania of King George III: A computational linguistic analysis.

    Directory of Open Access Journals (Sweden)

    Vassiliki Rentoumi

    Full Text Available We used a computational linguistic approach, exploiting machine learning techniques, to examine the letters written by King George III during mentally healthy and apparently mentally ill periods of his life. The aims of the study were: first, to establish the existence of alterations in the King's written language at the onset of his first manic episode; and secondly to identify salient sources of variation contributing to the changes. Effects on language were sought in two control conditions (politically stressful vs. politically tranquil periods and seasonal variation. We found clear differences in the letter corpus, across a range of different features, in association with the onset of mental derangement, which were driven by a combination of linguistic and information theory features that appeared to be specific to the contrast between acute mania and mental stability. The paucity of existing data relevant to changes in written language in the presence of acute mania suggests that lexical, syntactic and stylometric descriptions of written discourse produced by a cohort of patients with a diagnosis of acute mania will be necessary to support the diagnosis independently and to look for other periods of mental illness of the course of the King's life, and in other historically significant figures with similarly large archives of handwritten documents.

  1. Corpus methods and their reflection in linguistic theories of the 20th century

    Directory of Open Access Journals (Sweden)

    Simon Krek

    2013-05-01

    Full Text Available In the 20th century structuralism established itself as the central linguistic theory, in the first half mainly through its originator Ferdinand de Saussure, and in the second half with the figure of Noam Chomsky. The latter consistently refused to acknowledge analysis of extensive quantity of texts as a valuable method, and favoured linguistic intuition of a native speaker instead. In parallel with structuralism other trends in linguistics emerged which pointed to the inadequateness of the prevailing linguistic paradigm and to theoretical insights which were only possible after the systematic analysis of large quantities of texts. The paper discusses some of the dilemmas stemming from this dichotomy and places corpus linguistics in a broader linguistic context.

  2. Resilience in families in which a member has been diagnosed with schizophrenia.

    Science.gov (United States)

    Bishop, M; Greeff, A P

    2015-09-01

    Due to the extensive focus of the literature on the burden placed on families in which a member has been diagnosed with a mental illness such as schizophrenia, there is a need to identify factors that may help these families to be resilient and adapt to their crisis. The aim of this study was to identify family resilience qualities in families in which a member has been diagnosed with schizophrenia. The study comprised 42 families, represented by 33 parents and 9 siblings of the diagnosed family member. Families were recruited from three support groups within the Cape Metropolitan area, Western Cape, South Africa. Qualitative data were obtained through an open-ended question and quantitative data were collected with seven self-report questionnaires. The following family resilience qualities were identified: family income; finding support in their community; family togetherness; family communication style during crises; affirming and supportive communication patterns; family hardiness; commitment to the family; reframing crises as a challenge; and an internal locus of control within the family. The findings may be used by professionals and support group facilitators to enhance the resilience and functioning of families living with a member with schizophrenia. With approximately 1% of the world's population diagnosed with schizophrenia, it is clear that many families are affected when a member has been diagnosed. There is a need to identify factors that may help these families to be resilient. The aim of this study was to identify family resilience qualities in families in which a member has been diagnosed with schizophrenia. The following family resilience qualities were identified as resources that helped them to adapt to the many challenges put to them: family income, finding support in their community, the availability of hospitals, churches and professionals, family togetherness, family communication, family hardiness, commitment to the family, reframing crises

  3. Advantages and disadvantages in the use of internet as a corpus ...

    African Journals Online (AJOL)

    This paper initially discusses some of the consequences which the technological development has for lexicography, especially in terms of the different types of empirical basis which can be used in dictionary projects. The most important advantages and disadvantages of using the Internet as a corpus are then listed and ...

  4. Emotion Recognition of Weblog Sentences Based on an Ensemble Algorithm of Multi-label Classification and Word Emotions

    Science.gov (United States)

    Li, Ji; Ren, Fuji

    Weblogs have greatly changed the communication ways of mankind. Affective analysis of blog posts is found valuable for many applications such as text-to-speech synthesis or computer-assisted recommendation. Traditional emotion recognition in text based on single-label classification can not satisfy higher requirements of affective computing. In this paper, the automatic identification of sentence emotion in weblogs is modeled as a multi-label text categorization task. Experiments are carried out on 12273 blog sentences from the Chinese emotion corpus Ren_CECps with 8-dimension emotion annotation. An ensemble algorithm RAKEL is used to recognize dominant emotions from the writer's perspective. Our emotion feature using detailed intensity representation for word emotions outperforms the other main features such as the word frequency feature and the traditional lexicon-based feature. In order to deal with relatively complex sentences, we integrate grammatical characteristics of punctuations, disjunctive connectives, modification relations and negation into features. It achieves 13.51% and 12.49% increases for Micro-averaged F1 and Macro-averaged F1 respectively compared to the traditional lexicon-based feature. Result shows that multiple-dimension emotion representation with grammatical features can efficiently classify sentence emotion in a multi-label problem.

  5. Verb-second word order after German weil ‘because’: Psycholinguistic theory from corpus-linguistic data

    Directory of Open Access Journals (Sweden)

    Gerard Kempen

    2016-04-01

    Full Text Available In present-day spoken German, subordinate clauses introduced by the connector 'weil '‘because’ occur with two orders of subject, finite verb, and object(s. In addition to 'weil 'clauses with verb-final word order (“VF”; standard in subordinate clauses one often hears 'weil 'clauses with SVO, the standard order of main clauses (“verb-second”, V2. The “'weil'-V2” phenomenon is restricted to sentences where the 'weil 'clause follows the main clause, and is virtually absent from formal (written, edited German, occurring only in extemporaneous speech. Extant accounts of 'weil'-V2 focus on the interpretation of 'weil'-V2 clauses by the hearer, in particular on the type of discourse relation licensed by 'weil'-V2 vs. 'weil'-VF: causal/propositional or inferential/epistemic. Focusing instead on the production of 'weil 'clauses by the speaker, we examine a collection of about 1,000 sentences featuring a causal connector ('weil, da 'or 'denn' after the main clause, all extracted from a corpus of spoken German dialogues and annotated with tags denoting major prosodic and syntactic boundaries, and various types of disfluencies (pauses, hesitations. Based on the observed frequency patterns and on known linguistic properties of the connectors, we propose that 'weil'-V2 is caused by miscoordination between the mechanisms for lexical retrieval and grammatical encoding: Due to its high frequency, the lexical item 'weil 'is often selected prematurely, while the grammatical encoder is still working on the syntactic shape of the 'weil 'clause. 'Weil'-V2 arises when pragmatic and processing factors drive the encoder to discontinue the current sentence, and to plan the clause following 'weil 'in the form of the main clause of an independent, new sentence. Thus, the speaker continues with a V2 clause, seemingly in violation of the VF constraint imposed by the preceding 'weil'. We also explore implications of the model regarding the interpretation of

  6. From Utterance to Example Sentence

    DEFF Research Database (Denmark)

    Kristoffersen, Jette Hedegaard

    This poster will address some of the problems on excerption of example sentences for the online dictionary of Danish Sign Language (DTS) from a raw corpus of dialogues and monologues. In the Danish Sign Language Dictionary every meaning is illustrated by one or more sentences showing the sign...... lexicographers. The sentences were excerpted by hand from a raw corpus of dialogues and monologues – given to us by our group of consultants. The poster describes the process from utterance in a corpus in a larger context to an example sentence in a dictionary, where the purpose of having examples sentences...... for use in the dictionary consists of 11 stages in the DTS dictionary project. Special focus will be on the stage in the process where the sentence is judged suitable for dictionary use. A set of guidelines for what makes up a good example sentence has been developed for the DTS dictionary project...

  7. Linguistic Markers of Stance in Early and Advanced Academic Writing: A Corpus-Based Comparison

    Science.gov (United States)

    Aull, Laura L.; Lancaster, Zak

    2014-01-01

    This article uses corpus methods to examine linguistic expressions of stance in over 4,000 argumentative essays written by incoming first-year university students in comparison with the writing of upper-level undergraduate students and published academics. The findings reveal linguistic stance markers shared across the first-year essays despite…

  8. Type-II Weyl semimetals.

    Science.gov (United States)

    Soluyanov, Alexey A; Gresch, Dominik; Wang, Zhijun; Wu, QuanSheng; Troyer, Matthias; Dai, Xi; Bernevig, B Andrei

    2015-11-26

    Fermions--elementary particles such as electrons--are classified as Dirac, Majorana or Weyl. Majorana and Weyl fermions had not been observed experimentally until the recent discovery of condensed matter systems such as topological superconductors and semimetals, in which they arise as low-energy excitations. Here we propose the existence of a previously overlooked type of Weyl fermion that emerges at the boundary between electron and hole pockets in a new phase of matter. This particle was missed by Weyl because it breaks the stringent Lorentz symmetry in high-energy physics. Lorentz invariance, however, is not present in condensed matter physics, and by generalizing the Dirac equation, we find the new type of Weyl fermion. In particular, whereas Weyl semimetals--materials hosting Weyl fermions--were previously thought to have standard Weyl points with a point-like Fermi surface (which we refer to as type-I), we discover a type-II Weyl point, which is still a protected crossing, but appears at the contact of electron and hole pockets in type-II Weyl semimetals. We predict that WTe2 is an example of a topological semimetal hosting the new particle as a low-energy excitation around such a type-II Weyl point. The existence of type-II Weyl points in WTe2 means that many of its physical properties are very different to those of standard Weyl semimetals with point-like Fermi surfaces.

  9. Hemoperitoneum from Corpus Luteal Cyst Rupture: A Practical Approach in Emergency Room

    Directory of Open Access Journals (Sweden)

    Valeria Fiaschetti

    2014-01-01

    Full Text Available Corpus luteum cyst rupture with consequent hemoperitoneum is a common disorder in women in their reproductive age. This condition should be promptly recognized and treated because a delayed diagnosis may significantly reduce women’s fertility and intra-abdominal bleeding may be life-threatening. Many imaging modalities play a key role in the diagnosis of acute pelvic pain from gynecological causes. Ultrasound study (USS is usually the first imaging technique for initial evaluation. USS is used to confirm or to exclude the presence of intraperitoneal fluid but it has some limitations in the identification of the bleeding source. Contrast-enhanced computed tomography (CT is the imaging modality which could be used in the acute setting in order to recognize gynecological emergencies and to establish a correct management. Magnetic resonance imaging (MRI nowadays is the most useful technique for studying the pelvis but its low availability and the long acquisition time of the images limit its usefulness in characterization of acute gynecological complications. We report a case of a young patient with hemoperitoneum from hemorrhagic corpus luteum correctly identified by transabdominal USS and contrast-enhanced CT.

  10. Quantum field theory and the linguistic Minimalist Program: a remarkable isomorphism

    Science.gov (United States)

    Piattelli-Palmarini, M.; Vitiello, G.

    2017-08-01

    By resorting to recent results, we show that an isomorphism exist between linguistic features of the Minimalist Program and the quantum field theory formalism of condensed matter physics. Specific linguistic features which admit a representation in terms of the many-body algebraic formalism are the unconstrained nature of recursive Merge, the operation of the Labeling Algorithm, the difference between pronounced and un-pronounced copies of elements in a sentence and the build-up of the Fibonacci sequence in the syntactic derivation of sentence structures. The collective dynamical nature of the formation process of Logical Forms leading to the individuation of the manifold of concepts and the computational self-consistency of languages are also discussed.

  11. Describing linguistic information in a behavioural framework: Possible or not?

    Energy Technology Data Exchange (ETDEWEB)

    De Cooman, G. [Universiteit Gent, Zwijnaarde (Belgium)

    1996-12-31

    The paper discusses important aspects of the representation of linguistic information, using imprecise probabilities with a behavioural interpretation. We define linguistic information as the information conveyed by statements in natural language, but restrict ourselves to simple affirmative statements of the type {open_quote}subject-is-predicate{close_quote}. Taking the behavioural stance, as it is described in detail, we investigate whether it is possible to give a mathematical model for this kind of information. In particular, we evaluate Zadeli`s suggestion that we should use possibility measures to this end. We come to tile conclusion that, generally speaking, possibility measures are possibility models for linguistic information, but that more work should be done in order to evaluate the suggestion that they may be the only ones.

  12. MASCOT HTML and XML parser: an implementation of a novel object model for protein identification data.

    Science.gov (United States)

    Yang, Chunguang G; Granite, Stephen J; Van Eyk, Jennifer E; Winslow, Raimond L

    2006-11-01

    Protein identification using MS is an important technique in proteomics as well as a major generator of proteomics data. We have designed the protein identification data object model (PDOM) and developed a parser based on this model to facilitate the analysis and storage of these data. The parser works with HTML or XML files saved or exported from MASCOT MS/MS ions search in peptide summary report or MASCOT PMF search in protein summary report. The program creates PDOM objects, eliminates redundancy in the input file, and has the capability to output any PDOM object to a relational database. This program facilitates additional analysis of MASCOT search results and aids the storage of protein identification information. The implementation is extensible and can serve as a template to develop parsers for other search engines. The parser can be used as a stand-alone application or can be driven by other Java programs. It is currently being used as the front end for a system that loads HTML and XML result files of MASCOT searches into a relational database. The source code is freely available at http://www.ccbm.jhu.edu and the program uses only free and open-source Java libraries.

  13. Probabilistic modeling of discourse-aware sentence processing.

    Science.gov (United States)

    Dubey, Amit; Keller, Frank; Sturt, Patrick

    2013-07-01

    Probabilistic models of sentence comprehension are increasingly relevant to questions concerning human language processing. However, such models are often limited to syntactic factors. This restriction is unrealistic in light of experimental results suggesting interactions between syntax and other forms of linguistic information in human sentence processing. To address this limitation, this article introduces two sentence processing models that augment a syntactic component with information about discourse co-reference. The novel combination of probabilistic syntactic components with co-reference classifiers permits them to more closely mimic human behavior than existing models. The first model uses a deep model of linguistics, based in part on probabilistic logic, allowing it to make qualitative predictions on experimental data; the second model uses shallow processing to make quantitative predictions on a broad-coverage reading-time corpus. Copyright © 2013 Cognitive Science Society, Inc.

  14. Studying text coherence in Czech – a corpus-based analysis

    Directory of Open Access Journals (Sweden)

    Rysová Magdaléna

    2017-12-01

    Full Text Available The paper deals with the field of Czech corpus linguistics and represents one of various current studies analysing text coherence through language interactions. It presents a corpusbased analysis of grammatical coreference and sentence information structure (in terms of contextual boundness in Czech. It focuses on examining the interaction of these two language phenomena and observes where they meet to participate in text structuring. Specifically, the paper analyses contextually bound and non-bound sentence items and examines whether (and how often they are involved in relations of grammatical coreference in Czech newspaper articles. The analysis is carried out on the language data of the Prague Dependency Treebank (PDT containing 3,165 Czech texts. The results of the analysis are helpful in automatic text annotation - the paper presents how (or to what extent the annotation of grammatical coreference may be used in automatic (pre-annotation of sentence information structure in Czech. It demonstrates how accurately we may (automatically assume the value of contextual boundness for the antecedent and anaphor (as the two participants of a grammatical coreference relation. The results of the paper demonstrate that the anaphor of grammatical coreference is automatically predictable - it is a non-contrastive contextually bound sentence item in 99.18% of cases. On the other hand, the value of contextual boundness of the antecedent is not so easy to estimate (according to the PDT, the antecedent is contextually non-bound in 37% of cases, non-contrastive contextually bound in 50% and contrastive contextually bound in 13% of cases.

  15. Elaboration over a discourse facilitates retrieval in sentence processing

    Directory of Open Access Journals (Sweden)

    Melissa eTroyer

    2016-03-01

    Full Text Available Language comprehension requires access to stored knowledge and the ability to combine knowledge in new, meaningful ways. Previous work has shown that processing linguistically more complex expressions (‘Texas cattle rancher’ vs. ‘rancher’ leads to slow-downs in reading during initial processing, possibly reflecting effort in combining information. Conversely, when this information must subsequently be retrieved (as in filler-gap constructions, processing is facilitated for more complex expressions, possibly because more semantic cues are available during retrieval. To follow up on this hypothesis, we tested whether information distributed across a short discourse can similarly provide effective cues for retrieval. Participants read texts introducing two referents (e.g., two senators, one of whom was described in greater detail than the other (e.g., ‘The Democrat had voted for one of the senators, and the Republican had voted for the other, a man from Ohio who was running for president’. The final sentence (e.g., ‘The senator who the {Republican / Democrat} had voted for…’ contained a relative clause picking out either the Many-Cue referent (with ‘Republican’ or the One-Cue referent (with ‘Democrat’. We predicted facilitated retrieval (faster reading times for the Many-Cue condition at the verb region (‘had voted for’, where readers could understand that ‘The senator’ is the object of the verb. As predicted, this pattern was observed at the retrieval region and continued throughout the rest of the sentence. Participants also completed the Author/Magazine Recognition Tests (ART/MRT; Stanovich & West, 1989, providing a proxy for world knowledge. Since higher ART/MRT scores may index (a greater experience accessing relevant knowledge and/or (b richer/more highly-structured representations in semantic memory, we predicted it would be positively associated with effects of elaboration on retrieval. We did not observe

  16. The Comparative Power of Type/Token and Hapax legomena/Type Ratios: A Corpus-based Study of Authorial Differentiation

    Directory of Open Access Journals (Sweden)

    Sundus Muhsin Ali

    2014-06-01

    Full Text Available This paper presents an attempt to verify the comparative power of two statistical features: Type/Token, and Hapax legomena/Token ratios (henceforth TTR and HTR. A corpus of ten novels is compiled. Then sixteen samples (each is 5,000 tokens in length are taken randomly out of these novels as representative blocks. The researchers observe the way TTR and HTR behave in discriminating four novelists: Joyce, Woolf, Faulkner and Hemingway. When compared to the traditional statistical features (e.g. word length average, Sentence length average, etc., TTR and HTR are by far more competent in comparing the distinctive quantitative behavior of each novelist. It turns out that TTR and HTR contribute more or less in creating a sort of statistical identity which can be used in giving a vivid comparison and discrimination of the four novelists involved in this paper. Nevertheless, HTR sounds more viable in achieving the discriminating task than TTR.

  17. Corpus Linguistics, Network Analysis and Co-occurrence Matrices Corpus Linguistics, Network Analysis and Co-occurrence Matrices

    Directory of Open Access Journals (Sweden)

    Keith Stuart

    2009-12-01

    Full Text Available This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Politécnica de Valencia, UPV was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results obtained, we suggest that it may be viable to extract and to represent the intellectual capital of an academic institution using corpus linguistics methods in combination with the formulations of network theory.En este artículo describimos la investigación que se ha desarrollado en el diseño de una metodología para la representación reticular del conocimiento que se genera en el seno de una institución a partir de un corpus representativo de la producción científica de los integrantes de dicha comunidad discursiva, la Universidad Politécnica de Valencia.. Para ello, presentamos las acciones que se realizaron en las fases iniciales del estudio encaminadas a establecer el marco teórico y práctico en el que se inscribe nuestro análisis. En la sección de metodología se describen las herramientas informáticas utilizadas, así como los procesos que nos permitieron disponer de aquellos elementos presentes en el corpus, que nos llevarían al desarrollo de

  18. Dental and skeletal changes in mild to moderate Class II malocclusions treated by either a Twin-block or Xbow appliance followed by full fixed orthodontic treatment.

    Science.gov (United States)

    Ehsani, Sayeh; Nebbe, Brian; Normando, David; Lagravere, Manuel O; Flores-Mir, Carlos

    2015-11-01

    To compare the short-term skeletal and dental effects of two-phase orthodontic treatment including either a Twin-block or an XBow appliance. This was a retrospective clinical trial of 50 consecutive Class II cases treated in a private practice with either a Twin-block (25) or XBow (25) appliance followed by full fixed orthodontic treatment. To factor out growth, an untreated Class II control group (25) was considered. A MANOVA of treatment/observation changes followed by univariate pairwise comparisons showed that the maxilla moved forward less in the treatment groups than in the control group. As for mandibular changes, the corpus length increase was larger in the Twin-block group by 3.9 mm. Dentally, mesial movement of mandibular molars was greater in both treatment groups. Although no distalization of maxillary molars was found in either treatment group, restriction of mesial movement of these teeth was seen in both treatment groups. Both treatment groups demonstrated increased mandibular incisor proclination with larger increases for the XBow group by 3.3°. The Wits value was decreased by 1.6 mm more in the Twin-block group. No sex-related differences were observed. Class II correction using an XBow or Twin-block followed by fixed appliances occurs through a relatively similar combination of dental and skeletal effects. An increase in mandibular incisor inclination for the XBow group and an increased corpus length for the Twin-block group were notable exceptions. No overall treatment length differences were seen.

  19. Developing a tagset and tagger for the African languages of South ...

    African Journals Online (AJOL)

    annotations in the form of linguistic tags and annotations. That is, the annotations are used to direct the searches to specific grammatical and lexical phenomena in a corpus. In this article, we propose a corpus-based approach and a tagset to be used on a corpus of spoken language for the African languages of South Africa.

  20. Web corpus construction

    CERN Document Server

    Schafer, Roland

    2013-01-01

    The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and rem...

  1. The monoclonal antibody SM5-1 recognizes a fibronectin variant which is widely expressed in melanoma

    Directory of Open Access Journals (Sweden)

    Guo Yajun

    2006-01-01

    Full Text Available Abstract Background Previously we have generated the monoclonal antibody SM5-1 by using a subtractive immunization protocol of human melanoma. This antibody exhibits a high sensitivity for primary melanomas of 99% (248/250 tested and for metastatic melanoma of 96% (146/151 tested in paraffin embedded sections. This reactivity is superior to the one obtained by HMB-45, anti-MelanA or anti-Tyrosinase and is comparable to anti-S100. However, as compared to anti-S100, the antibody SM5-1 is highly specific for melanocytic lesions since 40 different neoplasms were found to be negative for SM5-1 by immunohistochemistry. The antigen recognized by SM5-1 is unknown. Methods In order to characterize the antigen recognized by mAb SM5-1, a cDNA library was constructed from the metastatic human melanoma cell line SMMUpos in the Uni-ZAP lambda phage and screened by mAb SM5-1. The cDNA clones identified by this approach were then sequenced and subsequently analyzed. Results Sequence analysis of nine independent overlapping clones (length 3100–5600 bp represent fibronectin cDNA including the ED-A, but not the ED-B region which are produced by alternative splicing. The 89aa splicing variant of the IIICS region was found in 8/9 clones and the 120aa splicing variant in 1/9 clones, both of which are included in the CS1 region of fibronectin being involved in melanoma cell adhesion and spreading. Conclusion The molecule recognized by SM5-1 is a melanoma associated FN variant expressed by virtually all primary and metastatic melanomas and may play an important role in melanoma formation and progression. This antibody is therefore not only of value in immunohistochemistry, but potentially also for diagnostic imaging and immunotherapy.

  2. Studies of top tagging identification methods and development of a new heavy object tagger

    International Nuclear Information System (INIS)

    Lapsien, Tobias

    2016-05-01

    At the Large Hadron Collider (LHC), precision tests of the standard model of particle physics and searches for new phenomena are performed. To make optimal use of the proton-proton collisions delivered by the LHC and its increasing collision rate, both the detectors and the reconstruction algorithms have to be optimized. The identification of heavy quarks is a key component in many measurements. This thesis describes a hardware and a software project which both aim at improving the identification of heavy quarks. In the first part of this thesis, the Phase 1 upgrade of the CMS pixel detector is introduced. One of the main motivations of the replacement of the Pixel detector is the improved b jet identification at large collision rates. The Phase 1 upgrade involves several production and calibration steps. An X-ray calibration procedure has been developed and the corresponding experimental setup is described. Measurements show that the calibration of the pixel modules is temperature independent and can be performed at room temperature. The stability of the setup is tested in order to fulfill the requirements for mass production of the pixel modules. A method to stabilize the calibration is introduced which is shown to reduce the systematic uncertainty. In the second part, algorithms to identify heavily boosted top quarks (''top tagger'') are described and their performance is compared. The OptimalR HEP top tagger and the shower deconstruction tagger show a better performance than existing tagging algorithms. They can be used in Run II with increased centre-of-mass energies of 13 and 14 TeV. It is also shown that existing top tagging algorithms can be improved by the usage of multivariate analysis methods. New algorithms are commissioned using CMS data with a centre-of-mass energy of 8 TeV, corresponding to an integrated luminosity of 19.7 fb -1 . In order to validate these new algorithms in data, two selections are made to measure the efficiency

  3. Sentence-Level Attachment Prediction

    Science.gov (United States)

    Albakour, M.-Dyaa; Kruschwitz, Udo; Lucas, Simon

    Attachment prediction is the task of automatically identifying email messages that should contain an attachment. This can be useful to tackle the problem of sending out emails but forgetting to include the relevant attachment (something that happens all too often). A common Information Retrieval (IR) approach in analyzing documents such as emails is to treat the entire document as a bag of words. Here we propose a finer-grained analysis to address the problem. We aim at identifying individual sentences within an email that refer to an attachment. If we detect any such sentence, we predict that the email should have an attachment. Using part of the Enron corpus for evaluation we find that our finer-grained approach outperforms previously reported document-level attachment prediction in similar evaluation settings.

  4. O wrażliwośco kontekstowej zdań typu 'S wie, że p' (ON CONTEXTUAL SENSITIVITY OF SENTENCE TYPE 'S KNOWS THAT P'

    Directory of Open Access Journals (Sweden)

    Rafał Palczewski

    2005-12-01

    Full Text Available Semantic contextualism is recently one of the most discussed epistemological theories. So far, the main part of discussion concerns contextualist solution of the skeptical problem. Nowadays it has become more clear that this theory needs strong and independent justification from a linguistic and language-philosophical point of view. In this paper The author outlines several treads concerning linguistic basis for contextualism. In part one there are presented some fundamental contextualism thesis and an example proposed by S. Cohen which has to support it. Next the following question is considered: which semantic feature is responsible for context dependence of knowledge ascription sentences? Is it indexicality, vagueness, ambiguity, ellipticity or unspecificity? Debate sketched in part three is concentrated on an analogy between linguistic behavior of knowledge ascription sentences and other context-dependent expressions, especially indexicals and gradable adjectives. The last part of this paper contains a new argument for contextualism proposed by K. De Rose. In addition such argument points out that contextualism does not confuse a truth conditions of knowledge ascribing sentences with their condition of warranted assertability.

  5. Parsley: a Command-Line Parser for Astronomical Applications

    Science.gov (United States)

    Deich, William

    Parsley is a sophisticated keyword + value parser, packaged as a library of routines that offers an easy method for providing command-line arguments to programs. It makes it easy for the user to enter values, and it makes it easy for the programmer to collect and validate the user's entries. Parsley is tuned for astronomical applications: for example, dates entered in Julian, Modified Julian, calendar, or several other formats are all recognized without special effort by the user or by the programmer; angles can be entered using decimal degrees or dd:mm:ss; time-like intervals as decimal hours, hh:mm:ss, or a variety of other units. Vectors of data are accepted as readily as scalars.

  6. Health adaptation policy for climate vulnerable groups: a 'critical computational linguistics' analysis.

    Science.gov (United States)

    Seidel, Bastian M; Bell, Erica

    2014-11-28

    Many countries are developing or reviewing national adaptation policy for climate change but the extent to which these meet the health needs of vulnerable groups has not been assessed. This study examines the adequacy of such policies for nine known climate-vulnerable groups: people with mental health conditions, Aboriginal people, culturally and linguistically diverse groups, aged people, people with disabilities, rural communities, children, women, and socioeconomically disadvantaged people. The study analyses an exhaustive sample of national adaptation policy documents from Annex 1 ('developed') countries of the United Nations Framework Convention on Climate Change: 20 documents from 12 countries. A 'critical computational linguistics' method was used involving novel software-driven quantitative mapping and traditional critical discourse analysis. The study finds that references to vulnerable groups are relatively little present or non-existent, as well as poorly connected to language about practical strategies and socio-economic contexts, both also little present. The conclusions offer strategies for developing policy that is better informed by a 'social determinants of health' definition of climate vulnerability, consistent with best practice in the literature and global policy prescriptions.

  7. A Corpus of Writing, Pronunciation, Reading, and Listening by Learners of English as a Foreign Language

    Science.gov (United States)

    Kotani, Katsunori; Yoshimi, Takehiko; Nanjo, Hiroaki; Isahara, Hitoshi

    2016-01-01

    In order to develop effective teaching methods and computer-assisted language teaching systems for learners of English as a foreign language who need to study the basic linguistic competences for writing, pronunciation, reading, and listening, it is necessary to first investigate which vocabulary and grammar they have or have not yet learned.…

  8. Is the comprehension of idiomatic sentences indeed impaired in paranoid Schizophrenia? A window into semantic processing deficits

    Science.gov (United States)

    Pesciarelli, Francesca; Gamberoni, Tania; Ferlazzo, Fabio; Lo Russo, Leo; Pedrazzi, Francesca; Melati, Ermanno; Cacciari, Cristina

    2014-01-01

    Schizophrenia patients have been reported to be more impaired in comprehending non-literal than literal language since early studies on proverbs. Preference for literal rather than figurative interpretations continues to be documented. The main aim of this study was to establish whether patients are indeed able to use combinatorial semantic processing to comprehend literal sentences and both combinatorial analysis, and retrieval of pre-stored meanings to comprehend idiomatic sentences. The study employed a sentence continuation task in which subjects were asked to decide whether a target word was a sensible continuation of a previous sentence fragment to investigate idiomatic and literal sentence comprehension in patients with paranoid schizophrenia. Patients and healthy controls were faster in accepting sensible continuations than in rejecting non-sensible ones in both literal and idiomatic sentences. Patients were as accurate as controls in comprehending literal and idiomatic sentences, but they were overall slower than controls in all conditions. Once the contribution of cognitive covariates was partialled out, the response times (RTs) to sensible idiomatic continuations of patients did not significantly differ from those of controls. This suggests that the state of residual schizophrenia did not contribute to slower processing of sensible idioms above and beyond the cognitive deficits that are typically associated with schizophrenia. PMID:25346676

  9. The CALBC Silver Standard Corpus for Biomedical Named Entities - A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers

    NARCIS (Netherlands)

    D. Rebholz-Schuhmann (Dietrich); A.J. Jimeno-Yepes (Antonio José); E.M. van Mulligen (Erik); N. Kang (Ning); J.A. Kors (Jan); D. Milward (David); P. Corbett (Peter); E. Buyko (Ekaterina); Tomanek (Katrin); E. Beisswanger (Elena); U. Hahn (Udo)

    2010-01-01

    textabstractThe production of gold standard corpora is time-consuming and costly. We propose an alternative: the 'silver standard corpus' (SSC), a corpus that has been generated by the harmonisation of the annotations that have been delivered from a selection of annotation systems. The systems have

  10. JaSlo: Integration of a Japanese-Slovene Bilingual Dictionary with a Corpus Search System

    Directory of Open Access Journals (Sweden)

    Kristina HMELJAK SANGAWA

    2012-12-01

    Full Text Available The paper presents a set of integrated on-line language resources targeted at Japanese language learners, primarily those whose mother tongue is Slovene. The resources consist of the on-line Japanese-Slovene learners’ dictionary jaSlo and two corpora, a 1 million word Japanese-Slovene parallel corpus and a 300 million word corpus of web pages, where each word and sentence is marked by its difficulty level; this corpus is furthermore available as a set of five distinct corpora, each one containing sentences of the particular level. The corpora are available for exploration through NoSketch Engine, the open source version of the commercial state-of-the-art corpus analysis software Sketch Engine. The dictionary is available for Web searching, and dictionary entries have direct links to examples from the corpora, thus offering a wider picture of a possible translations in concrete contextualised examples, and b monolingual Japanese usage examples of different difficulty levels to support language learning.

  11. Sulfatide-activated type II NKT cells prevent allergic airway inflammation by inhibiting type I NKT cell function in a mouse model of asthma.

    Science.gov (United States)

    Zhang, Guqin; Nie, Hanxiang; Yang, Jiong; Ding, Xuhong; Huang, Yi; Yu, Hongying; Li, Ruyou; Yuan, Zhuqing; Hu, Suping

    2011-12-01

    Asthma is a common chronic inflammatory disease involving many different cell types. Recently, type I natural killer T (NKT) cells have been demonstrated to play a crucial role in the development of asthma. However, the roles of type II NKT cells in asthma have not been investigated before. Interestingly, type I and type II NKT cells have been shown to have opposing roles in antitumor immunity, antiparasite immunity, and autoimmunity. We hypothesized that sulfatide-activated type II NKT cells could prevent allergic airway inflammation by inhibiting type I NKT cell function in asthma. Strikingly, in our mouse model, activation of type II NKT cells by sulfatide administration and adoptive transfer of sulfatide-activated type II NKT cells result in reduced-inflammation cell infiltration in the lung and bronchoalveolar lavage fluid, decreased levels of IL-4 and IL-5 in the BALF; and decreased serum levels of ovalbumin-specific IgE and IgG1. Furthermore, it is found that the activation of sulfatide-reactive type II NKT cells leads to the functional inactivation of type I NKT cells, including the proliferation and cytokine secretion. Our data reveal that type II NKT cells activated by glycolipids, such as sulfatide, may serve as a novel approach to treat allergic diseases and other disorders characterized by inappropriate type I NKT cell activation.

  12. Grammaticality, Acceptability, and Probability: A Probabilistic View of Linguistic Knowledge.

    Science.gov (United States)

    Lau, Jey Han; Clark, Alexander; Lappin, Shalom

    2017-07-01

    The question of whether humans represent grammatical knowledge as a binary condition on membership in a set of well-formed sentences, or as a probabilistic property has been the subject of debate among linguists, psychologists, and cognitive scientists for many decades. Acceptability judgments present a serious problem for both classical binary and probabilistic theories of grammaticality. These judgements are gradient in nature, and so cannot be directly accommodated in a binary formal grammar. However, it is also not possible to simply reduce acceptability to probability. The acceptability of a sentence is not the same as the likelihood of its occurrence, which is, in part, determined by factors like sentence length and lexical frequency. In this paper, we present the results of a set of large-scale experiments using crowd-sourced acceptability judgments that demonstrate gradience to be a pervasive feature in acceptability judgments. We then show how one can predict acceptability judgments on the basis of probability by augmenting probabilistic language models with an acceptability measure. This is a function that normalizes probability values to eliminate the confounding factors of length and lexical frequency. We describe a sequence of modeling experiments with unsupervised language models drawn from state-of-the-art machine learning methods in natural language processing. Several of these models achieve very encouraging levels of accuracy in the acceptability prediction task, as measured by the correlation between the acceptability measure scores and mean human acceptability values. We consider the relevance of these results to the debate on the nature of grammatical competence, and we argue that they support the view that linguistic knowledge can be intrinsically probabilistic. Copyright © 2016 Cognitive Science Society, Inc.

  13. A Study of the Speed of Understanding Sentences as a Function of Sentence Structure. Final Report.

    Science.gov (United States)

    Halamandaris, Pandelis G.

    On the basis of the grammatical theory developed by Noam Chomsky, it is reasonable to presume that the different parts of a sentence may not all be understood with equal facility and speed. One purpose of this study was to determine whether some of the grammatical relations within a sentence were understood more readily than others. Sentences of…

  14. The Effects of Linguistic Context on Word Recognition in Noise by Elderly Listeners Using Spanish Sentence Lists (SSL)

    Science.gov (United States)

    Cervera, Teresa; Rosell, Vicente

    2015-01-01

    This study evaluated the effects of the linguistic context on the recognition of words in noise in older listeners using the Spanish Sentence Lists. These sentences were developed based on the approach of the SPIN test for the English language, which contains high and low predictability (HP and LP) sentences. In addition, the relative contribution…

  15. REPETITION AS A SPECIAL TYPE OF SYNTACTIC RELATIONS IN REPRESENTED SPEECH: BASED ON THE PROSE BY MARINA TSVETAEVA

    Directory of Open Access Journals (Sweden)

    Olga Pavlovna Puchinina

    2017-03-01

    Full Text Available The author studies the peculiarities of repetition, being one of the ways to build syntactic relations in the represented speech structure, based on the prose by Marina Tsvetaeva. The article describes the different types of repetitions, their distinctive features and frequency of usage in the studied tests. The author describes different types of repetitions in the studied cases of represented speech from the point of view of their position. Repetition occurs at the beginning, end of sentences, beginning and end of a statement or paragraph, end of a statement and beginning of the next one; lexical items may be repeated in the middle of a statement. The morphology of repetitions, i.e., the way by what parts of speech the repeated words and structures are expressed, is of interest from the point of view of functional grammar. The author notes that Tsvetaeva repeats different parts of speech: conjunctions, prepositions, particles, nouns, pronouns, adverbs, numerals, verbs, modal words or a combination of two words. Moreover, due to her special intention, Tsvetaeva intensifies repetition through particular phonetic devices, such as alliteration, rhyme and rhythm, which make her prosaic works sound poetic. Purpose. The article is devoted to the topic of rendering another person’s speech, as it continues to be one of the most important issues of modern linguistics. The subject of analysis is repetition and its different types in the structure of represented speech on the material of prose texts by Marina Tsvetaeva. The author’s aims is to reveal the way these types of repetition (lexical, syntactic, semantic ones function in the structure of represented speech and what effect is achieved with their help. Methodology. The research has been conducted using the continuous sampling method and the quantitative estimation method, aimed to identify the frequency of using different types of repetition and repeated parts of speech and constructions in the

  16. Smart responsive microcapsules capable of recognizing heavy metal ions.

    Science.gov (United States)

    Pi, Shuo-Wei; Ju, Xiao-Jie; Wu, Han-Guang; Xie, Rui; Chu, Liang-Yin

    2010-09-15

    Smart responsive microcapsules capable of recognizing heavy metal ions are successfully prepared with oil-in-water-in-oil double emulsions as templates for polymerization in this study. The microcapsules are featured with thin poly(N-isopropylacrylamide-co-benzo-18-crown-6-acrylamide) (P(NIPAM-co-BCAm)) membranes, and they can selectively recognize special heavy metal ions such as barium(II) or lead(II) ions very well due to the "host-guest" complexation between the BCAm receptors and barium(II) or lead(II) ions. The stable BCAm/Ba(2+) or BCAm/Pb(2+) complexes in the P(NIPAM-co-BCAm) membrane cause a positive shift of the volume phase transition temperature of the crosslinked P(NIPAM-co-BCAm) hydrogel to a higher temperature, and the repulsion among the charged BCAm/Ba(2+) or BCAm/Pb(2+) complexes and the osmotic pressure within the P(NIPAM-co-BCAm) membranes result in the swelling of microcapsules. Induced by recognizing barium(II) or lead(II) ions, the prepared microcapsules with P(NIPAM-co-BCAm) membranes exhibit isothermal and significant swelling not only in outer and inner diameters but also in the membrane thickness. The proposed microcapsules in this study are highly attractive for developing smart sensors and/or carriers for detection and/or elimination of heavy metal ions. Copyright 2010 Elsevier Inc. All rights reserved.

  17. Modification as a propositional act

    DEFF Research Database (Denmark)

    Rijkhoff, Jan

    2014-01-01

    A complete functional theory of grammar should be able to specify the communicative function of each constituent in a sentence (or rather ‘Discourse Act’). Whereas predicating and referring have been given due attention in functional approaches to grammar, the modifying function of linguistic...... material in a Discourse Act has been largely ignored. This chapter argues (i) that modifying is no less actional than predicating or referring in the process of verbal communication and (ii) that the modifying function of linguistic material should be specified at the highest level of representation...

  18. Study of Applying Cognitive Linguistic Theory into Japanese Grammar Teaching——Taking Causative Sentence as an Example

    Institute of Scientific and Technical Information of China (English)

    Zhang Yu[1

    2016-01-01

    The traditional Japanese grammar teaching often only pays attention to the interpretation of syntax and the integrity of grammar structure. This violates the cultivation of communicative competence, and is not in conformity with the society’s requirements of applied foreign language talents. Cognitive linguistics theory, which links language form with semantic concept, reveals the internal relation of man’s thinking and language. If we can subtly apply cognitive linguistic theory into Japanese grammar teaching to explore the cognitive process in the speakers’ brain while expressing, we can get a good understanding of diffi cult points and “special case”. This paper explores the introductory methods and efficacy of the cognitive linguistics theory applied in Japanese grammar teaching method, by lecturing causative sentences an example.

  19. Integrated Spreadsheets as a Paradigm of Type II Technology Applications in Mathematics Teacher Education

    Science.gov (United States)

    Abramovich, Sergei

    2016-01-01

    The paper presents the use of spreadsheets integrated with digital tools capable of symbolic computations and graphic constructions in a master's level capstone course for secondary mathematics teachers. Such use of spreadsheets is congruent with the Type II technology applications framework aimed at the development of conceptual knowledge in the…

  20. Chronic constipation recognized as a sign of a SOX10 mutation in a patient with Waardenburg syndrome.

    Science.gov (United States)

    Arimoto, Yukiko; Namba, Kazunori; Nakano, Atsuko; Matsunaga, Tatsuo

    2014-05-01

    Waardenburg syndrome is characterized by hearing loss, pigmentation abnormalities, dysmorphologic features, and neurological phenotypes. Waardenburg syndrome consists of four distinct subtypes, and SOX10 mutations have been identified in type II and type IV. Type IV differs from type II owing to the presence of Hirschsprung disease. We identified a de novo nonsense mutation in SOX10 (p.G39X) in a female pediatric patient with Waardenburg syndrome with heterochromia iridis, profound bilateral sensorineural hearing loss, inner ear malformations, and overall hypopigmentation of the hair without dystopia canthorum. This patient has experienced chronic constipation since she was a neonate, but anorectal manometry showed a normal anorectal reflex. Chronic constipation in this patient was likely to be a consequence of a mild intestinal disorder owing to the SOX10 mutation, and this patient was considered to have a clinical phenotype intermediate between type II and type IV of the syndrome. Chronic constipation may be recognized as indicative of a SOX10 mutation in patients with Waardenburg syndrome. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Anterior screw fixation of a dislocated type II odontoid fracture facilitated by transoral and posterior cervical manual reduction.

    Science.gov (United States)

    Piedra, Mark P; Hunt, Matthew A; Nemecek, Andrew N

    2009-10-01

    Early fixation of type II odontoid fractures has been shown to provide high rates of long-term stabilization and osteosynthesis. In this report, the authors present the case of a patient with a locked type II odontoid fracture treated by anterior screw fixation facilitated by closed transoral and posterior cervical manual reduction. While transoral intraoperative reduction of a partially displaced odontoid fracture has previously been described, the authors present the first case utilizing this technique in the treatment of a completely dislocated type II odontoid fracture.

  2. What is being heard is not necessarily the same as what has been said

    International Nuclear Information System (INIS)

    KraIj, Metka

    2002-01-01

    Public and especially young people opinion polls in Slovenia indicate a rather high tolerance to the existing nuclear power plant but low acceptance of new nuclear facilities, including a waste disposal facility. Radioactivity and radioactive waste is generally considered a very problematic issue. Although more than 50% Slovenes declared that they do not need additional information about these issues and therefore expressed they were satisfied with the knowledge they had, the arguments against nuclear facilities expressed in public discussions and even in the media, differ from the scientific and technological accuracy in many cases. Public attitudes in Slovenia often reflect ideas that can be found in 'para-scientific' literature such as new age and alternative health, where terms 'energy' and 'radiation' are used in a metaphorical and quiet different sense than in the information materials about nuclear energy. This makes the communication between the public and nuclear professionals very difficult and even conflicting because both sides speak in fact a different language and are not even aware of this. The quality of and scientific accuracy information on web pages retrieved by the key words 'energy', 'radiation' and 'radioactivity' were analysed. In Slovenian language, the key word 'energy' produced the greatest number of hits, but the pages dealing with 'bioenergy', 'life energy', 'positive and negative energy', and the like were twice as frequent as the pages dealing with energy in a physical science sense. Under the key word r adiation approximately half of the pages could be classified as 'new age' pages and the other half gave an adequate physical explanation. Searching with the key word 'radioactivity' gave only a few pages but they contained exact physical explanations, mainly on measurements and radioactive decay but not on the effects of radioactive materials on human health which is the main interest of the public. The situation was similar on web

  3. Anglophonic Influence in the Use of Sound Symbolism in Italian Disney Comics: A Corpus-based Analysis

    Directory of Open Access Journals (Sweden)

    Pischedda Pier Simone

    2017-12-01

    Full Text Available This article will explore the linguistic implications of employing and creating sound symbolism (ideophones, onomatopoeia and interjections in Italian Disney comics. It will endeavour to investigate the way sound symbolic forms in both imported Disney US comics and original Italian stories have profoundly influenced the development of Italian sound symbolism in the last century. The diachronic analysis is carried out thanks to the creation of a corpus of ideophones and interjections from 210 Disney stories published between 1932 and 2013. The corpus will allow the author to investigate how these forms have changed diachronically throughout the eighty years under investigation with the final aim of highlighting changes and patterns in both original and translated Italian stories. The unique status of ideophones, confirmed by language, sociological and neurological studies, has led to interesting experimentations but also to complicated dynamics. Certain linguistic settings seem to foster a better affinity towards the device- particularly if compared to Romance languages, such as Italian and Spanish, that often have to rely on Anglophone renditions. Anglicisation has indeed overshadowed previous original attempts. Nevertheless, recent creations, particularly from cartoonists, bear witness to a willingness to stretch language again in order to enhance language iconicity.

  4. The Perilous Life of a Linguistic Genre Convention

    DEFF Research Database (Denmark)

    Borchmann, Simon

    2014-01-01

    , the descriptions are more informative than the structures hitherto described by text linguistics. Secondly, as historical norms, they are a testimony to the development and change of language use. Thirdly, the descriptions contribute to language users’ awareness of the origin of standards, their understanding......The primary, theoretical aim of the article is to present a linguistic text analysis that differs from standard text linguistic approaches by being informative with regard to the linguistic choices and textual organisation that characterise a text as a social act. The analysis is exemplified...... by using texts of a relatively new Danish journalistic genre nyhedsanalyse (news analysis). The secondary, empirical aim of the article is to present a corpus-based, linguistic analysis of central elements of the genre nyhedsanalyse within the Danish system of newspaper genres. Text linguistics is based...

  5. The development of an automated sentence generator for the assessment of reading speed

    Directory of Open Access Journals (Sweden)

    Legge Gordon E

    2008-03-01

    Full Text Available Abstract Reading speed is an important outcome measure for many studies in neuroscience and psychology. Conventional reading speed tests have a limited corpus of sentences and usually require observers to read sentences aloud. Here we describe an automated sentence generator which can create over 100,000 unique sentences, scored using a true/false response. We propose that an estimate of the minimum exposure time required for observers to categorise the truth of such sentences is a good alternative to reading speed measures that guarantees comprehension of the printed material. Removing one word from the sentence reduces performance to chance, indicating minimal redundancy. Reading speed assessed using rapid serial visual presentation (RSVP of these sentences is not statistically different from using MNREAD sentences. The automated sentence generator would be useful for measuring reading speed with button-press response (such as within MRI scanners and for studies requiring many repeated measures of reading speed.

  6. Study of burden distribution characteristics (IV): the development of a distribution predicting model in which coke collapse has been taken into account

    Energy Technology Data Exchange (ETDEWEB)

    Kamisaka, E; Okuno, Y; Irita, T; Matsuzaki, M; Isoyama, T; Kunitomo, K

    1984-01-01

    Using results quoted in a previous report (see Tetsu To Hagane, Vol. 68, page S 701, 1982), coke collapse has been quantified by means of landslide theory, according to which the stability of the burden is given by a safety factor which equals resistance moment/sliding moment. This has enabled coke collapse to be introduced in a model for predicting burden distribution. Application of this model has resulted in more accurate predictions of burden distribution, the computed values being in close agreement with the results of distribution experiments. 1 reference.

  7. Holistic corpus-based dialectology Dialetologia holística baseada em corpus

    Directory of Open Access Journals (Sweden)

    Benedikt Szmrecsanyi

    2011-01-01

    Full Text Available This paper is concerned with sketching future directions for corpus-based dialectology. We advocate a holistic approach to the study of geographically conditioned linguistic variability, and we present a suitable methodology, 'corpusbased dialectometry', in exactly this spirit. Specifically, we argue that in order to live up to the potential of the corpus-based method, practitioners need to (i abandon their exclusive focus on individual linguistic features in favor of the study of feature aggregates, (ii draw on computationally advanced multivariate analysis techniques (such as multidimensional scaling, cluster analysis, and principal component analysis, and (iii aid interpretation of empirical results by marshalling state-of-the-art data visualization techniques. To exemplify this line of analysis, we present a case study which explores joint frequency variability of 57 morphosyntax features in 34 dialects all over Great Britain.Este artigo debruça-se sobre o esboço propositivo de futuras direções para a dialetologia baseada em corpus. Defendemos uma abordagem holística para o estudo da variabilidade linguística geograficamente condicionada, e apresentamos uma metodologia adequada para tal - a dialetometria baseada em corpus. Mais especificamente, defendemos que para que se obtenham todos os resultados esperados da metodologia de corpus, pesquisadores devem: (i abandonar seu foco exclusivo em traços linguísticos individuais em favor do estudo dos agregados de traços, (ii amparar-se em métodos computacionais avançados de técnicas de análise multivariada (tais como escalagem multidimensional, análise de clusters, e análise de componente principal, e (iii auxiliar a interpretação de resultados empíricos através da utilização do estado da arte em técnicas de visualização. A fim de exemplificarmos essa linha de análise, apresentamos um estudo de caso que explora a variabilidade da frequência agregada de 57 tra

  8. Lexical Properties of Slovene Sign Language: A Corpus-Based Study

    Science.gov (United States)

    Vintar, Špela

    2015-01-01

    Slovene Sign Language (SZJ) has as yet received little attention from linguists. This article presents some basic facts about SZJ, its history, current status, and a description of the Slovene Sign Language Corpus and Pilot Grammar (SIGNOR) project, which compiled and annotated a representative corpus of SZJ. Finally, selected quantitative data…

  9. Translation as a Paradigm Shift: A Corpus Study of Academic Writing

    Directory of Open Access Journals (Sweden)

    Agnes Pisanski Peterlin

    2013-05-01

    Full Text Available In recent decades the increasing reliance on computer technology and the emergence of electronic publishing have precipitated changes in both the production and reception of academic writing. At the same time, the dominance of English as the medium of academic communication has been asserted in all fields of study. While many scholars write their own texts in English, it is not exceptional for others to have their papers translated into English. It is interesting, however, that translation of academic discourse has received relatively little research attention so far. In the study presented here, the question how translated academic texts differ from comparable original English academic texts is addressed. To explore this question, a 700,000-word corpus comprising 104 research articles (Slovene-English translations and comparable English originals is analyzed in terms of references to the entire text itself. The results show considerable differences between the translated texts and the comparable English-language originals.

  10. Recognizing emotional speech in Persian: a validated database of Persian emotional speech (Persian ESD).

    Science.gov (United States)

    Keshtiari, Niloofar; Kuhlmann, Michael; Eslami, Moharram; Klann-Delius, Gisela

    2015-03-01

    Research on emotional speech often requires valid stimuli for assessing perceived emotion through prosody and lexical content. To date, no comprehensive emotional speech database for Persian is officially available. The present article reports the process of designing, compiling, and evaluating a comprehensive emotional speech database for colloquial Persian. The database contains a set of 90 validated novel Persian sentences classified in five basic emotional categories (anger, disgust, fear, happiness, and sadness), as well as a neutral category. These sentences were validated in two experiments by a group of 1,126 native Persian speakers. The sentences were articulated by two native Persian speakers (one male, one female) in three conditions: (1) congruent (emotional lexical content articulated in a congruent emotional voice), (2) incongruent (neutral sentences articulated in an emotional voice), and (3) baseline (all emotional and neutral sentences articulated in neutral voice). The speech materials comprise about 470 sentences. The validity of the database was evaluated by a group of 34 native speakers in a perception test. Utterances recognized better than five times chance performance (71.4 %) were regarded as valid portrayals of the target emotions. Acoustic analysis of the valid emotional utterances revealed differences in pitch, intensity, and duration, attributes that may help listeners to correctly classify the intended emotion. The database is designed to be used as a reliable material source (for both text and speech) in future cross-cultural or cross-linguistic studies of emotional speech, and it is available for academic research purposes free of charge. To access the database, please contact the first author.

  11. A data-driven study of temporal adverbials as discourse segmentation markers

    Directory of Open Access Journals (Sweden)

    Marie-Paule Péry-Woodley

    2009-06-01

    Full Text Available De nombreuses études en linguistique et en psycholinguistique considèrent les adverbiaux temporels situés en position initiale de phrase comme de “bons” marqueurs de segmentation du discours. Cet article propose une analyse en corpus de leur capacité à signaler une discontinuité dans le discours (déplacement ou rupture. Nous portons ici une attention toute particulière aux configurations discursives dans lesquelles apparaissent les adverbiaux temporels à l'initiale en considérant deux autres traits : la position textuelle (relative à la structure du document et le type d'expressions sujet apparaissant dans la même phrase que l'adverbial. Notre méthodologie se base sur un corpus diversifié et étiqueté, représentant un volume de données relativement important pour le domaine. Le va-et-vient entre analyses quantitatives et qualitatives nous permet d'explorer comparativement selon différents types de textes les configurations discursives auxquelles participent les adverbiaux temporels à l'initiale. Au vu de nos résultats, il semble que les adverbiaux temporels signalent une discontinuité uniquement dans certaines configurations, ce qui signifie qu'ils ne constituent pas en eux-mêmes des marqueurs de segmentation.Many linguistic and psycholinguistic studies present sentence-initial temporal adverbials as “good” markers of discourse segmentation. This paper proposes a corpus-based evaluation of their potential to signal discontinuity such as a discourse break or shift. We look in particular at how temporal adverbials interact with other features, such as position in the text (in relation to document structure, and type of referring expression as grammatical subject in the host sentence. Our methodology calls upon a large diversified tagged corpus, and combines quantitative and qualitative approaches in order to systematically explore these configurations and their relation to text-type. Temporal adverbials are shown in

  12. Language as a whole - A new framework for linguistic knowledge integration. Comment on "Approaching human language with complex networks" by Cong and Liu

    Science.gov (United States)

    Chen, Xinying

    2014-12-01

    Researchers have been talking about the language system theoretically for many years [1]. A well accepted assumption is that language is a complex adaptive system [2] which is hierarchical [3] and contains multiple levels along the meaning-form dimension [4]. Over the last decade or so, driven by the availability of digital language data and the popularity of statistical approach, many researchers interested in theoretical questions have started to try to quantitatively describe microscopic linguistic features in a certain level of a language system by using authentic language data. Despite the fruitful findings, one question remains unclear. That is, how does a whole language system look like? For answering this question, network approach, an analysis method emphasizes the macro features of structures, has been introduced into linguistic studies [5]. By analyzing the static and dynamic linguistics networks constructed from authentic language data, many macro and micro linguistic features, such as lexical, syntactic or semantic features have been discovered and successfully applied in linguistic typographical studies so that the huge potential of linguistic networks research has revealed [6].

  13. Comparison of the effect of different intensity exercise on a bicycle ergometer on postprandial lipidemia in type II diabetic patients

    Directory of Open Access Journals (Sweden)

    Narges Argani

    2014-05-01

    Full Text Available BACKGROUND: Postprandial lipid clearance failure and lipoprotein disorders, which are independent risk factors for cardiovascular diseases are well-recognized in type II diabetes. Reduction of fats through exercise has been proved, though the mechanism is not well-defined, and the effects of different intensity exercise on postprandial lipidemia in diabetes type II is unknown. This study aims to find these effects using a cycle ergometer. METHODS: On three different days, 15 type II diabetics (10 women and 5 men, with a mean age 42.07 ± 6.05 years, weight 94.64 ± 4.37 kg, height 159.78 ± 9.09 cm, and body mass index 29.83 ± 3.93 kg/m2, consumed a full fat breakfast (750-800 kcal, 85% fat, and 150 min later, blood samples were taken from them to measure their lipid profile. The 1st day was the control day, without any exercises. Seven days later, 90 min after enriched breakfast, they did 30 min of exercise on the cycle ergometer with intensity of 55-70% of maximum heart rate (HRmax, and 14 days later, 90 min after enriched breakfast, they did 30 min of exercise with intensity of 70-85% of HRmax. RESULTS: According to Friedman non-parametric test, high-density lipoprotein (HDL cholesterol serum level significantly increased after 30 min of moderate intensity exercise (P > 0.05, from 39.4 ± 5.2 to 48.6 ± 9.3, while this increase was insignificant after a higher intensity exercise. Neither intensity levels had any significant effects on triglyceride or on low-density lipoprotein cholesterol. CONCLUSION: Results showed that moderate intensity exercise was more effective in increasing HDL cholesterol level in type II diabetics.   Keywords: Postprandial Lipidemia, Resistance Exercise, Bicycle Ergometer, Type II Diabetes 

  14. The suspended sentence in German criminal law

    Directory of Open Access Journals (Sweden)

    Jovašević Dragan

    2017-01-01

    Full Text Available From the ancient times until today, criminal law in all countries has provided different criminal sanctions as social control measures. These are court-imposed coercive measures that take away or limit certain rights and freedoms of criminal offenders. Sanctions are applied to natural or legal persons who violate the norms of the legal order and cause damage or endanger other legal goods that enjoy legal protection. In order to effectively protect social values jeopardized by the commission of crime, state legislations prescribe several kinds of criminal sanctions: 1 penalties, 2 precautions, 3 safety measures, 4 penalties for juvenile offenders, and 5 sanctions for legal persons. Penalties are the basic, the oldest and the most important type of criminal sanctions. They are prescribed for the largest number of criminal offences. Imposed instead of or alongside with penalties, warning measures have particularly important role in jurisprudence. Since they were introduced in the system of criminal sanctions in the early 20th century, there has been a notable increase in the application of these measures, particularly in cases involving negligent and accidental offences, and minor offences that do not cause serious consequences, whose perpetrators are not persons with criminal characteristics. Warning measures (suspended sentence are envisaged in all contemporary criminal legislations, including the German legislation. Suspended sentence is a conditional stay of execution of the sentence of imprisonment for a specified time, provided that the convicted person fulfills the imposed obligations and does not commit another criminal offense. Two conditions must be fulfilled for the application of these sanctions: a the formal requirement, which is attached to the sentence of imprisonment; and b the substantive requirement, which implies the court assessment that the application of these sanctions is justified and necessary in a particular case. Many

  15. A rare polyglycine type II-like helix motif in naturally occurring proteins.

    Science.gov (United States)

    Warkentin, Eberhard; Weidenweber, Sina; Schühle, Karola; Demmer, Ulrike; Heider, Johann; Ermler, Ulrich

    2017-11-01

    Common structural elements in proteins such as α-helices or β-sheets are characterized by uniformly repeating, energetically favorable main chain conformations which additionally exhibit a completely saturated hydrogen-bonding network of the main chain NH and CO groups. Although polyproline or polyglycine type II helices (PP II or PG II ) are frequently found in proteins, they are not considered as equivalent secondary structure elements because they do not form a similar self-contained hydrogen-bonding network of the main chain atoms. In this context our finding of an unusual motif of glycine-rich PG II -like helices in the structure of the acetophenone carboxylase core complex is of relevance. These PG II -like helices form hexagonal bundles which appear to fulfill the criterion of a (largely) saturated hydrogen-bonding network of the main-chain groups and therefore may be regarded in this sense as a new secondary structure element. It consists of a central PG II -like helix surrounded by six nearly parallel PG II -like helices in a hexagonal array, plus an additional PG II -like helix extending the array outwards. Very related structural elements have previously been found in synthetic polyglycine fibers. In both cases, all main chain NH and CO groups of the central PG II -helix are saturated by either intra- or intermolecular hydrogen-bonds, resulting in a self-contained hydrogen-bonding network. Similar, but incomplete PG II -helix patterns were also previously identified in a GTP-binding protein and an antifreeze protein. © 2017 Wiley Periodicals, Inc.

  16. Hero or has-been: is there a future for altruism in medical education?

    Science.gov (United States)

    Bishop, Jeffrey P; Rees, Charlotte E

    2007-08-01

    The term 'altruism' is often used without definition, leading to contradictions in what we expect from medical students. In this reflection paper, we critique the concept of 'altruism' from the perspective of moral philosophy and social psychology and challenge its unquestioned usage within the medical education literature, especially that emerging from the USA. We will argue that 'altruism' is a social construction with a particular history, stemming from Kantian philosophy and perpetuated within newer disciplines such as social psychology. As it currently stands, 'altruism' seems to mean utter self-sacrifice--a position contradictory to recent recommendations by regulatory bodies in the UK, which suggest that graduates should look after the 'self' and achieve a work-life balance. In this article, we argue that it is undesirable to have 'altruism' as a learning outcome for medical students and we also argue that 'altruism' is not an observable behavior that can be measured. Instead, we suggest that medical educators should employ a more balanced term, borrowed from the social psychology literature i.e. pro-social behavior. We argue that whilst 'pro-social behavior' focuses on actions that benefit others, it does not do so at the expense of the self. In addition, it focuses on students' observable behaviors rather than their inner motivations, so is measurable. We conclude our article by discussing the formation of physicians based upon a virtue ethics, where society and the profession are in dialogue about the telos of medicine and its virtues, and where the character of the young physician is formed within the crucible of that dialogue. Thus, central to this pro-social behavior is the concept of phronesis or prudence, including the balancing of self-interest such as self-care, and the interests of the other.

  17. Historical Post Office Directory Parser (POD Parser Software From the AddressingHistory Project

    Directory of Open Access Journals (Sweden)

    Nicola Osborne

    2014-07-01

    Full Text Available The POD Parser is Python software for parsing the OCR’d (optical character recognised text of digitised historical Scottish Post Office Directories (PODs to produce a consistent structured format for the data and for geocoding each address. The software was developed as part of the AddressingHistory project which sought to combine digitised historic directories with digitised and georeferenced historic maps.  The software has potential for reuse in multiple research contexts where historical post office directory data is relevant, and is therefore particularly of use in historical research into social, economic or demographic trends. The POD Parser is currently designed for use with Scottish directories but is extensible, perhaps with some adaptation, to use with other similarly formatted materials such as the English Trade Directories.

  18. Towards an integrated corpus stylistics

    Directory of Open Access Journals (Sweden)

    McIntyre Dan

    2015-12-01

    Full Text Available Over recent years, the use of corpora in stylistic analysis has grown in popularity. However, questions still remain over the remit of corpus stylistics, its distinction from corpus linguistics generally and its capacity to explain complex stylistic effects. This article argues in favour of an integrated corpus stylistics; that is, an approach to corpus stylistics that integrates it with other stylistic methods and analytical frameworks. I suggest that this approach is needed for two main reasons: (i it is analytically necessary in order to fully explain stylistic effects in texts, and (ii integrating corpus methods with other stylistic tools is what will distinguish corpus stylistics from corpus linguistics. My argument is supported by reference to examples from Mark Haddon’s no vel The Curious Incident of the Dog in the Night-time and the HBO TV series Deadwood. Both these examples rely for their explanation on a combination of corpus stylistic analytical techniques and other stylistic methods of analysis.

  19. Oral carcinogenesis is not achieved in different carcinogen-treated PAI-1 transgenic and wild-type mouse models.

    Science.gov (United States)

    Avgoustidis, Dimitris; Nisyrios, Themistoklis; Nkenke, Emeka; Lijnen, Roger; Ragos, Vassilis; Perrea, Despina; Donta, Ismini; Vaena, Apostolia; Yapijakis, Christos; Vairaktaris, Eleftherios

    2012-01-01

    In an effort to assess the role of plasminogen activator inhibitor-1 (PAI-1) in oral squamous cancer development and progression, two different carcinogen treatment protocols were conducted. Protocol I included mice from a PAI-1 transgenic (Tg) breed (n=56) and their wild-type (WT) counterparts (n=56), divided into one control group and two main experimental groups, treated with 7,12-dimethylbenz[a]anthracene (DMBA) for 8 and 16 weeks, respectively. Protocol II included the same number and types of animals and groups, which were similarly treated with 4-Nitroquinoline 1-oxide (4-NQO) in drinking water. Two drugs that affect plasma PAI-1 levels, enalapril and pravastatin, were administered to certain subgroups of animals in both protocols. None of the animals developed macroscopically-visible oral cancer lesions. Eleven animals under Protocol I and 52 animals under Protocol II died. Skin lesions were noted only in DMBA-treated animals (n=9). Almost all animals administered with 4-NQO developed alopecia and lost weight, while two of them developed stomach tumours, and one female mouse developed a large ovarian cyst. Transgenic mice may respond differently when used in well-established carcinogen models and oral carcinogenesis is hard to achieve in these rodents.

  20. Syntactic analysis in sentence comprehension: effects of dependency types and grammatical constraints.

    Science.gov (United States)

    De Vincenzi, M

    1996-01-01

    This paper presents three experiments on the parsing of Italian wh-questions that manipulate the wh-type (who vs. which-N) and the wh extraction site (main clause, dependent clause with or without complementizer). The aim of these manipulations is to see whether the parser is sensitive to the type of dependencies being processed and whether the processing effects can be explained by a unique processing principle, the minimal chain principle (MCP; De Vincenzi, 1991). The results show that the parser, following the MCP, prefers structures with fewer and less complex chains. In particular: (1) There is a processing advantage for the wh-subject extractions, the structures with less complex chains; (2) there is a processing dissociation between the who and which questions; (3) the parser respects the principle that governs the well-formedness of the empty categories (ECP).

  1. Publishing a Quality Context-aware Annotated Corpus and Lexicon for Harassment Research

    OpenAIRE

    Rezvan, Mohammadreza; Shekarpour, Saeedeh; Balasuriya, Lakshika; Thirunarayan, Krishnaprasad; Shalin, Valerie; Sheth, Amit

    2018-01-01

    Having a quality annotated corpus is essential especially for applied research. Despite the recent focus of Web science community on researching about cyberbullying, the community dose not still have standard benchmarks. In this paper, we publish first, a quality annotated corpus and second, an offensive words lexicon capturing different types type of harassment as (i) sexual harassment, (ii) racial harassment, (iii) appearance-related harassment, (iv) intellectual harassment, and (v) politic...

  2. Calibration of the ZEUS 6 m tagger

    Energy Technology Data Exchange (ETDEWEB)

    Schroeder, Matthias

    2008-11-15

    In the last period of the HERA running, data were taken with the ZEUS detector to measure directly the longitudinal structure function of the proton and to determine the total photoproduction cross section. The identification of electrons from photoproduction events, in which an electron interacts with a proton and is scattered at very small angles, is essential for both the measurements. The electrons were detected using a small electromagnetic calorimeter, the 6 m tagger, which as located near the beam pipe at {approx} 6m from the ep interaction point. A method for the calibration and reconstruction of electromagnetic showers in the 6m tagger was developed and the energy acceptance range and the resolution of the 6 m tagger was measured. Furthermore, the photon acceptance and the energy resolution of a different electromagnetic calorimeter used for the luminosity measurement at ZEUS were analyzed using the 6 m tagger. (orig.)

  3. Calibration of the ZEUS 6 m tagger

    International Nuclear Information System (INIS)

    Schroeder, Matthias

    2008-11-01

    In the last period of the HERA running, data were taken with the ZEUS detector to measure directly the longitudinal structure function of the proton and to determine the total photoproduction cross section. The identification of electrons from photoproduction events, in which an electron interacts with a proton and is scattered at very small angles, is essential for both the measurements. The electrons were detected using a small electromagnetic calorimeter, the 6 m tagger, which as located near the beam pipe at ∼ 6m from the ep interaction point. A method for the calibration and reconstruction of electromagnetic showers in the 6m tagger was developed and the energy acceptance range and the resolution of the 6 m tagger was measured. Furthermore, the photon acceptance and the energy resolution of a different electromagnetic calorimeter used for the luminosity measurement at ZEUS were analyzed using the 6 m tagger. (orig.)

  4. The usage of amount, quantity and body in a corpus of biology

    Directory of Open Access Journals (Sweden)

    Purificación Sánchez Hernández

    2002-04-01

    Full Text Available Grammars and dictionaries usually offer relevant and accurate information to students of a second language. However, the meaning of a textual element is often dynamic and that information is not always based on real usage patterns. New occurrences on the object level in new contexts can introduce novel semantic potentials, so that existing interpretations may be superseded by new ones. Concordancing has been shown to be one of the most important tools to facilitate the understanding of the usage patterns of a language. In this paper we examine the differences between amount, quantity and body as terms expressing magnitude, sum and size in a corpus of Biology. According to some popular dictionaries and grammars, the terms amount and quantity have always been considered synonymous terms for expressing magnitude, size and sum. We demonstrate that, according to our records, they cannot be always used as synonymous terms since they have different patterns of usage. On the other hand there are other forms, such as body, that appear in our Corpus, implying magnitude, size and sum, that are not usually described as having such meanings in dictionaries.

  5. Textual, Genre and Social Features of Spoken Grammar: A Corpus-Based Approach

    Directory of Open Access Journals (Sweden)

    Carmen Pérez-Llantada

    2009-02-01

    Full Text Available This paper describes a corpus-based approach to teaching and learning spoken grammar for English for Academic Purposes with reference to Bhatia’s (2002 multi-perspective model for discourse analysis: a textual perspective, a genre perspective and a social perspective. From a textual perspective, corpus-informed instruction helps students identify grammar items through statistical frequencies, collocational patterns, context-sensitive meanings and discoursal uses of words. From a genre perspective, corpus observation provides students with exposure to recurrent lexico-grammatical patterns across different academic text types (genres. From a social perspective, corpus models can be used to raise learners’ awareness of how speakers’ different discourse roles, discourse privileges and power statuses are enacted in their grammar choices. The paper describes corpus-based instructional procedures, gives samples of learners’ linguistic output, and provides comments on the students’ response to this method of instruction. Data resulting from the assessment process and student production suggest that corpus-informed instruction grounded in Bhatia’s multi-perspective model can constitute a pedagogical approach in order to i obtain positive student responses from input and authentic samples of grammar use, ii help students identify and understand the textual, genre and social aspects of grammar in real contexts of use, and therefore iii help develop students’ ability to use grammar accurately and appropriately.

  6. Priming sentence planning

    OpenAIRE

    Konopka, A.; Meyer, A.

    2014-01-01

    Sentence production requires mapping preverbal messages onto linguistic structures. Because sentences are normally built incrementally, the information encoded in a sentence-initial increment is critical for explaining how the mapping process starts and for predicting its timecourse. Two experiments tested whether and when speakers prioritize encoding of different types of information at the outset of formulation by comparing production of descriptions of transitive events (e.g., A dog is cha...

  7. Computer simulation as an important approach to explore language universal. Comment on "Dependency distance: a new perspective on syntactic patterns in natural languages" by Haitao Liu et al.

    Science.gov (United States)

    Lu, Qian

    2017-07-01

    Exploring language universal is one of the major goals of linguistic researches, which are largely devoted to answering the ;Platonic questions; in linguistics, that is, what is the language knowledge, how to get and use this knowledge. However, if solely guided by linguistic intuition, it is very difficult for syntactic studies to answer these questions, or to achieve abstractions in the scientific sense. This suggests that linguistic analyses based on the probability theory may provide effective ways to investigate into language universals in terms of biological motivations or cognitive psychological mechanisms. With the view that ;Language is a human-driven system;, Liu, Xu & Liang's review [1] pointed out that dependency distance minimization (DDM), which has been corroborated by big data analysis of corpus, may be a language universal shaped in language evolution, a universal that has profound effect on syntactic patterns.

  8. The suspended sentence in French Criminal Law

    Directory of Open Access Journals (Sweden)

    Jovašević Dragan

    2016-01-01

    Full Text Available From the ancient times until today, criminal law has provided different criminal sanctions as measures of social control. These coercive measures are imposed on the criminal offender by the competent court and aimed at limitting the offender's rights and freedoms or depriving the offender of certain rights and freedoms. These sanctions are applied to the natural or legal persons who violate the norms of the legal order and injure or endanger other legal goods that enjoy legal protection. In order to effectively protect social values, criminal legislations in all countries predict a number of criminal sanctions. These are: 1 imprisonment, 2 precautions, 3 safety measures, 4 penalties for juveniles, and 5 sanctions for legal persons. Apart and instead of punishment, warning measures have a significant role in the jurisprudence. Since they emerged in the early 20th century in the system of criminal sanctions, there has been an increase in their application to criminal offenders, especially when it comes to first-time offenders who committed a negligent or accidental criminal act. Warnings are applied in case of crimes that do not have serious consequences, and whose perpetrators are not hardened and incorrigible criminals. All contemporary criminal legislations (including the French legilation provide a warning measure of suspended sentence. Suspended sentence is a conditional stay of execution of sentence of imprisonment for a specified time, provided that the convicted person does not commit another criminal offense and fulfills other obligations. This sanction applies if the following two conditions are fulfilled: a forma! -which is attached to the sentence of imprisonment; and b material -which is the court assessment that the application of this sanction is justified and necessary in a particular case. In many modern criminal legislations, there are two different types of suspended (conditional sentence: 1 ordinary (classical suspended

  9. EEG Correlates of Song Prosody: A New Look at the Relationship between Linguistic and Musical Rhythm

    Science.gov (United States)

    Gordon, Reyna L.; Magne, Cyrille L.; Large, Edward W.

    2011-01-01

    Song composers incorporate linguistic prosody into their music when setting words to melody, a process called “textsetting.” Composers tend to align the expected stress of the lyrics with strong metrical positions in the music. The present study was designed to explore the idea that temporal alignment helps listeners to better understand song lyrics by directing listeners’ attention to instances where strong syllables occur on strong beats. Three types of textsettings were created by aligning metronome clicks with all, some or none of the strong syllables in sung sentences. Electroencephalographic recordings were taken while participants listened to the sung sentences (primes) and performed a lexical decision task on subsequent words and pseudowords (targets, presented visually). Comparison of misaligned and well-aligned sentences showed that temporal alignment between strong/weak syllables and strong/weak musical beats were associated with modulations of induced beta and evoked gamma power, which have been shown to fluctuate with rhythmic expectancies. Furthermore, targets that followed well-aligned primes elicited greater induced alpha and beta activity, and better lexical decision task performance, compared with targets that followed misaligned and varied sentences. Overall, these findings suggest that alignment of linguistic stress and musical meter in song enhances musical beat tracking and comprehension of lyrics by synchronizing neural activity with strong syllables. This approach may begin to explain the mechanisms underlying the relationship between linguistic and musical rhythm in songs, and how rhythmic attending facilitates learning and recall of song lyrics. Moreover, the observations reported here coincide with a growing number of studies reporting interactions between the linguistic and musical dimensions of song, which likely stem from shared neural resources for processing music and speech. PMID:22144972

  10. EEG correlates of song prosody: A new look at the relationship between linguistic and musical rhythm

    Directory of Open Access Journals (Sweden)

    Reyna L Gordon

    2011-11-01

    Full Text Available Song composers incorporate linguistic prosody into their music when setting words to melody, a process called textsetting. Composers tend to align the expected stress of the lyrics with strong metrical positions in the music. The present study was designed to explore the idea that temporal alignment helps listeners to better understand song lyrics by directing listeners’ attention to instances where strong syllables occur on strong beats. Three types of textsettings were created by aligning metronome clicks with all, some or none of the strong syllables in sung sentences. Electroencephalographic (EEG recordings were taken while participants listened to the sung sentences (primes and performed a lexical decision task on subsequent words and pseudowords (targets, presented visually. Comparison of misaligned and well-aligned sentences showed that temporal alignment between strong/weak syllables and strong/weak musical beats were associated with modulations of induced beta and evoked gamma power, which have been shown to fluctuate with rhythmic expectancies. Furthermore, targets that followed well-aligned primes elicited greater induced alpha and beta activity, and better lexical decision task performance, compared with targets that followed misaligned and varied sentences. Overall, these findings suggest that alignment of linguistic stress and musical meter in song enhances musical beat tracking and comprehension of lyrics by synchronizing neural activity with strong syllables. This approach may begin to explain the mechanisms underlying the relationship between linguistic and musical rhythm in songs, and how rhythmic attending facilitates learning and recall of song lyrics. Moreover, the observations reported here coincide with a growing number of studies reporting interactions between the linguistic and musical dimensions of song, which likely stem from shared neural resources for processing music and speech.

  11. EEG Correlates of Song Prosody: A New Look at the Relationship between Linguistic and Musical Rhythm.

    Science.gov (United States)

    Gordon, Reyna L; Magne, Cyrille L; Large, Edward W

    2011-01-01

    Song composers incorporate linguistic prosody into their music when setting words to melody, a process called "textsetting." Composers tend to align the expected stress of the lyrics with strong metrical positions in the music. The present study was designed to explore the idea that temporal alignment helps listeners to better understand song lyrics by directing listeners' attention to instances where strong syllables occur on strong beats. Three types of textsettings were created by aligning metronome clicks with all, some or none of the strong syllables in sung sentences. Electroencephalographic recordings were taken while participants listened to the sung sentences (primes) and performed a lexical decision task on subsequent words and pseudowords (targets, presented visually). Comparison of misaligned and well-aligned sentences showed that temporal alignment between strong/weak syllables and strong/weak musical beats were associated with modulations of induced beta and evoked gamma power, which have been shown to fluctuate with rhythmic expectancies. Furthermore, targets that followed well-aligned primes elicited greater induced alpha and beta activity, and better lexical decision task performance, compared with targets that followed misaligned and varied sentences. Overall, these findings suggest that alignment of linguistic stress and musical meter in song enhances musical beat tracking and comprehension of lyrics by synchronizing neural activity with strong syllables. This approach may begin to explain the mechanisms underlying the relationship between linguistic and musical rhythm in songs, and how rhythmic attending facilitates learning and recall of song lyrics. Moreover, the observations reported here coincide with a growing number of studies reporting interactions between the linguistic and musical dimensions of song, which likely stem from shared neural resources for processing music and speech.

  12. Options for Energy Upgrade of the Hall B Tagger

    International Nuclear Information System (INIS)

    H. Crannell; D. Sober

    1998-01-01

    Four options for an energy upgrade of the present Hall-B Tagger have been considered. These are: (1) Boost the magnetic field in the present Tagger; (2) Replace the Tagger with a new Tagger system; (3) Install a pre-Tagger magnetic chacain, and (4) Use the present Tagger as part of the beam dump. In this document each of these options is described briefly, the advantages and disadvantages of each are presented, and a very rough cost of implementation is suggested

  13. When novel sentences spoken or heard for the first time in the history of the universe are not enough: toward a dual-process model of language.

    Science.gov (United States)

    Van Lancker Sidtis, Diana

    2004-01-01

    Although interest in the language sciences was previously focused on newly created sentences, more recently much attention has turned to the importance of formulaic expressions in normal and disordered communication. Also referred to as formulaic expressions and made up of speech formulas, idioms, expletives, serial and memorized speech, slang, sayings, clichés, and conventional expressions, non-propositional language forms a large proportion of every speaker's competence, and may be differentially disturbed in neurological disorders. This review aims to examine non-propositional speech with respect to linguistic descriptions, psycholinguistic experiments, sociolinguistic studies, child language development, clinical language disorders, and neurological studies. Evidence from numerous sources reveals differentiated and specialized roles for novel and formulaic verbal functions, and suggests that generation of novel sentences and management of prefabricated expressions represent two legitimate and separable processes in language behaviour. A preliminary model of language behaviour that encompasses unitary and compositional properties and their integration in everyday language use is proposed. Integration and synchronizing of two disparate processes in language behaviour, formulaic and novel, characterizes normal communicative function and contributes to creativity in language. This dichotomy is supported by studies arising from other disciplines in neurology and psychology. Further studies are necessary to determine in what ways the various categories of formulaic expressions are related, and how these categories are processed by the brain. Better understanding of how non-propositional categories of speech are stored and processed in the brain can lead to better informed treatment strategies in language disorders.

  14. From a dictionary to an atlas: segmental mapping of the linguistic picture of the world

    Directory of Open Access Journals (Sweden)

    Sergey N. Gagarin

    2014-01-01

    Full Text Available 'At the heart of any language lies a vision. It embraces the world around us in myriads of complex ways. It is the lifeblood of every people's identity. It is so essential and indispensable that few assets of humankind can rival it for value or timelessness. It is known as the linguistic picture of the world, and it is notorious for being among the knottiest study subjects of language science. No coherent methodology has been proposed to date as to how it should be consistently structured to result in a systemic and navigable map of its core words and concepts. This constitutes a conspicuous gap in contemporary linguistics, which the present article addresses from the perspective of cognitive lexicology and lexicography while engaging the linguistic picture of the world on a segment-by-segment basis. In keeping with the aforesaid approach, one segment at a time is selected, and the discourse that reflects it is analysed with a view to identifying transcendental notions contained therein. The latter are construed as a type of cognitive concepts which epitomise the core ideas inherent in a particular type of spoken or written discourse. Being verbalised by means of relevant verbal fields, these transcendental notions permeate the cognitive and textual fabric of the selected segment of a linguistic picture of the world. By way of demonstrating the feasibility of this approach, a new type of dictionary has been compiled by the author, which captures and reveals in a semantically structured way the verbal side of the transcendental notion "countering" in the socio-political discourse of English-language media. Along with other transcendental notions, such as "facilitation", "communication", "attitude", etc., it is viewed as part of a range of the cognitive pillars which are essential to a limited segment of a linguistic picture of the world, but are by no means reserved to it, stretching far beyond and reaching throughout the vision of the world

  15. Automated Linguistic Personality Description and Recognition Methods

    Directory of Open Access Journals (Sweden)

    Danylyuk Illya

    2016-12-01

    Full Text Available Background: The relevance of our research, above all, is theoretically motivated by the development of extraordinary scientific and practical interest in the possibilities of language processing of huge amount of data generated by people in everyday professional and personal life in the electronic forms of communication (e-mail, sms, voice, audio and video blogs, social networks, etc.. Purpose: The purpose of the article is to describe the theoretical and practical framework of the project "Communicative-pragmatic and discourse-grammatical lingvopersonology: structuring linguistic identity and computer modeling". The description of key techniques is given, such as machine learning for language modeling, speech synthesis, handwriting simulation. Results: Lingvopersonology developed some great theoretical foundations, its methods, tools, and significant achievements let us predict that the newest promising trend is a linguistic identity modeling by means of information technology, including language. We see three aspects of the modeling: 1 modeling the semantic level of linguistic identity – by means of the use of corpus linguistics; 2 sound level formal modeling of linguistic identity – with the help of speech synthesis; 3 formal graphic level modeling of linguistic identity – with the help of image synthesis (handwriting. For the first case, we suppose to use machine learning technics and vector-space (word2vec algorithm for textual speech modeling. Hybrid CUTE method for personality speech modeling will be applied to the second case. Finally, trained with the person handwriting images neural network can be an instrument for the last case. Discussion: The project "Communicative-pragmatic, discourse, and grammatical lingvopersonology: structuring linguistic identity and computer modeling", which is implementing by the Department of General and Applied Linguistics and Slavonic philology, selected a task to model Yuriy Shevelyov (Sherekh

  16. GBParsy: A GenBank flatfile parser library with high speed

    Directory of Open Access Journals (Sweden)

    Kim Yeon-Ki

    2008-07-01

    Full Text Available Abstract Background GenBank flatfile (GBF format is one of the most popular sequence file formats because of its detailed sequence features and ease of readability. To use the data in the file by a computer, a parsing process is required and is performed according to a given grammar for the sequence and the description in a GBF. Currently, several parser libraries for the GBF have been developed. However, with the accumulation of DNA sequence information from eukaryotic chromosomes, parsing a eukaryotic genome sequence with these libraries inevitably takes a long time, due to the large GBF file and its correspondingly large genomic nucleotide sequence and related feature information. Thus, there is significant need to develop a parsing program with high speed and efficient use of system memory. Results We developed a library, GBParsy, which was C language-based and parses GBF files. The parsing speed was maximized by using content-specified functions in place of regular expressions that are flexible but slow. In addition, we optimized an algorithm related to memory usage so that it also increased parsing performance and efficiency of memory usage. GBParsy is at least 5 - 100× faster than current parsers in benchmark tests. Conclusion GBParsy is estimated to extract annotated information from almost 100 Mb of a GenBank flatfile for chromosomal sequence information within a second. Thus, it should be used for a variety of applications such as on-time visualization of a genome at a web site.

  17. Relative Weighting of Semantic and Syntactic Cues in Native and Non-Native Listeners' Recognition of English Sentences.

    Science.gov (United States)

    Shi, Lu-Feng; Koenig, Laura L

    2016-01-01

    Non-native listeners do not recognize English sentences as effectively as native listeners, especially in noise. It is not entirely clear to what extent such group differences arise from differences in relative weight of semantic versus syntactic cues. This study quantified the use and weighting of these contextual cues via Boothroyd and Nittrouer's j and k factors. The j represents the probability of recognizing sentences with or without context, whereas the k represents the degree to which context improves recognition performance. Four groups of 13 normal-hearing young adult listeners participated. One group consisted of native English monolingual (EMN) listeners, whereas the other three consisted of non-native listeners contrasting in their language dominance and first language: English-dominant Russian-English, Russian-dominant Russian-English, and Spanish-dominant Spanish-English bilinguals. All listeners were presented three sets of four-word sentences: high-predictability sentences included both semantic and syntactic cues, low-predictability sentences included syntactic cues only, and zero-predictability sentences included neither semantic nor syntactic cues. Sentences were presented at 65 dB SPL binaurally in the presence of speech-spectrum noise at +3 dB SNR. Listeners orally repeated each sentence and recognition was calculated for individual words as well as the sentence as a whole. Comparable j values across groups for high-predictability, low-predictability, and zero-predictability sentences suggested that all listeners, native and non-native, utilized contextual cues to recognize English sentences. Analysis of the k factor indicated that non-native listeners took advantage of syntax as effectively as EMN listeners. However, only English-dominant bilinguals utilized semantics to the same extent as EMN listeners; semantics did not provide a significant benefit for the two non-English-dominant groups. When combined, semantics and syntax benefitted EMN

  18. Initial evaluation of an interactive test of sentence gist recognition.

    Science.gov (United States)

    Tye-Murray, N; Witt, S; Castelloe, J

    1996-12-01

    The laser videodisc-based Sentence Gist Recognition (SGR) test consists of sets of topically related sentences that are cued by short film clips. Clients respond to test items by selecting picture illustrations and may interact with the talker by using repair strategies when they do not recognize a test item. The two experiments, involving 40 and 35 adult subjects, respectively, indicated that the SGR may better predict subjective measures of speechreading and listening performance than more traditional audiologic sentence and nonsense syllable tests. Data from cochlear implant users indicated that the SGR accounted for a greater percentage of the variance for selected items of the Communication Profile for the Hearing-Impaired and the Speechreading Questionnaire for Cochlear-Implant Users than two other audiologic tests. As in previous work, subjects were most apt to ask the talker to repeat an utterance that they did not recognize than to ask the talker to restructure it. It is suggested that the SGR may reflect the interactive nature of conversation and provide a simulated real-world listening and/or speechreading task. The principles underlaying this test are consistent with the development of other computer technologies and concepts, such as compact discinteractive and virtual reality.

  19. Exploring the Types of SMEs Which Could use Blogs as a Marketing Tool: a Proposed Future Research Agenda

    Directory of Open Access Journals (Sweden)

    Adeline Phaik Harn Chua

    2009-08-01

    Full Text Available Blogs appear to be gaining momentum as a marketing tool which can be used by organisations for such strategies and processes as branding, managing reputation, developing customer trust and loyalty, niche marketing, gathering marketing intelligence and promoting their online presence. There has been limited academic research in this area, and most significantly concerning the types of small and medium enterprises (SMEs for which blogs might have potential as a marketing tool. In an attempt to address the knowledge gap, this paper presents a future research agenda (in the form of research questions which can guide the eBusiness research community in conducting much needed studies in this area. This paper is particularly novel in that it aims to demonstrate how the heterogeneity of SMEs and their specific business uses of eBusiness technology such as blogs can form the central plank of a future research agenda. This is important because the existing eBusiness literature tends to treat eBusiness collectively rather than focusing on the specific business uses of different eBusiness technologies, and to treat SMEs as a homogeneous group. The paper concludes with a discussion of how this research agenda can form the basis of studies which use a range of different research methods, and how this "big picture" agenda approach might help the eBusiness research community build theory which better explains SME adoption and use of eBusiness.

  20. A corpus-based approach to generalising a chatbot system

    OpenAIRE

    Abu Shawar, Bayan; Atwell, Eric

    2003-01-01

    International research in NLP is dominated by work on English. NLP techniques and systems can be ported to other natural languages, but this is generally a labour-intensive task, requiring scarce computational and linguistic expertise; hence minority languages are poorly represented in NLP technology. We present an automated approach to porting an NLP technology, the AIML-based chatbot, to new languages, by using a corpus in the target language to retrain the chatbot. We have s...

  1. A new tagger for the charge identification of b-jets

    CERN Document Server

    The ATLAS collaboration

    2015-01-01

    A newly developed algorithm called the jet vertex charge tagger, aimed at identifying the sign of the charge of jets containing $b$-hadrons, referred to as $b$-jets, is presented in this note. In addition to the well established track-based jet charge determination, this algorithm introduces the so-called jet vertex charge reconstruction, which exploits the charge information associated to the displaced vertices within the jet. Furthermore, the charge of a soft muon contained in the jet is taken into account when available. All available information is combined into a multivariate discriminator produced using an artificial neural network method. The algorithm has been developed on jets matched to truth level $b$-hadrons provided by $t\\bar{t}$ events simulated at $\\sqrt{s}$ = 13 TeV using the full ATLAS detector simulation and reconstruction.

  2. Sentence connexion and global text structures: a case study of a political text, English leader article

    OpenAIRE

    Stein, Dieter; Mattei, Adriana

    1993-01-01

    The paper first gives a brief overview of the history and theoretical status of discourse analysis, or "text linguistics." The main body of the paper consists of a detailed analysis of sentence connexion, i.e. the logical relationship between sentences and larger chunks of text, performed on a newspaper leader article. The results of this local analysis are then related to the global organisation of text structure with components such as macro- and super-structure by way of int...

  3. Translating Linguistic Jokes for Dubbing

    Directory of Open Access Journals (Sweden)

    Elena ALEKSANDROVA

    2012-01-01

    Full Text Available This study has attempted to establish the possible ways of translating linguistic jokes whendubbing. The study is also intended to identify the most problematic cases of screen translation andthe factors which cause these problems. In order to support such an approach a corpus of 7American and British films has been compiled, including as many as 16 as their various dubbingtranslations into Russian. In the films, almost 12 instances of original linguistic jokes have beenidentified.

  4. Meaningful questions: The acquisition of auxiliary inversion in a connectionist model of sentence production.

    Science.gov (United States)

    Fitz, Hartmut; Chang, Franklin

    2017-09-01

    Nativist theories have argued that language involves syntactic principles which are unlearnable from the input children receive. A paradigm case of these innate principles is the structure dependence of auxiliary inversion in complex polar questions (Chomsky, 1968, 1975, 1980). Computational approaches have focused on the properties of the input in explaining how children acquire these questions. In contrast, we argue that messages are structured in a way that supports structure dependence in syntax. We demonstrate this approach within a connectionist model of sentence production (Chang, 2009) which learned to generate a range of complex polar questions from a structured message without positive exemplars in the input. The model also generated different types of error in development that were similar in magnitude to those in children (e.g., auxiliary doubling, Ambridge, Rowland, & Pine, 2008; Crain & Nakayama, 1987). Through model comparisons we trace how meaning constraints and linguistic experience interact during the acquisition of auxiliary inversion. Our results suggest that auxiliary inversion rules in English can be acquired without innate syntactic principles, as long as it is assumed that speakers who ask complex questions express messages that are structured into multiple propositions. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. When Is a Failure to Replicate Not a Type II Error?

    Science.gov (United States)

    Vasconcelos, Marco; Urcuioli, Peter J.; Lionello-DeNolf, Karen M.

    2007-01-01

    Zentall and Singer (2007) challenge our conclusion that the work-ethic effect reported by Clement, Feltus, Kaiser, and Zentall (2000) may have been a Type I error by arguing that (a) the effect has been extensively replicated and (b) the amount of overtraining our pigeons received may not have been sufficient to produce it. We believe that our…

  6. Applying corpus linguistics methodology to psycholinguistics research Aplicando a metodologia da linguística de corpus à pesquisa psicolinguística

    Directory of Open Access Journals (Sweden)

    Luciane Corrêa Ferreira

    2010-01-01

    Full Text Available This study concerns the use of corpus linguistics methodology in psycholinguistics research. Ten linguistic metaphors were selected from English and American newspapers. After that, we identified the underlying conceptual metaphor based on the conceptual metaphor inventory by Lakoff and Johnson (1980, 1999. We seek to investigate what sort of knowledge EFL-learners use when trying to understand a linguistic metaphor. We examined how EFL-learners comprehend linguistic metaphors, firstly without using the context and then using the context. The sample comprised 221 Brazilian students and 16 American students at UCSC. We have also carried out an empirical research using WebCorp.Este estudo investiga o uso de metodologia de lingüística de corpus na pesquisa psicolingüística. Selecionaram-se dez metáforas em jornais ingleses e norte-americanos. Depois, identificou-se a metáfora conceptual subjacente com base no inventário de metáforas conceptuais apresentado por Lakoff e Johnson (1980, 1999. Buscou-se investigar que tipo de conhecimento os aprendizes de LE empregam ao tentar compreender metáforas. Analisou-se a maneira como os aprendizes de LE compreendem metáforas lingüísticas sem usar o contexto e, depois, utilizando o contexto. A amostra incluiu 221 estudantes brasileiros e 16 estudantes norte-americanos da UCSC.

  7. THE FUNCTION OF SIMPLE SENTENCE BETWEEN ALBANIAN AND ENGLISH

    OpenAIRE

    Shkelqim Millaku

    2017-01-01

    In Albanian and English we have same kind of sentences (simple, compound or complex sentence). The major of elements or constituents that can be found in clauses are subject, predicate, object, complement etc. For Albanian and English most linguists agree on the needs to recognize at least the following word classes: noun, verb, adjective, preposition, adverb, determinative and conjunction. Each of these words classes is illustrated in the sentence below. The noun or noun phrase can be subjec...

  8. Evidence in clinical reasoning: a computational linguistics analysis of 789,712 medical case summaries 1983-2012.

    Science.gov (United States)

    Seidel, Bastian M; Campbell, Steven; Bell, Erica

    2015-03-21

    Better understanding of clinical reasoning could reduce diagnostic error linked to 8% of adverse medical events and 30% of malpractice cases. To a greater extent than the evidence-based movement, the clinical reasoning literature asserts the importance of practitioner intuition—unconscious elements of diagnostic reasoning. The study aimed to analyse the content of case report summaries in ways that explored the importance of an evidence concept, not only in relation to research literature but also intuition. The study sample comprised all 789,712 abstracts in English for case reports contained in the database PUBMED for the period 1 January 1983 to 31 December 2012. It was hypothesised that, if evidence and intuition concepts were viewed by these clinical authors as essential to understanding their case reports, they would be more likely to be found in the abstracts. Computational linguistics software was used in 1) concept mapping of 21,631,481 instances of 201 concepts, and 2) specific concept analyses examining 200 paired co-occurrences for 'evidence' and research 'literature' concepts. 'Evidence' is a fundamentally patient-centred, intuitive concept linked to less common concepts about underlying processes, suspected disease mechanisms and diagnostic hunches. In contrast, the use of research literature in clinical reasoning is linked to more common reasoning concepts about specific knowledge and descriptions or presenting features of cases. 'Literature' is by far the most dominant concept, increasing in relevance since 2003, with an overall relevance of 13% versus 5% for 'evidence' which has remained static. The fact that the least present types of reasoning concepts relate to diagnostic hunches to do with underlying processes, such as what is suspected, raises questions about whether intuitive practitioner evidence-making, found in a constellation of dynamic, process concepts, has become less important. The study adds support to the existing corpus of

  9. Capabilities and Equality of Health II

    DEFF Research Database (Denmark)

    Keiding, Hans

    The concept of capabilities, introduced originally by Sen, has inspired many researchers but has not found any simple formal representation which might be instrumental in the construction of a comprehensive theory of equality. In a previous paper (Keiding, 2005), we investigated whether preferences...... of the capability approach to questions of health or equality. In the present paper we extend the notion of rationalizing orderings of capabilities to a dynamical context, in the sense that the utility function is not yet revealed to the individual at the time when the capabilities are ordered. It turns out...... over capabilities as sets of functionings can be rationalized by maximization of a suitable utility function over the set of functionings. Such a rationalization turned out to be possible only in cases which must be considered exceptional and which do not allowfor interesting applications...

  10. A simple DOP model for constituency parsing of Italian sentences

    NARCIS (Netherlands)

    Sangati, F.

    2009-01-01

    We present a simplified Data-Oriented Parsing (DOP) formalism for learning the constituency structure of Italian sentences. In our approach we try to simplify the original DOP methodology by constraining the number and type of fragments we extract from the training corpus. We provide some examples

  11. The evolution of the Faculty of Language from a Chomskyan perspective: bridging linguistics and biology.

    Science.gov (United States)

    Longa, Victor Manuel

    2013-01-01

    While language was traditionally considered a purely cultural trait, the advent of Noam Chomsky's Generative Grammar in the second half of the twentieth century dramatically challenged that view. According to that theory, language is an innate feature, part of the human biological endowment. If language is indeed innate, it had to biologically evolve. This review has two main objectives: firstly, it characterizes from a Chomskyan perspective the evolutionary processes by which language could have come into being. Secondly, it proposes a new method for interpreting the archaeological record that radically differs from the usual types of evidence Paleoanthropology has concentrated on when dealing with language evolution: while archaeological remains have usually been regarded from the view of the behavior they could be associated with, the paper will consider archaeological remains from the view of the computational processes and capabilities at work for their production. This computational approach, illustrated with a computational analysis of prehistoric geometric engravings, will be used to challenge the usual generative thinking on language evolution, based on the high specificity of language. The paper argues that the biological machinery of language is neither specifically linguistic nor specifically human, although language itself can still be considered a species-specific innate trait. From such a view, language would be one of the consequences of a slight modification operated on an ancestral architecture shared with vertebrates.

  12. A unified approach for development of Urdu Corpus for OCR and demographic purpose

    Science.gov (United States)

    Choudhary, Prakash; Nain, Neeta; Ahmed, Mushtaq

    2015-02-01

    This paper presents a methodology for the development of an Urdu handwritten text image Corpus and application of Corpus linguistics in the field of OCR and information retrieval from handwritten document. Compared to other language scripts, Urdu script is little bit complicated for data entry. To enter a single character it requires a combination of multiple keys entry. Here, a mixed approach is proposed and demonstrated for building Urdu Corpus for OCR and Demographic data collection. Demographic part of database could be used to train a system to fetch the data automatically, which will be helpful to simplify existing manual data-processing task involved in the field of data collection such as input forms like Passport, Ration Card, Voting Card, AADHAR, Driving licence, Indian Railway Reservation, Census data etc. This would increase the participation of Urdu language community in understanding and taking benefit of the Government schemes. To make availability and applicability of database in a vast area of corpus linguistics, we propose a methodology for data collection, mark-up, digital transcription, and XML metadata information for benchmarking.

  13. INREM II: a computer implementation of recent models for estimating the dose equivalent to organs of man from an inhaled or ingested radionuclide

    International Nuclear Information System (INIS)

    Killough, G.G.; Dunning, D.E. Jr.; Pleasant, J.C.

    1978-01-01

    This report describes a computer code, INREM II, which calculates the internal radiation dose equivalent to organs of man which results from the intake of a radionuclide by inhalation or ingestion. Deposition and removal of radioactivity from the respiratory tract is represented by the ICRP Task Group Lung Model. A four-segment catenary model of the GI tract is used to estimate movement of radioactive material that is ingested or swallowed after being cleared from the respiratory tract. Retention of radioactivity in other organs is specified by linear combinations of decaying exponential functions. The formation and decay of radioactive daughters is treated explicitly, with each radionuclide species in the chain having its own uptake and retention parameters, as supplied by the user. The dose equivalent to a target organ is computed as the sum of contributions from each source organ in which radioactivity is assumed to be situated. This calculation utilizes a matrix of S-factors (rem/μCi-day) supplied by the user for the particular choice of source and target organs. Output permits the evaluation of crossfire components of dose when penetrating radiations are present. INREM II is coded in FORTRAN IV and has been compiled and executed on an IBM-360 computer

  14. Are computers effective lie detectors? A meta-analysis of linguistic cues to deception.

    Science.gov (United States)

    Hauch, Valerie; Blandón-Gitlin, Iris; Masip, Jaume; Sporer, Siegfried L

    2015-11-01

    This meta-analysis investigates linguistic cues to deception and whether these cues can be detected with computer programs. We integrated operational definitions for 79 cues from 44 studies where software had been used to identify linguistic deception cues. These cues were allocated to six research questions. As expected, the meta-analyses demonstrated that, relative to truth-tellers, liars experienced greater cognitive load, expressed more negative emotions, distanced themselves more from events, expressed fewer sensory-perceptual words, and referred less often to cognitive processes. However, liars were not more uncertain than truth-tellers. These effects were moderated by event type, involvement, emotional valence, intensity of interaction, motivation, and other moderators. Although the overall effect size was small, theory-driven predictions for certain cues received support. These findings not only further our knowledge about the usefulness of linguistic cues to detect deception with computers in applied settings but also elucidate the relationship between language and deception. © 2014 by the Society for Personality and Social Psychology, Inc.

  15. A radioligand immunoassay for 1,25-dihydroxyvitamin D3 receptors using monoclonal antibody: detection of a phenotypic receptor variant in vitamin D-dependency rickets (type II) which does not bind hormone

    International Nuclear Information System (INIS)

    Pike, J.W.; Dokoh, Shigeharu; Liberman, U.A.; Eil, C.; Haussler, M.R.; Marx, S.J.

    1984-01-01

    Vitamin D-dependency rickets, type II (VDDRII), is a well recognized heritable disorder characterized by peripheral target organ resistance to 1,25-dihydroxyvitamin D 3 (1,25(OH) 2 D 3 ), the hormonally active form of the vitamin. Recently, cultured skin fibroblasts obtained from a number of patients with VDDRII have been utilized to characterize the underlying molecular defects associated with this malady. Recently monoclonal antibodies to the vitamin D receptor have been generated, and a radioligand immunoassay (RLIA) for the detection of this molecule has been developed which is independent of its hormone-binding capacity. This report describes the application of the immunoassay in the detection of receptor-like molecules in fibroblasts derived from patients with VDDRII. The results indicate that the molecule is generally present in all patients, and provides a mechanism for individual responsiveness to pharmacologic treatment with vitamin D 3 metabolites. 8 refs.; 3 figs.; 1 table

  16. Toward a universal decoder of linguistic meaning from brain activation.

    Science.gov (United States)

    Pereira, Francisco; Lou, Bin; Pritchett, Brianna; Ritter, Samuel; Gershman, Samuel J; Kanwisher, Nancy; Botvinick, Matthew; Fedorenko, Evelina

    2018-03-06

    Prior work decoding linguistic meaning from imaging data has been largely limited to concrete nouns, using similar stimuli for training and testing, from a relatively small number of semantic categories. Here we present a new approach for building a brain decoding system in which words and sentences are represented as vectors in a semantic space constructed from massive text corpora. By efficiently sampling this space to select training stimuli shown to subjects, we maximize the ability to generalize to new meanings from limited imaging data. To validate this approach, we train the system on imaging data of individual concepts, and show it can decode semantic vector representations from imaging data of sentences about a wide variety of both concrete and abstract topics from two separate datasets. These decoded representations are sufficiently detailed to distinguish even semantically similar sentences, and to capture the similarity structure of meaning relationships between sentences.

  17. A computational linguistics motivated mapping of ICPC-2 PLUS to SNOMED CT.

    Science.gov (United States)

    Wang, Yefeng; Patrick, Jon; Miller, Graeme; O'Hallaran, Julie

    2008-10-27

    A great challenge in sharing data across information systems in general practice is the lack of interoperability between different terminologies or coding schema used in the information systems. Mapping of medical vocabularies to a standardised terminology is needed to solve data interoperability problems. We present a system to automatically map an interface terminology ICPC-2 PLUS to SNOMED CT. Three steps of mapping are proposed in this system. The UMLS metathesaurus mapping utilises explicit relationships between ICPC-2 PLUS and SNOMED CT terms in the UMLS library to perform the first stage of the mapping. Computational linguistic mapping uses natural language processing techniques and lexical similarities for the second stage of mapping between terminologies. Finally, the post-coordination mapping allows one ICPC-2 PLUS term to be mapped into an aggregation of two or more SNOMED CT terms. A total 5,971 of all 7,410 ICPC-2 terms (80.58%) were mapped to SNOMED CT using the three stages but with different levels of accuracy. UMLS mapping achieved the mapping of 53.0% ICPC2 PLUS terms to SNOMED CT with the precision rate of 96.46% and overall recall rate of 44.89%. Lexical mapping increased the result to 60.31% and post-coordination mapping gave an increase of 20.27% in mapped terms. A manual review of a part of the mapping shows that the precision of lexical mappings is around 90%. The accuracy of post-coordination has not been evaluated yet. Unmapped terms and mismatched terms are due to the differences in the structures between ICPC-2 PLUS and SNOMED CT. Terms contained in ICPC-2 PLUS but not in SNOMED CT caused a large proportion of the failures in the mappings. Mapping terminologies to a standard vocabulary is a way to facilitate consistent medical data exchange and achieve system interoperability and data standardisation. Broad scale mapping cannot be achieved by any single method and methods based on computational linguistics can be very useful for the task

  18. Note related to the elaboration of a coding by key sentences for the programming of a document automatic selection system

    International Nuclear Information System (INIS)

    Leroy, A.; Braffort, P.

    1959-01-01

    This note deals with the providing of CEA documentalists with a tool for coding studies. The authors first discuss issues related to code selection criteria (author classification, topic classification, and so on), and propose an overview and a discussion of linguistic models. They also comment how diagrams illustrating relationships between words are built up, and propose a diagram representation example which includes different concepts such as conditions, properties, object, tools or processes (for example hardness for a steel, batch processing for a condition, or sintering for a process), and also the introduction of negation. Then, the authors address how basic concepts can be highlighted, describe how key sentences can be built up, and propose an example analysis in the case of a published article dealing with nuclear reactors (in this case, the study of a liquid-metal neutron absorber for the control of a gas-cooled power reactor). Perspectives of evolution are finally discussed

  19. Causal-implicative relationships in the Serbian hypotaxis (Complex of generative complex sentences

    Directory of Open Access Journals (Sweden)

    Vojvodić Dojčil P.

    2015-01-01

    Full Text Available The paper analyzes the causal-implicative relationships in the segmented complex sentences with a “closed” (asymmetric, mandatory bi-situational generative (conditional semantic structure on the corpus of the Serbian language. The generative (conditional semantic structure consists of meaningfully interconnected antecedents and consequents that are based on the principle of subordination. These sentences are characterized with a general causal link due to the specific implicative relationship between the segments that can be realized within dependent clauses with diverse categorically-differential semantics (i.e. causal, consecutive, final, conditional and concessive. The author reaches a conclusion that the given implicative relationships (P ⇒ Q, P ⇐ Q/Q ⇒ P, P ⇔Q, P ⇒ Q´ Δ P´ ⇒ Q represent, in fact, semantic invariants of generative complex sentences. It is pointed out that the structure, formation and functioning of these relationships (sentences are always determined by the interconnectedness of syntax and lexicon. They are based on a general causal adverbial meaning of the conjunctions in a subordinate clause, which are also used to determine the adverbial semantics of a sentence as a whole. The article discusses in particular the aspectual-temporal correlations that are realized in complex sentences with a generative structure. It has been noted very often in the literature that there is no differentiation made among all of the types of the hypothetical conditionality - real, eventual, potential and unreal. The paper analyzes taxis of simultaneity and succession (anteriority/posteriority of the main and subordinate clause predicates in conditional sentences as a special type of the relative-temporal relationships within the same temporal plan. In order to interpret these correlations, the Serbian data was compared to the data in Russian and Polish. It is noted that the Northern Slavic languages (in this case Russian and

  20. "gnparser": a powerful parser for scientific names based on Parsing Expression Grammar.

    Science.gov (United States)

    Mozzherin, Dmitry Y; Myltsev, Alexander A; Patterson, David J

    2017-05-26

    Scientific names in biology act as universal links. They allow us to cross-reference information about organisms globally. However variations in spelling of scientific names greatly diminish their ability to interconnect data. Such variations may include abbreviations, annotations, misspellings, etc. Authorship is a part of a scientific name and may also differ significantly. To match all possible variations of a name we need to divide them into their elements and classify each element according to its role. We refer to this as 'parsing' the name. Parsing categorizes name's elements into those that are stable and those that are prone to change. Names are matched first by combining them according to their stable elements. Matches are then refined by examining their varying elements. This two stage process dramatically improves the number and quality of matches. It is especially useful for the automatic data exchange within the context of "Big Data" in biology. We introduce Global Names Parser (gnparser). It is a Java tool written in Scala language (a language for Java Virtual Machine) to parse scientific names. It is based on a Parsing Expression Grammar. The parser can be applied to scientific names of any complexity. It assigns a semantic meaning (such as genus name, species epithet, rank, year of publication, authorship, annotations, etc.) to all elements of a name. It is able to work with nested structures as in the names of hybrids. gnparser performs with ≈99% accuracy and processes 30 million name-strings/hour per CPU thread. The gnparser library is compatible with Scala, Java, R, Jython, and JRuby. The parser can be used as a command line application, as a socket server, a web-app or as a RESTful HTTP-service. It is released under an Open source MIT license. Global Names Parser (gnparser) is a fast, high precision tool for biodiversity informaticians and biologists working with large numbers of scientific names. It can replace expensive and error

  1. The New Development of the Study of Discourse Anaphora ------Review of Discourse Anaphora: A Cognitive-Functional Approach

    Directory of Open Access Journals (Sweden)

    Meixia Li

    2012-09-01

    Full Text Available The English word anaphora is derived from the Greek word ἀναφορά, meaning carrying back. For a long time anaphora has been the object of research in a wide range of disciplines, such as rhetoric, philosophy, theoretical linguistics and so on. A great number of remarkable achievements have been made in these fields. In the 1970’s there was a “discourse turn” in the domain of the humanities and the social sciences, which marked the birth and flourishing of such cross-disciplines as psycholinguistics, computational linguistics, cognitive linguistics, corpus linguistics, discourse studies and so on, and which also paved the way for the turn of the study of anaphora from focusing on intrasentential anaphora to intersentential anaphora. Intrasentential anaphora refers to the relationship between a pronoun and its antecedent being contained within one sentence, while intersentential anaphora can also be called discourse anaphora, which refers to “the relationship between a pronoun and its antecedent earlier in the discourse” (Clark & Parikh, 2006, p. 1. From the late 20th century on, discourse anaphora has become one of the hot topics in several fields such as psychology, cognitive science, artificial intelligence, etc. Many fruitful research results (i.e. Huang, 2002; Clark & Parikh, 2006, etc. have been obtained. Ming-Ming Pu’s monograph Discourse Anaphora: A Cognitive-Functional Approach, published by LINCOM GmbH in Muenchen, Germany in 2011 is another important work of the study of discourse anaphora. In this book, the author first proposes a cognitive-functional model to account for how the construction of mental structures determines the use and resolution of discourse anaphora. Afterwards he does a comparative quantitative study of both English and Chinese empirical and text data, which demonstrates that on the one hand the occurrence and distribution of discourse anaphora is more universal in nature than language

  2. From exemplar to grammar: a probabilistic analogy-based model of language learning.

    Science.gov (United States)

    Bod, Rens

    2009-07-01

    While rules and exemplars are usually viewed as opposites, this paper argues that they form end points of the same distribution. By representing both rules and exemplars as (partial) trees, we can take into account the fluid middle ground between the two extremes. This insight is the starting point for a new theory of language learning that is based on the following idea: If a language learner does not know which phrase-structure trees should be assigned to initial sentences, s/he allows (implicitly) for all possible trees and lets linguistic experience decide which is the "best" tree for each sentence. The best tree is obtained by maximizing "structural analogy" between a sentence and previous sentences, which is formalized by the most probable shortest combination of subtrees from all trees of previous sentences. Corpus-based experiments with this model on the Penn Treebank and the Childes database indicate that it can learn both exemplar-based and rule-based aspects of language, ranging from phrasal verbs to auxiliary fronting. By having learned the syntactic structures of sentences, we have also learned the grammar implicit in these structures, which can in turn be used to produce new sentences. We show that our model mimicks children's language development from item-based constructions to abstract constructions, and that the model can simulate some of the errors made by children in producing complex questions. Copyright © 2009 Cognitive Science Society, Inc.

  3. A responsible agenda for applied linguistics: Confessions of a philosopher

    Directory of Open Access Journals (Sweden)

    Albert Weideman

    2011-08-01

    Full Text Available When we undertake academic, disciplinary work, we rely on philosophical starting points. Several straightforward illustrations of this can be found in the history of applied linguistics. It is evident from the history of our field that various historically influential approaches to our discipline base themselves upon different academic confessions. This paper examines the effects of basing our applied linguistic work on the idea that applied linguistics is a discipline concerned with design. Such a characterisation does justice to both modernist and postmodernist emphases in applied linguistics. Conceptualisations of applied linguistics that came with the proposals for communicative language teaching (CLT some thirty to forty years ago propelled the discipline squarely into postmodern times. To account for this, we need to develop a theory of applied linguistics which shows what constitutive and regulative conditions exist for doing applied linguistic designs. A responsible agenda for applied linguistics today has as its first responsibility to free the users of its designs from toil and drudgery, as well as from becoming victims of fashion, ideology or theory. Secondly, it should design solutions to language problems in such a way that the technical imagination of the designer is not restricted but supported by theory and empirical investigation, and that the productive pedagogical fantasy of the implementers of such plans is set free. Thirdly, it must seek to become accountable by designing theoretically and socially defensible solutions to language problems, solutions that relieve some of the suffering, pain, poverty and injustice in our world.

  4. A Computational Model of the SC Multisensory Neurons: Integrative Capabilities, Maturation, and Plasticity

    Directory of Open Access Journals (Sweden)

    Cristiano Cuppini

    2011-10-01

    Full Text Available Different cortical and subcortical structures present neurons able to integrate stimuli of different sensory modalities. Among the others, one of the most investigated integrative regions is the Superior Colliculus (SC, a midbrain structure whose aim is to guide attentive behaviour and motor responses toward external events. Despite the large amount of experimental data in the literature, the neural mechanisms underlying the SC response are not completely understood. Moreover, recent data indicate that multisensory integration ability is the result of maturation after birth, depending on sensory experience. Mathematical models and computer simulations can be of value to investigate and clarify these phenomena. In the last few years, several models have been implemented to shed light on these mechanisms and to gain a deeper comprehension of the SC capabilities. Here, a neural network model (Cuppini et al., 2010 is extensively discussed. The model considers visual-auditory interaction, and is able to reproduce and explain the main physiological features of multisensory integration in SC neurons, and their acquisition during postnatal life. To reproduce a neonatal condition, the model assumes that during early life: 1 cortical-SC synapses are present but not active; 2 in this phase, responses are driven by non-cortical inputs with very large receptive fields (RFs and little spatial tuning; 3 a slight spatial preference for the visual inputs is present. Sensory experience is modeled by a “training phase” in which the network is repeatedly exposed to modality-specific and cross-modal stimuli at different locations. As results, Cortical-SC synapses are crafted during this period thanks to the Hebbian rules of potentiation and depression, RFs are reduced in size, and neurons exhibit integrative capabilities to cross-modal stimuli, such as multisensory enhancement, inverse effectiveness, and multisensory depression. The utility of the modelling

  5. Sentence Syntax and Content in the Human Temporal Lobe: An fMRI Adaptation Study in Auditory and Visual Modalities

    Energy Technology Data Exchange (ETDEWEB)

    Devauchelle, A.D.; Dehaene, S.; Pallier, C. [INSERM, Gif sur Yvette (France); Devauchelle, A.D.; Dehaene, S.; Pallier, C. [CEA, DSV, I2BM, NeuroSpin, F-91191 Gif Sur Yvette (France); Devauchelle, A.D.; Pallier, C. [Univ. Paris 11, Orsay (France); Oppenheim, C. [Univ Paris 05, Ctr Hosp St Anne, Paris (France); Rizzi, L. [Univ Siena, CISCL, I-53100 Siena (Italy); Dehaene, S. [Coll France, F-75231 Paris (France)

    2009-07-01

    Priming effects have been well documented in behavioral psycho-linguistics experiments: The processing of a word or a sentence is typically facilitated when it shares lexico-semantic or syntactic features with a previously encountered stimulus. Here, we used fMRI priming to investigate which brain areas show adaptation to the repetition of a sentence's content or syntax. Participants read or listened to sentences organized in series which could or not share similar syntactic constructions and/or lexico-semantic content. The repetition of lexico-semantic content yielded adaptation in most of the temporal and frontal sentence processing network, both in the visual and the auditory modalities, even when the same lexico-semantic content was expressed using variable syntactic constructions. No fMRI adaptation effect was observed when the same syntactic construction was repeated. Yet behavioral priming was observed at both syntactic and semantic levels in a separate experiment where participants detected sentence endings. We discuss a number of possible explanations for the absence of syntactic priming in the fMRI experiments, including the possibility that the conglomerate of syntactic properties defining 'a construction' is not an actual object assembled during parsing. (authors)

  6. Sentence Syntax and Content in the Human Temporal Lobe: An fMRI Adaptation Study in Auditory and Visual Modalities

    International Nuclear Information System (INIS)

    Devauchelle, A.D.; Dehaene, S.; Pallier, C.; Devauchelle, A.D.; Dehaene, S.; Pallier, C.; Devauchelle, A.D.; Pallier, C.; Oppenheim, C.; Rizzi, L.; Dehaene, S.

    2009-01-01

    Priming effects have been well documented in behavioral psycho-linguistics experiments: The processing of a word or a sentence is typically facilitated when it shares lexico-semantic or syntactic features with a previously encountered stimulus. Here, we used fMRI priming to investigate which brain areas show adaptation to the repetition of a sentence's content or syntax. Participants read or listened to sentences organized in series which could or not share similar syntactic constructions and/or lexico-semantic content. The repetition of lexico-semantic content yielded adaptation in most of the temporal and frontal sentence processing network, both in the visual and the auditory modalities, even when the same lexico-semantic content was expressed using variable syntactic constructions. No fMRI adaptation effect was observed when the same syntactic construction was repeated. Yet behavioral priming was observed at both syntactic and semantic levels in a separate experiment where participants detected sentence endings. We discuss a number of possible explanations for the absence of syntactic priming in the fMRI experiments, including the possibility that the conglomerate of syntactic properties defining 'a construction' is not an actual object assembled during parsing. (authors)

  7. Prediction in a visual language: real-time sentence processing in American Sign Language across development.

    Science.gov (United States)

    Lieberman, Amy M; Borovsky, Arielle; Mayberry, Rachel I

    2018-01-01

    Prediction during sign language comprehension may enable signers to integrate linguistic and non-linguistic information within the visual modality. In two eyetracking experiments, we investigated American Sign language (ASL) semantic prediction in deaf adults and children (aged 4-8 years). Participants viewed ASL sentences in a visual world paradigm in which the sentence-initial verb was either neutral or constrained relative to the sentence-final target noun. Adults and children made anticipatory looks to the target picture before the onset of the target noun in the constrained condition only, showing evidence for semantic prediction. Crucially, signers alternated gaze between the stimulus sign and the target picture only when the sentential object could be predicted from the verb. Signers therefore engage in prediction by optimizing visual attention between divided linguistic and referential signals. These patterns suggest that prediction is a modality-independent process, and theoretical implications are discussed.

  8. Conversation, characterisation and corpus linguistics: Dialogue in Jane Austen’s Sense and Sensibility

    Directory of Open Access Journals (Sweden)

    E.H. Hubbard

    2002-08-01

    Full Text Available This article reports on a corpus-based exploration of the role that fictional dialogue plays in characterisation. The focus is on the two main characters of Austen’s Sense and Sensibility and (a the extent to which certain features of their dialogue can be said to tie in with general perceptions that Elinor represents the “sense” and Marianne the “sensibility” of the novel’s title; and (b the extent to which Austen can be said to have exploited these features to enable the sisters to speak with subtly differing voices. The features themselves were drawn from two linguistic frameworks, namely cohesion in text linguistics (specifically, the category of conjunctive cohesion as originated by Halliday and Hasan (1976, and the category of “involvement” in register analysis (most prominently, Biber 1988. The density of these features in each dialogue was calculated, compared statistically and salient differences considered in relation to the focal issues of the study. Although two of the five hypotheses formulated were not supported, the results overall provided strong indications that Austen successfully distinguishes between the sisters through their dialogue, and often in ways that link with less subtle, more explicit cues to their character that are given in the text. The study thus reveals how certain text-linguistic and register features can underpin characterisation in fiction, and in so doing explicates aspects of what it is that readers and literary critics respond to when they comment on characterisation in a novel.

  9. PTSD as a criminal defense: a review of case law.

    Science.gov (United States)

    Berger, Omri; McNiel, Dale E; Binder, Renée L

    2012-01-01

    Posttraumatic stress disorder (PTSD) has been offered as a basis for criminal defenses, including insanity, unconsciousness, self-defense, diminished capacity, and sentencing mitigation. Examination of case law (e.g., appellate decisions) involving PTSD reveals that when offered as a criminal defense, PTSD has received mixed treatment in the judicial system. Courts have often recognized testimony about PTSD as scientifically reliable. In addition, PTSD has been recognized by appellate courts in U.S. jurisdictions as a valid basis for insanity, unconsciousness, and self-defense. However, the courts have not always found the presentation of PTSD testimony to be relevant, admissible, or compelling in such cases, particularly when expert testimony failed to show how PTSD met the standard for the given defense. In cases that did not meet the standard for one of the complete defenses, PTSD has been presented as a partial defense or mitigating circumstance, again with mixed success.

  10. The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

    Directory of Open Access Journals (Sweden)

    Ulrich Schäfer

    2013-02-01

    Full Text Available We describe a novel approach to precise searching in the full content of digital libraries. The Searchbench (for search workbench is based on sentence-wise syntactic and semantic natural language processing (NLP of both born-digital and scanned publications in PDF format. The term born-digital means natively digital, i.e. prepared electronically using typesetting systems such as LaTeX, OpenOffice, and the like. In the Searchbench, queries can be formulated as (possibly underspecified statements, consisting of simple subject-predicate-object constructs such as ‘algorithm improves word alignment’. This reduces the number of false hits in large document collections when the search words happen to appear close to each other, but are not semantically related. The method also abstracts from passive voice and predicate synonyms. Moreover, negated statements can be excluded from the search results, and negated antonym predicates again count as synonyms (e.g. not include = exclude.In the Searchbench, a sentence-semantic search can be combined with search filters for classical full-text, bibliographic metadata and automatically computed domain terms. Auto-suggest fields facilitate text input. Queries can be bookmarked or emailed. Furthermore, a novel citation browser in the Searchbench allows graphical navigation in citation networks. These have been extracted automatically from metadata and paper texts. The citation browser displays short phrases from citation sentences at the edges in the citation graph and thus allows students and researchers to quickly browse publications and immerse into a new research field. By clicking on a citation edge, the original citation sentence is shown in context, and optionally also in the original PDF layout.To showcase the usefulness of our research, we have a applied it to a collection of currently approx. 25,000 open access research papers in the field of computational linguistics and language technology, the ACL

  11. A Lingüística de corpus: história, problemas, legitimidade

    Directory of Open Access Journals (Sweden)

    Jacqueline Léon

    2006-01-01

    Full Text Available During the nineties, the accessibility of large corpora and the possibility of manipulation of enormous quantities of linguistic data was the origin of a renewal of interest in statistical and probability evidences that served to directly question linguistics about its objectives, methods and foundations. This interest gained increasing importance and became important currently under the name of corpus linguistics, a field of dominant research in language science. In this article we will show that the designation corpus linguistics covers considerably heterogeneous theoretical positions and research, topics. We show how corpus linguistics, originally of british origin, was later endowed with historical and theoretical legitimacy while at the same time intending to establish itself as a new paradigm in language science. Finally we distinguish two attitudes inside the british tradition: one, intending to build the studies on a corpus and in a new paradigm based on a retrospective construction of the critical works of chomsky during the years 1959 and 1960, which was intended to legitimize the studies; the other attitude involves the continuity of the tradition of british empirical linguistics.

  12. Evolution of Collective Behaviour in an Artificial World Using Linguistic Fuzzy Rule-Based Systems.

    Directory of Open Access Journals (Sweden)

    Jure Demšar

    Full Text Available Collective behaviour is a fascinating and easily observable phenomenon, attractive to a wide range of researchers. In biology, computational models have been extensively used to investigate various properties of collective behaviour, such as: transfer of information across the group, benefits of grouping (defence against predation, foraging, group decision-making process, and group behaviour types. The question 'why,' however remains largely unanswered. Here the interest goes into which pressures led to the evolution of such behaviour, and evolutionary computational models have already been used to test various biological hypotheses. Most of these models use genetic algorithms to tune the parameters of previously presented non-evolutionary models, but very few attempt to evolve collective behaviour from scratch. Of these last, the successful attempts display clumping or swarming behaviour. Empirical evidence suggests that in fish schools there exist three classes of behaviour; swarming, milling and polarized. In this paper we present a novel, artificial life-like evolutionary model, where individual agents are governed by linguistic fuzzy rule-based systems, which is capable of evolving all three classes of behaviour.

  13. FTA Corpus: a parallel corpus of English and Spanish Free Trade Agreements for the study of specialized collocations

    Directory of Open Access Journals (Sweden)

    Pedro Patiño García

    2013-04-01

    Full Text Available This paper describes the Corpus of Free Trade Agreements (henceforth FTA, a specialized parallel corpus in English and Spanish from Europe and America and a smaller subcorpus in English-Norwegian and Spanish-Norwegian that was prepared and then aligned with Translation Corpus Aligner 2 (Hofland & Johansson, 1998. The data was taken from Free Trade Agreements. These agreements are specialized texts officially signed and ratified by several countries and blocks of countries in the last twenty years. Thus, FTAs are a rich repository for terminology and phraseology that is used in different fields of business activity throughout the world. The corpus contains around 1.37 million words in the English section and 1.48 million words in its Spanish counterpart, plus 60,000 words each in the Spanish-Norwegian and English-Norwegian subcorpus. The corpus is being used primarily to study the terms and specialized collocations that include these terms in this kind of specialized texts.Keywords: specialized collocation, specialized parallel corpus, corpus linguistics, Free Trade Agreement

  14. A Corpus-Based Analysis of the Meaning and Function of Although

    Directory of Open Access Journals (Sweden)

    María Carbonell-Olivares

    2009-12-01

    Full Text Available The extent to which semantic and pragmatic descriptions of linguistic phenomena should be based on naturally occurring language is subject to debate. In the case of discourse markers and connective expressions, most research remains at a theoretical level, despite the increasing interest in the use of corpora for linguistic analysis in the last decades. We undertake a corpus-based analysis of the meaning and function of although, drawing on previous descriptions of this unit formulated from different theoretical perspectives (discourse analysis, pragmatics, semantics. We aim to contribute empirical evidence for certain claims made in the literature, and to complement such descriptions with our findings. The study focuses on the type of contrast expressed by although, the elements of the contrast relation and the structure of the information. First, the theoretical background for the particular issues we study is presented. Then, the objectives and methodology are explained, and results discussed.La medida en que las descripciones semánticas y pragmáticas de fenómenos lingüísticos deberían basarse en el lenguaje natural está abierta a debate. En el caso de los marcadores discursivos y expresiones conectivas, la mayor parte de la investigación se realiza en el plano teórico, a pesar del creciente interés por la utilización de córpora para el análisis lingüístico. Presentamos un análisis del significado y la función de although basado en un corpus, partiendo de las descripciones de esta unidad formuladas desde distintas perspectivas teóricas (análisis del discurso, pragmática, semántica. Pretendemos contrastar empíricamente algunas afirmaciones realizadas en la bibliografía, y complementar tales descripciones con nuestros resultados. Analizamos el tipo de contraste expresado por although, los elementos de la relación de contraste y la estructura de la información. En primer lugar, se presenta el marco teórico de las

  15. Discovery and development of the N-terminal procollagen type II (NPII) biomarker: a tool for measuring collagen type II synthesis.

    Science.gov (United States)

    Nemirovskiy, O V; Sunyer, T; Aggarwal, P; Abrams, M; Hellio Le Graverand, M P; Mathews, W R

    2008-12-01

    Progression of joint damage in osteoarthritis (OA) is likely to result from an imbalance between cartilage degradation and synthesis processes. Markers reflecting these two components appear to be promising in predicting the rate of OA progression. Both N- and C-terminal propeptides of type II collagen reflect the rates of collagen type II synthesis. The ability to quantify the procollagen peptides in biological fluids would enable a better understanding of OA disease pathology and provide means for assessing the proof of mechanism of anabolic disease modifying OA drugs (DMOADs). A polyclonal antibody that recognizes the sequence GPKGQKGEPGDIKDI in the propeptide region of rat, dog, and human type II collagen was raised in chicken and peptide-affinity purified. The immunoaffinity liquid chromatography mass spectrometry (LC-MS/MS) was used to extensively characterize N-terminal procollagen type II (NPII) peptides found in biological fluids. The novel competition enzyme-linked immunosorbent assay (ELISA) assay was developed to quantitatively measure the NPII peptides. Several peptides ranging from 17 to 41 amino acids with various modifications including hydroxylations on proline and lysine residues, oxidation of lysines to allysines, and attachments of glucose and galactose moieties to hydroxylysines were identified in a simple system such as ex vivo cultures of human articular cartilage (HAC) explants as well as in more complex biological fluids such as human urine and plasma. A competitive ELISA assay has been developed and applied to urine, plasma, and synovial fluid matrices in human, rat and dog samples. A novel NPII assay has been developed and applied to OA and normal human subjects to understand the changes in collagen type II synthesis related to the pathology of OA.

  16. Perceptual organization of speech signals by children with and without dyslexia

    Science.gov (United States)

    Nittrouer, Susan; Lowenstein, Joanna H.

    2013-01-01

    Developmental dyslexia is a condition in which children encounter difficulty learning to read in spite of adequate instruction. Although considerable effort has been expended trying to identify the source of the problem, no single solution has been agreed upon. The current study explored a new hypothesis, that developmental dyslexia may be due to faulty perceptual organization of linguistically relevant sensory input. To test that idea, sentence-length speech signals were processed to create either sine-wave or noise-vocoded analogs. Seventy children between 8 and 11 years of age, with and without dyslexia participated. Children with dyslexia were selected to have phonological awareness deficits, although those without such deficits were retained in the study. The processed sentences were presented for recognition, and measures of reading, phonological awareness, and expressive vocabulary were collected. Results showed that children with dyslexia, regardless of phonological subtype, had poorer recognition scores than children without dyslexia for both kinds of degraded sentences. Older children with dyslexia recognized the sine-wave sentences better than younger children with dyslexia, but no such effect of age was found for the vocoded materials. Recognition scores were used as predictor variables in regression analyses with reading, phonological awareness, and vocabulary measures used as dependent variables. Scores for both sorts of sentence materials were strong predictors of performance on all three dependent measures when all children were included, but only performance for the sine-wave materials explained significant proportions of variance when only children with dyslexia were included. Finally, matching young, typical readers with older children with dyslexia on reading abilities did not mitigate the group difference in recognition of vocoded sentences. Conclusions were that children with dyslexia have difficulty organizing linguistically relevant sensory

  17. D-PLACE: A Global Database of Cultural, Linguistic and Environmental Diversity

    Science.gov (United States)

    Kirby, Kathryn R.; Gray, Russell D.; Greenhill, Simon J.; Jordan, Fiona M.; Gomes-Ng, Stephanie; Bibiko, Hans-Jörg; Blasi, Damián E.; Botero, Carlos A.; Bowern, Claire; Ember, Carol R.; Leehr, Dan; Low, Bobbi S.; McCarter, Joe; Divale, William; Gavin, Michael C.

    2016-01-01

    From the foods we eat and the houses we construct, to our religious practices and political organization, to who we can marry and the types of games we teach our children, the diversity of cultural practices in the world is astounding. Yet, our ability to visualize and understand this diversity is limited by the ways it has been documented and shared: on a culture-by-culture basis, in locally-told stories or difficult-to-access repositories. In this paper we introduce D-PLACE, the Database of Places, Language, Culture, and Environment. This expandable and open-access database (accessible at https://d-place.org) brings together a dispersed corpus of information on the geography, language, culture, and environment of over 1400 human societies. We aim to enable researchers to investigate the extent to which patterns in cultural diversity are shaped by different forces, including shared history, demographics, migration/diffusion, cultural innovations, and environmental and ecological conditions. We detail how D-PLACE helps to overcome four common barriers to understanding these forces: i) location of relevant cultural data, (ii) linking data from distinct sources using diverse ethnonyms, (iii) variable time and place foci for data, and (iv) spatial and historical dependencies among cultural groups that present challenges for analysis. D-PLACE facilitates the visualisation of relationships among cultural groups and between people and their environments, with results downloadable as tables, on a map, or on a linguistic tree. We also describe how D-PLACE can be used for exploratory, predictive, and evolutionary analyses of cultural diversity by a range of users, from members of the worldwide public interested in contrasting their own cultural practices with those of other societies, to researchers using large-scale computational phylogenetic analyses to study cultural evolution. In summary, we hope that D-PLACE will enable new lines of investigation into the major drivers

  18. D-PLACE: A Global Database of Cultural, Linguistic and Environmental Diversity.

    Science.gov (United States)

    Kirby, Kathryn R; Gray, Russell D; Greenhill, Simon J; Jordan, Fiona M; Gomes-Ng, Stephanie; Bibiko, Hans-Jörg; Blasi, Damián E; Botero, Carlos A; Bowern, Claire; Ember, Carol R; Leehr, Dan; Low, Bobbi S; McCarter, Joe; Divale, William; Gavin, Michael C

    2016-01-01

    From the foods we eat and the houses we construct, to our religious practices and political organization, to who we can marry and the types of games we teach our children, the diversity of cultural practices in the world is astounding. Yet, our ability to visualize and understand this diversity is limited by the ways it has been documented and shared: on a culture-by-culture basis, in locally-told stories or difficult-to-access repositories. In this paper we introduce D-PLACE, the Database of Places, Language, Culture, and Environment. This expandable and open-access database (accessible at https://d-place.org) brings together a dispersed corpus of information on the geography, language, culture, and environment of over 1400 human societies. We aim to enable researchers to investigate the extent to which patterns in cultural diversity are shaped by different forces, including shared history, demographics, migration/diffusion, cultural innovations, and environmental and ecological conditions. We detail how D-PLACE helps to overcome four common barriers to understanding these forces: i) location of relevant cultural data, (ii) linking data from distinct sources using diverse ethnonyms, (iii) variable time and place foci for data, and (iv) spatial and historical dependencies among cultural groups that present challenges for analysis. D-PLACE facilitates the visualisation of relationships among cultural groups and between people and their environments, with results downloadable as tables, on a map, or on a linguistic tree. We also describe how D-PLACE can be used for exploratory, predictive, and evolutionary analyses of cultural diversity by a range of users, from members of the worldwide public interested in contrasting their own cultural practices with those of other societies, to researchers using large-scale computational phylogenetic analyses to study cultural evolution. In summary, we hope that D-PLACE will enable new lines of investigation into the major drivers

  19. The elaborate plumage in peacocks is not such a drag.

    Science.gov (United States)

    Askew, Graham N

    2014-09-15

    One of the classic examples of an exaggerated sexually selected trait is the elaborate plumage that forms the train in male peafowl Pavo cristatus (peacock). Such ornaments are thought to reduce locomotor performance as a result of their weight and aerodynamic drag, but this cost is unknown. Here, the effect that the train has on take-off flight in peacocks was quantified as the sum of the rates of change of the potential and kinetic energies of the body (P(CoM)) in birds with trains and following the train's removal. There was no significant difference between P(CoM) in birds with and without a train. The train incurs drag during take-off; however, while this produces a twofold increase in parasite drag, parasite power only accounts for 0.1% of the total aerodynamic power. The train represented 6.9% of body weight and is expected to increase induced power. The absence of a detectable effect on take-off performance does not necessarily mean that there is no cost associated with possessing such ornate plumage; rather, it suggests that given the variation in take-off performance per se, the magnitude of any effect of the train has little meaningful functional relevance. © 2014. Published by The Company of Biologists Ltd.

  20. The Cognitive Basis for Sentence Planning Difficulties in Discourse after Traumatic Brain Injury

    Science.gov (United States)

    Peach, Richard K.

    2013-01-01

    Purpose: Analyses of language production of individuals with traumatic brain injury (TBI) place increasing emphasis on microlinguistic (i.e., within-sentence) patterns. It is unknown whether the observed problems involve implementation of well-formed sentence frames or represent a fundamental linguistic disturbance in computing sentence structure.…

  1. CREATING AND PROCESSING A CORPUS

    Directory of Open Access Journals (Sweden)

    Prihantoro

    2015-05-01

    Full Text Available This paper seeks to describe some crucial importance of corpus and text processing. Corpus is a projection of how language is used by its speakers. Technology support has improved corpus for easier maintenance, made it space-saving, and it may electronically structure its data. The latest offers much freedom for corpus users to access and exploit it for language teaching, analysis or other specified tasks. This paper will demonstrate how to use open-access corpus on internet such as Corpus of Contemporary American English (COCA and British National Corpus (BNC. Besides how to use a corpus, another crucial importance that this paper seeks to describe is how to build a corpus. In this paper, the writer will use UNITEX, a corpus (text-based processing software. This software will demonstrate steps of corpus building, ranging from text collection, annotation, electronic dictionary application to some natural language based operations ranging from pattern matching, concordance, to simple extraction. It will show how graph technology may outperform regular expression, a retrieval method exploited by other corpus processor, in terms of writing output.

  2. Variation of Linguistic Form of News Titles of Indonesian National Newspapers

    Directory of Open Access Journals (Sweden)

    Atiqa Sabardila

    2017-09-01

    Full Text Available This descriptive-qualitative study aims at describing the linguistic form on news title. Related to data classification, this study uses segmenting immediate constituents technique. This technique is also utilized to separate the elements of the sentence function. The data is combined based on the similarities of their pattern and category. Furthermore, in order to classify the similarity of the meaning on the phrase or clause on title, this study provides the paraphrase and substitutes the constituent position of function in clause to recognize the arrangement. Moreover, in identifying the constituent of omitted clause, the technique is by inserting or expanding the structure of the title. The variation of the linguistic form is also based on whether or not the figurative language of the title appears. The news title is written by considering these several aspects: phonology, lexicon, grammatical, sentence modus, cohesion, and figurative language. Those aspects are intended to fulfill several functions, such as informing, influencing, preserving the social connection between the server and the receiver of the information or delivering the response about a phenomenon. Based on these several aspects, the characteristics of the linguistics of the titles give the practically ease of understanding for those who are learning to write the straight news discourse.

  3. When "growth" is not such a good thing…

    CERN Multimedia

    2009-01-01

    The Medical Service is launching a campaign to make people aware of the negative effects of sedentary lifestyles and poorly balanced diets and encourage them to change their habits. Take your health into your own hands: eat healthily and keep fit! At times of economic crisis, growth is welcomed as a positive development but when it’s the average BMI (body mass index or weight in kg/size in m2) of the personnel that’s expanding the news is not so good for a company’s health. "The average BMI has been regularly increasing in most countries for the last ten years, and CERN is unfortunately no exception to the rule", says Dr Reymond of the CERN Medical Service (GS-ME). The percentage of members of the personnel with a BMI over 30 has risen by 3% in three years, which is why we’ve decided to launch a health campaign this year." The initiative has the support of the Management, the Staff Association, the CERN Fitness Club, the three...

  4. A Balanced and Representative Corpus: The Effects of Strict Corpus ...

    African Journals Online (AJOL)

    Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all of the available corpora ...

  5. A need for closer examination of FASD by the criminal justice system: has the call been answered?

    Science.gov (United States)

    Gagnier, Karina Royer; Moore, Timothy E; Green, Melvyn

    2011-01-01

    Individuals with FASD exhibit deficits in many domains that can include memory, learning, behavioural inhibition, executive functioning, interpersonal skills, and language. These deficits have serious implications for affected persons when they become engaged in the legal system. In 2004, Moore and Green reviewed case law and psychological literature which suggested that FASD-related deficits placed affected individuals at a significant disadvantage in the justice system. According to them, this disadvantage stemmed from the limited awareness and knowledge of FASD demonstrated by key players in the justice system, as well as the scarcity of effective interventions in place to rehabilitate affected defendants. The aim of the current paper is to assess the extent to which awareness of FASD-related issues in the Canadian justice system has advanced since the publication of Moore and Green's conclusions. First, the deficits associated with FASD and their implications for the justice system are described. Next, recent case law and psychological evidence are reviewed as we consider issues of witness reliability and false confessions. The significance of FASD for sentencing, fitness to stand trial, and the Not Criminally Responsible by Reason of Mental Disorder defence are also briefly discussed. Finally, emerging system wide responses to FASD-related issues are presented. Overall, it appears that the call for closer examination of FASD by the justice system has been answered, but a need for increased education and awareness remains.

  6. Right Lateral Cerebellum Represents Linguistic Predictability.

    Science.gov (United States)

    Lesage, Elise; Hansen, Peter C; Miall, R Chris

    2017-06-28

    Mounting evidence indicates that posterolateral portions of the cerebellum (right Crus I/II) contribute to language processing, but the nature of this role remains unclear. Based on a well-supported theory of cerebellar motor function, which ascribes to the cerebellum a role in short-term prediction through internal modeling, we hypothesize that right cerebellar Crus I/II supports prediction of upcoming sentence content. We tested this hypothesis using event-related fMRI in male and female human subjects by manipulating the predictability of written sentences. Our design controlled for motor planning and execution, as well as for linguistic features and working memory load; it also allowed separation of the prediction interval from the presentation of the final sentence item. In addition, three further fMRI tasks captured semantic, phonological, and orthographic processing to shed light on the nature of the information processed. As hypothesized, activity in right posterolateral cerebellum correlated with the predictability of the upcoming target word. This cerebellar region also responded to prediction error during the outcome of the trial. Further, this region was engaged in phonological, but not semantic or orthographic, processing. This is the first imaging study to demonstrate a right cerebellar contribution in language comprehension independently from motor, cognitive, and linguistic confounds. These results complement our work using other methodologies showing cerebellar engagement in linguistic prediction and suggest that internal modeling of phonological representations aids language production and comprehension. SIGNIFICANCE STATEMENT The cerebellum is traditionally seen as a motor structure that allows for smooth movement by predicting upcoming signals. However, the cerebellum is also consistently implicated in nonmotor functions such as language and working memory. Using fMRI, we identify a cerebellar area that is active when words are predicted and

  7. Combining language and space: sentence bisection in unilateral spatial neglect.

    Science.gov (United States)

    Veronelli, Laura; Guasti, Maria T; Arduino, Lisa S; Vallar, Giuseppe

    2014-10-01

    In line bisection right-brain-damaged patients with left spatial neglect show a rightward deviation, with respect to the line's physical center. In word bisection ortho-phonological features of the stimulus' final (right-sided) part modulate performance of both patients and healthy participants (Veronelli, Vallar, Marinelli, Primativo, & Arduino, 2014). We investigated the role of linguistic factors in sentence bisection, in patients with and without neglect, and control participants. The effects of information in the right-sided part of the sentence (Experiment #1), and of lexical and syntactic violations (Experiment #2) were assessed. Neglect patients showed an overall rightward bias, larger than those of patients without neglect and controls. The neglect patients' bias was modulated by stimulus type, decreasing from lines, to letter strings and to all types of sentences. In sum, in visuo-manual sentence bisection a basic linguistic mechanism, such as sentence readability, brings about a more leftward appreciation of the stimulus, reducing the neglect patients' rightward bias. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. "RecognizeCane" : The new concept of a cane which recognizes the most common objects and safety clues.

    Science.gov (United States)

    Scherlen, Anne-Catherine; Dumas, Jean Claude; Guedj, Benjamin; Vignot, Alexandre

    2007-01-01

    This paper introduces the new concept of an electronic cane for blind people. While some systems inform the subject only of the presence of the object and its relative distance, RecognizeCane is also able to recognize most common objects and environment clues to increase the safety and confidence of the navigation process. The originality of RecognizeCane is the use of simple sensors, such as infrared, brilliance or water sensors to inform the subject of the presence, for example, of a stairway, a water puddle, a zebra crossing or a trash can. This cane does not use an embedded vision system. RecognizeCane is equipped with several sensors and microprocessors to collect sensor data and extract the desired information about the close environment by means of a dynamic analysis of output signals.

  9. Linguistic Corpora and Language Teaching.

    Science.gov (United States)

    Murison-Bowie, Simon

    1996-01-01

    Examines issues raised by corpus linguistics concerning the description of language. The article argues that it is necessary to start from correct descriptions of linguistic units and the contexts in which they occur. Corpus linguistics has joined with language teaching by sharing a recognition of the importance of a larger, schematic view of…

  10. A General Overview of Motivation in Linguistics

    Institute of Scientific and Technical Information of China (English)

    王航

    2014-01-01

    In recent years, the term of motivation in linguistics study has aroused the interests of scholars. Different studies of mo -tivation have been produced by different scholars. In this paper, the writer organizes the recent studies on motivation in linguistics. the paper is divided into three parts, the introduction of the term motivation, different types of motivation, and theories of moti -vation.

  11. How a Corpus-Based Study of the Factors which Influence Collocation Can Help in the Teaching of Business English

    Science.gov (United States)

    Walker, Crayton

    2011-01-01

    In this paper I use two case studies to show how corpus linguistics can be used to help in the teaching of business English. Senior managers in global companies often find themselves having to do their job in a foreign language. Given that language is one of the key tools of management, the senior managers are normally very keen to develop a…

  12. English and Persian Cognates/Pseudo Cognates-A Cross-Linguistic Investigation

    Directory of Open Access Journals (Sweden)

    Amin Marzban

    2015-03-01

    Full Text Available Different world languages have a lot of contact with each other and have had different influences on one another. Cognates, words which are similar across two or more languages in some aspects, especially with regard to pronunciation, portray an interesting and relevant aspect of foreign/second language translation and research. This study intended to identify a type of cognate words called false cognates in Persian and English, words which have the same form in two languages but represent different meanings, and tried to study and determine the historical relations between English and Persian words. It also aimed to trace their route back from Proto Indo-European languages to the modern languages. To this end, the related literature was gone through and a sufficient corpus of information has been revealed. Afterwards, authentic monolingual dictionaries in both Farsi and English languages were consulted to provide definitions for the cognates and allow for their cross-linguistic comparison. The findings revealed that most of such problematic cognates with different meanings in Farsi and English as found through this study are likely to be confusing and deceptive for Farsi-speaking EFL learners. The findings also pointed to the need for this line of research to receive more scholarly attention. The study of true and false cognates has some implications for contrastive analysts, error analysts, translators, translation theorists, foreign language teachers, curriculum designers, as well as lexicographers and lexicologists.

  13. CD1d-Restricted Type II NKT Cells Reactive With Endogenous Hydrophobic Peptides.

    Science.gov (United States)

    Nishioka, Yusuke; Masuda, Sakiko; Tomaru, Utano; Ishizu, Akihiro

    2018-01-01

    NKT cells belong to a distinct subset of T cells that recognize hydrophobic antigens presented by major histocompatibility complex class I-like molecules, such as CD1d. Because NKT cells stimulated by antigens can activate or suppress other immunocompetent cells through an immediate production of a large amount of cytokines, they are regarded as immunological modulators. CD1d-restricted NKT cells are classified into two subsets, namely, type I and type II. CD1d-restricted type I NKT cells express invariant T cell receptors (TCRs) and react with lipid antigens, including the marine sponge-derived glycolipid α-galactosylceramide. On the contrary, CD1d-restricted type II NKT cells recognize a wide variety of antigens, including glycolipids, phospholipids, and hydrophobic peptides, by their diverse TCRs. In this review, we focus particularly on CD1d-restricted type II NKT cells that recognize endogenous hydrophobic peptides presented by CD1d. Previous studies have demonstrated that CD1d-restricted type I NKT cells usually act as pro-inflammatory cells but sometimes behave as anti-inflammatory cells. It has been also demonstrated that CD1d-restricted type II NKT cells play opposite roles to CD1d-restricted type I NKT cells; thus, they function as anti-inflammatory or pro-inflammatory cells depending on the situation. In line with this, CD1d-restricted type II NKT cells that recognize type II collagen peptide have been demonstrated to act as anti-inflammatory cells in diverse inflammation-induction models in mice, whereas pro-inflammatory CD1d-restricted type II NKT cells reactive with sterol carrier protein 2 peptide have been demonstrated to be involved in the development of small vessel vasculitis in rats.

  14. Infarction of the entire corpus callosum as a complication in subarachnoid hemorrhage: A case report

    Directory of Open Access Journals (Sweden)

    Satoru Takahashi, M.D.

    2017-03-01

    Full Text Available The corpus callosum is the major commissural pathway connecting the cerebral hemispheres. This pathway receives its blood supply from anterior communicating artery, pericallosal artery, and posterior pericallosal artery. However, in some cases, the entire corpus callosum is supplied by median callosal artery; thus, occlusion of this artery can lead to infarction of the entire corpus callosum. Few reports have described this type of infarction, and no reports after subarachnoid hemorrhage (SAH exist. Here, we report on a 42-year-old female who was diagnosed with SAH after two aneurysms were discovered in bifurcation of left anterior cerebral artery (A1-A2. After successful clipping was performed, the patient was alert and had no neurological deficits; moreover, the computed tomography images that were acquired after the operation showed no evidence of infarction. Nine days after admittance to the hospital, drowsiness and weakness of the left limbs with brain swelling appeared and decompressive hemi-craniectomy was performed. Diagnostic cerebral angiography revealed vasospasms in both anterior and middle cerebral arteries, thus fasudil hydrochloride was administered intra-arterially. While blood flow in all arteries improved, diffusion-weighted magnetic resonance imaging detected infarction along the entire length of the corpus callosum and in the medial region of the right frontal lobe. We believe this infarction was due to secondary ischemia of median callosal artery. This case reminded us of the anatomical variation wherein median callosal artery is the sole blood supply line for the corpus callosum and demonstrated that infarction of the entire corpus callosum is possible.

  15. Sadhana | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    It also describes the working of Punjabi Shallow Parser used for the processing of the input sentence, which performs the tasks of Tokenizer, Morph-analyzer, Part-of-Speech Tagger and Chunker. This paper also considers the seven phases used in the process of EnConversion of input Punjabi text to UNL representation.

  16. The Ndebele Language Corpus: A Review of Some Factors Influencing the Content of the Corpus*

    Directory of Open Access Journals (Sweden)

    Samukele Hadebe

    2011-10-01

    Full Text Available

    Abstract: The Ndebele language corpus described here is that compiled by the ALLEX Project (now ALRI at the University of Zimbabwe. It is intended to reflect as much as possible the Ndebele language as spoken in Zimbabwe. The Ndebele language corpus was built in order to provide much-needed material for the study of the Ndebele language with a special focus on dictionarymaking and research. Like most corpora, the Ndebele language corpus may in future be used for other purposes not thought of at the time of its inception. It has been designed to meet generally acceptable standards so that it can be adaptable to various possible uses by various researchers. The article wants to outline the building process of the Ndebele language corpus with special emphasis on the challenges that faced compilers, and possible solutions. It is assumed that some of these challenges might not be peculiar to Ndebele alone but could also affect related African languages in a more or less similar situation. The main focus of the discussion will be the composition of the Ndebele language corpus, i.e. the type of texts that constitute the corpus. The corpus is composed of published texts, unpublished texts and oral material gathered from Ndebele-speaking districts of Zimbabwe. It will be argued that the use of the corpus and its reliability for research depends among other factors on its contents. It will also be shown that the contents of a corpus depend on a number of factors, some of which include sociolinguistic, political and economic considerations. These considerations have implications on both the content and quality of published and oral texts that constitute the Ndebele language corpus.

    Keywords: CORPUS, ORAL MATERIALS, CODE-MIXING, CODE-SWITCHING, MOTHER- TONGUE, NDEBELE

    Opsomming: Die Ndebeletaalkorpus: 'n Oorsig van sommige faktore wat die inhoud van die korpus be?nvloed. Die Ndebeletaalkorpus wat hier beskryf word, is di? saamgestel deur die

  17. A case for the sentence in reading comprehension.

    Science.gov (United States)

    Scott, Cheryl M

    2009-04-01

    This article addresses sentence comprehension as a requirement of reading comprehension within the framework of the narrow view of reading that was advocated in the prologue to this forum. The focus is on the comprehension requirements of complex sentences, which are characteristic of school texts. Topics included in this discussion are (a) evidence linking sentence comprehension and syntax with reading, (b) syntactic properties of sentences that make them difficult to understand, (c) clinical applications for the assessment of sentence comprehension as it relates to reading, and (d) evidence and methods for addressing sentence complexity in treatment. Sentence complexity can create comprehension problems for struggling readers. The contribution of sentence comprehension to successful reading has been overlooked in models that emphasize domain-general comprehension strategies at the text level. The author calls for the evaluation of sentence comprehension within the context of content domains where complex sentences are found.

  18. On possible linguistic correlates to brain lateralization

    Directory of Open Access Journals (Sweden)

    Tania Kouteva/Kuteva

    2014-04-01

    The present paper compares the two modes of processing proposed by Van Lancker Sidtis (2009 in her dual process model and the two domains of discourse organization distinguished in the framework of Discourse Grammar (Heine et al. 2013; Kaltenböck et al. 2011. These two frameworks were developed on different kinds of data. In the dual process model it is observations on patients with left or right hemisphere damage that marked the starting point of analysis. Central to the dual process model is the distinction between novel speech (or novel language, or newly created language, or propositional speech and formulaic speech (or formulaic expressions or automatic speech. Easily identified instances of formulaic speech are swear words, interjections, pause fillers, discourse elements, non-literal lexical meanings for idioms, proverbs. Unlike the dual process model, in the Discourse Grammar model it is linguistic discontinuities that provided the basis of analysis. Discourse grammar in this model is understood as all the linguistic resources that are available for constructing spoken and written (and signed texts. We argue that Discourse Grammar can be divided into two distinct domains, namely Sentence Grammar and Thetical Grammar. Whereas Sentence Grammar has been at the centre of interest in mainstream linguistics, Thetical Grammar encompasses linguistic phenomena – such as formulae of social exchange, imperatives, vocatives, interjections, including hesitation markers and pause fillers and what is traditionally known as “parenthetical” constructions – that pose a problem to orthodox grammatical analysis. We show that the findings made within the two frameworks are largely compatible with one another: both models converge on claiming that there is a significant correlation between linguistic categorization and hemisphere-based brain activity. In the dual process model it is hypothesized that there is a significant correlation between certain kinds of speech

  19. Children's and adults' on-line processing of syntactically ambiguous sentences during reading.

    Directory of Open Access Journals (Sweden)

    Holly S S L Joseph

    Full Text Available While there has been a fair amount of research investigating children's syntactic processing during spoken language comprehension, and a wealth of research examining adults' syntactic processing during reading, as yet very little research has focused on syntactic processing during text reading in children. In two experiments, children and adults read sentences containing a temporary syntactic ambiguity while their eye movements were monitored. In Experiment 1, participants read sentences such as, 'The boy poked the elephant with the long stick/trunk from outside the cage' in which the attachment of a prepositional phrase was manipulated. In Experiment 2, participants read sentences such as, 'I think I'll wear the new skirt I bought tomorrow/yesterday. It's really nice' in which the attachment of an adverbial phrase was manipulated. Results showed that adults and children exhibited similar processing preferences, but that children were delayed relative to adults in their detection of initial syntactic misanalysis. It is concluded that children and adults have the same sentence-parsing mechanism in place, but that it operates with a slightly different time course. In addition, the data support the hypothesis that the visual processing system develops at a different rate than the linguistic processing system in children.

  20. A linguistic analysis of sexist statements by Janusz Korwin-Mikke

    Directory of Open Access Journals (Sweden)

    Kamila Kuros

    2011-06-01

    Full Text Available In the following paper the author has performed a linguistic analysis of several chosen sexist remarks by Janusz Korwin-Mikke in order to prove how stereotypically women are regarded by the famous politician. This conservative man treats stereotypes as a proof of good order and fights to prevent women from trying to change what for thousands of years has given the general order and happiness for male sex and, in his opinion, for female sex. Significant for the author’s paper has been the importance of antifeminist’s texts within pragmatic and semantic aspect. The paper has concentrated on solely the linguistic aspect - in detail there have been discussed among other things: aphorisms and neologisms which are used by the controversial politician. In the paper the author has tried to prove that J. Korwin- Mikke’s language provokes and insults the opposite sex.

  1. Angiotensin II increases phosphodiesterase 5A expression in vascular smooth muscle cells: A mechanism by which angiotensin II antagonizes cGMP signaling

    Science.gov (United States)

    Kim, Dongsoo; Aizawa, Toru; Wei, Heng; Pi, Xinchun; Rybalkin, Sergei D.; Berk, Bradford C.; Yan, Chen

    2014-01-01

    Angiotensin II (Ang II) and nitric oxide (NO)/natriuretic peptide (NP) signaling pathways mutually regulate each other. Imbalance of Ang II and NO/NP has been implicated in the pathophysiology of many vascular diseases. cGMP functions as a key mediator in the interaction between Ang II and NO/NP. Cyclic nucleotide phosphodiesterase 5A (PDE5A) is important in modulating cGMP signaling by hydrolyzing cGMP in vascular smooth muscle cells (VSMC). Therefore, we examined whether Ang II negatively modulates intracellular cGMP signaling in VSMC by regulating PDE5A. Ang II rapidly and transiently increased PDE5A mRNA levels in rat aortic VSMC. Upregulation of PDE5A mRNA was associated with a time-dependent increase of both PDE5 protein expression and activity. Increased PDE5A mRNA level was transcription-dependent and mediated by the Ang II type 1 receptor. Ang II-mediated activation of extracellular signal-regulated kinases 1/2 (ERK1/2) was essential for Ang II-induced PDE5A upregulation. Pretreatment of VSMC with Ang II inhibited C-type NP (CNP) stimulated cGMP signaling, such as cGMP dependent protein kinase (PKG)-mediated phosphorylation of vasodilator-stimulated-phosphoprotein (VASP). Ang II-mediated inhibition of PKG was blocked when PDE5 activity was decreased by selective PDE5 inhibitors, suggesting that upregulation of PDE5A expression is an important mechanism for Ang II to attenuate cGMP signaling. PDE5A may also play a critical role in the growth promoting effects of Ang II because inhibition of PDE5A activity significantly decreased Ang II-stimulated VSMC growth. These observations establish a new mechanism by which Ang II antagonizes cGMP signaling and stimulates VSMC growth. PMID:15623434

  2. Type II restriction endonucleases--a historical perspective and more.

    Science.gov (United States)

    Pingoud, Alfred; Wilson, Geoffrey G; Wende, Wolfgang

    2014-07-01

    This article continues the series of Surveys and Summaries on restriction endonucleases (REases) begun this year in Nucleic Acids Research. Here we discuss 'Type II' REases, the kind used for DNA analysis and cloning. We focus on their biochemistry: what they are, what they do, and how they do it. Type II REases are produced by prokaryotes to combat bacteriophages. With extreme accuracy, each recognizes a particular sequence in double-stranded DNA and cleaves at a fixed position within or nearby. The discoveries of these enzymes in the 1970s, and of the uses to which they could be put, have since impacted every corner of the life sciences. They became the enabling tools of molecular biology, genetics and biotechnology, and made analysis at the most fundamental levels routine. Hundreds of different REases have been discovered and are available commercially. Their genes have been cloned, sequenced and overexpressed. Most have been characterized to some extent, but few have been studied in depth. Here, we describe the original discoveries in this field, and the properties of the first Type II REases investigated. We discuss the mechanisms of sequence recognition and catalysis, and the varied oligomeric modes in which Type II REases act. We describe the surprising heterogeneity revealed by comparisons of their sequences and structures. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Performance of a Lexical and POS Tagger for Sanskrit

    Science.gov (United States)

    Hellwig, Oliver

    Due to the phonetic, morphological, and lexical complexity of Sanskrit, the automatic analysis of this language is a real challenge in the area of natural language processing. The paper describes a series of tests that were performed to assess the accuracy of the tagging program SanskritTagger. To our knowlegde, it offers the first reliable benchmark data for evaluating the quality of taggers for Sanskrit using an unrestricted dictionary and texts from different domains. Based on a detailed analysis of the test results, the paper points out possible directions for future improvements of statistical tagging procedures for Sanskrit.

  4. THE CHILD JUSTICE ACT: A DETAILED CONSIDERATION OF SECTION 68 AS POINT OF DEPARTURE WITH RESPECT TO THE SENTENCING OF YOUNG OFFENDERS

    Directory of Open Access Journals (Sweden)

    Stephan Terblanche

    2012-12-01

    Full Text Available The Child Justice Act 75 of 2008 establishes a criminal justice system for child accused, separate from the criminal justice system which continues to apply for adult accused in South Africa. The Act aims to keep children out of detention and away from the formal criminal justice system, mainly through diversion. When these interventions would be inadequate or unsuccessful, the Act provides for child offenders to the tried and sentenced in child justice courts. Until now there has been little discussion of the details of the provisions dealing with sentencing.Sentencing in a child justice court is regulated by chapter 10 of the Act and section 68 is the first section in this chapter. This section effectively amounts to the “jurisdictional” provision of the new child sentencing system: it not only mandates child justice courts to impose their sentences in terms of the Act, but also provides the first set of boundaries (or the first part of the framework within which sentencing should take place. Despite its brevity, section 68 is not without interpretative challenges. Of course, it has to be interpreted within the context of the entire Act. Explaining this context is the first function of this article. The various aspects of section 68 are further critically explored and discussed.

  5. ENVIRONMENTS and EOL: identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life.

    Science.gov (United States)

    Pafilis, Evangelos; Frankild, Sune P; Schnetzer, Julia; Fanini, Lucia; Faulwetter, Sarah; Pavloudi, Christina; Vasileiadou, Katerina; Leary, Patrick; Hammock, Jennifer; Schulz, Katja; Parr, Cynthia Sims; Arvanitidis, Christos; Jensen, Lars Juhl

    2015-06-01

    The association of organisms to their environments is a key issue in exploring biodiversity patterns. This knowledge has traditionally been scattered, but textual descriptions of taxa and their habitats are now being consolidated in centralized resources. However, structured annotations are needed to facilitate large-scale analyses. Therefore, we developed ENVIRONMENTS, a fast dictionary-based tagger capable of identifying Environment Ontology (ENVO) terms in text. We evaluate the accuracy of the tagger on a new manually curated corpus of 600 Encyclopedia of Life (EOL) species pages. We use the tagger to associate taxa with environments by tagging EOL text content monthly, and integrate the results into the EOL to disseminate them to a broad audience of users. The software and the corpus are available under the open-source BSD and the CC-BY-NC-SA 3.0 licenses, respectively, at http://environments.hcmr.gr. © The Author 2015. Published by Oxford University Press.

  6. Reduction of As(V) to As(III) by commercial ZVI or As(0) with acid-treated ZVI

    International Nuclear Information System (INIS)

    Sun, Fenglong; Osseo-Asare, Kwadwo A.; Chen, Yongsheng; Dempsey, Brian A.

    2011-01-01

    Graphical abstract: Acid-treated ZVI (right) reduced As(V) to the less toxic As(0) while commercial ZVI reduced As(V) to As(III). There was no lag phase in either reaction when rigorously anoxic conditions were maintained. Highlights: ► Acid-treated ZVI reduced As(V) to the less toxic As(0). ► Commercial ZVI reduced As(V) to As(III), which was not released to solution. ► Magnetite/Fe(II) did not reduce As(V) even though the reaction is thermodynamically spontaneous. ► No lag phase was observed in any of the reactions, contrary to prior reports. ► Rigorously anoxic conditions seem necessary for observing these reactions. - Abstract: Zero-valent iron (ZVI) consists of an elemental iron core surrounded by a shell of corrosion products, especially magnetite. ZVI is used for in situ removal or immobilization of a variety of contaminants but the mechanisms for removal of arsenic remain controversial and the mobility of arsenic after reaction with ZVI is uncertain. These issues were addressed by separately studying reactions of As(V) with magnetite, commercial ZVI, and acid-treated ZVI. Strictly anoxic conditions were used. Adsorption of As(V) on magnetite was fast with pH dependence similar to previous reports using oxic conditions. As(V) was not reduced by magnetite and Fe(II) although the reaction is thermodynamically spontaneous. As(V) reactions with ZVI were also fast and no lag phase was observed which was contrary to previous reports. Commercial ZVI reduced As(V) to As(III) only when As(V) was adsorbed, i.e., for pH < 7. As(III) was not released to solution. Acid-treated ZVI reduced As(V) to As(0), shown using wet chemical analyses and XANES/EXAFS. Comparisons were drawn between reactivity of acid-treated ZVI and nano-ZVI; if true then acid-treated ZVI could provide similar reactive benefits at lower cost.

  7. As metáforas do presidente lula na perspectiva da linguística de corpus: o caso do desenvolvimento President Lula's metaphors in a corpus linguistic perspective: The case of 'development'

    Directory of Open Access Journals (Sweden)

    Tony Berber Sardinha

    2010-01-01

    Full Text Available Um dos grandes fenômenos linguísticos da vida política brasileira recente é o que a mídia vem chamando de 'metáforas do presidente Lula'. O ponto de partida deste trabalho é o fato de que deve haver muitas metáforas que passam despercebidas no discurso do presidente e que podem ser descobertas por meio de pesquisa com corpora eletrônicos. Investigamos a presença de metáforas conceptuais relacionadas a 'desenvolvimento' em um corpus composto por pronunciamentos emitidos ao longo de três anos pelo presidente Luís Inácio Lula da Silva. Os resultados indicam que há uso sistemático de três conceitos metafóricos que definem a noção de desenvolvimento do chefe de Estado: VIAGEM, CONSTRUÇÃO e ORGANISMO. Esses três conceitos, em geral, equacionam desenvolvimento com um processo longo, construído, planejado e gerado pelo governo.One of the main linguistic phenomena in recent Brazilian politics is what the media has called 'President Lula's metaphors'. The starting point for the present investigation is that there must be lots of metaphors that go unnoticed in the president's discourse and that these may be uncovered by corpus-based research. We looked at the presence of conceptual metaphors related to 'development' in a corpus of three years of official presidential speeches. The results indicated the systematic use of three metaphorical concepts that together define the notion of development for the head of State: JOURNEY, BUILDING and ORGANISM. These three concepts together equate development with a long process that is generated, planned and carried out by the government.

  8. Verbal Semantics Drives Early Anticipatory Eye Movements during the Comprehension of Verb-Initial Sentences.

    Science.gov (United States)

    Sauppe, Sebastian

    2016-01-01

    Studies on anticipatory processes during sentence comprehension often focus on the prediction of postverbal direct objects. In subject-initial languages (the target of most studies so far), however, the position in the sentence, the syntactic function, and the semantic role of arguments are often conflated. For example, in the sentence "The frog will eat the fly" the syntactic object ("fly") is at the same time also the last word and the patient argument of the verb. It is therefore not apparent which kind of information listeners orient to for predictive processing during sentence comprehension. A visual world eye tracking study on the verb-initial language Tagalog (Austronesian) tested what kind of information listeners use to anticipate upcoming postverbal linguistic input. The grammatical structure of Tagalog allows to test whether listeners' anticipatory gaze behavior is guided by predictions of the linear order of words, by syntactic functions (e.g., subject/object), or by semantic roles (agent/patient). Participants heard sentences of the type "Eat frog fly" or "Eat fly frog" (both meaning "The frog will eat the fly") while looking at displays containing an agent referent ("frog"), a patient referent ("fly") and a distractor. The verb carried morphological marking that allowed the order and syntactic function of agent and patient to be inferred. After having heard the verb, listeners fixated on the agent irrespective of its syntactic function or position in the sentence. While hearing the first-mentioned argument, listeners fixated on the corresponding referent in the display accordingly and then initiated saccades to the last-mentioned referent before it was encountered. The results indicate that listeners used verbal semantics to identify referents and their semantic roles early; information about word order or syntactic functions did not influence anticipatory gaze behavior directly after the verb was heard. In this verb-initial language, event semantics

  9. Verbal semantics drives early anticipatory eye movements during the comprehension of verb-initial sentences

    Directory of Open Access Journals (Sweden)

    Sebastian eSauppe

    2016-02-01

    Full Text Available Studies on anticipatory processes during sentence comprehension often focus on the prediction of postverbal direct objects. In subject-initial languages (the target of most studies so far, however, the position in the sentence, the syntactic function, and the semantic role of arguments are often conflated. For example, in the sentence The frog will eat the fly the syntactic object (fly is at the same time also the last word and the patient argument of the verb. It is therefore not apparent which kind of information listeners orient to for predictive processing during sentence comprehension. A visual world eye tracking study on the verb-initial language Tagalog (Austronesian tested what kind of information listeners use to anticipate upcoming postverbal linguistic input. The grammatical structure of Tagalog allows to test whether listeners' anticipatory gaze behavior is guided by predictions of the linear order of words, by syntactic functions (e.g., subject/object, or by semantic roles (agent/patient. Participants heard sentences of the type Eat frog fly or Eat fly frog (both meaning The frog will eat the fly while looking at displays containing an agent referent (frog, a patient referent (fly and a distractor. The verb carried morphological marking that allowed the order and syntactic function of agent and patient to be inferred. After having heard the verb, listeners fixated on the agent irrespective of its syntactic function or position in the sentence. While hearing the first-mentioned argument, listeners fixated on the corresponding referent in the display accordingly and then initiated saccades to the last-mentioned referent before it was encountered. The results indicate that listeners used verbal semantics to identify referents and their semantic roles early; information about word order or syntactic functions did not influence anticipatory gaze behavior directly after the verb was heard. In this verb-initial language, event semantics

  10. A Computerized Version of the Scrambled Sentences Test

    Directory of Open Access Journals (Sweden)

    Roberto Viviani

    2018-01-01

    Full Text Available The scrambled sentences test (SST, an experimental procedure that involves participants writing down their cognitions, has been used to elicit individual differences in depressiveness and vulnerability to depression. We describe here a modification of the SST to adapt it to computerized administration, with a particular view of its use in large samples and functional neuroimaging applications. In a first study with the computerized version, we reproduce the preponderance of positive cognitions in the healthy and the inverse association of these cognitions with individual measures of depressiveness. We also report a tendency of self-referential cognitions to elicit higher positive cognition rates. In a second study, we describe the patterns of neural activations elicited by emotional and neutral sentences in a functional neuroimaging study, showing that it replicates and extends previous findings obtained with the original version of the SST. During the formation of emotional cognitions, ventral areas such as the ventral anterior cingulus and the supramarginal gyrus were relatively activated. This activation pattern speaks for the recruitment of mechanisms coordinating motivational and associative processes in the formation of value-based decisions.

  11. Measures of speech rhythm and the role of corpus-based word frequency: a multifactorial comparison of Spanish(-English speakers

    Directory of Open Access Journals (Sweden)

    Michael J. Harris

    2011-12-01

    Full Text Available In this study, we address various measures that have been employed to distinguish between syllable and stress- timed languages. This study differs from all previous ones by (i exploring and comparing multiple metrics within a quantitative and multifactorial perspective and by (ii also documenting the impact of corpus-based word frequency. We begin with the basic distinctions of speech rhythms, dealing with the differences between syllable-timed languages and stress-timed languages and several methods that have been used to attempt to distinguish between the two. We then describe how these metrics were used in the current study comparing the speech rhythms of Mexican Spanish speakers and bilingual English/Spanish speakers (speakers born to Mexican parents in California. More specifically, we evaluate how well various metrics of vowel duration variability as well as the so far understudied factor of corpus-based frequency allow to classify speakers as monolingual or bilingual. A binary logistic regression identifies several main effects and interactions. Most importantly, our results call the utility of a particular rhythm metric, the PVI, into question and indicate that corpus data in the form of lemma frequencies interact with two metrics of durational variability, suggesting that durational variability metrics should ideally be studied in conjunction with corpus-based frequency data.

  12. Linguistic and Cognitive Effects of Bilingualism with Regional Minority Languages: A Study of Sardinian–Italian Adult Speakers

    Science.gov (United States)

    Garraffa, Maria; Obregon, Mateo; Sorace, Antonella

    2017-01-01

    This study explores the effects of bilingualism in Sardinian as a regional minority language on the linguistic competence in Italian as the dominant language and on non-linguistic cognitive abilities. Sardinian/Italian adult speakers and monolingual Italian speakers living in the same geographical area of Sardinia were compared in two kinds of tasks: (a) verbal and non-verbal cognitive tasks targeting working memory and attentional control and (b) tasks of linguistic abilities in Italian focused on the comprehension of sentences differing in grammatical complexity. Although no difference was found between bilinguals and monolinguals in the cognitive control of attention, bilinguals performed better on working memory tasks. Bilinguals with lower formal education were found to be faster at comprehension of one type of complex sentence (center embedded object relative clauses). In contrast, bilinguals and monolinguals with higher education showed comparable slower processing of complex sentences. These results show that the effects of bilingualism are modulated by type of language experience and education background: positive effects of active bilingualism on the dominant language are visible in bilinguals with lower education, whereas the effects of higher literacy in Italian obliterate those of active bilingualism in bilinguals and monolinguals with higher education. PMID:29163288

  13. Can a linguistic serial founder effect originating in Africa explain the worldwide phonemic cline?

    Science.gov (United States)

    Fort, Joaquim; Pérez-Losada, Joaquim

    2016-04-01

    It has been proposed that a serial founder effect could have caused the present observed pattern of global phonemic diversity. Here we present a model that simulates the human range expansion out of Africa and the subsequent spatial linguistic dynamics until today. It does not assume copying errors, Darwinian competition, reduced contrastive possibilities or any other specific linguistic mechanism. We show that the decrease of linguistic diversity with distance (from the presumed origin of the expansion) arises under three assumptions, previously introduced by other authors: (i) an accumulation rate for phonemes; (ii) small phonemic inventories for the languages spoken before the out-of-Africa dispersal; (iii) an increase in the phonemic accumulation rate with the number of speakers per unit area. Numerical simulations show that the predictions of the model agree with the observed decrease of linguistic diversity with increasing distance from the most likely origin of the out-of-Africa dispersal. Thus, the proposal that a serial founder effect could have caused the present observed pattern of global phonemic diversity is viable, if three strong assumptions are satisfied. © 2016 The Authors.

  14. Can a linguistic serial founder effect originating in Africa explain the worldwide phonemic cline?

    Science.gov (United States)

    2016-01-01

    It has been proposed that a serial founder effect could have caused the present observed pattern of global phonemic diversity. Here we present a model that simulates the human range expansion out of Africa and the subsequent spatial linguistic dynamics until today. It does not assume copying errors, Darwinian competition, reduced contrastive possibilities or any other specific linguistic mechanism. We show that the decrease of linguistic diversity with distance (from the presumed origin of the expansion) arises under three assumptions, previously introduced by other authors: (i) an accumulation rate for phonemes; (ii) small phonemic inventories for the languages spoken before the out-of-Africa dispersal; (iii) an increase in the phonemic accumulation rate with the number of speakers per unit area. Numerical simulations show that the predictions of the model agree with the observed decrease of linguistic diversity with increasing distance from the most likely origin of the out-of-Africa dispersal. Thus, the proposal that a serial founder effect could have caused the present observed pattern of global phonemic diversity is viable, if three strong assumptions are satisfied. PMID:27122180

  15. "cba to check the spelling" investigating parser performance on discussion forum posts

    OpenAIRE

    Foster, Jennifer

    2010-01-01

    We evaluate the Berkeley parser on text from an online discussion forum. We evaluate the parser output with and without gold tokens and spellings (using Sparseval and Parseval), and we compile a list of problematic phenomena for this domain. The Parseval f-score for a small development set is 77.56. This increases to 80.27 when we apply a set of simple transformations to the input sentences and to the Wall Street Journal (WSJ) training sections.

  16. Persian Linguistic and Rhetorical Pathology in The Telegram Communication Network

    Directory of Open Access Journals (Sweden)

    M. Hedayat Mofidi

    2017-12-01

    Full Text Available Today, the mobile phone, with its wide range of features, is an inexpensive, easy to use, and most modern communication tool. One of the special applications of smart phones is providing spaces and facilities such as SMS services, virtual networks, and interactive groups that link people and groups from different linguistic and cultural background. In this paper, we tried to study the messages of the Farsi-speaking users of the Telegram communication network from the linguistics perspective. In this regard, the linguistics and rhetorical correspondence of short messages in Persian language was studied by the content-analysis method. The statistical population consisted of 150 messages which contained 327 sentences. These messages were randomly selected from 5 different groups. Persian language used in cyberspace differs from standard Persian language. The first step in correcting communication language errors in cyberspace is to inform the users about the type of errors.

  17. A person is not a number: discourse involvement in subject-verb agreement computation.

    Science.gov (United States)

    Mancini, Simona; Molinaro, Nicola; Rizzi, Luigi; Carreiras, Manuel

    2011-09-02

    Agreement is a very important mechanism for language processing. Mainstream psycholinguistic research on subject-verb agreement processing has emphasized the purely formal and encapsulated nature of this phenomenon, positing an equivalent access to person and number features. However, person and number are intrinsically different, because person conveys extra-syntactic information concerning the participants in the speech act. To test the person-number dissociation hypothesis we investigated the neural correlates of subject-verb agreement in Spanish, using person and number violations. While number agreement violations produced a left-anterior negativity followed by a P600 with a posterior distribution, the negativity elicited by person anomalies had a centro-posterior maximum and was followed by a P600 effect that was frontally distributed in the early phase and posteriorly distributed in the late phase. These data reveal that the parser is differentially sensitive to the two features and that it deals with the two anomalies by adopting different strategies, due to the different levels of analysis affected by the person and number violations. Copyright © 2011 Elsevier B.V. All rights reserved.

  18. Effects of Icariside II on Corpus Cavernosum and Major Pelvic Ganglion Neuropathy in Streptozotocin-Induced Diabetic Rats

    Directory of Open Access Journals (Sweden)

    Guang-Yi Bai

    2014-12-01

    Full Text Available Diabetic erectile dysfunction is associated with penile dorsal nerve bundle neuropathy in the corpus cavernosum and the mechanism is not well understood. We investigated the neuropathy changes in the corpus cavernosum of rats with streptozotocin-induced diabetes and the effects of Icariside II (ICA II on improving neuropathy. Thirty-six 8-week-old Sprague-Dawley rats were randomly distributed into normal control group, diabetic group and ICA-II treated group. Diabetes was induced by a one-time intraperitoneal injection of streptozotocin (60 mg/kg. Three days later, the diabetic rats were randomly divided into 2 groups including a saline treated placebo group and an ICA II-treated group (5 mg/kg/day, by intragastric administration daily. Twelve weeks later, erectile function was measured by cavernous nerve electrostimulation with real time intracorporal pressure assessment. The penis was harvested for the histological examination (immunofluorescence and immunohistochemical staining and transmission electron microscopy detecting. Diabetic animals exhibited a decreased density of dorsal nerve bundle in penis. The neurofilament of the dorsal nerve bundle was fragmented in the diabetic rats. There was a decreased expression of nNOS and NGF in the diabetic group. The ICA II group had higher density of dorsal nerve bundle, higher expression of NGF and nNOS in the penis. The pathological change of major pelvic nerve ganglion (including the microstructure by transmission electron microscope and the neurite outgrowth length of major pelvic nerve ganglion tissue cultured in vitro was greatly attenuated in the ICA II-treated group (p < 0.01. ICA II treatment attenuates the diabetes-related impairment of corpus cavernosum and major pelvic ganglion neuropathy in rats with Streptozotocin-Induced Diabetes.

  19. Impact of Background Noise and Sentence Complexity on Processing Demands during Sentence Comprehension

    DEFF Research Database (Denmark)

    Wendt, Dorothea; Dau, Torsten; Hjortkjær, Jens

    2016-01-01

    Speech comprehension in adverse listening conditions can be effortful even when speech is fully intelligible. Acoustical distortions typically make speech comprehension more effortful, but effort also depends on linguistic aspects of the speech signal, such as its syntactic complexity....... In the present study, pupil dilations, and subjective effort ratings were recorded in 20 normal-hearing participants while performing a sentence comprehension task. The sentences were either syntactically simple (subject-first sentence structure) or complex (object-first sentence structure) and were presented...... and less by syntactic complexity. Conversely, pupil dilations increased with syntactic complexity but only showed a small effect of the noise level. Participants with higher WMC showed increased pupil responses in the higher-level noise condition but rated sentence comprehension as being less effortful...

  20. Sex Differences in Music: A Female Advantage at Recognizing Familiar Melodies

    Directory of Open Access Journals (Sweden)

    Michael T Ullman

    2016-03-01

    Full Text Available Although sex differences have been observed in various cognitive domains, there has been little work examining sex differences in the cognition of music. We tested the prediction that women would be better than men at recognizing familiar melodies, since memories of specific melodies are likely to be learned (at least in part by declarative memory, which shows female advantages. Participants were 24 men and 24 women, with half musicians and half non-musicians in each group. The two groups were matched on age, education, and various measures of musical training. Participants were presented with well-known and novel melodies, and were asked to indicate their recognition of familiar melodies as rapidly as possible. The women were significantly faster than the men in responding, with a large effect size. The female advantage held across musicians and non-musicians, and across melodies with and without commonly associated lyrics, as evidenced by an absence of interactions between sex and these factors. Additionally, the results did not seem to be explained by sex differences in response biases, or in basic auditory or motor processes as tested in a control task. Though caution is warranted given that this is the first study to examine sex differences in familiar melody recognition, the results are consistent with the hypothesis motivating our prediction, namely that declarative memory underlies knowledge about music (particularly about familiar melodies, and that the female advantage at declarative memory may thus lead to female advantages in music cognition (particularly at familiar melody recognition. Additionally, the findings argue against the view that female advantages at tasks involving verbal (or verbalizable material are due solely to a sex difference specific to the verbal domain. Further, the results may help explain previously-reported cognitive commonalities between music and language: since declarative memory also underlies language

  1. Laser-based additive manufacturing: where it has been, where it needs to go

    Science.gov (United States)

    Cooper, Khershed P.

    2014-03-01

    It is no secret that the laser was the driver for additive manufacturing (AM) of 3D objects since such objects were first demonstrated in the mid-1980s. A myriad of techniques utilizing the directed energy of lasers were invented. Lasers are used to selectively sinter or fuse incremental layers in powder-beds, melt streaming powder following a programmed path, and polymerize photopolymers in a liquid vat layer-by-layer. The laser is an energy source of choice for repair of damaged components, for manufacture of new or replacement parts, and for rapid prototyping of concept designs. Lasers enable microstructure gradients and heterogeneous structures designed to exhibit unique properties and behavior. Laserbased additive manufacturing has been successful in producing relatively simple near net-shape metallic parts saving material and cost, but requiring finish-machining and in repair and refurbishment of worn components. It has been routinely used to produce polymer parts. These capabilities have been widely recognized as evidenced by the explosion in interest in AM technology, nationally. These successes are, however, tempered by challenges facing practitioners such as process and part qualification and verification, which are needed to bring AM as a true manufacturing technology. The ONR manufacturing science program, in collaboration with other agencies, invested in basic R&D in AM since its beginnings. It continues to invest, currently focusing on developing cyber-enabled manufacturing systems for AM. It is believed that such computation, communication and control approaches will help in validating AM and moving it to the factory floor along side CNC machines.

  2. Computed tomographic evaluation of the proximal femur: A predictive classification in displaced femoral neck fracture management

    Directory of Open Access Journals (Sweden)

    Narender Kumar Magu

    2014-01-01

    Full Text Available Background: Femoral neck fracture is truly an enigma due to the high incidence of avascular necrosis and nonunion. Different methods have been described to determine the size of the femoral head fragment, as a small head has been said to be associated with poor outcome and nonunion due to inadequate implant purchase in the proximal fragment. These methods were two dimensional and were affected by radiography techniques, therefore did not determine true head size. Computed tomography (CT is an important option to measure true head size as images can be obtained in three dimensions. Henceforth, we subjected patients to CT scan of hip in cases with displaced fracture neck of femur. The study aims to define the term "small head or inadequate size femoral head" objectively for its prognostic significance. Materials and Methods: 70 cases of displaced femoral neck fractures underwent CT scan preoperatively for proximal femoral geometric measurements of both hips. Dual energy X-ray absorptiometry scan was done in all cases. Patients were treated with either intertrochanteric osteotomy or lag screw osteosynthesis based on the size of the head fragment on plain radiographs. Results: The average femoral head fragment volume was 57 cu cm (range 28.3-84.91 cu cm; standard deviation 14 cu cm. Proximal fragment volume of >43 cu cm was termed adequate size (type I and of ≤43 cu cm as small femoral head (type II. Fractures which united (n = 54 had a relatively large average head size (59 cu cm when compared to fractures that did not (n = 16, which had a small average head size (49 cu cm and this difference was statistically significant. In type I fractures union rate was comparable in both osteotomy and lag screw groups (P > 0.05. Lag screw fixation failed invariably, while osteotomy showed good results in type II fractures (P < 0.05. Conclusion: Computed tomography scan of the proximal femur is advisable for measuring true size of head fragment. An objective

  3. Acute aortic dissection type A discloses Corpus alienum

    Directory of Open Access Journals (Sweden)

    Kolat Philipp

    2009-01-01

    Full Text Available Abstract We report an unusual case of an aortic type A dissection with a corpus alienum which compresses the right ventricle. The patient successfully underwent an aortic root replacement in deep hypothermia with re-implantation of the coronary arteries using a modified Bentall procedure and the resection of the corpus alienum. Intraoperative finding reveals 3 greatly adhered gauze compresses, which were most likely forgotten in the operation 34 years ago.

  4. Psychotherapy Is Chaotic—(Not Only in a Computational World

    Directory of Open Access Journals (Sweden)

    Günter K. Schiepek

    2017-04-01

    Full Text Available Objective: The aim of this article is to outline the role of chaotic dynamics in psychotherapy. Besides some empirical findings of chaos at different time scales, the focus is on theoretical modeling of change processes explaining and simulating chaotic dynamics. It will be illustrated how some common factors of psychotherapeutic change and psychological hypotheses on motivation, emotion regulation, and information processing of the client's functioning can be integrated into a comprehensive nonlinear model of human change processes.Methods: The model combines 5 variables (intensity of emotions, problem intensity, motivation to change, insight and new perspectives, therapeutic success and 4 parameters into a set of 5 coupled nonlinear difference equations. The results of these simulations are presented as time series, as phase space embedding of these time series (i.e., attractors, and as bifurcation diagrams.Results: The model creates chaotic dynamics, phase transition-like phenomena, bi- or multi-stability, and sensibility of the dynamic patterns on parameter drift. These features are predicted by chaos theory and by Synergetics and correspond to empirical findings. The spectrum of these behaviors illustrates the complexity of psychotherapeutic processes.Conclusion: The model contributes to the development of an integrative conceptualization of psychotherapy. It is consistent with the state of scientific knowledge of common factors, as well as other psychological topics, such as: motivation, emotion regulation, and cognitive processing. The role of chaos theory is underpinned, not only in the world of computer simulations, but also in practice. In practice, chaos demands technologies capable of real-time monitoring and reporting on the nonlinear features of the ongoing process (e.g., its stability or instability. Based on this monitoring, a client-centered, continuous, and cooperative process of feedback and control becomes possible. By

  5. Psychotherapy Is Chaotic—(Not Only) in a Computational World

    Science.gov (United States)

    Schiepek, Günter K.; Viol, Kathrin; Aichhorn, Wolfgang; Hütt, Marc-Thorsten; Sungler, Katharina; Pincus, David; Schöller, Helmut J.

    2017-01-01

    Objective: The aim of this article is to outline the role of chaotic dynamics in psychotherapy. Besides some empirical findings of chaos at different time scales, the focus is on theoretical modeling of change processes explaining and simulating chaotic dynamics. It will be illustrated how some common factors of psychotherapeutic change and psychological hypotheses on motivation, emotion regulation, and information processing of the client's functioning can be integrated into a comprehensive nonlinear model of human change processes. Methods: The model combines 5 variables (intensity of emotions, problem intensity, motivation to change, insight and new perspectives, therapeutic success) and 4 parameters into a set of 5 coupled nonlinear difference equations. The results of these simulations are presented as time series, as phase space embedding of these time series (i.e., attractors), and as bifurcation diagrams. Results: The model creates chaotic dynamics, phase transition-like phenomena, bi- or multi-stability, and sensibility of the dynamic patterns on parameter drift. These features are predicted by chaos theory and by Synergetics and correspond to empirical findings. The spectrum of these behaviors illustrates the complexity of psychotherapeutic processes. Conclusion: The model contributes to the development of an integrative conceptualization of psychotherapy. It is consistent with the state of scientific knowledge of common factors, as well as other psychological topics, such as: motivation, emotion regulation, and cognitive processing. The role of chaos theory is underpinned, not only in the world of computer simulations, but also in practice. In practice, chaos demands technologies capable of real-time monitoring and reporting on the nonlinear features of the ongoing process (e.g., its stability or instability). Based on this monitoring, a client-centered, continuous, and cooperative process of feedback and control becomes possible. By contrast, restricted

  6. Psychotherapy Is Chaotic-(Not Only) in a Computational World.

    Science.gov (United States)

    Schiepek, Günter K; Viol, Kathrin; Aichhorn, Wolfgang; Hütt, Marc-Thorsten; Sungler, Katharina; Pincus, David; Schöller, Helmut J

    2017-01-01

    Objective: The aim of this article is to outline the role of chaotic dynamics in psychotherapy. Besides some empirical findings of chaos at different time scales, the focus is on theoretical modeling of change processes explaining and simulating chaotic dynamics. It will be illustrated how some common factors of psychotherapeutic change and psychological hypotheses on motivation, emotion regulation, and information processing of the client's functioning can be integrated into a comprehensive nonlinear model of human change processes. Methods: The model combines 5 variables (intensity of emotions, problem intensity, motivation to change, insight and new perspectives, therapeutic success) and 4 parameters into a set of 5 coupled nonlinear difference equations. The results of these simulations are presented as time series, as phase space embedding of these time series (i.e., attractors), and as bifurcation diagrams. Results: The model creates chaotic dynamics, phase transition-like phenomena, bi- or multi-stability, and sensibility of the dynamic patterns on parameter drift. These features are predicted by chaos theory and by Synergetics and correspond to empirical findings. The spectrum of these behaviors illustrates the complexity of psychotherapeutic processes. Conclusion: The model contributes to the development of an integrative conceptualization of psychotherapy. It is consistent with the state of scientific knowledge of common factors, as well as other psychological topics, such as: motivation, emotion regulation, and cognitive processing. The role of chaos theory is underpinned, not only in the world of computer simulations, but also in practice. In practice, chaos demands technologies capable of real-time monitoring and reporting on the nonlinear features of the ongoing process (e.g., its stability or instability). Based on this monitoring, a client-centered, continuous, and cooperative process of feedback and control becomes possible. By contrast, restricted

  7. Charm photoproduction using electron taggers with the Zeus detector at HERA

    International Nuclear Information System (INIS)

    Aghuzumtsyan, G.

    2006-07-01

    Charm production has been studied at a centre-of-mass energy of √(s)=318 GeV in the ZEUS experiment at the electron-proton collider HERA. The data from the years 1998-2000 are used. Photoproduction is identified by reconstructing the D * (2010) meson via the decay chain D *+ →K - π + π + (+c.c.) and by selecting the scattered electron in one of the dedicated electron taggers located 44m and 35m away from the interaction point. The total visible cross section and differential cross sections as a function of p T (D * ), η(D * ) and W were measured in the restricted phase space: -1.0 * ) T (D * )>1.5 GeV, 80 GeV 2 2 in case of the 44m tagger sample, and -1.5 * ) T (D * )>2.0 GeV, 200 GeV 2 2 in case of the 35m tagger sample. Next-to-leading-order QCD predictions were calculated in the same kinematic region as the data. The comparison of the theoretical predictions with the data shows good agreement within the theoretical uncertainties. Total charm cross sections were calculated by extrapolating the visible cross sections in the limited (p T (D * ), η(D * )) phase space to the full kinematic region. The measurements with the 35m tagger overlap with previous untagged data while the 44m tagger data are used to study a new intermediate region 80 GeV 44m <120 GeV between fixed target and and the other HERA measurements. The comparison of the NLO QCD predictions with the data shows good agreement. (orig.)

  8. Linguistic Characteristics of Advertising English

    Institute of Scientific and Technical Information of China (English)

    易高燕

    2010-01-01

    Advertising language takes form under the influence of linguistics,psychology and sociology,etc,and its way of choosing words and building sentences are quite different from normal English.And as a practical language,advertising English has its specific functions,and it has been distinguished from normal English as an independent language,and it has plentiful values.This paper aims to discuss some linguistic characteristics of advertising English.

  9. When "Safe" Means "Dangerous": A Corpus Investigation of Risk Communication in the Media

    Science.gov (United States)

    Tang, Chris; Rundblad, Gabriella

    2017-01-01

    The mass media has an important role in informing the general public about emerging health risks. Content-based studies of risk communication in the media have revealed a tendency to exaggerate risks or simplify science, but linguistic studies in this area are still scarce. This paper outlines a corpus based investigation of media reporting on the…

  10. Corpus callosum atrophy is associated with mental slowing and executive deficits in subjects with age-related white matter hyperintensities: the LADIS Study

    DEFF Research Database (Denmark)

    Jokinen, Hanna; Ryberg, Charlotte; Kalska, Hely

    2007-01-01

    BACKGROUND: Previous research has indicated that corpus callosum atrophy is associated with global cognitive decline in neurodegenerative diseases, but few studies have investigated specific cognitive functions. OBJECTIVE: To investigate the role of regional corpus callosum atrophy in mental speed...... of the total corpus callosum area and its subregions with cognitive performance were analysed using multiple linear regression, controlling for volume of WMH and other confounding factors. RESULTS: Atrophy of the total corpus callosum area was associated with poor performance in tests assessing speed of mental...... processing--namely, trail making A and Stroop test parts I and II. Anterior, but not posterior, corpus callosum atrophy was associated with deficits of attention and executive functions as reflected by the symbol digit modalities and digit cancellation tests, as well as by the subtraction scores in the trail...

  11. Corpus callosum atrophy is associated with mental slowing and executive deficits in subjects with age-related white matter hyperintensities. The LADIS study

    DEFF Research Database (Denmark)

    Jokinen, Hanne; Ryberg, Charlotte; Stegmann, Mikkel Bille

    2007-01-01

    Background: Previous research has indicated that corpus callosum atrophy is associated with global cognitive decline in neurodegenerative diseases, but few studies have investigated specific cognitive functions. Objective: To investigate the role of regional corpus callosum atrophy in mental speed...... of the total corpus callosum area and its subregions with cognitive performance were analysed using multiple linear regression, controlling for volume of WMH and other confounding factors. Results: Atrophy of the total corpus callosum area was associated with poor performance in tests assessing speed of mental...... processing - namely, trail making A and Stroop test parts I and II. Anterior, but not posterior, corpus callosum atrophy was associated with deficits of attention and executive functions as reflected by the symbol digit modalities and digit cancellation tests, as well as by the subtraction scores...

  12. Development of knowledge models by linguistic analysis of lexical relationships in technical documents

    International Nuclear Information System (INIS)

    Seguela, Patrick

    2001-01-01

    This research thesis addresses the problem of knowledge acquisition and structuring from technical texts, and the use of this knowledge in the development of models. The author presents the Cameleon method which aims at extracting binary lexical relationships from technical texts by identifying linguistic markers. The relevance of this method is assessed in the case of four different corpuses: a written technical corpus, an oral technical corpus, a corpus of texts of instructions, and a corpus of academic texts. The author reports the development of a model of representation of knowledge of a specific field by using lexical relationships. The method is then applied to develop a model used in document search within a knowledge management system [fr

  13. Topic Repetition in Conversations on Different Days as a Sign of Dementia.

    Science.gov (United States)

    Shinkawa, Kaoru; Yamada, Yasunori

    2018-01-01

    Detecting early signs of dementia in everyday situations becomes more and more important in a rapidly aging society. Language dysfunctions are recognized as the prominent signs of dementia. Previous computational studies characterized these language dysfunctions by using acoustic and linguistic features for detecting dementia. However, they mainly investigated language dysfunctions collected from patients during neuropsychological tests. Language dysfunctions observed during regular conversations in everyday situations received little attention. One of the dysfunctions associated with dementia which is frequently observed in regular conversations is the repetition of a topic on different days. In this study, we propose a feature to characterize topic repetition in conversations on different days. We used conversational data obtained from a daily monitoring service of eight elderly people, two of whom had dementia. Through the analysis of topic extraction with latent Dirichlet allocation, we found that the frequency of topic repetition was significantly higher in people with dementia than in the control group. The results suggest that our proposed feature for identifying topic repetition in regular conversations on different days might be used for detecting dementia.

  14. Sex Differences in Music: A Female Advantage at Recognizing Familiar Melodies

    Science.gov (United States)

    Miles, Scott A.; Miranda, Robbin A.; Ullman, Michael T.

    2016-01-01

    Although sex differences have been observed in various cognitive domains, there has been little work examining sex differences in the cognition of music. We tested the prediction that women would be better than men at recognizing familiar melodies, since memories of specific melodies are likely to be learned (at least in part) by declarative memory, which shows female advantages. Participants were 24 men and 24 women, with half musicians and half non-musicians in each group. The two groups were matched on age, education, and various measures of musical training. Participants were presented with well-known and novel melodies, and were asked to indicate their recognition of familiar melodies as rapidly as possible. The women were significantly faster than the men in responding, with a large effect size. The female advantage held across musicians and non-musicians, and across melodies with and without commonly associated lyrics, as evidenced by an absence of interactions between sex and these factors. Additionally, the results did not seem to be explained by sex differences in response biases, or in basic motor processes as tested in a control task. Though caution is warranted given that this is the first study to examine sex differences in familiar melody recognition, the results are consistent with the hypothesis motivating our prediction, namely that declarative memory underlies knowledge about music (particularly about familiar melodies), and that the female advantage at declarative memory may thus lead to female advantages in music cognition (particularly at familiar melody recognition). Additionally, the findings argue against the view that female advantages at tasks involving verbal (or verbalizable) material are due solely to a sex difference specific to the verbal domain. Further, the results may help explain previously reported cognitive commonalities between music and language: since declarative memory also underlies language, such commonalities may be

  15. Sex Differences in Music: A Female Advantage at Recognizing Familiar Melodies.

    Science.gov (United States)

    Miles, Scott A; Miranda, Robbin A; Ullman, Michael T

    2016-01-01

    Although sex differences have been observed in various cognitive domains, there has been little work examining sex differences in the cognition of music. We tested the prediction that women would be better than men at recognizing familiar melodies, since memories of specific melodies are likely to be learned (at least in part) by declarative memory, which shows female advantages. Participants were 24 men and 24 women, with half musicians and half non-musicians in each group. The two groups were matched on age, education, and various measures of musical training. Participants were presented with well-known and novel melodies, and were asked to indicate their recognition of familiar melodies as rapidly as possible. The women were significantly faster than the men in responding, with a large effect size. The female advantage held across musicians and non-musicians, and across melodies with and without commonly associated lyrics, as evidenced by an absence of interactions between sex and these factors. Additionally, the results did not seem to be explained by sex differences in response biases, or in basic motor processes as tested in a control task. Though caution is warranted given that this is the first study to examine sex differences in familiar melody recognition, the results are consistent with the hypothesis motivating our prediction, namely that declarative memory underlies knowledge about music (particularly about familiar melodies), and that the female advantage at declarative memory may thus lead to female advantages in music cognition (particularly at familiar melody recognition). Additionally, the findings argue against the view that female advantages at tasks involving verbal (or verbalizable) material are due solely to a sex difference specific to the verbal domain. Further, the results may help explain previously reported cognitive commonalities between music and language: since declarative memory also underlies language, such commonalities may be

  16. Phenomenology is not Phenomenalism. Is there such a thing as phenomenology of sport?

    Directory of Open Access Journals (Sweden)

    Jan Halák

    2014-06-01

    Full Text Available Background: The application of the philosophical mode of investigation called "phenomenology" in the context of sport. Objective: The goal is to show how and why the phenomenological method is very often misused in sport-related research. Methods: Interpretation of the key texts, explanation of their meaning. Results: The confrontation of concrete sport-related texts with the original meaning of the key phenomenological notions shows mainly three types of misuse - the confusion of phenomenology with immediacy, with an epistemologically subjectivist stance (phenomenalism, and with empirical research oriented towards objects in the world. Conclusions: Many of the discussed authors try to take over the epistemological validity of phenomenology for their research, which itself is not phenomenological, and it seems that this is because they are lacking such a methodological foundation. We believe that an authentically phenomenological analysis of sport is possible, but it must respect the basic distinctions that differentiate phenomenology from other styles of thinking.

  17. Preventive Effect of Boiogito on Metabolic Disorders in the TSOD Mouse, a Model of Spontaneous Obese Type II Diabetes Mellitus

    Directory of Open Access Journals (Sweden)

    Tsutomu Shimada

    2011-01-01

    Full Text Available “Boiogito” is a Kampo preparation which has been used since ancient times in patients with obesity of the “asthenic constitution” type, so-called “watery obesity”, and its effect has been recognized clinically. In this study, we investigated the anti-obesity effect of Boiogito in the TSOD (Tsumura Suzuki Obese Diabetes mouse, a model of spontaneous obese type II diabetes mellitus. Boiogito showed a significant anti-obesity effect in TSOD mice by suppressing body weight gain in a dosage-dependent manner. In addition, Boiogito showed significant ameliorative effects on features of metabolic syndrome such as hyperinsulinemia, fasting hyperglycemia and abnormal lipid metabolism. Regarding lipid accumulation in TSOD mice, Boiogito showed a significant suppressive effect on accumulation of subcutaneous fat, but the effect on the visceral fat accumulation that constitutes the basis of metabolic syndrome was weak, and the suppressive effect on insulin resistance was also weak. Furthermore, Boiogito did not alleviate the abnormal glucose tolerance, the hypertension or the peripheral neuropathy characteristically developed in the TSOD mice. In contrast, in the TSNO (Tsumura Suzuki Non-Obesity mice used as controls, Boiogito suppressed body weight gain and accumulation of subcutaneous and visceral fat. The above results suggested that Boiogito is effective as an anti-obesity drug against obesity of the “asthenic constitution” type in which subcutaneous fat accumulates, but cannot be expected to exert a preventive effect against various symptoms of metabolic syndrome that are based on visceral fat accumulation.

  18. The Interlanguage Interference on the Difficulties of Building Question Sentences by the Second Year Students of MA As Soorkaty Salatiga

    Directory of Open Access Journals (Sweden)

    Sari Famularsih

    2014-06-01

    Full Text Available This study is aimed to elaborate the interlanguage interference of the difficulties which are faced by 40 second year students of MA As Soorkaty Salatiga in academic year of 2004-2005 in building question sentences and the sources of difficulties. The research is conducted by such kind of test. The total number of test are 20 items, consist of 5 yes-no questions, 5 W-H questions, and 10 tag questions which is done in a written essay form. Based on the analysis, the result shows some conclusions. Firstly, the researcher finds the types of difficulties problems are faced by the students in building question mastery, such as: (1 building yes-no question sentences in translation the word “apakah”, that is related with the use of to be. (2 Building interrogative word question sentences in translation the words “apa”, “kapan”, “milik siapa”, etc. That are related with the use of “what”, “when”, “whose”, etc. (3 Completing the tag question sentences with the proper auxiliary based on tense of the sentences. The most difficulties in building question sentences are especially in yes-no question form by using the word “apakah”. In the term of interlanguage, the students’ errors are mostly included in expressing meaning by the words and grammar of the first language. The source of difficulties which have been analyzed are mostly caused by inter lingual and intra lingual transfer.

  19. Discourse, Paragraph, and Sentence Structure in Selected Philippine Languages. Final Report. Volume II, Sentence Structure.

    Science.gov (United States)

    Longacre, Robert E.

    Volume II of "Discourse, Paragraph, and Sentence Structure in Selected Philippine Languages" begins with an explanation of certain assumptions and postulates regarding sentence structure. A detailed treatment of systems of sentence structure and the parameters of such systems follows. Data in the various indigenous languages are…

  20. Polymorphisms in the F8 gene and MHC-II variants as risk factors for the development of inhibitory anti-factor VIII antibodies during the treatment of hemophilia a: a computational assessment.

    Directory of Open Access Journals (Sweden)

    Gouri Shankar Pandey

    Full Text Available The development of neutralizing anti-drug-antibodies to the Factor VIII protein-therapeutic is currently the most significant impediment to the effective management of hemophilia A. Common non-synonymous single nucleotide polymorphisms (ns-SNPs in the F8 gene occur as six haplotypes in the human population (denoted H1 to H6 of which H3 and H4 have been associated with an increased risk of developing anti-drug antibodies. There is evidence that CD4+ T-cell response is essential for the development of anti-drug antibodies and such a response requires the presentation of the peptides by the MHC-class-II (MHC-II molecules of the patient. We measured the binding and half-life of peptide-MHC-II complexes using synthetic peptides from regions of the Factor VIII protein where ns-SNPs occur and showed that these wild type peptides form stable complexes with six common MHC-II alleles, representing 46.5% of the North American population. Next, we compared the affinities computed by NetMHCIIpan, a neural network-based algorithm for MHC-II peptide binding prediction, to the experimentally measured values and concluded that these are in good agreement (area under the ROC-curve of 0.778 to 0.972 for the six MHC-II variants. Using a computational binding predictor, we were able to expand our analysis to (a include all wild type peptides spanning each polymorphic position; and (b consider more MHC-II variants, thus allowing for a better estimation of the risk for clinical manifestation of anti-drug antibodies in the entire population (or a specific sub-population. Analysis of these computational data confirmed that peptides which have the wild type sequence at positions where the polymorphisms associated with haplotypes H3, H4 and H5 occur bind MHC-II proteins significantly more than a negative control. Taken together, the experimental and computational results suggest that wild type peptides from polymorphic regions of FVIII constitute potential T-cell epitopes

  1. Deep-learning top taggers or the end of QCD?

    Energy Technology Data Exchange (ETDEWEB)

    Kasieczka, Gregor [Institute for Particle Physics, ETH Zürich,Otto-Stern-Weg 5, Zürich (Switzerland); Plehn, Tilman [Institut für Theoretische Physik, Universität Heidelberg,Philosophenweg 16, Heidelberg (Germany); Russell, Michael [School of Physics and Astronomy, University of Glasgow,Glasgow G12 8QQ, Glasgow (United Kingdom); Schell, Torben [Institut für Theoretische Physik, Universität Heidelberg,Philosophenweg 16, Heidelberg (Germany)

    2017-05-02

    Machine learning based on convolutional neural networks can be used to study jet images from the LHC. Top tagging in fat jets offers a well-defined framework to establish our DeepTop approach and compare its performance to QCD-based top taggers. We first optimize a network architecture to identify top quarks in Monte Carlo simulations of the Standard Model production channel. Using standard fat jets we then compare its performance to a multivariate QCD-based top tagger. We find that both approaches lead to comparable performance, establishing convolutional networks as a promising new approach for multivariate hypothesis-based top tagging.

  2. Deep-learning top taggers or the end of QCD?

    International Nuclear Information System (INIS)

    Kasieczka, Gregor; Plehn, Tilman; Russell, Michael; Schell, Torben

    2017-01-01

    Machine learning based on convolutional neural networks can be used to study jet images from the LHC. Top tagging in fat jets offers a well-defined framework to establish our DeepTop approach and compare its performance to QCD-based top taggers. We first optimize a network architecture to identify top quarks in Monte Carlo simulations of the Standard Model production channel. Using standard fat jets we then compare its performance to a multivariate QCD-based top tagger. We find that both approaches lead to comparable performance, establishing convolutional networks as a promising new approach for multivariate hypothesis-based top tagging.

  3. Preoperative embolization of a giant neurofibroma of the chest in a patient with neurofibromatosis type II: A case report

    Energy Technology Data Exchange (ETDEWEB)

    Bae, Suk Hyun [Dept. of Radiology, Inje University College of Medicine, Ilsan Paik Hospital, Goyang (Korea, Republic of); Shin, Jong Soo [Dept. of Radiology, Kyung Hee University Hospital at Gangdong, School of Medicine, Kyung Hee University, Seoul (Korea, Republic of)

    2017-01-15

    Giant plexiform neurofibromas, which are rare in patients with neurofibromatosis type II (NFII), are difficult to manage surgically, as they are extensively infiltrative and highly vascularized. Preoperative embolization is performed to reduce intraoperative blood loss and operative time, increase resectability of lesions, and improve visualization of the operative field during surgery of hypervascular tumors such as renal cell carcinoma and intracranial meningioma. Preoperative intravascular embolization of a giant chest wall neurofibroma has not been reported in the English literature. We report successful treatment of a giant chest wall neurofibroma in a 45-year-old male with NFII by preoperative intravascular embolization followed by surgical resection.

  4. LRSYS, PASCAL LR(1) Parser Generator System

    International Nuclear Information System (INIS)

    O'Hair, K.

    1991-01-01

    Description of program or function: LRSYS is a complete LR(1) parser generator system written entirely in a portable subset of Pascal. The system, LRSYS, includes a grammar analyzer program (LR) which reads a context-free (BNF) grammar as input and produces LR(1) parsing tables as output, a lexical analyzer generator (LEX) which reads regular expressions created by the REG process as input and produces lexical tables as output, and various parser skeletons that get merged with the tables to produce complete parsers (SMAKE). Current parser skeletons include Pascal, FORTRAN 77, and C. In addition, the CRAY1, DEC VAX11 version contains LRLTRAN and CFT- FORTRAN 77 skeletons. Other language skeletons can easily be added to the system. LRSYS is based on the LR program (NESC Abstract 822)

  5. The ability of IgY to recognize surface proteins of Streptococcus mutans

    Directory of Open Access Journals (Sweden)

    Basri A. Gani

    2009-12-01

    Full Text Available Background: Streptococcus mutans are gram positive bacteria classified into viridians group, and have a role in pathogenesis of dental caries. It’s adhesion to the tooth surface is mediated by cell surface proteins, which interact with specific receptor located in tooth pellicle. Glucan binding protein, Glukosyltransferase, and antigen I/II are basic proteins of S. mutans, which have a role in initiating the interaction. A previous study showed that chicken’s IgY can interfere the interaction. Purpose: The objective of this study was to assess the ability of IgY in recognizing the surface molecule of Streptococcus mutans expressed by various serotypes (c, d, e, f and a strain derived from IPB, Bogor. Method: Western blot was used as a method to determine such capability. Result: The result showed that IgY has a potency to recognize antigen I/II, but not the other proteins on the cell surface of all bacteria tested. Conclusion: The ability of IgY to bind the surface protein, antigen I/II, indicates that this avian antibody could be used as a candidate for anti-adhesion in preventing dental caries.

  6. Type II restriction endonucleases—a historical perspective and more

    Science.gov (United States)

    Pingoud, Alfred; Wilson, Geoffrey G.; Wende, Wolfgang

    2014-01-01

    This article continues the series of Surveys and Summaries on restriction endonucleases (REases) begun this year in Nucleic Acids Research. Here we discuss ‘Type II’ REases, the kind used for DNA analysis and cloning. We focus on their biochemistry: what they are, what they do, and how they do it. Type II REases are produced by prokaryotes to combat bacteriophages. With extreme accuracy, each recognizes a particular sequence in double-stranded DNA and cleaves at a fixed position within or nearby. The discoveries of these enzymes in the 1970s, and of the uses to which they could be put, have since impacted every corner of the life sciences. They became the enabling tools of molecular biology, genetics and biotechnology, and made analysis at the most fundamental levels routine. Hundreds of different REases have been discovered and are available commercially. Their genes have been cloned, sequenced and overexpressed. Most have been characterized to some extent, but few have been studied in depth. Here, we describe the original discoveries in this field, and the properties of the first Type II REases investigated. We discuss the mechanisms of sequence recognition and catalysis, and the varied oligomeric modes in which Type II REases act. We describe the surprising heterogeneity revealed by comparisons of their sequences and structures. PMID:24878924

  7. HALF A CENTURY OF GENERATIVE LINGUISTICS – WHAT HAS THE PARADIGM GIVEN TO SOCIAL SCIENCE?

    Directory of Open Access Journals (Sweden)

    Mihailo Antović

    2007-12-01

    Full Text Available This paper aims to join marking the fiftieth anniversary of generativelinguistics by focusing on some principal contributions the field has given to science in general. In the first part of the paper, I discuss the issue of 'modern linguistics', as it is widely taught in local universities, and examine the importance of the generative school in this notion of modernity. In the second part, I analyze the fundamental conceptions and epistemological framework of this school in the study of language, pinpointing fifteen legacies it seems to have already left to human thought, as follows: breakup withbehaviourism; reinstitution of the hypothetico-deductive method; elaboration of the mind-body issue; revival of the thesis that the only true reality is that of the human mind; reintroduction and extension of the term 'cognitive'; participation in rapidterminological changes in the social sciences; return to the problem of language universals with a strong focus on the genetic origins of language faculty; extended usage of the term 'grammar'; contribution to the deletion of clear boundaries between the natural and social sciences, especially in psychology; rise of reductionism in formalsciences; return to the once forgotten Gestalt principles of perception; radical breakup between lexical and sentence semantics; neo-Darwinism; rise of neuroscience; impetus to the foundation of new fields, often multidisciplinary ones. Reconciliation of cognitiveand generative linguistics in the future is anticipated.

  8. A GRAMMATICAL ADJUSTMENT ANALYSIS OF STATISTICAL MACHINE TRANSLATION METHOD USED BY GOOGLE TRANSLATE COMPARED TO HUMAN TRANSLATION IN TRANSLATING ENGLISH TEXT TO INDONESIAN

    Directory of Open Access Journals (Sweden)

    Eko Pujianto

    2017-04-01

    Full Text Available Google translate is a program which provides fast, free and effortless translating service. This service uses a unique method to translate. The system is called ―Statistical Machine Translation‖, the newest method in automatic translation. Machine translation (MT is an area of many kinds of different subjects of study and technique from linguistics, computers science, artificial intelligent (AI, translation theory, and statistics. SMT works by using statistical methods and mathematics to process the training data. The training data is corpus-based. It is a compilation of sentences and words of the languages (SL and TL from translation done by human. By using this method, Google let their machine discovers the rules for themselves. They do this by analyzing millions of documents that have already been translated by human translators and then generate the result based on the corpus/training data. However, questions arise when the results of the automatic translation prove to be unreliable in some extent. This paper questions the dependability of Google translate in comparison with grammatical adjustment that naturally characterizes human translators' specific advantage. The attempt is manifested through the analysis of the TL of some texts translated by the SMT. It is expected that by using the sample of TL produced by SMT we can learn the potential flaws of the translation. If such exists, the partial of more substantial undependability of SMT may open more windows to the debates of whether this service may suffice the users‘ need.

  9. Recognizing famous voices: influence of stimulus duration and different types of retrieval cues.

    Science.gov (United States)

    Schweinberger, S R; Herholz, A; Sommer, W

    1997-04-01

    The current investigation measured the effects of increasing stimulus duration on listeners' ability to recognize famous voices. In addition, the investigation studied the influence of different types of cues on the naming of voices that could not be named before. Participants were presented with samples of famous and unfamiliar voices and were asked to decide whether or not the samples were spoken by a famous person. The duration of each sample increased in seven steps from 0.25 s up to a maximum of 2 s. Voice recognition improvements with stimulus duration were with a growth function. Gains were most rapid within the first second and less pronounced thereafter. When participants were unable to name a famous voice, they were cued with either a second voice sample, the occupation, or the initials of the celebrity. Initials were most effective in eliciting the name only when semantic information about the speaker had been accessed prior to cue presentation. Paralleling previous research on face naming, this may indicate that voice naming is contingent on previous activation of person-specific semantic information.

  10. Big data is not a monolith

    CERN Document Server

    Ekbia, Hamid R; Mattioli, Michael

    2016-01-01

    Big data is ubiquitous but heterogeneous. Big data can be used to tally clicks and traffic on web pages, find patterns in stock trades, track consumer preferences, identify linguistic correlations in large corpuses of texts. This book examines big data not as an undifferentiated whole but contextually, investigating the varied challenges posed by big data for health, science, law, commerce, and politics. Taken together, the chapters reveal a complex set of problems, practices, and policies. The advent of big data methodologies has challenged the theory-driven approach to scientific knowledge in favor of a data-driven one. Social media platforms and self-tracking tools change the way we see ourselves and others. The collection of data by corporations and government threatens privacy while promoting transparency. Meanwhile, politicians, policy makers, and ethicists are ill-prepared to deal with big data's ramifications. The contributors look at big data's effect on individuals as it exerts social control throu...

  11. Social Issues in Applied Linguistics: Linguistic Diversity in the Classroom and Beyond. Is it Wrong or Just Different? Indigenous Spanish in Mexico

    OpenAIRE

    Pellicer, Dora

    2010-01-01

    Varieties of L2 language use are frequently rejected and criticized in the absence of linguistic criteria to sustain such attitudes. In Mexico, indigenous varieties of Spanish, the second language (L2) of diverse populations, has been stigmatized as uneducated Spanish. A majority of elementary school teachers interviewed, who are Spanish first language (L1) speakers, maintain that particular variations in accent and pronunciation as well as some grammatical variations are characteristic of in...

  12. Impact of Background Noise and Sentence Complexity on Processing Demands during Sentence Comprehension.

    Science.gov (United States)

    Wendt, Dorothea; Dau, Torsten; Hjortkjær, Jens

    2016-01-01

    Speech comprehension in adverse listening conditions can be effortful even when speech is fully intelligible. Acoustical distortions typically make speech comprehension more effortful, but effort also depends on linguistic aspects of the speech signal, such as its syntactic complexity. In the present study, pupil dilations, and subjective effort ratings were recorded in 20 normal-hearing participants while performing a sentence comprehension task. The sentences were either syntactically simple (subject-first sentence structure) or complex (object-first sentence structure) and were presented in two levels of background noise both corresponding to high intelligibility. A digit span and a reading span test were used to assess individual differences in the participants' working memory capacity (WMC). The results showed that the subjectively rated effort was mostly affected by the noise level and less by syntactic complexity. Conversely, pupil dilations increased with syntactic complexity but only showed a small effect of the noise level. Participants with higher WMC showed increased pupil responses in the higher-level noise condition but rated sentence comprehension as being less effortful compared to participants with lower WMC. Overall, the results demonstrate that pupil dilations and subjectively rated effort represent different aspects of effort. Furthermore, the results indicate that effort can vary in situations with high speech intelligibility.

  13. Impact of background noise and sentence complexity on processing demands during sentence comprehension

    Directory of Open Access Journals (Sweden)

    Dorothea eWendt

    2016-03-01

    Full Text Available Speech comprehension in adverse listening conditions can be effortful even when speech is fully intelligible. Acoustical distortions typically make speech comprehension more effortful, but effort also depends on linguistic aspects of the speech signal, such as its syntactic complexity. In the present study, pupil dilations and subjective effort ratings were recorded in 20 normal-hearing participants while performing a sentence comprehension task. The sentences were either syntactically simple (subject-first sentence structure or complex (object-first sentence structure and were presented in two levels of background noise both corresponding to high intelligibility. A digit span and a reading span test were used to assess individual differences in the participants' working memory capacity. The results showed that the subjectively rated effort was mostly affected by the noise level and less by syntactic complexity. Conversely, pupil dilations increased with syntactic complexity but only showed a small effect of the noise level. Participants with higher working memory capacity showed increased pupil responses in the higher-level noise condition but rated sentence comprehension as being less effortful compared to participants with lower working memory capacity. Overall, the results demonstrate that pupil dilations and subjectively rated effort represent different aspects of effort. Furthermore, the results indicate that effort can vary in situations with high speech intelligibility.

  14. Angiotensin Converting Enzyme Inhibitor Has a Protective Effect on Decompression Sickness in Rats

    Directory of Open Access Journals (Sweden)

    Aleksandra Mazur

    2018-03-01

    Full Text Available Introduction: Commercial divers, high altitude pilots, and astronauts are exposed to some inherent risk of decompression sickness (DCS, though the mechanisms that trigger are still unclear. It has been previously showed that diving may induce increased levels of serum angiotensin converting enzyme. The renin angiotensin aldosterone system (RAAS is one of the most important regulators of blood pressure and fluid volume. The purpose of the present study was to control the influence of angiotensin II on the appearance of DCS.Methods: Sprague Dawley rats have been pre-treated with inhibitor of angiotensin II receptor type 1 (losartan; 10 mg/kg, angiotensin-converting enzyme (ACE inhibitor (enalapril; 10 mg/kg, and calcium-entry blocker (nifedipine; 20 mg/kg. The experimental groups were treated for 4 weeks before exposure to hyperbaric pressure while controls were not treated. Seventy-five rats were subjected to a simulated dive at 1000 kPa absolute pressure for 45 min before starting decompression. Clinical assessment took place over a period of 60 min after surfacing. Blood samples were collected for measurements of TBARS, interleukin 6 (IL-6, angiotensin II (ANG II and ACE.Results: The diving protocol induced 60% DCS in non-treated animals. This ratio was significantly decreased after treatment with enalapril, but not other vasoactive drugs. Enalapril did not change ANG II or ACE concentration, while losartant decreased post dive level of ACE but not ANG II. None of the treatment modified the effect of diving on TBARS and IL-6 values.Conclusion: Results suggests that the rennin angiotensin system is involved in a process of triggering DCS but this has to be further investigated. However, a vasorelaxation mediated process, which potentially could increase the load of inert gas during hyperbaric exposure, and antioxidant properties were excluded by our results.

  15. Reproducing American Sign Language Sentences: Cognitive Scaffolding in Working Memory

    Directory of Open Access Journals (Sweden)

    Ted eSupalla

    2014-08-01

    Full Text Available The American Sign Language Sentence Reproduction Test (ASL-SRT requires the precise reproduction of a series of ASL sentences increasing in complexity and length. Error analyses of such tasks provides insight into working memory and scaffolding processes. Data was collected from three groups expected to differ in fluency: deaf children, deaf adults and hearing adults, all users of ASL. Quantitative (correct/incorrect recall and qualitative error analyses were performed. Percent correct on the reproduction task supports its sensitivity to fluency as test performance clearly differed across the three groups studied. A linguistic analysis of errors further documented differing strategies and bias across groups. Subjects’ recall projected the affordance and constraints of deep linguistic representations to differing degrees, with subjects resorting to alternate processing strategies in the absence of linguistic knowledge. A qualitative error analysis allows us to capture generalizations about the relationship between error pattern and the cognitive scaffolding, which governs the sentence reproduction process. Highly fluent signers and less-fluent signers share common chokepoints on particular words in sentences. However, they diverge in heuristic strategy. Fluent signers, when they make an error, tend to preserve semantic details while altering morpho-syntactic domains. They produce syntactically correct sentences with equivalent meaning to the to-be-reproduced one, but these are not verbatim reproductions of the original sentence. In contrast, less-fluent signers tend to use a more linear strategy, preserving lexical status and word ordering while omitting local inflections, and occasionally resorting to visuo-motoric imitation. Thus, whereas fluent signers readily use top-down scaffolding in their working memory, less fluent signers fail to do so. Implications for current models of working memory across spoken and signed modalities are

  16. Mining the Geophysical Research Abstracts Corpus: Mapping the impact of Free and Open Source Software on the EGU Divisions

    Science.gov (United States)

    Löwe, Peter; Klump, Jens; Robertson, Jesse

    2015-04-01

    Text mining is commonly employed as a tool in data science to investigate and chart emergent information from corpora of research abstracts, such as the Geophysical Research Abstracts (GRA) published by Copernicus. In this context current standards, such as persistent identifiers like DOI and ORCID, allow us to trace, cite and map links between journal publications, the underlying research data and scientific software. This network can be expressed as a directed graph which enables us to chart networks of cooperation and innovation, thematic foci and the locations of research communities in time and space. However, this approach of data science, focusing on the research process in a self-referential manner, rather than the topical work, is still in a developing stage. Scientific work presented at the EGU General Assembly is often the first step towards new approaches and innovative ideas to the geospatial community. It represents a rich, deep and heterogeneous source of geoscientific thought. This corpus is a significant data source for data science, which has not been analysed on this scale previously. In this work, the corpus of the Geophysical Research Abstracts is used for the first time as a data base for analyses of topical text mining. For this, we used a sturdy and customizable software framework, based on the work of Schmitt et al. [1]. For the analysis we used the High Performance Computing infrastructure of the German Research Centre for Geosciences GFZ in Potsdam, Germany. Here, we report on the first results from the analysis of the continuous spreading the of use of Free and Open Source Software Tools (FOSS) within the EGU communities, mapping the general increase of FOSS-themed GRA articles in the last decade and the developing spatial patterns of involved parties and FOSS topics. References: [1] Schmitt, L. M., Christianson, K.T, Gupta R..: Linguistic Computing with UNIX Tools, in Kao, A., Poteet S.R. (Eds.): Natural Language processing and Text

  17. In vivo bone strain in the mandibular corpus of Sapajus during a range of oral food processing behaviors.

    Science.gov (United States)

    Ross, Callum F; Iriarte-Diaz, Jose; Reed, David A; Stewart, Thomas A; Taylor, Andrea B

    2016-09-01

    It has been hypothesized that mandibular corpus morphology of primates is related to the material properties of the foods that they chew. However, chewing foods with different material properties is accompanied by low levels of variation in mandibular strain patterns in macaques. We hypothesized that if variation in primate mandible form reflects adaptations to feeding on foods with different material and geometric properties, then this variation will be driven primarily by differences in oral food processing behavior rather than differences in chewing per se. To test this hypothesis, we recorded in vivo bone strain data from the lateral and medial surfaces of the mandibular corpus during complete feeding sequences in three adult male Sapajus as they fed on foods with a range of sizes and material properties. We assessed whether variation in mandibular corpus strain regimes is associated with variation in feeding behaviors and/or chewing on different foods, and we quantified the relative variation in mandibular corpus strain regimes associated with chewing on foods of different material properties versus a range of oral food processing behaviors (incisor, premolar, and molar biting; pulling on incisors; mastication). Feeding behavior had a significant effect on mandibular corpus strain regimes, as did chewing side and the cycle number in a feeding sequence. However, food type had weaker effects and usually only through interaction effects with chewing side and/or cycle type. Strain regimes varied most across different chew sides, then across different behaviors, and lastly between mastication cycles on different foods. Strain magnitudes associated with premolar, molar, and incisor biting were larger than those recorded during mastication. These data suggest that intra- and inter-specific variation in mandible morphology is a trade-off between performance requirements of different oral food processing behaviors and of variation in chewing side, with direct effects

  18. INTEGRATING CORPUS-BASED RESOURCES AND NATURAL LANGUAGE PROCESSING TOOLS INTO CALL

    Directory of Open Access Journals (Sweden)

    Pascual Cantos Gomez

    2002-06-01

    Full Text Available This paper ainis at presenting a survey of computational linguistic tools presently available but whose potential has been neither fully considered not exploited to its full in modern CALL. It starts with a discussion on the rationale of DDL to language learning, presenting typical DDL-activities. DDL-software and potential extensions of non-typical DDL-software (electronic dictionaries and electronic dictionary facilities to DDL . An extended section is devoted to describe NLP-technology and how it can be integrated into CALL, within already existing software or as stand alone resources. A range of NLP-tools is presentcd (MT programs, taggers, lemn~atizersp, arsers and speech technologies with special emphasis on tagged concordancing. The paper finishes with a number of reflections and ideas on how language technologies can be used efficiently within the language learning context and how extensive exploration and integration of these technologies might change and extend both modern CAI,I, and the present language learning paradigiii..

  19. Deep-learning Top Taggers or The End of QCD?

    CERN Multimedia

    CERN. Geneva

    2017-01-01

    https://arxiv.org/abs/1701.08784 Machine learning based on convolutional neural networks can be used to study jet images from the LHC. Top tagging in fat jets offers a well-defined framework to establish our DeepTop approach and compare its performance to QCD-based top taggers. We first optimize a network architecture to identify top quarks in Monte Carlo simulations of the Standard Model production channel. Using standard fat jets we then compare its performance to a multivariate QCD-based top tagger. We find that both approaches lead to comparable performance, establishing convolutional networks as a promising new approach for multivariate hypothesis-based top tagging.

  20. Making sense of (exceptional) causal relations. A cross-cultural and cross-linguistic study.

    Science.gov (United States)

    Le Guen, Olivier; Samland, Jana; Friedrich, Thomas; Hanus, Daniel; Brown, Penelope

    2015-01-01

    In order to make sense of the world, humans tend to see causation almost everywhere. Although most causal relations may seem straightforward, they are not always construed in the same way cross-culturally. In this study, we investigate concepts of "chance," "coincidence," or "randomness" that refer to assumed relations between intention, action, and outcome in situations, and we ask how people from different cultures make sense of such non-law-like connections. Based on a framework proposed by Alicke (2000), we administered a task that aims to be a neutral tool for investigating causal construals cross-culturally and cross-linguistically. Members of four different cultural groups, rural Mayan Yucatec and Tseltal speakers from Mexico and urban students from Mexico and Germany, were presented with a set of scenarios involving various types of causal and non-causal relations and were asked to explain the described events. Three links varied as to whether they were present or not in the scenarios: Intention-to-Action, Action-to-Outcome, and Intention-to-Outcome. Our results show that causality is recognized in all four cultural groups. However, how causality and especially non-law-like relations are interpreted depends on the type of links, the cultural background and the language used. In all three groups, Action-to-Outcome is the decisive link for recognizing causality. Despite the fact that the two Mayan groups share similar cultural backgrounds, they display different ideologies regarding concepts of non-law-like relations. The data suggests that the concept of "chance" is not universal, but seems to be an explanation that only some cultural groups draw on to make sense of specific situations. Of particular importance is the existence of linguistic concepts in each language that trigger ideas of causality in the responses from each cultural group.

  1. Making sense of (exceptional) causal relations. A cross-cultural and cross-linguistic study

    Science.gov (United States)

    Le Guen, Olivier; Samland, Jana; Friedrich, Thomas; Hanus, Daniel; Brown, Penelope

    2015-01-01

    In order to make sense of the world, humans tend to see causation almost everywhere. Although most causal relations may seem straightforward, they are not always construed in the same way cross-culturally. In this study, we investigate concepts of “chance,” “coincidence,” or “randomness” that refer to assumed relations between intention, action, and outcome in situations, and we ask how people from different cultures make sense of such non-law-like connections. Based on a framework proposed by Alicke (2000), we administered a task that aims to be a neutral tool for investigating causal construals cross-culturally and cross-linguistically. Members of four different cultural groups, rural Mayan Yucatec and Tseltal speakers from Mexico and urban students from Mexico and Germany, were presented with a set of scenarios involving various types of causal and non-causal relations and were asked to explain the described events. Three links varied as to whether they were present or not in the scenarios: Intention-to-Action, Action-to-Outcome, and Intention-to-Outcome. Our results show that causality is recognized in all four cultural groups. However, how causality and especially non-law-like relations are interpreted depends on the type of links, the cultural background and the language used. In all three groups, Action-to-Outcome is the decisive link for recognizing causality. Despite the fact that the two Mayan groups share similar cultural backgrounds, they display different ideologies regarding concepts of non-law-like relations. The data suggests that the concept of “chance” is not universal, but seems to be an explanation that only some cultural groups draw on to make sense of specific situations. Of particular importance is the existence of linguistic concepts in each language that trigger ideas of causality in the responses from each cultural group. PMID:26579028

  2. A Fuzzy Linguistic Methodology to Deal With Unbalanced Linguistic Term Sets

    OpenAIRE

    Herrera, F.; Herrera-Viedma, Enrique; Martinez, L.

    2008-01-01

    Many real problems dealing with qualitative aspects use linguistic approaches to assess such aspects. In most of these problems, a uniform and symmetrical distribution of the linguistic term sets for linguistic modeling is assumed. However, there exist problems whose assessments need to be represented by means of unbalanced linguistic term sets, i.e., using term sets that are not uniformly and symmetrically distributed. The use of linguistic variables implies processes of computing with words...

  3. Semantics, contrastive linguistics and parallel corpora

    Directory of Open Access Journals (Sweden)

    Violetta Koseska

    2014-09-01

    Full Text Available Semantics, contrastive linguistics and parallel corpora In view of the ambiguity of the term “semantics”, the author shows the differences between the traditional lexical semantics and the contemporary semantics in the light of various semantic schools. She examines semantics differently in connection with contrastive studies where the description must necessary go from the meaning towards the linguistic form, whereas in traditional contrastive studies the description proceeded from the form towards the meaning. This requirement regarding theoretical contrastive studies necessitates construction of a semantic interlanguage, rather than only singling out universal semantic categories expressed with various language means. Such studies can be strongly supported by parallel corpora. However, in order to make them useful for linguists in manual and computer translations, as well as in the development of dictionaries, including online ones, we need not only formal, often automatic, annotation of texts, but also semantic annotation - which is unfortunately manual. In the article we focus on semantic annotation concerning time, aspect and quantification of names and predicates in the whole semantic structure of the sentence on the example of the “Polish-Bulgarian-Russian parallel corpus”.

  4. A Game-based Corpus for Analysing the Interplay between Game Context and Player Experience

    DEFF Research Database (Denmark)

    Shaker, Noor; Yannakakis, Georgios N.; Asteriadis, Stylianos

    2011-01-01

    present dierent types of information that have been extracted from game context, player preferences and perception of the game, as well as user features, automatically extracted from video recordings.We run a number of initial experiments to analyse players' behavior while playing video games as a case......Recognizing players' aective state while playing video games has been the focus of many recent research studies. In this paper we describe the process that has been followed to build a corpus based on game events and recorded video sessions from human players while playing Super Mario Bros. We...

  5. Then and now: A reconsideration of the first corpus of scientific English

    Directory of Open Access Journals (Sweden)

    John M. Swales

    2004-10-01

    Full Text Available The subtitle of Huddleston (1971 reads A syntactic study based on an analysis of scientific texts; this volume thus represents the first carefully designed and substantial corpus of scientific English. In this paper I re-examine a selection of his findings based on the science and engineering half of Hyland's corpus of 240 research articles. Features selected were variation in the passivization of individual transitive verbs, the paucity of instances of V + V-ing structures like "He continued working", and the meaning of the modal must in research prose. In all three cases, Huddleston's findings were largely confirmed in a database constructed about 35 years later, thus suggesting that English research writing in the sciences is, at least in grammatical terms, fundamentally stable. In the closing section, I contrast this linguistic stability with the rapid technological development of corpus linguistics. I instance a recent co-taught experimental course in which international senior doctoral students from the health and social sciences were able, with relatively little training and guidance, to construct paired corpora of their own research writings and of published articles from their own specialities and then conduct precisely the kinds of analysis that only a highly professional linguist could, with considerable more labour, conduct nearly forty years ago.

  6. Focal F0 peak shape and sentence mode in Swedish

    DEFF Research Database (Denmark)

    Ambrazaitis, Gilbert; Buanzur, Tuarik C.; Niebuhr, Oliver

    2015-01-01

    Shape characteristics of rising-falling accentual F0 peaks of Stockholm Swedish Accent I words in narrow focus are studied in a corpus of 287 read sentences. The corpus includes statements and three types of polar questions. Results reveal a clear effect of sentence mode on the shape of the accen...

  7. A Linguistic Multigranular Sensory Evaluation Model for Olive Oil

    Directory of Open Access Journals (Sweden)

    Luis Martinez

    2008-06-01

    Full Text Available Evaluation is a process that analyzes elements in order to achieve different objectives such as quality inspection, marketing and other fields in industrial companies. This paper focuses on sensory evaluation where the evaluated items are assessed by a panel of experts according to the knowledge acquired via human senses. In these evaluation processes the information provided by the experts implies uncertainty, vagueness and imprecision. The use of the Fuzzy Linguistic Approach 32 has provided successful results modelling such a type of information. In sensory evaluation it may happen that the panel of experts have more or less degree knowledge of about the evaluated items or indicators. So, it seems suitable that each expert could express their preferences in different linguistic term sets based on their own knowledge. In this paper, we present a sensory evaluation model that manages multigranular linguistic evaluation framework based on a decision analysis scheme. This model will be applied to the sensory evaluation process of Olive Oil.

  8. CUILESS2016: a clinical corpus applying compositional normalization of text mentions.

    Science.gov (United States)

    Osborne, John D; Neu, Matthew B; Danila, Maria I; Solorio, Thamar; Bethard, Steven J

    2018-01-10

    Traditionally text mention normalization corpora have normalized concepts to single ontology identifiers ("pre-coordinated concepts"). Less frequently, normalization corpora have used concepts with multiple identifiers ("post-coordinated concepts") but the additional identifiers have been restricted to a defined set of relationships to the core concept. This approach limits the ability of the normalization process to express semantic meaning. We generated a freely available corpus using post-coordinated concepts without a defined set of relationships that we term "compositional concepts" to evaluate their use in clinical text. We annotated 5397 disorder mentions from the ShARe corpus to SNOMED CT that were previously normalized as "CUI-less" in the "SemEval-2015 Task 14" shared task because they lacked a pre-coordinated mapping. Unlike the previous normalization method, we do not restrict concept mappings to a particular set of the Unified Medical Language System (UMLS) semantic types and allow normalization to occur to multiple UMLS Concept Unique Identifiers (CUIs). We computed annotator agreement and assessed semantic coverage with this method. We generated the largest clinical text normalization corpus to date with mappings to multiple identifiers and made it freely available. All but 8 of the 5397 disorder mentions were normalized using this methodology. Annotator agreement ranged from 52.4% using the strictest metric (exact matching) to 78.2% using a hierarchical agreement that measures the overlap of shared ancestral nodes. Our results provide evidence that compositional concepts can increase semantic coverage in clinical text. To our knowledge we provide the first freely available corpus of compositional concept annotation in clinical text.

  9. Asking or Telling--Real-time Processing of Prosodically Distinguished Questions and Statements.

    Science.gov (United States)

    Heeren, Willemijn F L; Bibyk, Sarah A; Gunlogson, Christine; Tanenhaus, Michael K

    2015-12-01

    We introduce a targeted language game approach using the visual world, eye-movement paradigm to assess when and how certain intonational contours affect the interpretation of utterances. We created a computer-based card game in which elliptical utterances such as "Got a candy" occurred with a nuclear contour most consistent with a yes-no question (H* H-H%) or a statement (L* L-L%). In Experiment I we explored how such contours are integrated online. In Experiment 2 we studied the expectations listeners have for how intonational contours signal intentions: do these reflect linguistic categories or rapid adaptation to the paradigm? Prosody had an immediate effect on interpretation, as indexed by the pattern and timing of fixations. Moreover, the association between different contours and intentions was quite robust in the absence of clear syntactic cues to sentence type, and was not due to rapid adaptation. Prosody had immediate effects on interpretation even though there was a construction-based bias to interpret "got a" as a question. Taken together, we believe this paradigm will provide further insights into how intonational contours and their phonetic realization interact with other cues to sentence type in online comprehension.

  10. Fast self paced listening times in syntactic comprehension is aphasia -- implications for deficits

    Directory of Open Access Journals (Sweden)

    Jennifer Michaud

    2015-04-01

    (the DV against corrected self paced listening times in the corresponding words in the baseline sentences (the IV in correct responses. We call the residuals of these regressions “relative corrected listening times.” Relative corrected listening times are based on listening times at points at which task-related operations are similar, and therefore factor out these effects. They also factor out differences in general speed of processing and motor functioning, which determine the intercepts of the regressions. The relative corrected listening times for each participant therefore reflect the time taken by each pwa or control participant to perform the parsing and interpretation operations needed in the experimental sentences, compared to the time taken by the other controls or pwa. We performed these regressions separately in each of the four groups of sentences. We analyzed pwa and controls separately. We analyzed groups and not individuals, because performing these regressions on an individual basis would lead to a mean residual for an experimental sentence type compared to a baseline sentence type that approximates zero in each individual (this is a property of linear regression. We analyzed controls and pwa separately to produce separate estimates of basic speed of processing, decision-making and other processes in controls and pwa. We calculated the normal range of relative corrected listening times for each sentence type group by applying the formula in Crawford and Howell (1998 to the results of the regressions in the controls. We then determined whether the relative corrected listening time for each group of sentences in each pwa was within the normal range of residuals. We focus on a finding that has not previously been reported, which is that, in some pwa, some relative corrected listening times (i.e., residuals of the regression of critical words in experimental sentences against corresponding words in baseline sentences were lower than those seen in

  11. Different types of anastomotic methods: a review of literature

    Directory of Open Access Journals (Sweden)

    Shadi Mooloughi

    2015-09-01

    Full Text Available Constructing successful anastomosis is an important concept in gastrointestinal tract surgeries, which can be affected by various factors such as preoperative bowel condition, intra- and postoperative complications, bleeding and the device characteristics. Suturing, stapling and compression anastomosis are different techniques. Despite the invention of compression anastomosis, which goes back almost two centuries, this method has not obtained the popularity of the suturing and stapling anastomosis and further studies are required. Designing methods and devices with no drawbacks might reduce the complications associated with anastomosis as the alternative to suturing and stapling anastomoses. Several materials can be used as reinforcement materials, which can improve the consequences of the stapled anastomosis. In addition to reinforcement materials, other forms of supports have been proposed, which might be capable of reducing the postoperative complications of anastomosis. In this study, we briefly review various types of anastomotic techniques and associated complications in different types of gastrointestinal surgeries.

  12. Interhemispheric functional disconnection because of abnormal corpus callosum integrity in bipolar disorder type II.

    Science.gov (United States)

    Yasuno, Fumihiko; Kudo, Takashi; Matsuoka, Kiwamu; Yamamoto, Akihide; Takahashi, Masato; Nakagawara, Jyoji; Nagatsuka, Kazuyuki; Iida, Hidehiro; Kishimoto, Toshifumi

    2016-11-01

    A significantly lower fractional anisotropy (FA) value has been shown in anterior parts of the corpus callosum in patients with bipolar disorder. We investigated the association between abnormal corpus callosum integrity and interhemispheric functional connectivity (IFC) in patients with bipolar disorder. We examined the association between FA values in the corpus callosum (CC-FA) and the IFC between homotopic regions in the anterior cortical structures of bipolar disorder ( n =16) and major depressive disorder ( n =22) patients with depressed or euthymic states. We found a positive correlation between the CC-FA and IFC values between homotopic regions of the ventral prefrontal cortex and insula cortex, and significantly lower IFC between these regions in bipolar disorder patients. The abnormal corpus callosum integrity in bipolar disorder patients is relevant to the IFC between homotopic regions, possibly disturbing the exchange of emotional information between the cerebral hemispheres resulting in emotional dysregulation. None. © The Royal College of Psychiatrists 2016. This is an open access article distributed under the terms of the Creative Commons Non-Commercial, No Derivatives (CC BY-NC-ND) license.

  13. Triplets with growth failure, microcephaly, mental retardation, nail hypoplasia and corpus callosum agenesis: is it a variant of Coffin-Siris or a new syndrome?

    Science.gov (United States)

    Kirel, B; Kural, N; Yakut, A; Adapinar, B

    2000-01-01

    We report eight-year-old triplet girls whose clinical features included microcephaly, severe mental retardation, hypoplasia of distal phalanges of both fifth and second fingers and nail hypoplasia on second fingers, dysmorphic facial features, and partial corpus callosum agenesis. During infancy, a Pavlik harness was used for congenital hip dislocation, and they had difficulty in feeding. One had been operated for patent ductus arteriosus. To our knowledge, this rare combination has not been previously reported in triplets whose clinical features closely resemble those of Coffin-Siris syndrome. The other diagnostic possibilities are also reviewed.

  14. Audiovisual sentence recognition not predicted by susceptibility to the McGurk effect.

    Science.gov (United States)

    Van Engen, Kristin J; Xie, Zilong; Chandrasekaran, Bharath

    2017-02-01

    In noisy situations, visual information plays a critical role in the success of speech communication: listeners are better able to understand speech when they can see the speaker. Visual influence on auditory speech perception is also observed in the McGurk effect, in which discrepant visual information alters listeners' auditory perception of a spoken syllable. When hearing /ba/ while seeing a person saying /ga/, for example, listeners may report hearing /da/. Because these two phenomena have been assumed to arise from a common integration mechanism, the McGurk effect has often been used as a measure of audiovisual integration in speech perception. In this study, we test whether this assumed relationship exists within individual listeners. We measured participants' susceptibility to the McGurk illusion as well as their ability to identify sentences in noise across a range of signal-to-noise ratios in audio-only and audiovisual modalities. Our results do not show a relationship between listeners' McGurk susceptibility and their ability to use visual cues to understand spoken sentences in noise, suggesting that McGurk susceptibility may not be a valid measure of audiovisual integration in everyday speech processing.

  15. A Corpus-based Study of EFL Learners’ Errors in IELTS Essay Writing

    Directory of Open Access Journals (Sweden)

    Hoda Divsar

    2017-03-01

    Full Text Available The present study analyzed different types of errors in the EFL learners’ IELTS essays. In order to determine the major types of errors, a corpus of 70 IELTS examinees’ writings were collected, and their errors were extracted and categorized qualitatively. Errors were categorized based on a researcher-developed error-coding scheme into 13 aspects. Based on the descriptive statistical analyses, the frequency of each error type was calculated and the commonest errors committed by the EFL learners in IELTS essays were identified. The results indicated that the two most frequent errors that IELTS candidates committed were related to word choice and verb forms. Based on the research results, pedagogical implications highlight analyzing EFL learners’ writing errors as a useful basis for instructional purposes including creating pedagogical teaching materials that are in line with learners’ linguistic strengths and weaknesses.

  16. Ruptured Aortic Aneurysm From Late Type II Endoleak Treated by Transarterial Embolization

    International Nuclear Information System (INIS)

    Gunasekaran, Senthil; Funaki, Brian; Lorenz, Jonathan

    2013-01-01

    Endoleak is the most common complication after endovascular aneurysm repair. The most common type of endoleak, a type II endoleak, typically follows a benign course and is only treated when associated with increasing aneurysm size. In this case report, we describe a ruptured abdominal aortic aneurysm due to a late, type II endoleak occurring 10 years after endovascular aneurysm repair that was successfully treated by transarterial embolization.

  17. Dictionary Writing System (DWS + Corpus Query Package (CQP: The Case of TshwaneLex

    Directory of Open Access Journals (Sweden)

    Gilles-Maurice de Schryver

    2011-10-01

    Full Text Available

    Abstract: In this article the integrated corpus query functionality of the dictionary compilation software TshwaneLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed to obtain part-of-speech tagged corpora that can be used for lexicographic purposes. All points are illustrated with data drawn from English and Northern Sotho. The tools and techniques themselves, however, are language-independent, and as such the encouraging outcomes of this study are far-reaching.

    Keywords: LEXICOGRAPHY, DICTIONARY, SOFTWARE, DICTIONARY WRITING SYS-TEM (DWS, CORPUS QUERY PACKAGE (CQP, TSHWANELEX, CORPUS, CORPUS ANNO-TATION, PART-OF-SPEECH TAGGER (POS-TAGGER, MACHINE LEARNING, NORTHERN SOTHO (SESOTHO SA LEBOA

    Samenvatting: Woordenboekaanmaaksysteem + corpusanalysepakket: een studie van TshwaneLex. In dit artikel wordt het geïntegreerde corpusanalysepakket van het woordenboekaanmaaksysteem TshwaneLex geanalyseerd. Aandacht gaat zowel naar het verwer-ken van onbewerkte corpusdata als naar geannoteerde corpusdata. Wat het laatste betreft wordt aangetoond hoe, met een minimum aan intellectuele arbeid, automatische leertechnieken met suc-ces kunnen worden ingezet om corpora voor lexicografische doeleinden aan te maken waarin de woordklassen expliciet worden vermeld. Alle stappen van de redenering worden geïllustreerd met gegevens uit het Engels en Noord-Sotho. De instrumenten en technieken zelf zijn echter allemaal taalonafhankelijk, waardoor de veelbelovende resultaten van deze studie verreikend zijn.

    Sleutelwoorden: LEXICOGRAFIE, WOORDENBOEK, SOFTWARE, WOORDENBOEK-AANMAAKSYSTEEM, CORPUSANALYSEPAKKET, TSHWANELEX, CORPUS, CORPUSANNO-TATIE, WOORDKLASSETAGGER, AUTOMATISCHE LEERTECHNIEKEN, NOORD-SOTHO

  18. COALA--A Computational System for Interlanguage Analysis.

    Science.gov (United States)

    Pienemann, Manfred

    1992-01-01

    Describes a linguistic analysis computational system that responds to highly complex queries about morphosyntactic and semantic structures contained in large sets of language acquisition data by identifying, displaying, and analyzing sentences that meet the defined linguistic criteria. (30 references) (Author/CB)

  19. On different approaches to syntactic analysis into bi-lexical dependencies: An empirical comparison of direct, PCFG-based, and HPSG-based parsers

    Directory of Open Access Journals (Sweden)

    Angelina Ivanova

    2016-04-01

    Full Text Available We compare three different approaches to parsing into syntactic, bi- lexical dependencies for English: a ‘direct’ data-driven dependency parser, a statistical phrase structure parser, and a hybrid, ‘deep’ grammar-driven parser. The analyses from the latter two are post- converted to bi-lexical dependencies. Through this ‘reduction’ of all three approaches to syntactic dependency parsers, we determine empirically what performance can be obtained for a common set of de- pendency types for English, across a broad variety of domains. In doing so, we observe what trade-offs apply along three dimensions, accuracy, efficiency, and resilience to domain variation. Our results suggest that the hand-built grammar in one of our parsers helps in both accuracy and cross-domain parsing performance, but these accuracy gains do not necessarily translate to improvements in the downstream task of negation resolution.

  20. Inhibition of type I NKT cells by retinoids or following sulfatide-mediated activation of type II NKT cells attenuates alcoholic liver disease

    Science.gov (United States)

    Maricic, Igor; Sheng, Huiming; Marrero, Idania; Seki, Ehikiro; Kisseleva, Tatiana; Chaturvedi, Som; Molle, Natasha; Mathews, K. Stephanie; Gao, Bin; Kumar, Vipin

    2015-01-01

    Innate immune mechanisms leading to liver injury following chronic alcohol ingestion are poorly understood. Natural killer T (NKT) cells, enriched in the liver and comprised of at least two distinct subsets, type I and type II, recognize different lipid antigens presented by CD1d molecules. We have investigated whether differential activation of NKT cell subsets orchestrates inflammatory events leading to alcoholic liver disease (ALD). We found that following chronic plus binge feeding of Lieber-DeCarli liquid diet in male C57BL/6 mice, type I but not type II NKT cells are activated leading to recruitment of inflammatory Gr-1highCD11b+ cells into liver. A central finding is that liver injury following alcohol feeding is dependent upon type I NKT cells. Thus liver injury is significantly inhibited in Jα18−/− mice deficient in type I NKT cells as well as following their inactivation by sulfatide-mediated activation of type II NKT cells. Furthermore we have identified a novel pathway involving all-trans retinoic acid (ATRA) and its receptor RARγ signaling that inhibits type I NKT cells and consequently ALD. A semi-quantitative PCR analysis of hepatic gene expression of some of the key proinflammatory molecules shared in human disease indicated that their upregulation in ALD is dependent upon type I NKT cells. Conclusion Type I but not type II NKT cells become activated following alcohol feeding. Type I NKT cells-induced inflammation and neutrophil recruitment results in liver tissue damage while type II NKT cells protect from injury in ALD. Inhibition of type I NKT cells by retinoids or by sulfatide prevents ALD. Since the CD1d pathway is highly conserved between mice and humans, NKT cell subsets might be targeted for potential therapeutic intervention in ALD. PMID:25477000

  1. Clinical problems of the patients with subarachnoid hemorrhage which had not been diagnosed until they suffered ischemic neurological deficits and/or rebleeding

    International Nuclear Information System (INIS)

    Fukui, Issei; Ishikawa, Tatsuya; Moroi, Junta

    2011-01-01

    A small part of the patients with subarachnoid hemorrhage (SAH) are not properly diagnosed until they suffer ischemic neurological deficits and/or rebleeding during vasospasm. We therefore investigated the clinical profile of such patients. We retrospectively analyzed 581 patients with aneurysmal SAH experienced in our institute between 2001 and 2009. Patient's characteristics, presence and severity of headache before final diagnosis, imaging investigations they received, their World Federation of Neurological Surgeons (WFNS) grades at admission, the location of aneurysm, treatment, and outcome at discharge were investigated. Five patients were not correctly diagnosed until they presented neurological deficits due to vasospasm or experienced simultaneous rebleeding. Their mean age was 69.4, and all were female. Although all patients had a bad headache, they did not undergo any imaging examinations. Immediate coil embolization was performed for 2 patients, 1 died due to rupturing during embolization procedure. The other 3 were treated by open clipping surgery (2 delayed and 1 immediate). Four patients had some disability as a sequel, and their outcomes were significantly worse compared with 53 patients with SAH in WFNS Grade II. The patients with SAH who had not been properly diagnosed in the acute stage had a poor outcome. We should be very careful when we see patients complaining of severe headache. (author)

  2. What is the role of community capabilities for maternal health? An exploration of community capabilities as determinants to institutional deliveries in Bangladesh, India, and Uganda

    Directory of Open Access Journals (Sweden)

    Ligia Paina

    2016-11-01

    Full Text Available Abstract Background While community capabilities are recognized as important factors in developing resilient health systems and communities, appropriate metrics for these have not yet been developed. Furthermore, the role of community capabilities on access to maternal health services has been underexplored. In this paper, we summarize the development of a community capability score based on the Future Health System (FHS project’s experience in Bangladesh, India, and Uganda, and, examine the role of community capabilities as determinants of institutional delivery in these three contexts. Methods We developed a community capability score using a pooled dataset containing cross-sectional household survey data from Bangladesh, India, and Uganda. Our main outcome of interest was whether the woman delivered in an institution. Our predictor variables included the community capability score, as well as a series of previously identified determinants of maternal health. We calculate both population-averaged effects (using GEE logistic regression, as well as sub-national level effects (using a mixed effects model. Results Our final sample for analysis included 2775 women, of which 1238 were from Bangladesh, 1199 from India, and 338 from Uganda. We found that individual-level determinants of institutional deliveries, such as maternal education, parity, and ante-natal care access were significant in our analysis and had a strong impact on a woman’s odds of delivering in an institution. We also found that, in addition to individual-level determinants, greater community capability was significantly associated with higher odds of institutional delivery. For every additional capability, the odds of institutional delivery would increase by up to almost 6 %. Conclusion Individual-level characteristics are strong determinants of whether a woman delivered in an institution. However, we found that community capability also plays an important role, and should be

  3. COGNITIVE METAPHOR IN MODERN LINGUISTICS

    Directory of Open Access Journals (Sweden)

    Antonina KARTASHOVA

    2010-11-01

    Full Text Available The article outlines the basic notions connected with cognitive metaphor which has lately undergone a thorough examination. The contribution made by linguists resulted in the rise of cognitive linguistics. This science regards metaphor not as a linguistic phenomenon but as a mental one that establishes connection between language and mind in the form of understanding new notions in terms of notions and categories known due to the previously gained experience. The interaction of new and previous experience can generate three main types of metaphors: structural metaphors which imply the structuring of target domain in terms of source domain, ontological metaphors which view abstract notions as concrete objects with clear outlines and orientational metaphors which represent the ways to fix the experience of spatial orientation. The classification of metaphors complemented with examples is presented below along with some controversial cases of determining the type of metaphor.

  4. A grammar of newspaper editorial language: The complex sentence ...

    African Journals Online (AJOL)

    This study sets out to examine what linguistic choices are made at the level of the sentence in selected English Language editorials in a particular newspaper in Ghana – the Daily Graphic. Data for the study consists of 338 selected sentences from 22 editorials of the Daily Graphic published in January 2008. We have ...

  5. Retrieval Interference in Syntactic Processing: The Case of Reflexive Binding in English.

    Science.gov (United States)

    Patil, Umesh; Vasishth, Shravan; Lewis, Richard L

    2016-01-01

    It has been proposed that in online sentence comprehension the dependency between a reflexive pronoun such as himself/herself and its antecedent is resolved using exclusively syntactic constraints. Under this strictly syntactic search account, Principle A of the binding theory-which requires that the antecedent c-command the reflexive within the same clause that the reflexive occurs in-constrains the parser's search for an antecedent. The parser thus ignores candidate antecedents that might match agreement features of the reflexive (e.g., gender) but are ineligible as potential antecedents because they are in structurally illicit positions. An alternative possibility accords no special status to structural constraints: in addition to using Principle A, the parser also uses non-structural cues such as gender to access the antecedent. According to cue-based retrieval theories of memory (e.g., Lewis and Vasishth, 2005), the use of non-structural cues should result in increased retrieval times and occasional errors when candidates partially match the cues, even if the candidates are in structurally illicit positions. In this paper, we first show how the retrieval processes that underlie the reflexive binding are naturally realized in the Lewis and Vasishth (2005) model. We present the predictions of the model under the assumption that both structural and non-structural cues are used during retrieval, and provide a critical analysis of previous empirical studies that failed to find evidence for the use of non-structural cues, suggesting that these failures may be Type II errors. We use this analysis and the results of further modeling to motivate a new empirical design that we use in an eye tracking study. The results of this study confirm the key predictions of the model concerning the use of non-structural cues, and are inconsistent with the strictly syntactic search account. These results present a challenge for theories advocating the infallibility of the human

  6. Hominid mandibular corpus shape variation and its utility for recognizing species diversity within fossil Homo.

    Science.gov (United States)

    Lague, Michael R; Collard, Nicole J; Richmond, Brian G; Wood, Bernard A

    2008-12-01

    Mandibular corpora are well represented in the hominin fossil record, yet few studies have rigorously assessed the utility of mandibular corpus morphology for species recognition, particularly with respect to the linear dimensions that are most commonly available. In this study, we explored the extent to which commonly preserved mandibular corpus morphology can be used to: (i) discriminate among extant hominid taxa and (ii) support species designations among fossil specimens assigned to the genus Homo. In the first part of the study, discriminant analysis was used to test for significant differences in mandibular corpus shape at different taxonomic levels (genus, species and subspecies) among extant hominid taxa (i.e. Homo, Pan, Gorilla, Pongo). In the second part of the study, we examined shape variation among fossil mandibles assigned to Homo (including H. habilis sensu stricto, H. rudolfensis, early African H. erectus/H. ergaster, late African H. erectus, Asian H. erectus, H. heidelbergensis, H. neanderthalensis and H. sapiens). A novel randomization procedure designed for small samples (and using group 'distinctness values') was used to determine whether shape variation among the fossils is consistent with conventional taxonomy (or alternatively, whether a priori taxonomic groupings are completely random with respect to mandibular morphology). The randomization of 'distinctness values' was also used on the extant samples to assess the ability of the test to recognize known taxa. The discriminant analysis results demonstrated that, even for a relatively modest set of traditional mandibular corpus measurements, we can detect significant differences among extant hominids at the genus and species levels, and, in some cases, also at the subspecies level. Although the randomization of 'distinctness values' test is more conservative than discriminant analysis (based on comparisons with extant specimens), we were able to detect at least four distinct groups among the

  7. Hominid mandibular corpus shape variation and its utility for recognizing species diversity within fossil Homo

    Science.gov (United States)

    Lague, Michael R; Collard, Nicole J; Richmond, Brian G; Wood, Bernard A

    2008-01-01

    Mandibular corpora are well represented in the hominin fossil record, yet few studies have rigorously assessed the utility of mandibular corpus morphology for species recognition, particularly with respect to the linear dimensions that are most commonly available. In this study, we explored the extent to which commonly preserved mandibular corpus morphology can be used to: (i) discriminate among extant hominid taxa and (ii) support species designations among fossil specimens assigned to the genus Homo. In the first part of the study, discriminant analysis was used to test for significant differences in mandibular corpus shape at different taxonomic levels (genus, species and subspecies) among extant hominid taxa (i.e. Homo, Pan, Gorilla, Pongo). In the second part of the study, we examined shape variation among fossil mandibles assigned to Homo(including H. habilis sensu stricto, H. rudolfensis, early African H. erectus/H. ergaster, late African H. erectus, Asian H. erectus, H. heidelbergensis, H. neanderthalensis and H. sapiens). A novel randomization procedure designed for small samples (and using group ‘distinctness values’) was used to determine whether shape variation among the fossils is consistent with conventional taxonomy (or alternatively, whether a priori taxonomic groupings are completely random with respect to mandibular morphology). The randomization of ‘distinctness values’ was also used on the extant samples to assess the ability of the test to recognize known taxa. The discriminant analysis results demonstrated that, even for a relatively modest set of traditional mandibular corpus measurements, we can detect significant differences among extant hominids at the genus and species levels, and, in some cases, also at the subspecies level. Although the randomization of ‘distinctness values’ test is more conservative than discriminant analysis (based on comparisons with extant specimens), we were able to detect at least four distinct groups

  8. Levelling-out and register variation in the translations of experienced and inexperienced translators: a corpus-based study

    Directory of Open Access Journals (Sweden)

    Karien Redelinghuys

    2016-07-01

    Full Text Available Explicitation, simplification, normalisation and levelling-out, the four features of translation proposed by Baker (1996, have attracted considerable attention in translation studies. Although the first three have been studied extensively, levelling-out has been the subject of less empirical investigation. Furthermore, there are no studies to date that have investigated the extent to which levelling-out occurs in translations by experienced translators and inexperienced translators. In this study, levelling-out is operationalised in terms of register. It is hypothesised that less register variation will be apparent in translations by inexperienced translators and, in keeping with the features of translation hypothesis, it is predicted that select linguistic features will demonstrate less register variation in translations than in non-translations. A custom-built corpus was compiled to test these hypotheses. While some light is shed on how translation expertise contributes to register sensitivity and the distribution of certain features across different registers, little evidence could be found for levelling-out as register variation is evident in the translation corpora.

  9. Computing with Words in Decision support Systems: An overview on Models and Applications

    Directory of Open Access Journals (Sweden)

    Luis Martinez

    2010-10-01

    Full Text Available Decision making is inherent to mankind, as human beings daily face situations in which they should choose among different alternatives by means of reasoning and mental processes. Many of these decision problems are under uncertain environments with vague and imprecise information. This type of information is usually modelled by linguistic information because of the common use of language by the experts involved in the given decision situations, originating linguistic decision making. The use of linguistic information in decision making demands processes of Computing with Words to solve the related decision problems. Different methodologies and approaches have been proposed to accomplish such processes in an accurate and interpretable way. The good performance of linguistic computing dealing with uncertainty has caused a spread use of it in different types of decision based applications. This paper overviews the more significant and extended linguistic computing models due to its key role in linguistic decision making and a wide range of the most recent applications of linguistic decision support models.

  10. On the concept of a linguistic variable

    International Nuclear Information System (INIS)

    Kerre, E.

    1996-01-01

    The concept of a linguistic variable plays a crucial role in the representation of imprecise knowledge in information sciences. A variable is called linguistic as soon as its values are linguistic terms rather than numerical ones. The power of daily communication and common sense reasoning lies in the use of such linguistic values. Even when exact numerical values are available, experts tend to transform these values into linguistic ones. A physician will usually translate a numerical measurement of a blood pressure into linguistic specifications such as normal, very high, too low... Zadeh has argued that the set of values for a linguistic variable assumes a more-or-less fixed structure. Starting from an atomic value and its antonym all remaining values are constructed using logical connectives on the one hand and linguistic hedges on the other hand. In this paper we will describe how to represent the value set of a linguistic variable in general and of linguistic hedges in particular

  11. Time-dependent magnetization of a type-II superconductor numerically calculated by using the flux-creep equation

    International Nuclear Information System (INIS)

    Lee, J. H.; Park, I. S.; Ahmad, D.; Kim, D.; Kim, Y. C.; Ko, R. K.; Jeong, D. Y.

    2012-01-01

    The macroscopic magnetic behaviors of a type-II superconductor, such as the field- or the temperature-dependent magnetization, have been described by using critical state models. However, because the models are time-independent, the magnetic relaxation in a type-II superconductor cannot be described by them, and the time dependence of the magnetization can affect the field or the temperature-dependent magnetization curve described by the models. In order to avoid the time independence of critical state models, we try the numerical calculation used by Qin et al., who mainly calculated the temperature dependence of the ac susceptibility χ(T). Their calculation showed that the frequency-dependent χ(T) could be obtained by using the flux-creep equation. We calculated the field-dependent magnetization and magnetic relaxation by using a numerical method. The calculated field-dependent magnetization M(H) curves shows the shapes of a typical type-II superconductor. The calculated magnetic relaxation do not show a logarithmic decay of the magnetization, but the addition of a surface barrier to the relaxation calculation caused a clear logarithmic decay of the magnetization, producing a crossover at a mid-time. This means that the logarithmic magnetic relaxation is caused by not only flux creep but also a combination of flux creep and a surface barrier.

  12. Analysis of the type II-A CRISPR-Cas system of Streptococcus agalactiae reveals distinctive features according to genetic lineages

    Science.gov (United States)

    Lier, Clément; Baticle, Elodie; Horvath, Philippe; Haguenoer, Eve; Valentin, Anne-Sophie; Glaser, Philippe; Mereghetti, Laurent; Lanotte, Philippe

    2015-01-01

    CRISPR-Cas systems (clustered regularly interspaced short palindromic repeats/CRISPR-associated proteins) are found in 90% of archaea and about 40% of bacteria. In this original system, CRISPR arrays comprise short, almost unique sequences called spacers that are interspersed with conserved palindromic repeats. These systems play a role in adaptive immunity and participate to fight non-self DNA such as integrative and conjugative elements, plasmids, and phages. In Streptococcus agalactiae, a bacterium implicated in colonization and infections in humans since the 1960s, two CRISPR-Cas systems have been described. A type II-A system, characterized by proteins Cas9, Cas1, Cas2, and Csn2, is ubiquitous, and a type I–C system, with the Cas8c signature protein, is present in about 20% of the isolates. Unlike type I–C, which appears to be non-functional, type II-A appears fully functional. Here we studied type II-A CRISPR-cas loci from 126 human isolates of S. agalactiae belonging to different clonal complexes that represent the diversity of the species and that have been implicated in colonization or infection. The CRISPR-cas locus was analyzed both at spacer and repeat levels. Major distinctive features were identified according to the phylogenetic lineages previously defined by multilocus sequence typing, especially for the sequence type (ST) 17, which is considered hypervirulent. Among other idiosyncrasies, ST-17 shows a significantly lower number of spacers in comparison with other lineages. This characteristic could reflect the peculiar virulence or colonization specificities of this lineage. PMID:26124774

  13. How Hearing Impairment Affects Sentence Comprehension: Using Eye Fixations to Investigate the Duration of Speech Processing

    DEFF Research Database (Denmark)

    Wendt, Dorothea; Kollmeier, Birger; Brand, Thomas

    2015-01-01

    ; this measure uses eye fixations recorded while the participant listens to a sentence. Eye fixations toward a target picture (which matches the aurally presented sentence) were measured in the presence of a competitor picture. Based on the recorded eye fixations, the single target detection amplitude, which...... reflects the tendency of the participant to fixate the target picture, was used as a metric to estimate the duration of sentence processing. The single target detection amplitude was calculated for sentence structures with different levels of linguistic complexity and for different listening conditions......: in quiet and in two different noise conditions. Participants with hearing impairment spent more time processing sentences, even at high levels of speech intelligibility. In addition, the relationship between the proposed online measure and listener-specific factors, such as hearing aid use and cognitive...

  14. A case of total agenesis of the corpus callosum

    International Nuclear Information System (INIS)

    Sakamoto, Masanobu; Takeda, Katsuhiko; Bandou, Mitsuaki; Murayama, Shigeo; Sakuta, Manabu

    1985-01-01

    We have reported a case of agenesis of the corpus callosum, in which NMR-CT revealed a complete defect of it, and have examined the localization of the speech center of this patient. The patient is a right-handed 26-year-old man who has complained of headache on the parietal region. His neurological examination revealed only a mild mental difficulty (IQ 77). X-ray CT showed the lateral ventricles to be separated widely and the posterior horns dilated, which were compatible with the agenesis of the corpus callosum. Further, NMR-CT has revealed a total agenesis of the corpus callosum. NMR-CT seems to be highly useful for the detection of the degree of the callosal defect. We have carried out the intracarotid amobarbital injection (Wada's test) for the determination of the lateralization of cerebral speech dominance. It had been reported by some authors that when it comes to the cerebral speech dominance, acallosal patients had no difference between each hemisphere. However, our results have demonstrated a left sided dominance. (author)

  15. Book Review: Backhaus, Peter (2007: Linguistic Landscapes: A Comparative Study of Urban Multilingualism in Tokyo. Clevedon: Multilingual Matters; 158 Pages ISBN 9781853599460

    Directory of Open Access Journals (Sweden)

    Omar Alomoush

    2015-12-01

    future.    As pointed out throughout, Backhaus presents a congruent methodological approach, which has added new dimensions to the existing field of linguistic landscape. More specifically, Backhaus identifies three guiding research questions: Linguistic Landscape by whom? Linguistic landscape for whom?  Linguistic landscape quo vadis?. At the same time, his definition of the unit of analysis as described above contributed greatly to linguistic landscape research. Although Backhaus relies on former studies, Backhaus has created analytical categories neglected by previous studies, especially linguistic idiosyncrasies, and uses his own terminology, particularly ‘part writing’ with its main types adopted from the field of musicology: homophonic, mixed, polyphonic, monophonic signs. The same notions with the exception of monophonic signs have been implemented by Reh (2004, but the terminological designations are different. I wonder why Backhaus uses the term ‘polyphonic signs’, which might be replaced by code mixing or switching in that it may be mainly subdivided into intra-sentential code-switches and inter-sentential code-switches. This work also counts on the observations made by Scollon and Scollon (2003, especially those on code prominence and layering. For example, code preference as an analytical category in Backhaus’ quantitative study relies on placement and size in case that there is a conflict, font size outweighs order.  As far as my current project is concerned, the relevance of this work comes from the methodological considerations provided, which will help to expand upon Backhaus’ paradigm to apply in the linguistic landscape of urban Jordan. In other words, we will adapt and build upon this methodological framework to devise a coding scheme suitable for the linguistic landscape of Jordanian cities.

  16. A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents.

    Science.gov (United States)

    Segura-Bedmar, Isabel; Martínez, Paloma; de Pablo-Sánchez, César

    2011-03-29

    A drug-drug interaction (DDI) occurs when one drug influences the level or activity of another drug. The increasing volume of the scientific literature overwhelms health care professionals trying to be kept up-to-date with all published studies on DDI. This paper describes a hybrid linguistic approach to DDI extraction that combines shallow parsing and syntactic simplification with pattern matching. Appositions and coordinate structures are interpreted based on shallow syntactic parsing provided by the UMLS MetaMap tool (MMTx). Subsequently, complex and compound sentences are broken down into clauses from which simple sentences are generated by a set of simplification rules. A pharmacist defined a set of domain-specific lexical patterns to capture the most common expressions of DDI in texts. These lexical patterns are matched with the generated sentences in order to extract DDIs. We have performed different experiments to analyze the performance of the different processes. The lexical patterns achieve a reasonable precision (67.30%), but very low recall (14.07%). The inclusion of appositions and coordinate structures helps to improve the recall (25.70%), however, precision is lower (48.69%). The detection of clauses does not improve the performance. Information Extraction (IE) techniques can provide an interesting way of reducing the time spent by health care professionals on reviewing the literature. Nevertheless, no approach has been carried out to extract DDI from texts. To the best of our knowledge, this work proposes the first integral solution for the automatic extraction of DDI from biomedical texts.

  17. OUTCOME OF GARTLAND TYPEII SUPRACONDYLAR FRACTURES OF HUMERUS TREATED BY CONSERVATIVE METHOD

    Directory of Open Access Journals (Sweden)

    Dinesh Mitra

    2015-08-01

    Full Text Available BACKGROUND: The current literatures recommend operative method (closed reduction and pinning for type II supracondylar fractures of humerus. But some surgeons still prefer conservative method for type II supracondylar fractures of humerus. We pr esent results of 14 cases of type II supracondylar fractures treated with CR and AE POP immobilization . The purpose of this study is to evaluate the outcome of conservative treatment in management of type II supracondylar fracture of humerus. MATERIALS AND METHODS: Fourteen children treated by conservative methods (CR & AE POP between January 2013 and December 2014 is included in this study. The mean age group is 6.8 years (3 years - 11 years. The patient follow up is done for a minimum of 10 - 12 weeks. Treatment outcome is based on final clinical and radiological assessments and grading of results was done using Flynn’s criteria. RESULTS: Gartland type II fracture gives 82% excellent results and 28 % good results as per Flynn’s criteria. Of the 14 patien ts only two cases required re manipulation. Surgical intervention was not needed for any of the patients. No patients in this study developed compartment syndrome / cubitus varus deformity. CONCLUSION: Satisfactory results can be obtained with conservative treatment (closed reduction and above elbow POP if proper selection of the patient and careful clinical and radiological follow up is done

  18. ABSTRACT NOUNS IN THE SPEECH OF THE EMGLISHMEN (BASED ON FICTION WORKS AND BRITISH NATIONAL CORPUS

    Directory of Open Access Journals (Sweden)

    Natalia Veniaminovna Khokhlova

    2015-01-01

    Full Text Available The research aimed at studying the use of abstract nouns in the Englishmen’s speech from the standpoint of sociolinguistics. The article introduces a new, sociolinguistic, approach to research of abstract nouns; it is also the first time they are studied in a language corpus. The first stage of the research was based on fiction literary works: abstract nouns were extracted of analysis from the statements of the characters belonging to the opposite social classes. Later, these data was compared with the results of the original corpus research based on the British national corpus: sentences with nouns were selected out of the conversational subcorpus of BNC and were further sorted into abstract, concrete and words denoting people. Then, their frequency and vocabulary was studied with regards to speakers’ age, gender and social standing. The results revealed that abstract words are used more often that concrete ones regardless of the speaker’s social characteristics, however, the size and content of vocabulary is different (it is generally more substantial in the speech of women and representatives of higher social classes. The results of this research can be used in elaborating a course of the English language or in teaching general linguistics, sociolinguistics and country studies. 

  19. Aligning Assessments for COSMA Accreditation

    Science.gov (United States)

    Laird, Curt; Johnson, Dennis A.; Alderman, Heather

    2015-01-01

    Many higher education sport management programs are currently in the process of seeking accreditation from the Commission on Sport Management Accreditation (COSMA). This article provides a best-practice method for aligning student learning outcomes with a sport management program's mission and goals. Formative and summative assessment procedures…

  20. Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, Ronald C. [Case Western Reserve Univ., Cleveland, OH (United States)

    1991-11-01

    This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese`s group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a group of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.

  1. Automated insertion of sequences into a ribosomal RNA alignment: An application of computational linguistics in molecular biology

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, R.C.

    1991-11-01

    This thesis involved the construction of (1) a grammar that incorporates knowledge on base invariancy and secondary structure in a molecule and (2) a parser engine that uses the grammar to position bases into the structural subunits of the molecule. These concepts were combined with a novel pinning technique to form a tool that semi-automates insertion of a new species into the alignment for the 16S rRNA molecule (a component of the ribosome) maintained by Dr. Carl Woese's group at the University of Illinois at Urbana. The tool was tested on species extracted from the alignment and on a group of entirely new species. The results were very encouraging, and the tool should be substantial aid to the curators of the 16S alignment. The construction of the grammar was itself automated, allowing application of the tool to alignments for other molecules. The logic programming language Prolog was used to construct all programs involved. The computational linguistics approach used here was found to be a useful way to attach the problem of insertion into an alignment.

  2. Blind quantum computation protocol in which Alice only makes measurements

    Science.gov (United States)

    Morimae, Tomoyuki; Fujii, Keisuke

    2013-05-01

    Blind quantum computation is a new secure quantum computing protocol which enables Alice (who does not have sufficient quantum technology) to delegate her quantum computation to Bob (who has a full-fledged quantum computer) in such a way that Bob cannot learn anything about Alice's input, output, and algorithm. In previous protocols, Alice needs to have a device which generates quantum states, such as single-photon states. Here we propose another type of blind computing protocol where Alice does only measurements, such as the polarization measurements with a threshold detector. In several experimental setups, such as optical systems, the measurement of a state is much easier than the generation of a single-qubit state. Therefore our protocols ease Alice's burden. Furthermore, the security of our protocol is based on the no-signaling principle, which is more fundamental than quantum physics. Finally, our protocols are device independent in the sense that Alice does not need to trust her measurement device in order to guarantee the security.

  3. Sentence‐Chain Based Seq2seq Model for Corpus Expansion

    Directory of Open Access Journals (Sweden)

    Euisok Chung

    2017-08-01

    Full Text Available This study focuses on a method for sequential data augmentation in order to alleviate data sparseness problems. Specifically, we present corpus expansion techniques for enhancing the coverage of a language model. Recent recurrent neural network studies show that a seq2seq model can be applied for addressing language generation issues; it has the ability to generate new sentences from given input sentences. We present a method of corpus expansion using a sentence‐chain based seq2seq model. For training the seq2seq model, sentence chains are used as triples. The first two sentences in a triple are used for the encoder of the seq2seq model, while the last sentence becomes a target sequence for the decoder. Using only internal resources, evaluation results show an improvement of approximately 7.6% relative perplexity over a baseline language model of Korean text. Additionally, from a comparison with a previous study, the sentence chain approach reduces the size of the training data by 38.4% while generating 1.4‐times the number of n‐grams with superior performance for English text.

  4. Formal linguistics as a cue to demographic history.

    Science.gov (United States)

    Longobardi, Giuseppe; Ceolin, Andrea; Ecay, Aaron; Ghirotto, Silvia; Guardiano, Cristina; Irimia, Monica-Alexandrina; Michelioudakis, Dimitris; Radkevich, Nina; Pettener, Davide; Luiselli, Donata; Barbujani, Guido

    2016-06-20

    Beyond its theoretical success, the development of molecular genetics has brought about the possibility of extraordinary progress in the study of classification and in the inference of the evolutionary history of many species and populations. A major step forward was represented by the availability of extremely large sets of molecular data suited to quantitative and computational treatments. In this paper, we argue that even in cognitive sciences, purely theoretical progress in a discipline such as linguistics may have analogous impact. Thus, exactly on the model of molecular biology, we propose to unify two traditionally unrelated lines of linguistic investigation: 1) the formal study of syntactic variation (parameter theory) in the biolinguistic program; 2) the reconstruction of relatedness among languages (phylogenetic taxonomy). The results of our linguistic analysis have thus been plotted against data from population genetics and the correlations have turned out to be largely significant: given a non-trivial set of languages/populations, the description of their variation provided by the comparison of systematic parametric analysis and molecular anthropology informatively recapitulates their history and relationships. As a result, we can claim that the reality of some parametric model of the language faculty and language acquisition/transmission (more broadly of generative grammar) receives strong and original support from its historical heuristic power. Then, on these grounds, we can begin testing Darwin's prediction that, when properly generated, the trees of human populations and of their languages should eventually turn out to be significantly parallel.

  5. The Genome of a Tortoise Herpesvirus (Testudinid Herpesvirus 3) Has a Novel Structure and Contains a Large Region That Is Not Required for Replication In Vitro or Virulence In Vivo

    Science.gov (United States)

    Gandar, Frédéric; Wilkie, Gavin S.; Gatherer, Derek; Kerr, Karen; Marlier, Didier; Diez, Marianne; Marschang, Rachel E.; Mast, Jan; Dewals, Benjamin G.

    2015-01-01

    ABSTRACT Testudinid herpesvirus 3 (TeHV-3) is the causative agent of a lethal disease affecting several tortoise species. The threat that this virus poses to endangered animals is focusing efforts on characterizing its properties, in order to enable the development of prophylactic methods. We have sequenced the genomes of the two most studied TeHV-3 strains (1976 and 4295). TeHV-3 strain 1976 has a novel genome structure and is most closely related to a turtle herpesvirus, thus supporting its classification into genus Scutavirus, subfamily Alphaherpesvirinae, family Herpesviridae. The sequence of strain 1976 also revealed viral counterparts of cellular interleukin-10 and semaphorin, which have not been described previously in members of subfamily Alphaherpesvirinae. TeHV-3 strain 4295 is a mixture of three forms (m1, m2, and M), in which, in comparison to strain 1976, the genomes exhibit large, partially overlapping deletions of 12.5 to 22.4 kb. Viral subclones representing these forms were isolated by limiting dilution assays, and each replicated in cell culture comparably to strain 1976. With the goal of testing the potential of the three forms as attenuated vaccine candidates, strain 4295 was inoculated intranasally into Hermann's tortoises (Testudo hermanni). All inoculated subjects died, and PCR analyses demonstrated the ability of the m2 and M forms to spread and invade the brain. In contrast, the m1 form was detected in none of the organs tested, suggesting its potential as the basis of an attenuated vaccine candidate. Our findings represent a major step toward characterizing TeHV-3 and developing prophylactic methods against it. IMPORTANCE Testudinid herpesvirus 3 (TeHV-3) causes a lethal disease in tortoises, several species of which are endangered. We have characterized the viral genome and used this information to take steps toward developing an attenuated vaccine. We have sequenced the genomes of two strains (1976 and 4295), compared their growth in

  6. Is There Such a Thing As a "Perfect" Parent?

    Science.gov (United States)

    ... socially, emotionally, intellectually and physically. A child's maturity level may be different for the various qualities he is developing, including social skills, athletic abilities and learning capabilities. He might be strong in math but weak in writing (or vice versa), or ...

  7. Academic writing in a corpus of 4th grade science notebooks: An analysis of student language use and adult expectations of the genres of school science

    Science.gov (United States)

    Esquinca, Alberto

    This is a study of language use in the context of an inquiry-based science curriculum in which conceptual understanding ratings are used split texts into groups of "successful" and "unsuccessful" texts. "Successful" texts could include known features of science language. 420 texts generated by students in 14 classrooms from three school districts, culled from a prior study on the effectiveness of science notebooks to assess understanding, in addition to the aforementioned ratings are the data sources. In science notebooks, students write in the process of learning (here, a unit on electricity). The analytical framework is systemic functional linguistics (Halliday and Matthiessen, 2004; Eggins, 2004), specifically the concepts of genre, register and nominalization. Genre classification involves an analysis of the purpose and register features in the text (Schleppegrell, 2004). The use of features of the scientific academic register, namely the use relational processes and nominalization (Halliday and Martin, 1993), requires transitivity analysis and noun analysis. Transitivity analysis, consisting of the identification of the process type, is conducted on 4737 ranking clauses. A manual count of each noun used in the corpus allows for a typology of nouns. Four school science genres, procedures, procedural recounts reports and explanations, are found. Most texts (85.4%) are factual, and 14.1% are classified as explanations, the analytical genre. Logistic regression analysis indicates that there is no significant probability that the texts classified as explanation are placed in the group of "successful" texts. In addition, material process clauses predominate in the corpus, followed by relational process clauses. Results of a logistic regression analysis indicate that there is a significant probability (Chi square = 15.23, p placed in the group of "successful" texts. In addition, 59.5% of 6511 nouns are references to physical materials, followed by references to

  8. Europäisch eingestellt – Valenzforschung mit Parallelkorpora

    Directory of Open Access Journals (Sweden)

    Kamber, Alain

    2009-01-01

    Full Text Available The aim of this research is to demonstrate with a case study the significance of corpus linguistics within the field of verb valency and bilingual lexicography. Specifically, we will introduce a corpus-based process that determines context-sensitive translations of polysemous word forms. Three steps are considered here in detail. First, text evidences of the verb einstellen in the monolingual Deutsches Referenzkorpus (DeReKo will be examined with a collocation analysis. With help of the analytical instrument COSMAS II, the collocation profiles will then be summarized into a typology (senses and subsenses, valency structures and typical collocations. In a further step, the determined senses can be attributed to the corresponding translations of the word form einstellen in other languages (English, French and Italian by means of the multilingual parallel corpus Europarl (Open Source Parallel Corpus OPUS. Finally, the results will be compared to the codifications of commonly used bilingual dictionaries.

  9. A Linguistic Analysis of Suicide-Related Twitter Posts.

    Science.gov (United States)

    O'Dea, Bridianne; Larsen, Mark E; Batterham, Philip J; Calear, Alison L; Christensen, Helen

    2017-09-01

    Suicide is a leading cause of death worldwide. Identifying those at risk and delivering timely interventions is challenging. Social media site Twitter is used to express suicidality. Automated linguistic analysis of suicide-related posts may help to differentiate those who require support or intervention from those who do not. This study aims to characterize the linguistic profiles of suicide-related Twitter posts. Using a dataset of suicide-related Twitter posts previously coded for suicide risk by experts, Linguistic Inquiry and Word Count (LIWC) and regression analyses were conducted to determine differences in linguistic profiles. When compared with matched non-suicide-related Twitter posts, strongly concerning suicide-related posts were characterized by a higher word count, increased use of first-person pronouns, and more references to death. When compared with safe-to-ignore suicide-related posts, strongly concerning suicide-related posts were characterized by increased use of first-person pronouns, greater anger, and increased focus on the present. Other differences were found. The predictive validity of the identified features needs further testing before these results can be used for interventional purposes. This study demonstrates that strongly concerning suicide-related Twitter posts have unique linguistic profiles. The examination of Twitter data for the presence of such features may help to validate online risk assessments and determine those in need of further support or intervention.

  10. Charm photoproduction using electron taggers with the zeus detector at HERA

    Energy Technology Data Exchange (ETDEWEB)

    Aghuzumtsyan, G.

    2006-07-15

    Charm production has been studied at a centre-of-mass energy of {radical}(s)=318 GeV in the ZEUS experiment at the electron-proton collider HERA. The data from the years 1998-2000 are used. Photoproduction is identified by reconstructing the D{sup *}(2010) meson via the decay chain D{sup *+}{yields}K{sup -}{pi}{sup +}{pi}{sup +} (+c.c.) and by selecting the scattered electron in one of the dedicated electron taggers located 44m and 35m away from the interaction point. The total visible cross section and differential cross sections as a function of p{sub T}(D{sup *}), {eta}(D{sup *}) and W were measured in the restricted phase space: -1.0<{eta}D({sup *})<1.5, p{sub T}(D{sup *})>1.5 GeV, 80 GeVin case of the 44m tagger sample, and -1.52.0 GeV, 200 GeVin case of the 35m tagger sample. Next-to-leading-order QCD predictions were calculated in the same kinematic region as the data. The comparison of the theoretical predictions with the data shows good agreement within the theoretical uncertainties. Total charm cross sections were calculated by extrapolating the visible cross sections in the limited (p{sub T}(D{sup *}), {eta}(D{sup *})) phase space to the full kinematic region. The measurements with the 35m tagger overlap with previous untagged data while the 44m tagger data are used to study a new intermediate region 80 GeVthe other HERA measurements. The comparison of the NLO QCD predictions with the data shows good agreement. (orig.)

  11. Individual differences in executive control relate to metaphor processing: an eye movement study of sentence reading.

    Science.gov (United States)

    Columbus, Georgie; Sheikh, Naveed A; Côté-Lecaldare, Marilena; Häuser, Katja; Baum, Shari R; Titone, Debra

    2014-01-01

    Metaphors are common elements of language that allow us to creatively stretch the limits of word meaning. However, metaphors vary in their degree of novelty, which determines whether people must create new meanings on-line or retrieve previously known metaphorical meanings from memory. Such variations affect the degree to which general cognitive capacities such as executive control are required for successful comprehension. We investigated whether individual differences in executive control relate to metaphor processing using eye movement measures of reading. Thirty-nine participants read sentences including metaphors or idioms, another form of figurative language that is more likely to rely on meaning retrieval. They also completed the AX-CPT, a domain-general executive control task. In Experiment 1, we examined sentences containing metaphorical or literal uses of verbs, presented with or without prior context. In Experiment 2, we examined sentences containing idioms or literal phrases for the same participants to determine whether the link to executive control was qualitatively similar or different to Experiment 1. When metaphors were low familiar, all people read verbs used as metaphors more slowly than verbs used literally (this difference was smaller for high familiar metaphors). Executive control capacity modulated this pattern in that high executive control readers spent more time reading verbs when a prior context forced a particular interpretation (metaphorical or literal), and they had faster total metaphor reading times when there was a prior context. Interestingly, executive control did not relate to idiom processing for the same readers. Here, all readers had faster total reading times for high familiar idioms than literal phrases. Thus, executive control relates to metaphor but not idiom processing for these readers, and for the particular metaphor and idiom reading manipulations presented.

  12. Individual Differences in Executive Control Relate to Metaphor Processing: An Eye Movement Study of Sentence Reading

    Directory of Open Access Journals (Sweden)

    Georgie eColumbus

    2015-01-01

    Full Text Available Metaphors are common elements of language that allow us to creatively stretch the limits of word meaning. However, metaphors vary in their degree of novelty, which determines whether people must create new meanings on-line or retrieve previously known metaphorical meanings from memory. Such variations affect the degree to which general cognitive capacities such as executive control are required for successful comprehension.We investigated whether individual differences in executive control relate to metaphor processing using eye movement measures of reading. Thirty-nine participants read sentences including metaphors or idioms, another form of figurative language that is more likely to rely on meaning retrieval. They also completed the AX-CPT, a domain-general executive control task. In Experiment 1, we examined sentences containing metaphorical or literal uses of verbs, presented with or without prior context. In Experiment 2, we examined sentences containing idioms or literal phrases for the same participants to determine whether the link to executive control was qualitatively similar or different to Experiment 1.When metaphors were low familiar, all people read verbs used as metaphors more slowly than verbs used literally (this difference was smaller for high familiar metaphors. Executive control capacity modulated this pattern in that high executive control readers spent more time reading verbs when a prior context forced a particular interpretation (metaphorical or literal, and they had faster total metaphor reading times when there was a prior context. Interestingly, executive control did not relate to idiom processing for the same readers. Here, all readers had faster total reading times for high familiar idioms than literal phrases. Thus, executive control relates to metaphor but not idiom processing for these readers, and for the particular metaphor and idiom reading manipulations presented.

  13. Does a summative portfolio foster the development of capabilities such as reflective practice and understanding ethics? An evaluation from two medical schools.

    Science.gov (United States)

    O'Sullivan, Anthony J; Howe, Amanda C; Miles, Susan; Harris, Peter; Hughes, Chris S; Jones, Philip; Scicluna, Helen; Leinster, Sam J

    2012-01-01

    Portfolios need to be evaluated to determine whether they encourage students to develop in capabilities such as reflective practice and ethical judgment. The aims of this study were (i) to determine whether preparing a portfolio helps promote students' development in a range of capabilities including understanding ethical and legal principles, reflective practice and effective communication, and (ii) to determine to what extent the format of the portfolio affected the outcome by comparing the experiences of students at two different medical schools. A questionnaire was designed to evaluate undergraduate medical students' experiences of completing a portfolio at two medical schools. A total of 526 (45% response rate) students answered the on-line questionnaire. Students from both medical schools gave the highest ranking for the portfolio as a trigger for reflective practice. 63% of students agreed their portfolio helped them develop reflective practice skills (p portfolios helped them understand ethical and legal principles whereas 29% disagreed (p portfolio helped them to develop effective communication. Students perceive portfolio preparation as an effective learning tool for the development of capabilities such as understanding ethical and legal principles and reflective practice, whereas other capabilities such as effective communication require complementary techniques and other modes of assessment.

  14. ImageParser: a tool for finite element generation from three-dimensional medical images

    Directory of Open Access Journals (Sweden)

    Yamada T

    2004-10-01

    Full Text Available Abstract Background The finite element method (FEM is a powerful mathematical tool to simulate and visualize the mechanical deformation of tissues and organs during medical examinations or interventions. It is yet a challenge to build up an FEM mesh directly from a volumetric image partially because the regions (or structures of interest (ROIs may be irregular and fuzzy. Methods A software package, ImageParser, is developed to generate an FEM mesh from 3-D tomographic medical images. This software uses a semi-automatic method to detect ROIs from the context of image including neighboring tissues and organs, completes segmentation of different tissues, and meshes the organ into elements. Results The ImageParser is shown to build up an FEM model for simulating the mechanical responses of the breast based on 3-D CT images. The breast is compressed by two plate paddles under an overall displacement as large as 20% of the initial distance between the paddles. The strain and tangential Young's modulus distributions are specified for the biomechanical analysis of breast tissues. Conclusion The ImageParser can successfully exact the geometry of ROIs from a complex medical image and generate the FEM mesh with customer-defined segmentation information.

  15. A Corpus-based Study on the Use of Contractions by EFL Learners in Argumentative Essays

    Directory of Open Access Journals (Sweden)

    M. Pınar Babanoğlu

    2017-01-01

    Full Text Available Contraction forms in English are mostly occur in speech and informal writing and they are generally avoided in formal writing types such as academic prose, business reports and journal articles, therefore, most teachers discourage their use in academic essays (Biber, Johansonn, Leech, Conrad and Finegan 1999. Contractions in English have two types; negative contractions (isn’t, haven’t, doesn’t and verb contractions (I’m, they’ve, that’s. This corpus based study attempts to investigate contraction usage in learner and native English speaker essays. Major goal is to examine whether learners consider essay writing rules in respect of contractions which are accepted inappropriate for academic prose style. Five corpora, three learner and two native English, were utilized in order to analyze verb and not-contraction forms. Frequency calculations of contraction forms in each corpus compared via log-likelihood measurement for statistical significance.  Results revealed that learners use considerably more contraction forms, especially negative ones, than native English students in their argumentative essays.

  16. When Language Switching has No Apparent Cost: Lexical Access in Sentence Context

    Science.gov (United States)

    Gullifer, Jason W.; Kroll, Judith F.; Dussias, Paola E.

    2013-01-01

    We report two experiments that investigate the effects of sentence context on bilingual lexical access in Spanish and English. Highly proficient Spanish-English bilinguals read sentences in Spanish and English that included a marked word to be named. The word was either a cognate with similar orthography and/or phonology in the two languages, or a matched non-cognate control. Sentences appeared in one language alone (i.e., Spanish or English) and target words were not predictable on the basis of the preceding semantic context. In Experiment 1, we mixed the language of the sentence within a block such that sentences appeared in an alternating run in Spanish or in English. These conditions partly resemble normally occurring inter-sentential code-switching. In these mixed-language sequences, cognates were named faster than non-cognates in both languages. There were no effects of switching the language of the sentence. In Experiment 2, with Spanish-English bilinguals matched closely to those who participated in the first experiment, we blocked the language of the sentences to encourage language-specific processes. The results were virtually identical to those of the mixed-language experiment. In both cases, target cognates were named faster than non-cognates, and the magnitude of the effect did not change according to the broader context. Taken together, the results support the predictions of the Bilingual Interactive Activation + Model (Dijkstra and van Heuven, 2002) in demonstrating that bilingual lexical access is language non-selective even under conditions in which language-specific cues should enable selective processing. They also demonstrate that, in contrast to lexical switching from one language to the other, inter-sentential code-switching of the sort in which bilinguals frequently engage, imposes no significant costs to lexical processing. PMID:23750141

  17. When language switching has no apparent cost: Lexical access in sentence context

    Directory of Open Access Journals (Sweden)

    Jason W. Gullifer

    2013-05-01

    Full Text Available We report two experiments that investigate the effects of sentence context on bilingual lexical access in Spanish and English. Highly proficient Spanish-English bilinguals read sentences in Spanish and English that included a marked word to be named. The word was either a cognate with similar orthography and/or phonology in the two languages, or a matched non-cognate control. Sentences appeared in one language alone (i.e., Spanish or English and target words were not predictable on the basis of the preceding semantic context. In Experiment 1, we mixed the language of the sentence within a block such that sentences appeared in an alternating run in Spanish or in English. These conditions partly resemble normally occurring inter-sentential code-switching. In these mixed language sequences, cognates were named faster than non-cognates in both languages. There were no effects of switching the language of the sentence. In Experiment 2, with Spanish-English bilinguals matched closely to those who participated in the first experiment, we blocked the language of the sentences to encourage language-specific processes. The results were virtually identical to those of the mixed language experiment. In both cases, target cognates were named faster than non-cognates, and the magnitude of the effect did not change according to the broader context. Taken together, the results support the predictions of the Bilingual Interactive Activation + Model (Dijkstra & Van Heuven, 2002 in demonstrating that bilingual lexical access is language nonselective even under conditions in which language-specific cues should enable selective processing. They also demonstrate that, in contrast to lexical switching from one language to the other, inter-sentential code-switching of the sort in which bilinguals frequently engage, imposes no significant costs to lexical processing.

  18. Upgrading of TREAT experimental capabilities

    International Nuclear Information System (INIS)

    Dickerman, C.E.; Rose, D.; Bhattacharyya, S.K.

    1982-01-01

    The TREAT facility at the Argonne National Laboratory site in the Idaho National Engineering Laboratory is being upgraded to provide capabilities for fast-reactor-safety transient experiments not possible at any other experimental facility. Principal TREAT Upgrade (TU) goal is provision for 37-pin size experiments on energetics of core-disruptive accidents (CDA) in fast breeder reactor cores with moderate sodium void coefficients. this goal requires a significant enhancement of the capabilities of the TREAT facility, specifically including reactor control, hardened neutron spectrum incident on the test sample, and enlarged building. The upgraded facility will retain the capability for small-size experiments of the types currently being performed in TREAT. Reactor building and crane upgrading have been completed. TU schedules call for the components of the upgraded reactor system to be finished in 1984, including upgraded TREAT fuel and control system, and expanded coverage by the hodoscope fuel-motion diagnostics system

  19. Linguistic Competence Profiles in English as a Foreign Language in Mexican University Students.

    Science.gov (United States)

    Vivaldo-Lima, Javier; Gonzalez-Robles, Rosa O.; Castillo-Morales, Alberto

    Linguistic competence in English (LCE) as a foreign language has been acknowledged as an important determinant of academic success in higher education. The purpose of this study is to determine the LCE profiles of Mexican freshman students as well as the main factors associated with differences in linguistic competence between proficient and poor…

  20. Treating the Capability for Suicide: A Vital and Understudied Frontier in Suicide Prevention.

    Science.gov (United States)

    Anestis, Michael D; Law, Keyne C; Jin, Hyejin; Houtsma, Claire; Khazem, Lauren R; Assavedo, Brittney L

    2017-10-01

    Current efforts at suicide prevention center largely on reducing suicidal desire among individuals hospitalized for suicidality or being treated for related psychopathology. Such efforts have yielded evidence-based treatments, and yet the national suicide rate has continued to climb. We propose that this disconnect is heavily influenced by an unmet need to consider population-level interventions aimed at reducing the capability for suicide. Drawing on lessons learned from other public health phenomena that have seen drastic declines in frequency in recent decades (HIV, lung cancer, motor vehicle accidents), we propose that current suicidality treatment efforts trail current suicidality theories in their lack of focus on the extent to which individuals thinking about suicide are capable of transitioning from ideation to attempt. We summarize extant evidence for specific capability-centered approaches (e.g., means safety) and propose other options for improving our ability to address this largely overlooked variable. We also note that population-level approaches in this regard would represent an important opportunity to decrease risk in individuals who either lack access to evidence-based care or underreport suicidal ideation, as a reduced capability for suicide would theoretically diminish the potency of suicidal desire and, in this sense, lower the odds of a transition from ideation to attempt. © 2016 The American Association of Suicidology.

  1. Using text mining techniques to extract phenotypic information from the PhenoCHF corpus.

    Science.gov (United States)

    Alnazzawi, Noha; Thompson, Paul; Batista-Navarro, Riza; Ananiadou, Sophia

    2015-01-01

    Phenotypic information locked away in unstructured narrative text presents significant barriers to information accessibility, both for clinical practitioners and for computerised applications used for clinical research purposes. Text mining (TM) techniques have previously been applied successfully to extract different types of information from text in the biomedical domain. They have the potential to be extended to allow the extraction of information relating to phenotypes from free text. To stimulate the development of TM systems that are able to extract phenotypic information from text, we have created a new corpus (PhenoCHF) that is annotated by domain experts with several types of phenotypic information relating to congestive heart failure. To ensure that systems developed using the corpus are robust to multiple text types, it integrates text from heterogeneous sources, i.e., electronic health records (EHRs) and scientific articles from the literature. We have developed several different phenotype extraction methods to demonstrate the utility of the corpus, and tested these methods on a further corpus, i.e., ShARe/CLEF 2013. Evaluation of our automated methods showed that PhenoCHF can facilitate the training of reliable phenotype extraction systems, which are robust to variations in text type. These results have been reinforced by evaluating our trained systems on the ShARe/CLEF corpus, which contains clinical records of various types. Like other studies within the biomedical domain, we found that solutions based on conditional random fields produced the best results, when coupled with a rich feature set. PhenoCHF is the first annotated corpus aimed at encoding detailed phenotypic information. The unique heterogeneous composition of the corpus has been shown to be advantageous in the training of systems that can accurately extract phenotypic information from a range of different text types. Although the scope of our annotation is currently limited to a single

  2. Grammar Is a System That Characterizes Talk in Interaction.

    Science.gov (United States)

    Ginzburg, Jonathan; Poesio, Massimo

    2016-01-01

    Much of contemporary mainstream formal grammar theory is unable to provide analyses for language as it occurs in actual spoken interaction. Its analyses are developed for a cleaned up version of language which omits the disfluencies, non-sentential utterances, gestures, and many other phenomena that are ubiquitous in spoken language. Using evidence from linguistics, conversation analysis, multimodal communication, psychology, language acquisition, and neuroscience, we show these aspects of language use are rule governed in much the same way as phenomena captured by conventional grammars. Furthermore, we argue that over the past few years some of the tools required to provide a precise characterizations of such phenomena have begun to emerge in theoretical and computational linguistics; hence, there is no reason for treating them as "second class citizens" other than pre-theoretical assumptions about what should fall under the purview of grammar. Finally, we suggest that grammar formalisms covering such phenomena would provide a better foundation not just for linguistic analysis of face-to-face interaction, but also for sister disciplines, such as research on spoken dialogue systems and/or psychological work on language acquisition.

  3. How Age, Linguistic Status, and the Nature of the Auditory Scene Alter the Manner in Which Listening Comprehension Is Achieved in Multitalker Conversations.

    Science.gov (United States)

    Avivi-Reich, Meital; Jakubczyk, Agnes; Daneman, Meredyth; Schneider, Bruce A

    2015-10-01

    We investigated how age and linguistic status affected listeners' ability to follow and comprehend 3-talker conversations, and the extent to which individual differences in language proficiency predict speech comprehension under difficult listening conditions. Younger and older L1s as well as young L2s listened to 3-talker conversations, with or without spatial separation between talkers, in either quiet or against moderate or high 12-talker babble background, and were asked to answer questions regarding their contents. After compensating for individual differences in speech recognition, no significant differences in conversation comprehension were found among the groups. As expected, conversation comprehension decreased as babble level increased. Individual differences in reading comprehension skill contributed positively to performance in younger EL1s and in young EL2s to a lesser degree but not in older EL1s. Vocabulary knowledge was significantly and positively related to performance only at the intermediate babble level. The results indicate that the manner in which spoken language comprehension is achieved is modulated by the listeners' age and linguistic status.

  4. MORPHOMETRIC ANALYSIS OF CORPUS CALLOSUM- A STUDY IN CADAVER AND MRI

    Directory of Open Access Journals (Sweden)

    Ambili Puthanveetil

    2017-07-01

    Full Text Available BACKGROUND The Corpus Callosum (CC can best be seen in the mid-sagittal section of brain both in cadaver and MRI. The morphometric measurements of the same will be of use in neurosurgical procedures. Sexual dimorphism and the age-related changes in its measurements remained controversial. Till date, no studies have been done on corpus callosum in Kerala. MATERIALS AND METHODS Measurements of CC has been taken and studied in detail in 24 formalin fixed brains from the Department of Anatomy and 48 MR images from the Department of Radiology. The changes according to age and sex were analysed. RESULTS The mean length of CC in the cadaver was 7.24 cm, which was 3.38 cm posterior to frontal pole and 5.73 cm anterior to occipital pole. In MR images, the mean length was 7.10 in males and 6.76 in females. The difference we got was not statistically significant. The length increased with age. Thickness of genu and body decreased as the age advances, but the splenial thickness was found to be increasing with age. There was significant correlation between the thicknesses of various parts of CC. CONCLUSION The values were almost similar to those in the previous studies. Morphometrically, a significant gender difference was not identified in the present study. There were changes according to age both in males and females.

  5. COGITA network has constructed a glossary of diagnostic reasoning terms.

    Science.gov (United States)

    Barais, Marie; Hauswaldt, Johannes; Dinant, Geert-Jan; van de Wiel, Margje; Stolper, C F Erik; Van Royen, Paul

    2017-12-01

    The role of gut feelings in diagnostic reasoning is recognized by most GPs throughout Europe, and probably throughout the world. Studies on this topic have emerged from different countries but there is the risk that authors will use different terms for similar concepts. The European Expert Group on Cognitive and Interactive Processes in Diagnosis and Management in General Practice, COGITA for short, was founded in 2008 to conduct cross-border research in the area of non-analytical diagnostic reasoning. Academic GPs, PhD students, psychologists, linguists and students meet once a year to share their experiences, exchange results and initiate new studies on the topic. A milestone in their research is this publication of a short glossary of diagnostic reasoning terms relating to the gut feelings research topic. It was constructed by the COGITA group members following a literature review, which aimed to define salient terms used in their publications. They described the terms, cross-reviewed the wording and reached consensus within the group. Two sections were created: (1) a diagnostic reasoning section that describes concepts such as analytical and non-analytical reasoning, clinical mind lines, and intuition, and (2) a research methods section describing concepts such as linguistic validity and saturation. The glossary, including relevant literature, has been published on the website http://www.gutfeelingsingeneralpractice.eu . In the future, the glossary will be modified if necessary and completed by members of the COGITA group. [Box: see text].

  6. Proper Names and Named Entities Recognition in the Automatic Text Processing. Review of the book: Nouvel, D., Ehrmann, M., & Rosset, S. (2016. Named Entities for Computational Linguistics. London; Hoboken: ISTE Ltd; John Wiley & Sons, Inc., 2016.

    Directory of Open Access Journals (Sweden)

    Daria M. Golikova

    2018-03-01

    Full Text Available The reviewed book by Damien Nouvel, Maud Ehrmann, and Sophie Rosset Named Entities for Computational Linguistics deals with automatic processing of texts, written in a natural language, and with named entities recognition, aimed at extracting most important information in these texts. The notion of named entities here extends to the entire set of linguistic units referring to an object. The researchers minutely consider the concept of named entities, juxtaposing this category to that of proper names and comparing their definitions, and describe all the stages of creation and implementation of automatic text annotation algorithms, as well as different ways of evaluating their performance quality. Proper names, in this context, are seen as a particular instance of named entities, one of the typical sources of reference to real objects to be electronically recognized in the text. The book provides a detailed overview and analysis of previous studies in the same field, based mainly on the English language data. It presents instruments and resources required to create and implement the algorithms in question, these may include typologies, knowledge or databases, and various types of corpora. Theoretical considerations, proposed by the authors, are supported by a significant number of exemplary cases, with algorithms operation principles presented in charts. The reviewed book gives quite a comprehensive picture of modern computational linguistic studies focused on named entities recognition and indicates some problems which are unresolved as yet.

  7. Pragmatics as a Component vs. Pragmatics as a Perspective of Linguistics

    DEFF Research Database (Denmark)

    Haberland, Hartmut

    2010-01-01

    The discussion of pragmatics as a component or a perspective of linguistics was started in the 1970’es in Haberland and Mey (1977). I am not going to claim that “Continental Pragmatics” (as Stephen Levinson called the perspective school in 1982) has the correct approach and that some other (should...

  8. Previous Exposure to the Fish Parasite Anisakis as a Potential Risk Factor for Gastric or Colon Adenocarcinoma.

    Science.gov (United States)

    Garcia-Perez, Juan Carlos; Rodríguez-Perez, Rosa; Ballestero, Araceli; Zuloaga, Jaime; Fernandez-Puntero, Belen; Arias-Díaz, Javier; Caballero, María Luisa

    2015-10-01

    Anisakiasis is a global disease caused by consumption of raw or lightly cooked fish contaminated with L3 Anisakis spp. larvae. High rates of parasitization of fish worldwide make Anisakis a serious health hazard. In fact, anisakiasis is a growing disease in countries such as Spain, Italy, and Japan, where consumption of raw/marinated fish is high. Some parasitic infections have been recognized as a causative factor for human cancer. Suggested mechanisms include chronic inflammation elicited by the parasite, and a possible tumorigenic effect from certain parasitic secretions. Anisakis can produce persistent local inflammation and granuloma, and larvae have been incidentally found in gastrointestinal (GI) tumors. Our aim was to discover possible differences in the prevalence of unnoticed or asymptomatic previous Anisakis infection in GI cancer patients compared with healthy individuals. Serum levels of specific antibodies against Anisakis antigens were used as a reliable marker of previous contact with their larvae. Ninety-four participants without a previous history of Anisakis infection were prospectively allocated into 1 of 2 groups: 47 patients with GI cancer and 47 controls. Specific IgE, IgA1, and IgG1 against the Anisakis recombinant antigens Ani s 1, Ani s 5, Ani s 9, and Ani s 10 were determined by an ELISA assay. The ratio of positivity to sIgA1, rAni s 1, or rAni s 5 was significantly higher in the cancer patients than in the controls (38.30% vs 6.38%, P cancer showed a higher proportion of positive results for sIgA1 to rAni s 1 (P cancer patients were shown to be positive for sIgA1 to both rAni s 1 (P cancer.

  9. A Corpus Luteum Is Not a Prerequisite for the Expression of Progesterone Induced Blocking Factor by T-Lymphocytes a Week After Implantation

    OpenAIRE

    Check, Jerome H.; Szekeres-Bartho, Julia; Nazari, Parvin; Katz, Youval; Check, Matthew L.

    2001-01-01

    Purpose: To determine if production of the immunomodulatory protein, progesterone induced blocking factor (PIBF), requires merely progesterone or whether other factors made by the corpus luteum are required.

  10. Effective use of latent semantic indexing and computational linguistics in biological and biomedical applications.

    Science.gov (United States)

    Chen, Hongyu; Martin, Bronwen; Daimon, Caitlin M; Maudsley, Stuart

    2013-01-01

    Text mining is rapidly becoming an essential technique for the annotation and analysis of large biological data sets. Biomedical literature currently increases at a rate of several thousand papers per week, making automated information retrieval methods the only feasible method of managing this expanding corpus. With the increasing prevalence of open-access journals and constant growth of publicly-available repositories of biomedical literature, literature mining has become much more effective with respect to the extraction of biomedically-relevant data. In recent years, text mining of popular databases such as MEDLINE has evolved from basic term-searches to more sophisticated natural language processing techniques, indexing and retrieval methods, structural analysis and integration of literature with associated metadata. In this review, we will focus on Latent Semantic Indexing (LSI), a computational linguistics technique increasingly used for a variety of biological purposes. It is noted for its ability to consistently outperform benchmark Boolean text searches and co-occurrence models at information retrieval and its power to extract indirect relationships within a data set. LSI has been used successfully to formulate new hypotheses, generate novel connections from existing data, and validate empirical data.

  11. Linguistic Levels of Translation: A Generic Exploration of Translation Difficulties in Literary Textual Corpus

    Directory of Open Access Journals (Sweden)

    Magda Madkour

    2016-11-01

    Full Text Available This case study research was based on a generic exploration of the translation problems that graduate students face in literary translation. Literary translation is fundamental to translation programs at higher education due to the upsurge that has occurred in publishing classical and modern literary works from various cultures. However, literary texts have special characteristics that make the process of transferring them from one language into another a daunting task. Translating literary texts is difficult even for professional translators because misinterpreting the messages of the source texts can lead to distorting the aesthetic aspects of the literary work. Students need to learn various linguistic levels of literary translation as well as strategies and methods of translation. Learning the linguistics levels of translation necessitates providing adequate training that is based on enhancing students’ cognitive abilities. Cognitive-based translation training helps students learn the procedures of solving the problems of translating sound and literary devices. Cognitive approaches are relevant to the translation process since cognition implies mental activities that students can use to understand and synthesize the literary text, and reconstruct it creatively. Therefore, the current study aimed at examining the relationship between cognitive teaching methodologies and students’ performance in literary translation. To examine this relationship, qualitative and quantitative data was collected from graduate students at the College of Languages and Translation at Imam Mohammed bin Saud Islamic University (IMAMU University, Riyadh, Saudi Arabia. In addition, corpus data was gathered from authentic literary texts including, novels, short stories, and poetry, to investigate the effect of linguistic analysis and cognitive strategies on the quality of literary translation. Quantitative data was analyzed using the Statistical Package for the

  12. Carbon black nanoparticles induce type II epithelial cells to release chemotaxins for alveolar macrophages

    Directory of Open Access Journals (Sweden)

    Donaldson Ken

    2005-12-01

    Full Text Available Abstract Background Alveolar macrophages are a key cell in dealing with particles deposited in the lungs and in determining the subsequent response to that particle exposure. Nanoparticles are considered a potential threat to the lungs and the mechanism of pulmonary response to nanoparticles is currently under intense scrutiny. The type II alveolar epithelial cell has previously been shown to release chemoattractants which can recruit alveolar macrophages to sites of particle deposition. The aim of this study was to assess the responses of a type II epithelial cell line (L-2 to both fine and nanoparticle exposure in terms of secretion of chemotactic substances capable of inducing macrophage migration. Results Exposure of type II cells to carbon black nanoparticles resulted in significant release of macrophage chemoattractant compared to the negative control and to other dusts tested (fine carbon black and TiO2 and nanoparticle TiO2 as measured by macrophage migration towards type II cell conditioned medium. SDS-PAGE analysis of the conditioned medium from particle treated type II cells revealed that a higher number of protein bands were present in the conditioned medium obtained from type II cells treated with nanoparticle carbon black compared to other dusts tested. Size-fractionation of the chemotaxin-rich supernatant determined that the chemoattractants released from the epithelial cells were between 5 and 30 kDa in size. Conclusion The highly toxic nature and reactive surface chemistry of the carbon black nanoparticles has very likely induced the type II cell line to release pro-inflammatory mediators that can potentially induce migration of macrophages. This could aid in the rapid recruitment of inflammatory cells to sites of particle deposition and the subsequent removal of the particles by phagocytic cells such as macrophages and neutrophils. Future studies in this area could focus on the exact identity of the substance(s released by the

  13. BelleII@home: Integrate volunteer computing resources into DIRAC in a secure way

    Science.gov (United States)

    Wu, Wenjing; Hara, Takanori; Miyake, Hideki; Ueda, Ikuo; Kan, Wenxiao; Urquijo, Phillip

    2017-10-01

    The exploitation of volunteer computing resources has become a popular practice in the HEP computing community as the huge amount of potential computing power it provides. In the recent HEP experiments, the grid middleware has been used to organize the services and the resources, however it relies heavily on the X.509 authentication, which is contradictory to the untrusted feature of volunteer computing resources, therefore one big challenge to utilize the volunteer computing resources is how to integrate them into the grid middleware in a secure way. The DIRAC interware which is commonly used as the major component of the grid computing infrastructure for several HEP experiments proposes an even bigger challenge to this paradox as its pilot is more closely coupled with operations requiring the X.509 authentication compared to the implementations of pilot in its peer grid interware. The Belle II experiment is a B-factory experiment at KEK, and it uses DIRAC for its distributed computing. In the project of BelleII@home, in order to integrate the volunteer computing resources into the Belle II distributed computing platform in a secure way, we adopted a new approach which detaches the payload running from the Belle II DIRAC pilot which is a customized pilot pulling and processing jobs from the Belle II distributed computing platform, so that the payload can run on volunteer computers without requiring any X.509 authentication. In this approach we developed a gateway service running on a trusted server which handles all the operations requiring the X.509 authentication. So far, we have developed and deployed the prototype of BelleII@home, and tested its full workflow which proves the feasibility of this approach. This approach can also be applied on HPC systems whose work nodes do not have outbound connectivity to interact with the DIRAC system in general.

  14. Quantifying cross-linguistic influence with a computational model : A study of case-marking comprehension

    NARCIS (Netherlands)

    Matusevych, Yevgen; Alishahi, Afra; Backus, Albert

    2017-01-01

    Cross-linguistic influence (CLI) is one of the key phenomena in bilingual and second language learning. We propose a method for quantifying CLI in the use of linguistic constructions with the help of a computational model, which acquires constructions in two languages from bilingual input. We focus

  15. A note on statistical methods in comparative linguistics

    NARCIS (Netherlands)

    Cowan, H.K.J.

    1959-01-01

    It is desirable to distinguish between lexicostatistics as a means of proving relationships between languages or linguistic groups not previously known to be related, and glottochronology as a means of measuring the time depths of separations between languages or linguistic groups already known to

  16. Corpus Callosum Size is Linked to Dichotic Deafness and Hemisphericity, Not Sex or Handedness

    Science.gov (United States)

    Morton, Bruce E.; Rafto, Stein E.

    2006-01-01

    Individuals differ in the number of corpus callosum (CC) nerve fibers interconnecting their cerebral hemispheres by about threefold. Early reports suggested that males had smaller CCs than females. This was often interpreted to support the concept that the male brain is more "lateralized" or "specialized," thus accounting for presumed male…

  17. From Shared Contexts to Syntactic Categories: The Role of Distributional Information in Learning Linguistic Form-Classes

    Science.gov (United States)

    Reeder, Patricia A.; Newport, Elissa L.; Aslin, Richard N.

    2013-01-01

    A fundamental component of language acquisition involves organizing words into grammatical categories. Previous literature has suggested a number of ways in which this categorization task might be accomplished. Here we ask whether the patterning of the words in a corpus of linguistic input ("distributional information") is sufficient, along with a…

  18. Test beam results for an upgraded forward tagger of the L3 experiment at LEP II

    International Nuclear Information System (INIS)

    Chemarin, M.; Depasse, P.; Fay, J.; Felcini, M.; Fredj, L.; Ille, B.; Nessi-Tedaldi, F.; Susinno, G.F.

    1994-01-01

    We have tested new scintillator modules with silicon photodiode readout for the upgraded Active Lead Rings (ALR) of the L3 detector at LEP II. Results are presented from data recorded in muon and electron test beams with particular emphasis on the light production and collection as a function of the particle impact position on the scintillator modules. The results from the beam test data will be used for the design of the readout and trigger electronics in conjunction with the required ALR performance as an electron tagger and beam background monitor at LEP II. ((orig.))

  19. Linguistic Context Versus Semantic Competition in Word Recognition by Younger and Older Adults With Cochlear Implants.

    Science.gov (United States)

    Amichetti, Nicole M; Atagi, Eriko; Kong, Ying-Yee; Wingfield, Arthur

    The increasing numbers of older adults now receiving cochlear implants raises the question of how the novel signal produced by cochlear implants may interact with cognitive aging in the recognition of words heard spoken within a linguistic context. The objective of this study was to pit the facilitative effects of a constraining linguistic context against a potential age-sensitive negative effect of response competition on effectiveness of word recognition. Younger (n = 8; mean age = 22.5 years) and older (n = 8; mean age = 67.5 years) adult implant recipients heard 20 target words as the final words in sentences that manipulated the target word's probability of occurrence within the sentence context. Data from published norms were also used to measure response entropy, calculated as the total number of different responses and the probability distribution of the responses suggested by the sentence context. Sentence-final words were presented to participants using a word-onset gating paradigm, in which a target word was presented with increasing amounts of its onset duration in 50 msec increments until the word was correctly identified. Results showed that for both younger and older adult implant users, the amount of word-onset information needed for correct recognition of sentence-final words was inversely proportional to their likelihood of occurrence within the sentence context, with older adults gaining differential advantage from the contextual constraints offered by a sentence context. On the negative side, older adults' word recognition was differentially hampered by high response entropy, with this effect being driven primarily by the number of competing responses that might also fit the sentence context. Consistent with previous research with normal-hearing younger and older adults, the present results showed older adult implant users' recognition of spoken words to be highly sensitive to linguistic context. This sensitivity, however, also resulted in a

  20. Metáforas e Linguística de Corpus: metodologia de análise aplicada a um gênero de negócios Metaphors and Corpus Linguistics: a method for finding metaphors in a business genre

    Directory of Open Access Journals (Sweden)

    Tony Berber Sardinha

    2011-01-01

    Full Text Available O presente trabalho visa a relatar o desenvolvimento de uma metodologia de identificação de metáforas em corpora eletrônicos. Como exemplo, foi tomado o gênero teleconferências de apresentação de resultados financeiros. A metodologia é do tipo "bottom-up" / "corpus-driven" e se baseia na identificação de palavras com frequência marcante (palavras-chave e de seus padrões de co-ocorrência, seguido do cálculo de similaridade semântica entre essas palavras. Com isso, chega-se a um conjunto de palavras que são então interpretadas em seu co-texto, por meio de concordâncias.This paper aims at reporting the development of a method for metaphor identification in computer corpora. The method was tested on a particular corpus, namely of investment conference calls, and comprises procedures that work from the bottom up, and rely on marked frequency, collocation and semantic similarity as signalling devices for metaphor. As such, the method is an example of corpus-driven research into metaphor. The application of these procedures yields a number of metaphor candidates, which are then checked manually through concordances.

  1. THE ARABIC ORIGINS OF ENGLISH AND INDO-EUROPEAN "URBAN TERMS": A RADICAL LINGUISTIC THEORY APPROACH

    Directory of Open Access Journals (Sweden)

    Zaidan Ali Jassem

    2015-06-01

    Full Text Available This paper traces the Arabic origins of English, German, French, Latin, Greek, and Sanskrit "urban terms" from a radical linguistic (or lexical root theory perspective. The data comprises 130 such terms like abide, building, city, construction, courtyard, hotel, house, live, mansion, mason, palace, metropolis, residence, road, rural, sedentary, sojourn, stay, structure, tent, town, urban, villa, village, zoo, and so on. The results clearly show that all such words have true Arabic cognates with the same or similar forms and meanings, whose differences are due to natural and plausible causes and different routes of linguistic change. Moreover, the results support the adequacy of the radical linguistic theory according to which, unlike the Comparative Method and/or Family Tree Model, Arabic, English, German, French, Latin, Greek, and Sanskrit are dialects of the same language or family, renamed Eurabian or Urban family, with Arabic being their origin all for sharing the whole cognates with them and for its huge phonetic, morphological, grammatical, and lexical variety and wealth. Also, they indicate that there is a radical language from which all human languages stemmed and which has been preserved almost intact in Arabic as the most conservative and productive language, without which it is impossible to interpret its linguistic richness and versatility on all levels.

  2. Grammar-Based Specification and Parsing of Binary File Formats

    Directory of Open Access Journals (Sweden)

    William Underwood

    2012-03-01

    Full Text Available The capability to validate and view or play binary file formats, as well as to convert binary file formats to standard or current file formats, is critically important to the preservation of digital data and records. This paper describes the extension of context-free grammars from strings to binary files. Binary files are arrays of data types, such as long and short integers, floating-point numbers and pointers, as well as characters. The concept of an attribute grammar is extended to these context-free array grammars. This attribute grammar has been used to define a number of chunk-based and directory-based binary file formats. A parser generator has been used with some of these grammars to generate syntax checkers (recognizers for validating binary file formats. Among the potential benefits of an attribute grammar-based approach to specification and parsing of binary file formats is that attribute grammars not only support format validation, but support generation of error messages during validation of format, validation of semantic constraints, attribute value extraction (characterization, generation of viewers or players for file formats, and conversion to current or standard file formats. The significance of these results is that with these extensions to core computer science concepts, traditional parser/compiler technologies can potentially be used as a part of a general, cost effective curation strategy for binary file formats.

  3. ‘Shift’ ‘n ‘control’: The computer as a third interactant in Spanish-language

    Science.gov (United States)

    Goble, Ryan; Vickers, Caroline H

    2015-01-01

    The purpose of this paper is to examine the role of the computer in medical consultations in which English- Spanish-bilingual medical providers interact with Spanish-monolingual patients. Following previous studies that have revealed that the presence of the computer in consultations detracts from direct provider– patient communication, we pay specific attention to how the use of the computer in Spanish-language medical consultations can complement or adversely affect the co-construction of the patient’s health narrative. The data for the present study consist of 36 Spanish-language medical consultations in Southern California. Applying a conversation analytical approach to the health narratives in the corpus, we argue that the computer is essentially a third interactant to which medical providers orient through lowered volume, minimal responses, bureaucratic side talk, and, most importantly, code-switching to English – all of which strip the patients of control over the co-construction of their health narrative with their medical provider. Because the patient does not have access to the computational task and the language, we posit that this exacerbates the already existing adverse effects that the computer has on provider–patient interaction.

  4. Lexicon and teaching: an analysis of the use of less frequent prepositions in a Spanish learners corpus

    Directory of Open Access Journals (Sweden)

    Jéssyca Camargo Cruz

    2017-08-01

    Full Text Available This article aims at presenting a quantitative research and the analysis of the use of prepositions that are less frequent (underused in a corpus of learners of Spanish as a foreign language. We have observed the use of contra, hacia, enfrente de, excepto and tras through Corpus Linguistics by contrasting this lexical set and a supplementary corpus, composed by normative and descriptive Spanish grammar and by an online reference corpus of Spanish (CREA. Therefore, we present analyses made on a corpus constituted by 276 writings (85.729 words, gathered from two groups of freshman Language/Letras students, from 2011 to 2013. The data were collected with the aid of the WordSmith Tools (version 6 software and its tools, WordList and Concord enabled us to extract the frequency list of the prepositions in the corpus of study, as well as to observe and analyse their respective uses based on the lines of concordance.

  5. A New Decision-Making Method for Stock Portfolio Selection Based on Computing with Linguistic Assessment

    Directory of Open Access Journals (Sweden)

    Chen-Tung Chen

    2009-01-01

    Full Text Available The purpose of stock portfolio selection is how to allocate the capital to a large number of stocks in order to bring a most profitable return for investors. In most of past literatures, experts considered the portfolio of selection problem only based on past crisp or quantitative data. However, many qualitative and quantitative factors will influence the stock portfolio selection in real investment situation. It is very important for experts or decision-makers to use their experience or knowledge to predict the performance of each stock and make a stock portfolio. Because of the knowledge, experience, and background of each expert are different and vague, different types of 2-tuple linguistic variable are suitable used to express experts' opinions for the performance evaluation of each stock with respect to criteria. According to the linguistic evaluations of experts, the linguistic TOPSIS and linguistic ELECTRE methods are combined to present a new decision-making method for dealing with stock selection problems in this paper. Once the investment set has been determined, the risk preferences of investor are considered to calculate the investment ratio of each stock in the investment set. Finally, an example is implemented to demonstrate the practicability of the proposed method.

  6. Effects of syntactic structure in the memory of concrete and abstract Chinese sentences.

    Science.gov (United States)

    Ho, C S; Chen, H C

    1993-09-01

    Smith (1981) found that concrete English sentences were better recognized than abstract sentences and that this concreteness effect was potent only when the concrete sentence was also affirmative but the effect switched to an opposite end when the concrete sentence was negative. These results were partially replicated in Experiment 1 by using materials from a very different language (i.e., Chinese): concrete-affirmative sentences were better remembered than concrete-negative and abstract sentences, but no reliable difference was found between the latter two types. In Experiment 2, the task was modified by using a visual presentation instead of an oral one as in Experiment 1. Both concrete-affirmative and concrete-negative sentences were better memorized then abstract ones in Experiment 2. The findings in the two experiments are explained by a combination of the dual-coding model and Marschark's (1985) item-specific and relational processing. The differential effects of experience with different language systems on processing verbal materials in memory are also discussed.

  7. "Papa Said That One Day I Would Understand": Examining Child Agency and Character Development in "Roll of Thunder, Hear My Cry" Using Critical Corpus Linguistics

    Science.gov (United States)

    Hardstaff, Sarah

    2015-01-01

    This paper considers the issue of child agency in Mildred D. Taylor's 1976 novel "Roll of Thunder, Hear My Cry" using a critical corpus linguistics framework based on Halliday's systemic functional linguistics. The novel has long received praise for its portrayal of child agency in a hostile racist society as well as its depiction of a…

  8. Eponymous Adjectives in the Russian Language: Corpus-Based Approach to Studying One Word-Formation Type

    Directory of Open Access Journals (Sweden)

    Boris Yu. Norman

    2017-11-01

    Full Text Available The article discusses morphological, semantic and pragmatic characteristics of Russian adjectives derived from surnames (pushkinsky, chekhovsky, leninsky, and others. The authors analyze types of suffixal morphs used to form adjectives from Russian and foreign surnames; characterize intralingual obstacles to adjectives formation and explain extralingual factors affecting the usage of surname derivatives. Popularity of a politician or a public fi gure, such as an actor or sportsman, directly provides for a wider circulation not only of their surnames, but also of their surname derivatives. The corpus-based approach to studying adjectives derived from surnames of political leaders and writers reveals specific sets of linguistic units called semantic clusters. These clusters are based on categorial meanings of adjectives, and provided that the collocating nouns belong to the same ideographic class. Adjectives derived from surnames do not belong to one specific grammar sub-class; on the contrary, they combine characteristics of qualitative, relative, and possessive adjectives. Precedent names determine connotations of such adjectives, although the semantic structure of a derivative does not necessarily copy that of the source word. Eponymous words are a convenient means for accommodating complex information whereas understanding their meaning requires certain background knowledge and builds on the principles of compositional semantics.

  9. Do not resonate with actions: sentence polarity modulates cortico-spinal excitability during action-related sentence reading.

    Directory of Open Access Journals (Sweden)

    Marco Tullio Liuzza

    Full Text Available BACKGROUND: Theories of embodied language suggest that the motor system is differentially called into action when processing motor-related versus abstract content words or sentences. It has been recently shown that processing negative polarity action-related sentences modulates neural activity of premotor and motor cortices. METHODS AND FINDINGS: We sought to determine whether reading negative polarity sentences brought about differential modulation of cortico-spinal motor excitability depending on processing hand-action related or abstract sentences. Facilitatory paired-pulses Transcranial Magnetic Stimulation (pp-TMS was applied to the primary motor representation of the right-hand and the recorded amplitude of induced motor-evoked potentials (MEP was used to index M1 activity during passive reading of either hand-action related or abstract content sentences presented in both negative and affirmative polarity. Results showed that the cortico-spinal excitability was affected by sentence polarity only in the hand-action related condition. Indeed, in keeping with previous TMS studies, reading positive polarity, hand action-related sentences suppressed cortico-spinal reactivity. This effect was absent when reading hand action-related negative polarity sentences. Moreover, no modulation of cortico-spinal reactivity was associated with either negative or positive polarity abstract sentences. CONCLUSIONS: Our results indicate that grammatical cues prompting motor negation reduce the cortico-spinal suppression associated with affirmative action sentences reading and thus suggest that motor simulative processes underlying the embodiment may involve even syntactic features of language.

  10. An efficient implementation of the head-corner parser

    NARCIS (Netherlands)

    vanNoord, G

    This paper describes an efficient and robust implementation of a bidirectional, head-driven parser for constraint-based grammars. This parser is developed for the OVIS system: a Dutch spoken dialogue system in which information about public transport can be obtained by telephone. After a review of

  11. Midline corpus callosum is a neuroanatomical focus of fetal alcohol damage.

    Science.gov (United States)

    Bookstein, Fred L; Sampson, Paul D; Connor, Paul D; Streissguth, Ann P

    2002-06-15

    Prenatal exposure to high levels of alcohol often induces birth defects that combine morphological stigmata with neurological or neuropsychological deficits. But it has proved problematic to diagnose these syndromes in adolescents and adults, in whom the morphological signs are absent or attenuated, the behavioral deficits nonspecific, and the exposure history often difficult to reconstruct. Localizing the associated brain abnormalities might circumvent most of these difficulties. To this end, three-dimensional (3D) locations were recorded for 67 homologous points on or near the corpus callosum in magnetic resonance (MR) brain images from 60 adolescents and adults who were normal, 60 diagnosed with fetal alcohol syndrome, and 60 diagnosed with fetal alcohol effects. We combined the standard statistical approach to this type of geometric data, Procrustes analysis, with a multivariate strategy focusing on differences in variability. In this data set, the shape of the corpus callosum and its vicinity proves systematically much more variable in the alcohol-affected brains than in those of the normal subjects. From this excess variability follows a promising classification rule, having both high sensitivity (100 out of 117) and high specificity (49 out of 60) in this sample. The discrimination uses four landmark points and two summary scores of callosal outline shape. The information from the corpus callosum and vicinity, as viewed in MR brain images of full-grown subjects, may serve as a permanent record of the prenatal effects of alcohol, even in patients who are first suspected of these syndromes relatively late in life or who lack the facial signs of prenatal alcohol damage. The statistical pattern underlying the callosal diagnosis also leads to speculations on mechanisms of the prenatal damage. Copyright 2002 Wiley-Liss, Inc.

  12. Recurrence Quantification Analysis of Sentence-Level Speech Kinematics.

    Science.gov (United States)

    Jackson, Eric S; Tiede, Mark; Riley, Michael A; Whalen, D H

    2016-12-01

    Current approaches to assessing sentence-level speech variability rely on measures that quantify variability across utterances and use normalization procedures that alter raw trajectory data. The current work tests the feasibility of a less restrictive nonlinear approach-recurrence quantification analysis (RQA)-via a procedural example and subsequent analysis of kinematic data. To test the feasibility of RQA, lip aperture (i.e., the Euclidean distance between lip-tracking sensors) was recorded for 21 typically developing adult speakers during production of a simple utterance. The utterance was produced in isolation and in carrier structures differing just in length or in length and complexity. Four RQA indices were calculated: percent recurrence (%REC), percent determinism (%DET), stability (MAXLINE), and stationarity (TREND). Percent determinism (%DET) decreased only for the most linguistically complex sentence; MAXLINE decreased as a function of linguistic complexity but increased for the longer-only sentence; TREND decreased as a function of both length and linguistic complexity. This research note demonstrates the feasibility of using RQA as a tool to compare speech variability across speakers and groups. RQA offers promise as a technique to assess effects of potential stressors (e.g., linguistic or cognitive factors) on the speech production system.

  13. Slavic Phraseology: A View Through Corpora

    Directory of Open Access Journals (Sweden)

    Zakharov Victor

    2017-12-01

    Full Text Available The study of word collocability is one of the main tasks of linguistics. The combinatory ability of language units, collocability, is one of the linguistic syntagmatic laws. This phenomenon is the main object of the phraseology and lexicography. The article deals with set phrases of different types in Russian, Czech and Slovak from the point of view of their quantitative evaluation. Corpus linguistics understand set phrases as statistically determined unities. This approach is the basic point of different automatic ways to extract idioms and collocations. The paper describes experiments which show how text corpora and corpus methods and tools can be used to expand the entries in existing dictionaries and how set phrases could be evaluated quantitatively. It is shown and maintained that corpus linguistics methods and tools allow to create dictionaries of new type which have to include a larger amount of set phrases and collocations than before.

  14. Identification of type II and type III pyoverdine receptors from Pseudomonas aeruginosa.

    Science.gov (United States)

    de Chial, Magaly; Ghysels, Bart; Beatson, Scott A; Geoffroy, Valérie; Meyer, Jean Marie; Pattery, Theresa; Baysse, Christine; Chablain, Patrice; Parsons, Yasmin N; Winstanley, Craig; Cordwell, Stuart J; Cornelis, Pierre

    2003-04-01

    Pseudomonas aeruginosa produces, under conditions of iron limitation, a high-affinity siderophore, pyoverdine (PVD), which is recognized at the level of the outer membrane by a specific TonB-dependent receptor, FpvA. So far, for P. aeruginosa, three different PVDs, differing in their peptide chain, have been described (types I-III), but only the FpvA receptor for type I is known. Two PVD-producing P. aeruginosa strains, one type II and one type III, were mutagenized by a mini-TnphoA3 transposon. In each case, one mutant unable to grow in the presence of the strong iron chelator ethylenediaminedihydroxyphenylacetic acid (EDDHA) and the cognate PVD was selected. The first mutant, which had an insertion in the pvdE gene, upstream of fpvA, was unable to take up type II PVD and showed resistance to pyocin S3, which is known to use type II FpvA as receptor. The second mutant was unable to take up type III PVD and had the transposon insertion in fpvA. Cosmid libraries of the respective type II and type III PVD wild-type strains were constructed and screened for clones restoring the capacity to grow in the presence of PVD. From the respective complementing genomic fragments, type II and type III fpvA sequences were determined. When in trans, type II and type III fpvA restored PVD production, uptake, growth in the presence of EDDHA and, in the case of type II fpvA, pyocin S3 sensitivity. Complementation of fpvA mutants obtained by allelic exchange was achieved by the presence of cognate fpvA in trans. All three receptors posses an N-terminal extension of about 70 amino acids, similar to FecA of Escherichia coli, but only FpvAI has a TAT export sequence at its N-terminal end.

  15. COGNITIVE LINGUISTIC AND ESP: USING METAPHOR AS A USEFUL DEVICE FOR TEACHING L2 BUSINESS ENGLISH LEARNERS

    Directory of Open Access Journals (Sweden)

    Ida Hendriyani

    2017-04-01

    Full Text Available Cognitive Linguistics is language essentially and inherently symbolic in nature. It means that all linguistic expressions such as; words, phrases and sentences, utterances, signs or writting are symbols of concept in a speaker’s mind. Metaphor is a figure of speech in which a word or phrase is applied to an object or action that it does not literally denote in order to imply a resemblance. Many people have encountered them in literature and poetry, and most of us know that even when we talk casually, we may take advantage of such figurative language, perhaps especially when there are some evasive, hard-to-grasp emotions or thought that we want to communicate. We can find several metaphors in newspapers, literature or scientific articles, etc. Metaphor is a central issue in ESP teaching. The purposes of this paper are (1 to increase the students’ knowledge of technical and semi-technical business vocabulary. (2 to improve business reading fluency and (3 to enrich vocabulary in business texts or readings. Metaphor in a specific English language program can provide students with a useful device to raise awareness of key concepts, models and issues and to improve their reading in business. The term of Business English refers to a wide range of ESL (English for Specific Learning courses characterized by having sense of purpose, specific content and general. This frequent presence of metaphor in economic and business texts are for second language learners, to improve language learners’ specialized reading of business texts and to help them produce native-like discourse when they translate business texts.

  16. Inflammatory fibroid polyp of the ileum with the appearance of a Borrmann type II lesion, caused by colostomy irrigation: report of a case.

    Science.gov (United States)

    Ojima, Y; Okajima, M; Asahara, T; Arita, M; Kobayashi, R; Nakahara, M; Masaoka, Y; Toyota, K; Fujitaka, T; Kawahori, K; Shimamoto, F; Dohi, K

    1997-01-01

    Inflammatory fibroid polyps (IFPs) are rarely found in the gastrointestinal tract. The majority of IFPs are sessile-pedunculated or pedunculated polypoid lesions, whereas a polyp presenting like a Borrmann type II lesion is extremely unusual. This report describes the case of a 74-year-old man with a history of intussusception, in whom a preoperative diagnosis of a cecal tumor of the ileocecal valve was made. A laparotomy subsequently revealed a lesion similar to a Borrmann type II tumor located 15 cm above the ileocecal valve, but not at the valve. The lesion was diagnosed as an IFP which had been caused by repeated colostomy irrigation. The aim of the present report is to draw attention to this entity, which should be included in the differential diagnosis of intussusception and small bowel obstruction.

  17. Musical metaphors: evidence for a spatial grounding of non-literal sentences describing auditory events.

    Science.gov (United States)

    Wolter, Sibylla; Dudschig, Carolin; de la Vega, Irmgard; Kaup, Barbara

    2015-03-01

    This study investigated whether the spatial terms high and low, when used in sentence contexts implying a non-literal interpretation, trigger similar spatial associations as would have been expected from the literal meaning of the words. In three experiments, participants read sentences describing either a high or a low auditory event (e.g., The soprano sings a high aria vs. The pianist plays a low note). In all Experiments, participants were asked to judge (yes/no) whether the sentences were meaningful by means of up/down (Experiments 1 and 2) or left/right (Experiment 3) key press responses. Contrary to previous studies reporting that metaphorical language understanding differs from literal language understanding with regard to simulation effects, the results show compatibility effects between sentence implied pitch height and response location. The results are in line with grounded models of language comprehension proposing that sensory motor experiences are being elicited when processing literal as well as non-literal sentences. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. ESTIMATING MORAL DEVELOPMENT IN A SAMPLE OF ALTERNATIVE FORMULAS OF PRISON SENTENCED BENEFICIARIES FROM WEST VENEZUELA

    Directory of Open Access Journals (Sweden)

    CARLOS ENRIQUE ZERPA

    2006-05-01

    Full Text Available The level of moral development in a sampled of convicts from West Venezuela is presented. The studywas carried out by means of Defining Issues Test (DIT evaluation. The sample includes 50 persons between19 and 55 years old (M = 34,33 y DT = 8,76, out of them 39 were males (average age 33,74,DT=8,24 an 11 were females (average age 35,91, DT=10,67. All of them were beneficiaries of alternativesformulas of their respective sentences such as social work, parole or conditional suspension ofsentences and following the corresponding process or rehabilitation out of prisons. This sample registereda moral principles index of 19,67 and a non-relevant answers index of 12,07 as well as fourthmoral developmental stage of 41,3 which are unusually high. The results suggest that there is not astatistically significant difference in the pattern of moral development depending on sex, age, or institutionfor sentenced fulfillment. However, the unusual high pattern of answers and the inverse correspondence between the duration of sentences and the resulting level of moral principles suggest that thesample behaves in a way, which is different from what is expected in no-out-law samples. Findings andimplications are discused.

  19. Recognition of lysophosphatidylcholine by type II NKT cells and protection from an inflammatory liver disease.

    Science.gov (United States)

    Maricic, Igor; Girardi, Enrico; Zajonc, Dirk M; Kumar, Vipin

    2014-11-01

    Lipids presented by the MHC class I-like molecule, CD1d, are recognized by NK T (NKT) cells, which can be broadly categorized into two subsets. The well-characterized type I NKT cells express a semi-invariant TCR and can recognize both α- and β-linked glycolipids, whereas type II NKT cells are less well studied, express a relatively diverse TCR repertoire, and recognize β-linked lipids. Recent structural studies have shown a distinct mode of recognition of a self-glycolipid sulfatide bound to CD1d by a type II NKT TCR. To further characterize Ag recognition by these cells, we have used the structural data and screened other small molecules able to bind to CD1d and activate type II NKT cells. Using plate-bound CD1d and APC-based Ag presentation assay, we found that phospholipids such as lysophosphatidylcholine (LPC) can stimulate the sulfatide-reactive type II NKT hybridoma Hy19.3 in a CD1d-dependent manner. Using plasmon resonance studies, we found that this type II NKT TCR binds with CD1d-bound LPC with micromolar affinities similar to that for sulfatide. Furthermore, LPC-mediated activation of type II NKT cells leads to anergy induction in type I NKT cells and affords protection from Con A-induced hepatitis. These data indicate that, in addition to self-glycolipids, self-lysophospholipids are also recognized by type II NKT cells. Because lysophospholipids are involved during inflammation, our findings have implications for not only understanding activation of type II NKT cells in physiological settings, but also for the development of immune intervention in inflammatory diseases. Copyright © 2014 by The American Association of Immunologists, Inc.

  20. Dimensioni cognitivo-semantiche, sintattiche e pragmatiche dei verbi in un corpus di inglese ‘lingua franca’ in contesti multiculturali di immigrazione

    Directory of Open Access Journals (Sweden)

    Laura Centonze

    2016-01-01

    Full Text Available Abstract – English has been increasingly adopted as the Lingua Franca (ELF by people with different L1s (cf. Crystal 2003; McArthur 2003; Seidlhofer 2004 and diverse lingua-cultural backgrounds (Cogo et al. 2011. Previous research into its syntax (Mauranen and Ranta 2009 has undoubtedly shed light on some recurrent features of intercultural communication in immigration and asylum contexts (e.g. negation, question formulation, if-clauses, but little attention has been paid to the correlation between personal pronouns, semantic verb types and tense within the communicative process and, more specifically, the report of any traumatic event (Tuval-Mashiach et al. 2004 to which immigrants have been exposed over time and which constitutes what is generally referred to as Post-Traumatic Stress Disorder (PTSD; Friedman et al. 2007; Liotti and Farina 2011. This paper is based on the analysis of a corpus of transcripts published on the internet by the Minnesota Historical Society. The site, Becoming Minnesotan, collects excerpts from oral interviews conducted between 1967 and 2011 with recent immigrants to Minnesota usually fleeing civil war and strife, and their American-born children. By adopting the quantitative research methodology typical of corpus linguistics (Biber et al. 1999; Meyer 2002; Sinclair 1991, we searched for and extracted all the instances of semantic verbs types as well as personal pronouns and mapped their frequencies by means of Wordsmith Tools 6 (Scott 2012. Drawing on Halliday’s (1994 and Scheibman’s (2002 semantic verb type taxonomy, we can observe the frequency of certain semantic verb categories and tenses, and make some general considerations which seem to back up the ‘trance-like level of public re-enactment’ (Guido 2008, p. 105 that the interviewees experience during interviews: their physical and interior ‘displacement’ thus appears to be iconically represented by means of a tense usage being

  1. A rational inference approach to group and individual-level sentence comprehension performance in aphasia.

    Science.gov (United States)

    Warren, Tessa; Dickey, Michael Walsh; Liburd, Teljer L

    2017-07-01

    The rational inference, or noisy channel, account of language comprehension predicts that comprehenders are sensitive to the probabilities of different interpretations for a given sentence and adapt as these probabilities change (Gibson, Bergen & Piantadosi, 2013). This account provides an important new perspective on aphasic sentence comprehension: aphasia may increase the likelihood of sentence distortion, leading people with aphasia (PWA) to rely more on the prior probability of an interpretation and less on the form or structure of the sentence (Gibson, Sandberg, Fedorenko, Bergen & Kiran, 2015). We report the results of a sentence-picture matching experiment that tested the predictions of the rational inference account and other current models of aphasic sentence comprehension across a variety of sentence structures. Consistent with the rational inference account, PWA showed similar sensitivity to the probability of particular kinds of form distortions as age-matched controls, yet overall their interpretations relied more on prior probability and less on sentence form. As predicted by rational inference, but not by other models of sentence comprehension in aphasia, PWA's interpretations were more faithful to the form for active and passive sentences than for direct object and prepositional object sentences. However contra rational inference, there was no evidence that individual PWA's severity of syntactic or semantic impairment predicted their sensitivity to form versus the prior probability of a sentence, as cued by semantics. These findings confirm and extend previous findings that suggest the rational inference account holds promise for explaining aphasic and neurotypical comprehension, but they also raise new challenges for the account. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. Breaking the Resource Bottleneck for Multilingual Parsing

    National Research Council Canada - National Science Library

    Hwa, Rebecca; Resnik, Philip; Weinberg, Amy

    2005-01-01

    ...-quality English resources. We present a large-scale experiment showing that Chinese dependency trees can be induced by using an English parser, a word alignment package, and a large corpus of sentence-aligned bilingual text...

  3. A cognitive linguistic exploration of metaphors within the WATER ...

    African Journals Online (AJOL)

    A cognitive linguistic exploration of metaphors within the WATER frame in Swami Vivekananda's Complete Works : A corpus-driven study in light of conceptual metaphor theory. ... Which terms to search for specifically was determined after a manual reading of a sample from the Complete Works. The data were then tagged ...

  4. Structural similarities between brain and linguistic data provide evidence of semantic relations in the brain.

    Directory of Open Access Journals (Sweden)

    Colleen E Crangle

    Full Text Available This paper presents a new method of analysis by which structural similarities between brain data and linguistic data can be assessed at the semantic level. It shows how to measure the strength of these structural similarities and so determine the relatively better fit of the brain data with one semantic model over another. The first model is derived from WordNet, a lexical database of English compiled by language experts. The second is given by the corpus-based statistical technique of latent semantic analysis (LSA, which detects relations between words that are latent or hidden in text. The brain data are drawn from experiments in which statements about the geography of Europe were presented auditorily to participants who were asked to determine their truth or falsity while electroencephalographic (EEG recordings were made. The theoretical framework for the analysis of the brain and semantic data derives from axiomatizations of theories such as the theory of differences in utility preference. Using brain-data samples from individual trials time-locked to the presentation of each word, ordinal relations of similarity differences are computed for the brain data and for the linguistic data. In each case those relations that are invariant with respect to the brain and linguistic data, and are correlated with sufficient statistical strength, amount to structural similarities between the brain and linguistic data. Results show that many more statistically significant structural similarities can be found between the brain data and the WordNet-derived data than the LSA-derived data. The work reported here is placed within the context of other recent studies of semantics and the brain. The main contribution of this paper is the new method it presents for the study of semantics and the brain and the focus it permits on networks of relations detected in brain data and represented by a semantic model.

  5. Analysing Culture and Interculture in Saudi EFL Textbooks: A Corpus Linguistic Approach

    Science.gov (United States)

    Almujaiwel, Sultan

    2018-01-01

    This paper combines corpus processing tools to investigate the cultural elements of Saudi education of English as a foreign language (EFL). The latest Saudi EFL textbooks (2016 onwards) are available in researchable PDF formats. This helps process them through corpus search software tools. The method adopted is based on analysing 20 cultural…

  6. [Nutrition and exercise of schoolchildren. Results of a cross-sectional study at the Lessing Gymnasium (secondary) school in Mannheim, Germany].

    Science.gov (United States)

    Neumann, H; Neumann, P

    2007-12-01

    During the last few years there have been many articles in various newspapers about the unhealthy nutrition of children. It was the aim of this study to find out about the state of nutrition and exercise among the children at our school, a Gymnasium (secondary or grammar school) in Mannheim, Germany. We distributed questionnaires to 729 fellow pupils at the school to get some data/information about their exercise, nutrition and leisure activities. We developed a point-based system to record the quality of the food eaten; the time spent on sports, playing musical instruments; looking at television (TV) etc. or using the computer; and other leisure activities (e.g. reading, meeting friends, playing games). 84% of completed questionnaires were returned (98% of those sent to pupils in the lower forms). Food served at our school was reasonably good, but could have been improved by having a proper dining room or cafeteria. The situation regarding exercise tended to get worse in the younger pupils, because the school day has been lengthened. We found gender-related differences between school marks and other parameters. There was a positive relationship between good marks and a low [normal?] body mass index (BMI) and a negative effect on nutrition/exercise of hours spent watching TV or on the computer (PC). We also found a strong correlation in girls between school marks and their leisure. The number of points for wholesome meals, how much time the pupils spent on sports and music led to better school marks. Pupils of the lower grades exercised much less than older ones. Body weight index and their TV/computer time had similar effects in girls and boys. A low BMI correlated with better school marks, while high TV/PC time worsened them At our school relatively few pupils were overweight. Good nutrition and good amounts of exercise correlated positively with good marks. The amount of exercise taken by the pupils decreased as they advanced to the higher grades.

  7. The corpus-driven revolution in Polish Sign Language: the interview with Dr. Paweł Rutkowski

    Directory of Open Access Journals (Sweden)

    Iztok Kosem

    2018-02-01

    Full Text Available Dr. Paweł Rutkowski is head of the Section for Sign Linguistics at the University of Warsaw. He is a general linguist and a specialist in the field of syntax of natural languages, carrying out research on Polish Sign Language (polski język migowy — PJM. He has been awarded a number of prizes, grants and scholarships by such institutions as the Foundation for Polish Science, Polish Ministry of Science and Higher Education, National Science Centre, Poland, Polish–U.S. Fulbright Commission, Kosciuszko Foundation and DAAD. Dr. Rutkowski leads the team developing the Corpus of Polish Sign Language and the Corpus-based Dictionary of Polish Sign Language, the first dictionary of this language prepared in compliance with modern lexicographical standards. The dictionary is an open-access publication, available freely at the following address: http://www.slownikpjm.uw.edu.pl/en/. This interview took place at eLex 2017, a biennial conference on electronic lexicography, where Dr. Rutkowski was awarded the Adam Kilgarriff Prize and gave a keynote address entitled Sign language as a challenge to electronic lexicography: The Corpus-based Dictionary of Polish Sign Language and beyond. The interview was conducted by Dr. Victoria Nyst from Leiden University, Faculty of Humanities, and Dr. Iztok Kosem from the University of Ljubljana, Faculty of Arts.

  8. 38 CFR 38.618 - Findings concerning commission of a capital crime where a person has not been convicted due to...

    Science.gov (United States)

    2010-07-01

    ... commission of a capital crime where a person has not been convicted due to death or flight to avoid... of a capital crime where a person has not been convicted due to death or flight to avoid prosecution... committed a Federal or State capital crime of which he or she was not convicted due to death or flight to...

  9. Sentence Learning in Children and Adults: The Production of Forms and Transforms.

    Science.gov (United States)

    Ehri, Linnea C.

    This investigation was intended to study the effects of some linguistic variables on child and adult memories for sentences when recall was prompted by nouns embedded in the sentences. Its purpose was to examine for developmental differences in sentence processing systems expected by psycholinguistic theory and research. A group of 64 subjects,…

  10. Thai Language Sentence Similarity Computation Based on Syntactic Structure and Semantic Vector

    Science.gov (United States)

    Wang, Hongbin; Feng, Yinhan; Cheng, Liang

    2018-03-01

    Sentence similarity computation plays an increasingly important role in text mining, Web page retrieval, machine translation, speech recognition and question answering systems. Thai language as a kind of resources scarce language, it is not like Chinese language with HowNet and CiLin resources. So the Thai sentence similarity research faces some challenges. In order to solve this problem of the Thai language sentence similarity computation. This paper proposes a novel method to compute the similarity of Thai language sentence based on syntactic structure and semantic vector. This method firstly uses the Part-of-Speech (POS) dependency to calculate two sentences syntactic structure similarity, and then through the word vector to calculate two sentences semantic similarity. Finally, we combine the two methods to calculate two Thai language sentences similarity. The proposed method not only considers semantic, but also considers the sentence syntactic structure. The experiment result shows that this method in Thai language sentence similarity computation is feasible.

  11. A tentative map of discourse studies and their place in Linguistics A tentative map of discourse studies and their place in Linguistics

    Directory of Open Access Journals (Sweden)

    Michael Hoey

    2008-04-01

    Full Text Available In this paper I want to look at a range of approaches to discourse analysis with a view not to describing any of them in detail but to seeing how they relate to each other in the discipline. Surveys are a necessary evil at best; they inevitably offend by simplification and omission and ometimes by distortion as well. Despite my best endeavours, this survey is unlikely to be any exception. By way of compensation the latter part of the paper attempts a tentative characterisation of the place of discourse studies within linguistics in general and seeks to suggest why there are differences in focus between such studies; if this part of the paper offends, it will at least offend by commission. Discourse analysis has become over the past few years one of the growth areas of modern linguistics. As a result there is now far more being written every year than any person could possibly read. In estimates prepared for an inter-University working party on a computerised bibliography of ESP and Discourse Analysis, Hoey & de Escorcia reported that in 1981 an estimated 5,000 papers and 200 books were published on discourse. A consequence of this growth has been a parallel growth in terminological distinctions, reflected most schismatically perhaps in the two labels given to the sub-discipline(s we study - discourse analysis and textlinguistics. In this paper I want to look at a range of approaches to discourse analysis with a view not to describing any of them in detail but to seeing how they relate to each other in the discipline. Surveys are a necessary evil at best; they inevitably offend by simplification and omission and ometimes by distortion as well. Despite my best endeavours, this survey is unlikely to be any exception. By way of compensation the latter part of the paper attempts a tentative characterisation of the place of discourse studies within linguistics in general and seeks to suggest why there are differences in focus between such studies

  12. Decomposing a Chunk into Its Elements and Reorganizing Them As a New Chunk: The Two Different Sub-processes Underlying Insightful Chunk Decomposition

    Directory of Open Access Journals (Sweden)

    Xiaofei Wu

    2017-11-01

    Full Text Available Familiar chunks can be processed highly efficiently, and this automatic process can prohibit the problem solver from developing novel and original ways to creatively solve difficult problems. For this reason, the role of the reverse process, chunk decomposition (CD, the process by which familiar patterns are broken down into their component elements in order to be regrouped in another meaningful manner, has been generally recognized as part of the creative process. However, previous studies on this issue have mainly focused on the decomposition process of CD (the D-process, while the reorganization process of CD has been greatly neglected or has not been distinctively identified in previous work. In this paper, we argue that the R-process could be equally as important as the D-process for CD. Even if a problem solver manages to decompose a familiar chunk into its elements, he or she still may not solve the problem if these elements are not successfully organized in a new and meaningful manner. To investigate whether the cognitive mechanism of the R-process is different from that of the D-process, we designed an experiment for detecting the effects of chunk tightness, which is regarded as the key factor in CD and which can be experimentally manipulated by the radical-level (loose and stroke-level (tight Chinese character CD tasks in the D-process, the R-process, and the more purified organization task (the O-process task that does not involve the decomposition process. Our results showed that the stroke-level (tight task was more difficult than the radical-level (loose task for the D-process. However, for the R-process, the stroke- and radical-level tasks showed no differences in performance. Moreover, for the more purified reorganization task, the O-process task, the radical-level organization and reorganization could be even more difficult than the stroke-level organization and reorganization. This result demonstrated that the cognitive

  13. US News Media Portrayal of Islam and Muslims: A Corpus-Assisted Critical Discourse Analysis

    Science.gov (United States)

    Samaie, Mahmoud; Malmir, Bahareh

    2017-01-01

    This article exploits the synergy of critical discourse studies and Corpus Linguistics to study the pervasive representation of Islam and Muslims in an approximate 670,000-word corpus of US news media stories published between 2001 and 2015. Following collocation and concordance analysis of the most frequent topics or categories which revolve…

  14. The ADEME has been cautious by not publishing its '100 per cent renewable in 2050' scenario - Towards a 100 per cent renewable energy mix in 2050 - Final report

    International Nuclear Information System (INIS)

    Tamain, Bernard; Nifenecker, Herve; Fournie, Laurent; Chiche, Alice; Faure, Nathalie; Bardet, Regis; Alais, Jean-Christophe; Girard, Robin; Bossavy, Arthur; Le Gars, Loic; Biau, Jean-Baptiste; Piqueras, Ugo; Peyrusse, Colombe

    2015-01-01

    In a recent study which has not been published but could have been read by the authors, the ADEME tries to establish that, in 2050, it will be possible, in the sector of electricity production, to do without fossil and nuclear energies, and indicates that such a solution based on 100 per cent of renewable energies is economically viable. In this paper, the authors show that the renewable costs indicated by the ADEME in its '100 per cent renewable' scenario are significantly underestimated. They more precisely discuss the costs of roof-mounted or ground-based photovoltaic energy, ground-based and offshore wind energy, and of energy storage. They discuss issues related to grids, to intermittency management which they consider as an underestimated issue (constraints, imports, electric consumption, storage issues are addressed). The text of the ADEME study is also provided. This study aims at identifying the constraints under which it is possible to provide a 100 per cent renewable electricity, at determining the optimal energy mixes associated with different predictions of technological evolutions, consumption, and so on, at identifying the geographical distribution of the different renewable production means, and at determining the economic impacts of a 100 per cent renewable mix

  15. Cross-linguistic scope ambiguity: When two systems meet

    Directory of Open Access Journals (Sweden)

    Gregory Scontras

    2017-04-01

    Full Text Available Accurately recognizing and resolving ambiguity is a hallmark of linguistic ability. English is a language with scope ambiguities in doubly-quantified sentences like 'A shark ate every pirate'; this sentence can either describe a scenario with a single shark eating all of the pirates, or a scenario with many sharks—a potentially-different one eating each pirate. In Mandarin Chinese, the corresponding sentence is unambiguous, as it can only describe the single-shark scenario. We present experimental evidence to this effect, comparing native speakers of English with native speakers of Mandarin in their interpretations of doubly-quantified sentences. Having demonstrated the difference between these two languages in their ability for inverse scope interpretations, we then probe the robustness of the grammar of scope by extending our experiments to English-dominant adult heritage speakers of Mandarin. Like native speakers of Mandarin, heritage Mandarin speakers lack inverse scope in Mandarin. Crucially, these speakers also lack inverse scope in English, their dominant language in adulthood. We interpret these results as evidence for the pressure to simplify the grammar of scope, decreasing ambiguity when possible. In other words, when two systems meet—as in the case of heritage speakers—the simpler system prevails. This article is part of the special collection: Quantifier Scope

  16. Conceptualizing innovation capabilities: A contingency perspective

    Directory of Open Access Journals (Sweden)

    Tor Helge Aas

    2017-01-01

    Full Text Available Empirical research has confirmed that a positive relationship exists between the implementation of innovation activities and the future performance of organizations. Firms utilize resources and capabilities to develop innovations in the form of new products, services or processes. Some firms prove to be better at reproducing innovation success than others, and the capacity to do so is referred to as innovation capability. However, the term innovation capability is ambiguously treated in extant literature. There are several different definitions of the concept and the distinction between innovation capabilities and other types of capabilities, such as dynamic capabilities, is neither explicitly stated, nor is the relationship between the concept and other resource- and capability-based concepts within strategy theory established. Although innovation is increasingly identified as crucial for a firm’s sustainable competitiveness in contemporary volatile and complex markets, the strategy-innovation link is underdeveloped in extant research. To overcome this challenge this paper raises the following research question: What type of innovation capabilities are required to innovate successfully? Due to the status of the extant research, we chose a conceptual research design to answer our research question and the paper contributes with a conceptual framework to discuss what innovation capabilities firms need to reproduce innovation success. Based on careful examination of current literature on innovation capability specifically, and the strategy-innovation link in general, we suggest that innovation capability must be viewed along two dimensions – innovation novelty and market characteristics. This framework enables the identification of four different contexts for innovation capabilities in a two-bytwo matrix. We discuss the types of innovation capabilities necessary within the four different contexts. This novel framework contributes to the

  17. ContextD: an algorithm to identify contextual properties of medical terms in a Dutch clinical corpus.

    Science.gov (United States)

    Afzal, Zubair; Pons, Ewoud; Kang, Ning; Sturkenboom, Miriam C J M; Schuemie, Martijn J; Kors, Jan A

    2014-11-29

    In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. We created a Dutch clinical corpus containing four types of anonymized clinical documents: entries from general practitioners, specialists' letters, radiology reports, and discharge letters. Using a Dutch list of medical terms extracted from the Unified Medical Language System, we identified medical terms in the corpus with exact matching. The identified terms were annotated for negation, temporality, and experiencer properties. To adapt the ConText algorithm, we translated English trigger terms to Dutch and added several general and document specific enhancements, such as negation rules for general practitioners' entries and a regular expression based temporality module. The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development.

  18. Mannheim Curves in Nonflat 3-Dimensional Space Forms

    Directory of Open Access Journals (Sweden)

    Wenjing Zhao

    2015-01-01

    Full Text Available We consider the Mannheim curves in nonflat 3-dimensional space forms (Riemannian or Lorentzian and we give the concept of Mannheim curves. In addition, we investigate the properties of nonnull Mannheim curves and their partner curves. We come to the conclusion that a necessary and sufficient condition is that a linear relationship with constant coefficients will exist between the curvature and the torsion of the given original curves. In the case of null curve, we reveal that there are no null Mannheim curves in the 3-dimensional de Sitter space.

  19. Audiovisual sentence repetition as a clinical criterion for auditory development in Persian-language children with hearing loss.

    Science.gov (United States)

    Oryadi-Zanjani, Mohammad Majid; Vahab, Maryam; Rahimi, Zahra; Mayahi, Anis

    2017-02-01

    It is important for clinician such as speech-language pathologists and audiologists to develop more efficient procedures to assess the development of auditory, speech and language skills in children using hearing aid and/or cochlear implant compared to their peers with normal hearing. So, the aim of study was the comparison of the performance of 5-to-7-year-old Persian-language children with and without hearing loss in visual-only, auditory-only, and audiovisual presentation of sentence repetition task. The research was administered as a cross-sectional study. The sample size was 92 Persian 5-7 year old children including: 60 with normal hearing and 32 with hearing loss. The children with hearing loss were recruited from Soroush rehabilitation center for Persian-language children with hearing loss in Shiraz, Iran, through consecutive sampling method. All the children had unilateral cochlear implant or bilateral hearing aid. The assessment tool was the Sentence Repetition Test. The study included three computer-based experiments including visual-only, auditory-only, and audiovisual. The scores were compared within and among the three groups through statistical tests in α = 0.05. The score of sentence repetition task between V-only, A-only, and AV presentation was significantly different in the three groups; in other words, the highest to lowest scores belonged respectively to audiovisual, auditory-only, and visual-only format in the children with normal hearing (P audiovisual sentence repetition scores in all the 5-to-7-year-old children (r = 0.179, n = 92, P = 0.088), but audiovisual sentence repetition scores were found to be strongly correlated with auditory-only scores in all the 5-to-7-year-old children (r = 0.943, n = 92, P = 0.000). According to the study's findings, audiovisual integration occurs in the 5-to-7-year-old Persian children using hearing aid or cochlear implant during sentence repetition similar to their peers with normal hearing

  20. The ModelCC Model-Driven Parser Generator

    Directory of Open Access Journals (Sweden)

    Fernando Berzal

    2015-01-01

    Full Text Available Syntax-directed translation tools require the specification of a language by means of a formal grammar. This grammar must conform to the specific requirements of the parser generator to be used. This grammar is then annotated with semantic actions for the resulting system to perform its desired function. In this paper, we introduce ModelCC, a model-based parser generator that decouples language specification from language processing, avoiding some of the problems caused by grammar-driven parser generators. ModelCC receives a conceptual model as input, along with constraints that annotate it. It is then able to create a parser for the desired textual syntax and the generated parser fully automates the instantiation of the language conceptual model. ModelCC also includes a reference resolution mechanism so that ModelCC is able to instantiate abstract syntax graphs, rather than mere abstract syntax trees.

  1. Intrathoracic drainage of a perforated prepyloric gastric ulcer with a type II paraoesophageal hernia

    Directory of Open Access Journals (Sweden)

    Zonneveld Bas JGL

    2008-12-01

    Full Text Available Abstract Background With an incidence of less than 5%, type II paraesophageal hernias are one of the less common types of hiatal hernias. We report a case of a perforated prepyloric gastric ulcer which, due to a type II hiatus hernia, drained into the mediastinum. Case presentation A 61-year old Caucasian man presented with acute abdominal pain. On a conventional x-ray of the chest a large mediastinal air-fluid collection and free intra-abdominal air was seen. Additional computed tomography revealed a large intra-thoracic air-fluid collection with a type II paraesophageal hernia. An emergency upper midline laparotomy was performed and a perforated pre-pyloric gastric ulcer was treated with an omental patch repair. The patient fully recovered after 10 days and continues to do well. Conclusion Type II paraesophageal hernia is an uncommon diagnosis. The main risk is gastric volvulus and possible gastric torsion. Intrathoracic perforation of gastric ulcers due to a type II hiatus hernia is extremely rare and can be a diagnostic and treatment challenge.

  2. Type II Modic Changes May not Always Represent Fat Degeneration: A Study Using MR Fat Suppression Sequence.

    Science.gov (United States)

    Feng, Zhiyun; Liu, Yuanhao; Wei, Wei; Hu, Shengping; Wang, Yue

    2016-08-15

    A radiological study of type II Modic changes (MCs). The aim of this study was to determine the characteristics of type II MCs on fat suppression (FS) magnetic resonance (MR) images and its association with radiological disc degeneration. Type II MCs are common endplate signal changes on MR images. On the basis of limited histological samples, type II MCs are thought to be stable fat degeneration. FS technique on MR, which can quantify fat content, may be an alternative to explore the pathology of MCs. To date, however, the characteristics of type II MCs on FS sequence have not been studied. Lumbar MR images conducted in a single hospital during a defined period were reviewed to include those with type II MCs and FS images. On FS images, signal status of type II MCs was visually classified as suppressed or not-suppressed. Signal intensity of vertebral regions with and without MCs was measured quantitatively on T2-weighted (T2W) and FS images to calculate fat content index and validate the visual classification. Using image analysis program Osirix, MCs size and adjacent disc degeneration were measured quantitatively. Paired t-tests and logistic regressions were used to determine the associations studied. Sixty-four lumbar MRIs were included and 150 endplates with type II MCs were studied. Although signal of 37 (24.7%) type II MCs was suppressed on FS images, that of 113 (75.3%) was not suppressed. The discs adjacent to type II MCs had lower signal intensity (0.13 ± 0.003 vs. 0.14 ± 0.004, P Type II MCs that were not suppressed on FS image were associated with greater age [odds ratio (OR) = 1.11, P type II MCs was not suppressed on FS MR images, suggesting that there are ongoing complicated pathologies. Type II MCs may not merely represent fat replacement. 3.

  3. Using Extra-Linguistic Material for Mandarin-French Verbal Constructions Comparison

    OpenAIRE

    Magistry, Pierre; Prevot, Laurent; Cheung, Hintat; Shiao, C.Y; Desalle, Yann; Gaume, Bruno

    2009-01-01

    International audience; Systematic cross-linguistic studies of verbs syntactic-semantic behaviors for ty-pologically distant languages such as Mandarin Chinese and French are difficult to conduct.Such studies are nevertheless necessary due to the crucial role that verbal constructions playin the mental lexicon. This paper addresses the problem by combining psycho-linguisticsand computational methods. Psycho-linguistics provides us with a bilingual corpus that fea-tures verbal construction ass...

  4. Establishment and evaluation of a stable cattle type II alveolar epithelial cell line.

    Directory of Open Access Journals (Sweden)

    Feng Su

    Full Text Available Macrophages and dendritic cells are recognized as key players in the defense against mycobacterial infection. Recent research has confirmed that alveolar epithelial cells (AECs also play important roles against mycobacterium infections. Thus, establishing a stable cattle AEC line for future endogenous immune research on bacterial invasion is necessary. In the present study, we first purified and immortalized type II AECs (AEC II cells by transfecting them with a plasmid containing the human telomerase reverse trancriptase gene. We then tested whether or not the immortalized cells retained the basic physiological properties of primary AECs by reverse-transcription polymerase chain reaction and Western blot. Finally, we tested the secretion capacity of immortalized AEC II cells upon stimulation by bacterial invasion. The cattle type II alveolar epithelial cell line (HTERT-AEC II that we established retained lung epithelial cell characteristics: the cells were positive for surfactants A and B, and they secreted tumor necrosis factor-α and interleukin-6 in response to bacterial invasion. Thus, the cell line we established is a potential tool for research on the relationship between AECs and Mycobacterium tuberculosis.

  5. Woman in Either/Or, I & II: A Computer Analysis

    Directory of Open Access Journals (Sweden)

    Alastair McKinnon

    2013-11-01

    Full Text Available The author analyzes Kierkegaard's pronouncing on the woman and the feminine in the text Either/Or, specially in parts I and II. For this, key words such as “woman”, “feminine”, “virginity”, “bride”, “wife”, among many others, were selected and Michael J. Greenacre's SimCA 2.0 comparative analysis program was employed in order to show both the common and different perspectives of the aesthete A and the aesthete B, as well as to offer a series of discursive dimensions that are present in the text, which gives a set of valuable conclusions for the understanding of this Kierkegaardian work and the opinion of the danish philosopher on the woman and what is proper to her. The results of the analysis are shown at the end of the paper with tables which compare A's and B's vocabulary and show their frequency; other figures exhibit with graphs the behaviour of the key words against the most important discursive dimensions.

  6. SEISRISK II; a computer program for seismic hazard estimation

    Science.gov (United States)

    Bender, Bernice; Perkins, D.M.

    1982-01-01

    The computer program SEISRISK II calculates probabilistic ground motion values for use in seismic hazard mapping. SEISRISK II employs a model that allows earthquakes to occur as points within source zones and as finite-length ruptures along faults. It assumes that earthquake occurrences have a Poisson distribution, that occurrence rates remain constant during the time period considered, that ground motion resulting from an earthquake is a known function of magnitude and distance, that seismically homogeneous source zones are defined, that fault locations are known, that fault rupture lengths depend on magnitude, and that earthquake rates as a function of magnitude are specified for each source. SEISRISK II calculates for each site on a grid of sites the level of ground motion that has a specified probability of being exceeded during a given time period. The program was designed to process a large (essentially unlimited) number of sites and sources efficiently and has been used to produce regional and national maps of seismic hazard.}t is a substantial revision of an earlier program SEISRISK I, which has never been documented. SEISRISK II runs considerably [aster and gives more accurate results than the earlier program and in addition includes rupture length and acceleration variability which were not contained in the original version. We describe the model and how it is implemented in the computer program and provide a flowchart and listing of the code.

  7. Evidence that Mediator is essential for Pol II transcription, but is not a required component of the preinitiation complex in vivo.

    Science.gov (United States)

    Petrenko, Natalia; Jin, Yi; Wong, Koon Ho; Struhl, Kevin

    2017-07-12

    The Mediator complex has been described as a general transcription factor, but it is unclear if it is essential for Pol II transcription and/or is a required component of the preinitiation complex (PIC) in vivo. Here, we show that depletion of individual subunits, even those essential for cell growth, causes a general but only modest decrease in transcription. In contrast, simultaneous depletion of all Mediator modules causes a drastic decrease in transcription. Depletion of head or middle subunits, but not tail subunits, causes a downstream shift in the Pol II occupancy profile, suggesting that Mediator at the core promoter inhibits promoter escape. Interestingly, a functional PIC and Pol II transcription can occur when Mediator is not detected at core promoters. These results provide strong evidence that Mediator is essential for Pol II transcription and stimulates PIC formation, but it is not a required component of the PIC in vivo.

  8. Linguistic and perceptual-motor contributions to the kinematic properties of the braille reading finger.

    Science.gov (United States)

    Hughes, Barry; Van Gemmert, Arend W A; Stelmach, George E

    2011-08-01

    Recordings of the dominant finger during the reading of braille sentences by experienced readers reveal that the velocity of the finger changes frequently during the traverse of a line of text. These changes, not previously reported, involve a multitude of accelerations and decelerations, as well as reversals of direction. We investigated the origin of these velocity intermittencies (as well as movement reversals) by asking readers to twice read out-loud or silently sentences comprising high- or low-frequency words which combined to make grammatical sentences that were either meaningful or nonmeaningful. In a control condition we asked braille readers to smoothly scan lines of braille comprised of meaningless cell combinations. Word frequency and re-reading each contribute to the kinematics of finger movements, but neither sentence meaning nor the mode of reading do so. The velocity intermittencies were so pervasive that they are not easily attributable either to linguistic processing, text familiarity, mode of reading, or to sensory-motor interactions with the textured patterns of braille, but seem integral to all braille finger movements except reversals. While language-related processing can affect the finger movements, the effects are superimposed on a highly intermittent velocity profile whose origin appears to lie in the motor control of slow movements. Copyright © 2010 Elsevier B.V. All rights reserved.

  9. Variability and Intelligibility of Clarified Speech to Different Listener Groups

    Science.gov (United States)

    Silber, Ronnie F.

    Two studies examined the modifications that adult speakers make in speech to disadvantaged listeners. Previous research that has focused on speech to the deaf individuals and to young children has shown that adults clarify speech when addressing these two populations. Acoustic measurements suggest that the signal undergoes similar changes for both populations. Perceptual tests corroborate these results for the deaf population, but are nonsystematic in developmental studies. The differences in the findings for these populations and the nonsystematic results in the developmental literature may be due to methodological factors. The present experiments addressed these methodological questions. Studies of speech to hearing impaired listeners have used read, nonsense, sentences, for which speakers received explicit clarification instructions and feedback, while in the child literature, excerpts of real-time conversations were used. Therefore, linguistic samples were not precisely matched. In this study, experiments used various linguistic materials. Experiment 1 used a children's story; experiment 2, nonsense sentences. Four mothers read both types of material in four ways: (1) in "normal" adult speech, (2) in "babytalk," (3) under the clarification instructions used in the "hearing impaired studies" (instructed clear speech) and (4) in (spontaneous) clear speech without instruction. No extra practice or feedback was given. Sentences were presented to 40 normal hearing college students with and without simultaneous masking noise. Results were separately tabulated for content and function words, and analyzed using standard statistical tests. The major finding in the study was individual variation in speaker intelligibility. "Real world" speakers vary in their baseline intelligibility. The four speakers also showed unique patterns of intelligibility as a function of each independent variable. Results were as follows. Nonsense sentences were less intelligible than story

  10. Using Edit Distance to Analyse Errors in a Natural Language to Logic Translation Corpus

    Science.gov (United States)

    Barker-Plummer, Dave; Dale, Robert; Cox, Richard; Romanczuk, Alex

    2012-01-01

    We have assembled a large corpus of student submissions to an automatic grading system, where the subject matter involves the translation of natural language sentences into propositional logic. Of the 2.3 million translation instances in the corpus, 286,000 (approximately 12%) are categorized as being in error. We want to understand the nature of…

  11. A Computer String-Grammar of English.

    Science.gov (United States)

    Sager, Naomi

    This volume is the fourth in a series of detailed reports on a working computer program for the syntactic analysis of English sentences into their component strings. The report (1) records the considerations involved in various decisions among alternative grammatical formulations and presents the word-subclasses, the linguistic strings, etc., for…

  12. Dependency-dependent interference: NPI interference, agreement attraction, and global pragmatic inferences

    Directory of Open Access Journals (Sweden)

    Ming eXiang

    2013-10-01

    Full Text Available Previous psycholinguistics studies have shown that when forming a long distance dependency in online processing, the parser sometimes accepts a sentence even though the required grammatical constraints are only partially met. A mechanistic account of how such errors arise sheds light on both the underlying linguistic representations involved and the processing mechanisms that put such representations together. In the current study, we contrast the NPI (negative polarity items interference effect, as shown by the acceptance of an ungrammatical sentence like The bills that democratic senators have voted for will ever become law, with the well-known phenomenon of agreement attraction (The key to the cabinets are…. On the surface, these two types of errors look alike and thereby can be explained as being driven by the same source: similarity based memory interference. However, we argue that the linguistic representations involved in NPI licensing are substantially different from those of subject-verb agreement, and therefore the interference effects in each domain potentially arise from distinct sources. In particular, we show that NPI interference at least partially arises from pragmatic inferences. In a self-paced reading study with an acceptability judgment task, we showed NPI interference was modulated by participants’ general pragmatic communicative skills, as quantified by the Autism-Spectrum Quotient (Baron-Cohen 2001, especially in offline tasks. Participants with more autistic traits were actually less prone to the NPI interference effect than those with fewer autistic traits. This result contrasted with agreement attraction conditions, which were not influenced by individual pragmatic skill differences. We also show that different NPI licensors have distinct interference profiles. We discuss two kinds of interference effects for NPI licensing: memory-retrieval based and pragmatically triggered.

  13. The linguistic roots of Natural Pedagogy

    Directory of Open Access Journals (Sweden)

    Otávio eMattos

    2015-09-01

    Full Text Available Natural pedagogy is a human-specific capacity that allows us to acquire cultural information from communication even before the emergence of the first words, encompassing three core elements: (i a sensitivity to ostensive signals like eye contact that indicate to infants that they are being addressed through communication, (ii a subsequent referential expectation (satisfied by the use of declarative gestures and (iii a biased interpretation of ostensive-referential communication as conveying relevant information about the referent's kind (Csibra & Gergely, 2011, 2009, 2006. Remarkably, the link between natural pedagogy and another human-specific capacity, namely language, has rarely been investigated in detail. We here argue that children’s production and comprehension of declarative gestures around 10 months of age are in fact expressions of an evolving faculty of language. Through both declarative gestures and ostensive signals, infants can assign the roles of 3rd , 2nd and 1st person, building the ‘deictic space’ that grounds both natural pedagogy and language use. Secondly, we argue that the emergence of two kinds of linguistic structures (i.e. proto-determiner phrases and proto-sentences in the one-word period sheds light on the different kinds of information that children can acquire or convey at different stages of development (namely, generic knowledge about kinds and knowledge about particular events/actions/state of affairs, respectively. Furthermore, the development of nominal and temporal reference in speech allows children to cognize information in terms of spatial and temporal relations. In this way, natural pedagogy transpires as an inherent aspect of our faculty of language, rather than as an independent adaptation that pre-dates language in evolution or development (Csibra & Gergely, 2006. This hypothesis is further testable through predictions it makes on the different linguistic profiles of toddlers with developmental

  14. The linguistic roots of natural pedagogy.

    Science.gov (United States)

    Mattos, Otávio; Hinzen, Wolfram

    2015-01-01

    Natural pedagogy is a human-specific capacity that allows us to acquire cultural information from communication even before the emergence of the first words, encompassing three core elements: (i) a sensitivity to ostensive signals like eye contact that indicate to infants that they are being addressed through communication, (ii) a subsequent referential expectation (satisfied by the use of declarative gestures) and (iii) a biased interpretation of ostensive-referential communication as conveying relevant information about the referent's kind (Csibra and Gergely, 2006, 2009, 2011). Remarkably, the link between natural pedagogy and another human-specific capacity, namely language, has rarely been investigated in detail. We here argue that children's production and comprehension of declarative gestures around 10 months of age are in fact expressions of an evolving faculty of language. Through both declarative gestures and ostensive signals, infants can assign the roles of third, second, and first person, building the 'deictic space' that grounds both natural pedagogy and language use. Secondly, we argue that the emergence of two kinds of linguistic structures (i.e., proto-determiner phrases and proto-sentences) in the one-word period sheds light on the different kinds of information that children can acquire or convey at different stages of development (namely, generic knowledge about kinds and knowledge about particular events/actions/state of affairs, respectively). Furthermore, the development of nominal and temporal reference in speech allows children to cognize information in terms of spatial and temporal relations. In this way, natural pedagogy transpires as an inherent aspect of our faculty of language, rather than as an independent adaptation that pre-dates language in evolution or development (Csibra and Gergely, 2006). This hypothesis is further testable through predictions it makes on the different linguistic profiles of toddlers with developmental

  15. Predictors of post-sentence mental health service use in a population cohort of first-time adult offenders in Western Australia.

    Science.gov (United States)

    Sodhi-Berry, Nita; Knuiman, Matthew; Preen, David B; Alan, Janine; Morgan, Vera A

    2015-12-01

    Little is known about whether or how offenders use mental health services after sentence completion. This study aimed to determine the likelihood of such service use by adult (18-44 years) first-time offenders up to 5 years after sentence completion and possible predictor variables. Pre-sentence and post-sentence mental health service use was obtained from whole-population linked administrative data on 23,661 adult offenders. Cox proportional hazard models were used to determine which socio-demographic, offending and pre-sentence health service variables were associated with such post-sentence service use. The estimated 5-year probability of any post-sentence mental health service use was 12% for offenders who had not previously used such services, but still only 42% for those who had. For the latter, best predictors of post-sentence use were past psychiatric diagnosis and history of self-harm; history of self-harm also predicted post-sentence use among new mental health services users and so also did past physical illness. Indigenous offenders had a greater likelihood of service use for any mental disorder or for substance use disorders than non-Indigenous offenders, irrespective of pre-sentence use. Among those with pre-sentence service contact, imprisoned offenders were less likely to use mental health services after sentence than those under community penalties; in its absence, socio-economic disadvantage and geographic accessibility were associated with greater likelihood of post-sentence use. Our findings highlight the discontinuity of mental healthcare for most sentenced offenders, but especially prisoners, and suggest a need for better management strategies for these vulnerable groups with mental disorders. Copyright © 2014 John Wiley & Sons, Ltd.

  16. Un corpus DIY pour l’étude du roumain en diachronie. Stratégies de constitution et stratégies de recherche

    Directory of Open Access Journals (Sweden)

    Ana Zisman

    2017-12-01

    Full Text Available The present paper aims to provide an overview of some of the advantages of creating and working with a DIY corpus, i.e. a corpus compiled by the linguist, as groundwork for a PhD thesis. Collected in order to investigate the grammatical and pragmatical behavior in historical Romanian of some so-called parenthetical verbs: a zice/ a spune ‘to say’, a crede ‘to think’, a şti ‘to know’, within 5 types of texts from the 16th/17th to the 20th centuries, this DIY corpus represents a necessary alternative as a database of Romania texts. Although its creation demanded some additional steps (e.g. the selection of the texts, which is determined by various diachronical factors, such a corpus proves to be relevant for investigating parenthetical verbs in literary, historical and law texts, as well as in formal and informal letters. In order to do so, the paradigm of the afore-mentioned verbs has to be systematized in relation to a precise word frequency per text type.

  17. A shared neural substrate for mentalizing and the affective component of sentence comprehension.

    Directory of Open Access Journals (Sweden)

    Pierre-Yves Hervé

    Full Text Available Using event-related fMRI in a sample of 42 healthy participants, we compared the cerebral activity maps obtained when classifying spoken sentences based on the mental content of the main character (belief, deception or empathy or on the emotional tonality of the sentence (happiness, anger or sadness. To control for the effects of different syntactic constructions (such as embedded clauses in belief sentences, we subtracted from each map the BOLD activations obtained during plausibility judgments on structurally matching sentences, devoid of emotions or ToM. The obtained theory of mind (ToM and emotional speech comprehension networks overlapped in the bilateral temporo-parietal junction, posterior cingulate cortex, right anterior temporal lobe, dorsomedial prefrontal cortex and in the left inferior frontal sulcus. These regions form a ToM network, which contributes to the emotional component of spoken sentence comprehension. Compared with the ToM task, in which the sentences were enounced on a neutral tone, the emotional sentence classification task, in which the sentences were play-acted, was associated with a greater activity in the bilateral superior temporal sulcus, in line with the presence of emotional prosody. Besides, the ventromedial prefrontal cortex was more active during emotional than ToM sentence processing. This region may link mental state representations with verbal and prosodic emotional cues. Compared with emotional sentence classification, ToM was associated with greater activity in the caudate nucleus, paracingulate cortex, and superior frontal and parietal regions, in line with behavioral data showing that ToM sentence comprehension was a more demanding task.

  18. A Computational Evaluation of Sentence Processing Deficits in Aphasia

    Science.gov (United States)

    Patil, Umesh; Hanne, Sandra; Burchert, Frank; De Bleser, Ria; Vasishth, Shravan

    2016-01-01

    Individuals with agrammatic Broca's aphasia experience difficulty when processing reversible non-canonical sentences. Different accounts have been proposed to explain this phenomenon. The Trace Deletion account (Grodzinsky, 1995, 2000, 2006) attributes this deficit to an impairment in syntactic representations, whereas others (e.g., Caplan,…

  19. Non-linguistic Conditions for Causativization as a Linguistic Attractor.

    Science.gov (United States)

    Nichols, Johanna

    2017-01-01

    An attractor, in complex systems theory, is any state that is more easily or more often entered or acquired than departed or lost; attractor states therefore accumulate more members than non-attractors, other things being equal. In the context of language evolution, linguistic attractors include sounds, forms, and grammatical structures that are prone to be selected when sociolinguistics and language contact make it possible for speakers to choose between competing forms. The reasons why an element is an attractor are linguistic (auditory salience, ease of processing, paradigm structure, etc.), but the factors that make selection possible and propagate selected items through the speech community are non-linguistic. This paper uses the consonants in personal pronouns to show what makes for an attractor and how selection and diffusion work, then presents a survey of several language families and areas showing that the derivational morphology of pairs of verbs like fear and frighten , or Turkish korkmak 'fear, be afraid' and korkutmak 'frighten, scare', or Finnish istua 'sit' and istutta 'seat (someone)', or Spanish sentarse 'sit down' and sentar 'seat (someone)' is susceptible to selection. Specifically, the Turkish and Finnish pattern, where 'seat' is derived from 'sit' by addition of a suffix-is an attractor and a favored target of selection. This selection occurs chiefly in sociolinguistic contexts of what is defined here as linguistic symbiosis, where languages mingle in speech, which in turn is favored by certain demographic, sociocultural, and environmental factors here termed frontier conditions. Evidence is surveyed from northern Eurasia, the Caucasus, North and Central America, and the Pacific and from both modern and ancient languages to raise the hypothesis that frontier conditions and symbiosis favor causativization.

  20. Learning to perceive and recognize a second language: the L2LP model revised.

    Science.gov (United States)

    van Leussen, Jan-Willem; Escudero, Paola

    2015-01-01

    We present a test of a revised version of the Second Language Linguistic Perception (L2LP) model, a computational model of the acquisition of second language (L2) speech perception and recognition. The model draws on phonetic, phonological, and psycholinguistic constructs to explain a number of L2 learning scenarios. However, a recent computational implementation failed to validate a theoretical proposal for a learning scenario where the L2 has less phonemic categories than the native language (L1) along a given acoustic continuum. According to the L2LP, learners faced with this learning scenario must not only shift their old L1 phoneme boundaries but also reduce the number of categories employed in perception. Our proposed revision to L2LP successfully accounts for this updating in the number of perceptual categories as a process driven by the meaning of lexical items, rather than by the learners' awareness of the number and type of phonemes that are relevant in their new language, as the previous version of L2LP assumed. Results of our simulations show that meaning-driven learning correctly predicts the developmental path of L2 phoneme perception seen in empirical studies. Additionally, and to contribute to a long-standing debate in psycholinguistics, we test two versions of the model, with the stages of phonemic perception and lexical recognition being either sequential or interactive. Both versions succeed in learning to recognize minimal pairs in the new L2, but make diverging predictions on learners' resulting phonological representations. In sum, the proposed revision to the L2LP model contributes to our understanding of L2 acquisition, with implications for speech processing in general.

  1. A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package

    Science.gov (United States)

    Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M.

    2013-01-01

    Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs. PMID:24688703

  2. A FREQUENCY-BASED LINGUISTIC APPROACH TO PROTEIN DECODING AND DESIGN: SIMPLE CONCEPTS, DIVERSE APPLICATIONS, AND THE SCS PACKAGE

    Directory of Open Access Journals (Sweden)

    Kenta Motomura

    2013-02-01

    Full Text Available Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions and dissimilarities (e.g., behaviors of low-rank samples between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.

  3. Automatic recognition of touch gestures in the corpus of social touch

    NARCIS (Netherlands)

    Jung, Merel Madeleine; Poel, Mannes; Poppe, Ronald Walter; Heylen, Dirk K.J.

    For an artifact such as a robot or a virtual agent to respond appropriately to human social touch behavior, it should be able to automatically detect and recognize touch. This paper describes the data collection of CoST: Corpus of Social Touch, a data set containing 7805 captures of 14 different

  4. The Linguistic Landscape as a Learning Space for Contextual Language Learning

    Science.gov (United States)

    Aladjem, Ruthi; Jou, Bibiana

    2016-01-01

    One of the challenges of teaching and learning a foreign language is that students are not being sufficiently exposed to the target language. However, it is quite common to find linguistic and cultural exponents of different foreign languages in authentic contexts (termed the "Linguistic landscape"). Using the Linguistic landscape as a…

  5. Type II NKT Cells in Inflammation, Autoimmunity, Microbial Immunity, and Cancer.

    Science.gov (United States)

    Marrero, Idania; Ware, Randle; Kumar, Vipin

    2015-01-01

    Natural killer T cells (NKT) recognize self and microbial lipid antigens presented by non-polymorphic CD1d molecules. Two major NKT cell subsets, type I and II, express different types of antigen receptors (TCR) with distinct mode of CD1d/lipid recognition. Though type II NKT cells are less frequent in mice and difficult to study, they are predominant in human. One of the major subsets of type II NKT cells reactive to the self-glycolipid sulfatide is the best characterized and has been shown to induce a dominant immune regulatory mechanism that controls inflammation in autoimmunity and in anti-cancer immunity. Recently, type II NKT cells reactive to other self-glycolipids and phospholipids have been identified suggesting both promiscuous and specific TCR recognition in microbial immunity as well. Since the CD1d pathway is highly conserved, a detailed understanding of the biology and function of type II NKT cells as well as their interplay with type I NKT cells or other innate and adaptive T cells will have major implications for potential novel interventions in inflammatory and autoimmune diseases, microbial immunity, and cancer.

  6. Syntactic computation in the human brain: the degree of merger as a key factor.

    Directory of Open Access Journals (Sweden)

    Shinri Ohta

    Full Text Available Our goal of this study is to characterize the functions of language areas in most precise terms. Previous neuroimaging studies have reported that more complex sentences elicit larger activations in the left inferior frontal gyrus (L. F3op/F3t, although the most critical factor still remains to be identified. We hypothesize that pseudowords with grammatical particles and morphosyntactic information alone impose a construction of syntactic structures, just like normal sentences, and that "the Degree of Merger" (DoM in recursively merged sentences parametrically modulates neural activations. Using jabberwocky sentences with distinct constructions, we fitted various parametric models of syntactic, other linguistic, and nonlinguistic factors to activations measured with functional magnetic resonance imaging. We demonstrated that the models of DoM and "DoM+number of Search (searching syntactic features" were the best to explain activations in the L. F3op/F3t and supramarginal gyrus (L. SMG, respectively. We further introduced letter strings, which had neither lexical associations nor grammatical particles, but retained both matching orders and symbol orders of sentences. By directly contrasting jabberwocky sentences with letter strings, localized activations in L. F3op/F3t and L. SMG were indeed independent of matching orders and symbol orders. Moreover, by using dynamic causal modeling, we found that the model with a inhibitory modulatory effect for the bottom-up connectivity from L. SMG to L. F3op/F3t was the best one. For this best model, the top-down connection from L. F3op/F3t to L. SMG was significantly positive. By using diffusion-tensor imaging, we confirmed that the left dorsal pathway of the superior longitudinal and arcuate fasciculi consistently connected these regions. Lastly, we established that nonlinguistic order-related and error-related factors significantly activated the right (R. lateral premotor cortex and R. F3op/F3t

  7. Structures, Not Strings: Linguistics as Part of the Cognitive Sciences.

    Science.gov (United States)

    Everaert, Martin B H; Huybregts, Marinus A C; Chomsky, Noam; Berwick, Robert C; Bolhuis, Johan J

    2015-12-01

    There are many questions one can ask about human language: its distinctive properties, neural representation, characteristic uses including use in communicative contexts, variation, growth in the individual, and origin. Every such inquiry is guided by some concept of what 'language' is. Sharpening the core question--what is language?--and paying close attention to the basic property of the language faculty and its biological foundations makes it clear how linguistics is firmly positioned within the cognitive sciences. Here we will show how recent developments in generative grammar, taking language as a computational cognitive mechanism seriously, allow us to address issues left unexplained in the increasingly popular surface-oriented approaches to language. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. Sonority as variation: A study about the conceptualization of physical notions in university students

    International Nuclear Information System (INIS)

    Escudero, Consuelo; Jaime, Eduardo A

    2007-01-01

    Results of researches over conceptions and specific competencies of university students as regards acoustic waves and their conceptualization are put forward in this paper. The starting point is a theoretical scheme previously done that allows the linking and interconnection of theorical contributions related with the cognitive psychology, the developmental psychology, problems solving, the linguistic and symbolical representation of concepts and their relation with the didactics. The corpus is made up mainly by answers to written works which have allowed analyzing implicit conceptions of students, especially those ignored or misunderstood by them. This is a qualitative research, in which data are grouped in categories that are not provided before the theoretical framework. Conclusions show the potentiality of the theoretical framework to interpret processes of meaning building of the level of sonority as variation, and for the design and improvement of instructional proposals tending to achieve a critical meaningful learning

  9. Sonority as variation: A study about the conceptualization of physical notions in university students

    Energy Technology Data Exchange (ETDEWEB)

    Escudero, Consuelo; Jaime, Eduardo A [Departamento de Fisica, Facultad de Ingenieria - UNSJ (Argentina); Departamento de BiologIa, Facultad de Ciencias Exactas, Fisicas y Naturales - UNSJ (Argentina)

    2007-11-15

    Results of researches over conceptions and specific competencies of university students as regards acoustic waves and their conceptualization are put forward in this paper. The starting point is a theoretical scheme previously done that allows the linking and interconnection of theorical contributions related with the cognitive psychology, the developmental psychology, problems solving, the linguistic and symbolical representation of concepts and their relation with the didactics. The corpus is made up mainly by answers to written works which have allowed analyzing implicit conceptions of students, especially those ignored or misunderstood by them. This is a qualitative research, in which data are grouped in categories that are not provided before the theoretical framework. Conclusions show the potentiality of the theoretical framework to interpret processes of meaning building of the level of sonority as variation, and for the design and improvement of instructional proposals tending to achieve a critical meaningful learning.

  10. The effectiveness of linguistic plays on the grammatical skills of hearing-impaired children with hearing aids

    Directory of Open Access Journals (Sweden)

    Sahar Mohammad Esmaeilzadeh

    2014-12-01

    Full Text Available Background and Aim: Grammatical skills development of hearing-impaired children depends on using appropriate educational rehabilitation programs. This study aims to investigate the effectiveness of linguistic plays on the grammatical skills in hearing-impaired children with hearing aids.Methods: Ten hearing-impaired children with hearing aids, aged between 5 and 7, were randomly assigned to two groups (5 children in each group. Each treatment group received 12 sessions on linguistic plays. The grammatical skills of these children were evaluated via the TOLD-P: 3 (Persian version; in addition, their level of intelligence was assessed by the Raven test.Results: The difference between the scores of both control and treatment groups revealed a statistically significant difference in grammatical skills (t=7.61, p=0.001 and three subskills of the children who participated in the linguistic plays. These subskills include syntactic understanding (t=3.16, p=0.013, sentence imitation (t=1.71, p=0.006, and morphological completion (t=6.55, p=0.001. In other words, the findings suggest that linguistic plays have a significant impact on the improvement of the aforementioned skills in hearing-impaired children.Conclusion: Results suggest that it would be beneficial to include linguistic plays as part of routine rehabilitation programs as a means of improving the grammatical difficulties of children. After partaking in linguistic plays, children significantly improved their ability to comprehend the meaning of sentences and also to recognize, understand, and use common Persian morphological forms.

  11. [Development of a System to Use Patient's Information Which is Required at the Radiological Department].

    Science.gov (United States)

    Satoh, Akihiro

    2016-04-01

    The purpose of this study is to develop a new system to get and share some data of a patient which are required for a radiological examination not using an electronic medical chart or a radiological information system (RIS), and also to demonstrate that this system is operated on cloud technology. I used Java Enterprise Edition (Java EE) as a programing language and MySQL as a server software, and I used two laptops as hardware for client computer and server computer. For cloud computing, I hired a server of Google App Engine for Java (GAE). As a result, I could get some data of the patient required at his/her examination instantly using this system. This system also helps to improve the efficiency of examination. For example, it has been useful when I want to decide radiographic condition or to create CT images such as multi-planar reconstruction (MPR) or volume rendering (VR). When it comes to cloud computing, the GAE was used experimentally due to some legal restrictions. From the above points it is clear that this system has played an important role in radiological examinations, but there has been still few things which I have to resolve for cloud computing.

  12. Identification of DRG-1 As a Melanoma-Associated Antigen Recognized by CD4+ Th1 Cells.

    Directory of Open Access Journals (Sweden)

    Yukiko Kiniwa

    Full Text Available Immunotherapy has emerged as a promising strategy for the treatment of metastatic melanoma. Clinical studies have demonstrated the feasibility of cancer immunotherapy using tumor antigens recognized by CD8(+ T cells. However, the overall immune responses induced by these antigens are too weak and transient to induce tumor regression in the majority of patients who received immunization. A growing body of evidence suggests that CD4(+ T helper (Th cells play an important role in antitumor immunity. Therefore, the identification of MHC class II-restricted tumor antigens capable of stimulating CD4(+ T cells may provide opportunities for developing effective cancer vaccines. To this end, we describe the identification of developmentally regulated GTP-binding protein 1 (DRG-1 as a melanoma-associated antigen recognized by HLA-DR11-restricted CD4(+ Th1 cells. Epitope mapping analysis showed that the DRG1248-268 epitope of DRG-1 was required for T cell recognition. Reverse transcription-polymerase chain reaction revealed that DRG-1 was highly expressed in melanoma cell lines but not in normal tissues. DRG-1 knockdown by lentiviral-based shRNA suppressed melanoma cell proliferation and soft agar colony formation. Taken together, these data suggest that DRG-1 plays an important role in melanoma cell growth and transformation, indicating that DRG1 may represent a novel target for CD4(+ T cell-mediated immunotherapy in melanoma.

  13. Identification of DRG-1 As a Melanoma-Associated Antigen Recognized by CD4+ Th1 Cells.

    Science.gov (United States)

    Kiniwa, Yukiko; Li, Jiang; Wang, Mingjun; Sun, Chuang; Lee, Jeffrey E; Wang, Rong-Fu; Wang, Helen Y

    2015-01-01

    Immunotherapy has emerged as a promising strategy for the treatment of metastatic melanoma. Clinical studies have demonstrated the feasibility of cancer immunotherapy using tumor antigens recognized by CD8(+) T cells. However, the overall immune responses induced by these antigens are too weak and transient to induce tumor regression in the majority of patients who received immunization. A growing body of evidence suggests that CD4(+) T helper (Th) cells play an important role in antitumor immunity. Therefore, the identification of MHC class II-restricted tumor antigens capable of stimulating CD4(+) T cells may provide opportunities for developing effective cancer vaccines. To this end, we describe the identification of developmentally regulated GTP-binding protein 1 (DRG-1) as a melanoma-associated antigen recognized by HLA-DR11-restricted CD4(+) Th1 cells. Epitope mapping analysis showed that the DRG1248-268 epitope of DRG-1 was required for T cell recognition. Reverse transcription-polymerase chain reaction revealed that DRG-1 was highly expressed in melanoma cell lines but not in normal tissues. DRG-1 knockdown by lentiviral-based shRNA suppressed melanoma cell proliferation and soft agar colony formation. Taken together, these data suggest that DRG-1 plays an important role in melanoma cell growth and transformation, indicating that DRG1 may represent a novel target for CD4(+) T cell-mediated immunotherapy in melanoma.

  14. Structure and meaning in English a guide for teachers

    CERN Document Server

    Kennedy, Graeme

    2014-01-01

    Structure and Meaning in English is designed to help teachers of English develop an understanding of those aspects of English which are especially relevant for learners who speak other languages. Using corpus research, Graeme Kennedy cuts to the heart of what is important in the teaching of English. The book provides pedagogically- relevant information about English at the levels of sounds, words, sentences and texts. It draws attention to those linguistic items and processes which research has shown are typically hard for learners and which lead to errors.Each chapter contains:a description o

  15. Use of "Google Scholar" in Corpus-Driven EAP Research

    Science.gov (United States)

    Brezina, Vaclav

    2012-01-01

    This primarily methodological article makes a proposition for linguistic exploration of textual resources available through the "Google Scholar" search engine. These resources ("Google Scholar virtual corpus") are significantly larger than any existing corpus of academic writing. "Google Scholar", however, was not designed for linguistic searches…

  16. Is there a role for the lexis-grammar interface in interlanguage complexity research?

    OpenAIRE

    Paquot, Magali; Colloquium on cross-linguistic aspects of complexity in second language research

    2014-01-01

    A major contribution of recent research in theoretical linguistics, corpus linguistics and psycholinguistics has been to provide convergent evidence that lexis and grammar are closely intertwined (Sinclair, 1991; Stefanowitsch & Gries, 2003; Goldberg, 2006, Ellis & Cadierno, 2009; Römer, 2009). It has also been convincingly demonstrated that language is essentially made up of word combinations that constitute single choices and that words acquire meanings from their context (Sinclair, 1991; B...

  17. A 38 million words Dutch text corpus and its users | Kruyt | Lexikos

    African Journals Online (AJOL)

    In August 1996, the 38 Million Words Corpus was available for consultation by the international research community. The present paper reports on the characteristics of this corpus (design, text classification, linguistic annotation) and on its use, both in dictionary projects and in linguistic research. In spite of limitations with ...

  18. Complex exon-intron marking by histone modifications is not determined solely by nucleosome distribution.

    Directory of Open Access Journals (Sweden)

    Pawandeep Dhami

    2010-08-01

    Full Text Available It has recently been shown that nucleosome distribution, histone modifications and RNA polymerase II (Pol II occupancy show preferential association with exons ("exon-intron marking", linking chromatin structure and function to co-transcriptional splicing in a variety of eukaryotes. Previous ChIP-sequencing studies suggested that these marking patterns reflect the nucleosomal landscape. By analyzing ChIP-chip datasets across the human genome in three cell types, we have found that this marking system is far more complex than previously observed. We show here that a range of histone modifications and Pol II are preferentially associated with exons. However, there is noticeable cell-type specificity in the degree of exon marking by histone modifications and, surprisingly, this is also reflected in some histone modifications patterns showing biases towards introns. Exon-intron marking is laid down in the absence of transcription on silent genes, with some marking biases changing or becoming reversed for genes expressed at different levels. Furthermore, the relationship of this marking system with splicing is not simple, with only some histone modifications reflecting exon usage/inclusion, while others mirror patterns of exon exclusion. By examining nucleosomal distributions in all three cell types, we demonstrate that these histone modification patterns cannot solely be accounted for by differences in nucleosome levels between exons and introns. In addition, because of inherent differences between ChIP-chip array and ChIP-sequencing approaches, these platforms report different nucleosome distribution patterns across the human genome. Our findings confound existing views and point to active cellular mechanisms which dynamically regulate histone modification levels and account for exon-intron marking. We believe that these histone modification patterns provide links between chromatin accessibility, Pol II movement and co-transcriptional splicing.

  19. Practical, general parser combinators

    NARCIS (Netherlands)

    A. Izmaylova (Anastasia); A. Afroozeh (Ali); T. van der Storm (Tijs)

    2016-01-01

    textabstractParser combinators are a popular approach to parsing where contextfree grammars are represented as executable code. However, conventional parser combinators do not support left recursion, and can have worst-case exponential runtime. These limitations hinder the expressivity and

  20. Photometric properties of type II supernovae

    Energy Technology Data Exchange (ETDEWEB)

    Barbon, R [Osservatorio Astrofisico, Asiago (Italy); Trieste Univ. (Italy). Instituto di Matematica); Ciatti, F; Rosino, L [Osservatorio Astrofisico, Asiago (Italy); Pavia Univ. (Italy))

    1979-02-01

    An analysis of the available photometric observations for type II supernovae is presented. The possibility of drawing average curves by the fitting method, as previously done for type I supernovae, is indicated. Two basic shapes have been put into evidence, the first one (2/3 of the objects) is characterized by the presence of a plateau at intermediate phase, the second one by an almost linear decline. Average curves have been also built for the intrinsic color indices. Peculiar cases are discussed, including the unusual objects of types III-IV. The mean absolute magnitude at maximum for type II supernovae has been determined about Msub(B) = -16.45 (sigma=0.78), as a calibration for their use as distance indicators. The distribution in different morphological types and luminosity classes of the parent galaxies is briefly discussed.

  1. Phonological Working Memory limitations and Agrammatism: Is there a causal relationship between the two?

    Directory of Open Access Journals (Sweden)

    Maria Varkanitsa

    2014-04-01

    These results are inconsistent with theories assuming a single WM capacity that deals with all types of linguistic information. Rather, this study provides evidence that syntactic parsing relies on different resources from what is typically measured in verbal WM tasks. The four patients with Broca’s aphasia that participated in this study presented with similar phonological WM limitations with only quantitative differences in some tasks, despite the fact that only two of them also had sentence comprehension difficulties. The results also suggest that the agrammatic patients suffer from a specific WM deficit, linked to processing syntactic dependencies, which is not tapped by phonological WM tests.

  2. Production of graphic symbol sentences by individuals with aphasia: efficacy of a computer-based augmentative and alternative communication intervention.

    Science.gov (United States)

    Koul, Rajinder; Corwin, Melinda; Hayes, Summer

    2005-01-01

    The study employed a single-subject multiple baseline design to examine the ability of 9 individuals with severe Broca's aphasia or global aphasia to produce graphic symbol sentences of varying syntactical complexity using a software program that turns a computer into a speech output communication device. The sentences ranged in complexity from simple two-word phrases to those with morphological inflections, transformations, and relative clauses. Overall, results indicated that individuals with aphasia are able to access, manipulate, and combine graphic symbols to produce phrases and sentences of varying degrees of syntactical complexity. The findings are discussed in terms of the clinical and public policy implications.

  3. Divergent approaches to corpus processing: the need for ...

    African Journals Online (AJOL)

    With a good corpus, data can be provided giving an authoritative body of linguistic evidence which can support generalisations and against which hypotheses can be tested. As this proves the invaluable status of a corpus, the article assesses the processing of the Shona corpus and discusses how some aspects of the ...

  4. Multiple Regions of a Cortical Network Commonly Encode the Meaning of Words in Multiple Grammatical Positions of Read Sentences.

    Science.gov (United States)

    Anderson, Andrew James; Lalor, Edmund C; Lin, Feng; Binder, Jeffrey R; Fernandino, Leonardo; Humphries, Colin J; Conant, Lisa L; Raizada, Rajeev D S; Grimm, Scott; Wang, Xixi

    2018-05-16

    Deciphering how sentence meaning is represented in the brain remains a major challenge to science. Semantically related neural activity has recently been shown to arise concurrently in distributed brain regions as successive words in a sentence are read. However, what semantic content is represented by different regions, what is common across them, and how this relates to words in different grammatical positions of sentences is weakly understood. To address these questions, we apply a semantic model of word meaning to interpret brain activation patterns elicited in sentence reading. The model is based on human ratings of 65 sensory/motor/emotional and cognitive features of experience with words (and their referents). Through a process of mapping functional Magnetic Resonance Imaging activation back into model space we test: which brain regions semantically encode content words in different grammatical positions (e.g., subject/verb/object); and what semantic features are encoded by different regions. In left temporal, inferior parietal, and inferior/superior frontal regions we detect the semantic encoding of words in all grammatical positions tested and reveal multiple common components of semantic representation. This suggests that sentence comprehension involves a common core representation of multiple words' meaning being encoded in a network of regions distributed across the brain.

  5. Predicting Patterns of Grammatical Complexity across Language Exam Task Types and Proficiency Levels

    Science.gov (United States)

    Biber, Douglas; Gray, Bethany; Staples, Shelley

    2016-01-01

    In the present article, we explore the extent to which previous research on register variation can be used to predict spoken/written task-type variation as well as differences across score levels in the context of a major standardized language exam (TOEFL iBT). Specifically, we carry out two sets of linguistic analyses based on a large corpus of…

  6. Rapid mineralisation of the herbicide isoproturon in soil from a previously treated Danish agricultural field.

    Science.gov (United States)

    Sørensen, Sebastian R; Aamand, Jens

    2003-10-01

    Mineralisation of the phenylurea herbicide isoproturon (3-(4-isopropylphenyl)-1,1-dimethylurea) and two of its known metabolites, 3-(4-isopropylphenyl)-1-methylurea (monodesmethyl-isoproturon) and 4-isopropylaniline, was studied in Danish agricultural soils with or without previous exposure to isoproturon. A potential for rapid mineralisation of isoproturon and the two metabolites was present in soils sampled from three plots within an agricultural field previously treated regularly with the herbicide, with 34-45%, 51-58% and 33-36% of the added [phenyl-U-14C]isoproturon, [phenyl-U-14C]monodesmethyl-isoproturon and [phenyl-U-14C]4-isopropylaniline metabolised to [14C]carbon dioxide within 30 days at 20 degrees C. In contrast, such extensive mineralisation of these three compounds was not observed within this period in soils sampled from two other agricultural fields without previous treatment with isoproturon. The mineralisation patterns indicated growth-linked metabolism of the three compounds in the previously exposed soils, and doubling times for [14C]carbon dioxide production ranged from 1.6 to 3.2, 1.0 to 2.1 and 1.3 to 1.7 days for isoproturon, monodesmethyl-isoproturon and 4-isopropylaniline, respectively. The ability to mineralise [phenyl-U-14C]isoproturon to [14C]carbon dioxide was successfully sub-cultured to a fresh mineral medium which provided isoproturon as sole source of carbon and nitrogen. One of the soils sampled from an agricultural field not previously treated with isoproturon showed accelerated mineralisation of [phenyl-U-14C]4-isopropylaniline toward the end of the experiment, with a doubling time for [14C]carbon dioxide production of 7.4days. This study indicates that the occurrence of rapid mineralisation of the phenyl ring of isoproturon to carbon dioxide is related to previous exposure to the herbicide, which suggests that microbial adaptation upon repeated isoproturon use may occur within agricultural fields.

  7. JaSlo: Integration of a Japanese-Slovene Bilingual Dictionary with a Corpus Search System

    Directory of Open Access Journals (Sweden)

    HMELJAK SANGAWA, Kristina

    2012-12-01

    Full Text Available The paper presents a set of integrated on-line language resources targeted at Japanese language learners, primarily those whose mother tongue is Slovene. The resources consist of the on-line Japanese-Slovene learners’ dictionary jaSlo and two corpora, a 1 million word Japanese-Slovene parallel corpus and a 300 million word corpus of web pages, where each word and sentence is marked by its difficulty level; this corpus is furthermore available as a set of five distinct corpora, each one containing sentences of the particular level. The corpora are available for exploration through NoSketch Engine, the open source version of the commercial state-of-the-art corpus analysis software Sketch Engine. The dictionary is available for Web searching, and dictionary entries have direct links to examples from the corpora, thus offering a wider picture of a possible translations in concrete contextualised examples, and b monolingual Japanese usage examples of different difficulty levels to support language learning.-----Članek predstavlja japonsko-slovenski slovar jaSlo, spletni slovar za slovensko govoreče učence japonščine, in vključitev primerov iz dveh korpusov s pomočjo odprto-kodnega korpusnega iskalnika NoSketch Engine. Korpusa sta jaSlo (milijon besed, vzporedni korpus japonskih in slovenskih besedil, ki je bil zgrajen za ta namen in vsebuje večinoma literarna, spletna in akademska besedila, ter JpWaC-L (300 milijonov besed, korpus spletnih besedil, razdeljenih v povedi, ki so rangirane po težavnostnih stopnjah. S pregledno povezavo korpusnih primerov in slovarskih iztočnic v dvojezičnem slovarju za učence japonščine kot tujega jezika, ponuja sistem uporabnikom prijazen dostop k slovarskim podatkom, tj. reprezentativnim prevodnim ustreznicam, in korpusnim podatkom, ki ponujajo a širšo sliko možnih prevodnih ustreznic v konkretnih primerih s sobesedilom in b enojezične primere rabe japonskih besed v povedih različnih te

  8. GENPLOT: A formula-based Pascal program for data manipulation and plotting

    Science.gov (United States)

    Kramer, Matthew J.

    Geochemical processes involving alteration, differentiation, fractionation, or migration of elements may be elucidated by a number of discrimination or variation diagrams (e.g., AFM, Harker, Pearce, and many others). The construction of these diagrams involves arithmetic combination of selective elements (involving major, minor, or trace elements). GENPLOT utilizes a formula-based algorithm (an expression parser) which enables the program to manipulate multiparameter databases and plot XY, ternary, tetrahedron, and REE type plots without needing to change either the source code or rearranging databases. Formulae may be any quadratic expression whose variables are the column headings of the data matrix. A full-screen editor with limited equations and arithmetic functions (spreadsheet) has been incorporated into the program to aid data entry and editing. Data are stored as ASCII files to facilitate interchange of data between other programs and computers. GENPLOT was developed in Turbo Pascal for the IBM and compatible computers but also is available in Apple Pascal for the Apple Ile and Ill. Because the source code is too extensive to list here (about 5200 lines of Pascal code), the expression parsing routine, which is central to GENPLOT's flexibility is incorporated into a smaller demonstration program named SOLVE. The following paper includes a discussion on how the expression parser works and a detailed description of GENPLOT's capabilities.

  9. Efficacy of pulsed dye laser treatment for common warts is not influenced by the causative HPV type: a prospective study.

    Science.gov (United States)

    Fichman, Yoseph; Levi, Assi; Hodak, Emmilia; Halachmi, Shlomit; Mazor, Sigal; Wolf, Dana; Caplan, Orit; Lapidoth, Moshe

    2018-05-01

    Verruca vulgaris (VV) is a prevalent skin condition caused by various subtypes of human papilloma virus (HPV). The most common causes of non-genital lesions are HPV types 2 and 4, and to a lesser extent types 1, 3, 26, 29, and 57. Although numerous therapeutic modalities exist, none is universally effective or without adverse events (AE). Pulsed dye laser (PDL) is a favorable option due to its observed efficacy and relatively low AE rate. However, it is not known which verrucae are most likely to respond to PDL, or whether the causative viral subtype influences this response. The objective of this prospective blinded study was to assess whether the HPV subtype was predictive of response to PDL. For that matter, 26 verrucae from 26 immunocompetent patients were biopsied prior to treatment by PDL. HPV coding sequences were isolated and genotyped using PCR analysis. Patients were treated by PDL (595 nm wavelength, 5 mm spot size, 1.5 ms pulse duration, 12 J/cm 2 fluence) once a month for up to 6 months, and clinical response was assessed. Binary logistic regression analysis and linear logistic regression analysis were used in order to evaluate statistical significance. Different types of HPV were identified in 22 of 26 tissue samples. Response to treatment did not correlate with HPV type, age, or gender. As no association between HPV type and response to PDL therapy could be established, it is therefore equally effective for all HPV types and remains a favorable treatment option for all VV.

  10. Additive effect of rikkunshito, an herbal medicine, on chemotherapy-induced nausea, vomiting, and anorexia in uterine cervical or corpus cancer patients treated with cisplatin and paclitaxel: results of a randomized phase II study (JORTC KMP-02).

    Science.gov (United States)

    Ohnishi, Shunsuke; Watari, Hidemichi; Kanno, Maki; Ohba, Yoko; Takeuchi, Satoshi; Miyaji, Tempei; Oyamada, Shunsuke; Nomura, Eiji; Kato, Hidenori; Sugiyama, Toru; Asaka, Masahiro; Sakuragi, Noriaki; Yamaguchi, Takuhiro; Uezono, Yasuhito; Iwase, Satoru

    2017-09-01

    Rikkunshito, an herbal medicine, is widely prescribed in Japan for the treatment of anorexia and functional dyspepsia, and has been reported to recover reductions in food intake caused by cisplatin. We investigated whether rikkunshito could improve chemotherapy-induced nausea and vomiting (CINV) and anorexia in patients treated with cisplatin. Patients with uterine cervical or corpus cancer who were to receive cisplatin (50 mg/m² day 1) and paclitaxel (135 mg/m² day 0) as first-line chemotherapy were randomly assigned to the rikkunshito group receiving oral administration on days 0-13 with standard antiemetics, or the control group receiving antiemetics only. The primary endpoint was the rate of complete control (CC: no emesis, no rescue medication, and no significant nausea) in the overall phase (0-120 hours). Two-tailed panorexia. Copyright © 2017. Asian Society of Gynecologic Oncology, Korean Society of Gynecologic Oncology

  11. Real-time parallel processing of grammatical structure in the fronto-striatal system: a recurrent network simulation study using reservoir computing.

    Science.gov (United States)

    Hinaut, Xavier; Dominey, Peter Ford

    2013-01-01

    Sentence processing takes place in real-time. Previous words in the sentence can influence the processing of the current word in the timescale of hundreds of milliseconds. Recent neurophysiological studies in humans suggest that the fronto-striatal system (frontal cortex, and striatum--the major input locus of the basal ganglia) plays a crucial role in this process. The current research provides a possible explanation of how certain aspects of this real-time processing can occur, based on the dynamics of recurrent cortical networks, and plasticity in the cortico-striatal system. We simulate prefrontal area BA47 as a recurrent network that receives on-line input about word categories during sentence processing, with plastic connections between cortex and striatum. We exploit the homology between the cortico-striatal system and reservoir computing, where recurrent frontal cortical networks are the reservoir, and plastic cortico-striatal synapses are the readout. The system is trained on sentence-meaning pairs, where meaning is coded as activation in the striatum corresponding to the roles that different nouns and verbs play in the sentences. The model learns an extended set of grammatical constructions, and demonstrates the ability to generalize to novel constructions. It demonstrates how early in the sentence, a parallel set of predictions are made concerning the meaning, which are then confirmed or updated as the processing of the input sentence proceeds. It demonstrates how on-line responses to words are influenced by previous words in the sentence, and by previous sentences in the discourse, providing new insight into the neurophysiology of the P600 ERP scalp response to grammatical complexity. This demonstrates that a recurrent neural network can decode grammatical structure from sentences in real-time in order to generate a predictive representation of the meaning of the sentences. This can provide insight into the underlying mechanisms of human cortico

  12. The Dubbing of Gay-themed TV Series in Italy: Corpus-based Evidence of Manipulation and Censorship

    Directory of Open Access Journals (Sweden)

    Annalisa Sandrelli

    2016-02-01

    Full Text Available The manipulation and censorship of taboo and sensitive references in AVT has attracted the interest of several scholars over the last few years, but systematic studies of the impact of such practices on different genres in different countries are still needed. The paper analyses the Italian dubbing of three gay-themed TV series, Queer as Folk, Queer as Folk (US remake, and The L Word. Gay-themed series are less mainstream on Italian TV channels than in the UK and the USA, and gayspeak as a linguistic phenomenon is not well-known to Italian audiences. Consequently, the Italian dubbed dialogues are the product of different degrees of manipulation and even censorship. The paper finds empirical evidence of such alterations in data from the DubTalk corpus and attempts a tentative evaluation of the impact of such practices on the Italian versions of the above-mentioned series.

  13. Non-linguistic Conditions for Causativization as a Linguistic Attractor

    Directory of Open Access Journals (Sweden)

    Johanna Nichols

    2018-01-01

    Full Text Available An attractor, in complex systems theory, is any state that is more easily or more often entered or acquired than departed or lost; attractor states therefore accumulate more members than non-attractors, other things being equal. In the context of language evolution, linguistic attractors include sounds, forms, and grammatical structures that are prone to be selected when sociolinguistics and language contact make it possible for speakers to choose between competing forms. The reasons why an element is an attractor are linguistic (auditory salience, ease of processing, paradigm structure, etc., but the factors that make selection possible and propagate selected items through the speech community are non-linguistic. This paper uses the consonants in personal pronouns to show what makes for an attractor and how selection and diffusion work, then presents a survey of several language families and areas showing that the derivational morphology of pairs of verbs like fear and frighten, or Turkish korkmak ‘fear, be afraid’ and korkutmak ‘frighten, scare’, or Finnish istua ‘sit’ and istutta ‘seat (someone’, or Spanish sentarse ‘sit down’ and sentar ‘seat (someone’ is susceptible to selection. Specifically, the Turkish and Finnish pattern, where ‘seat’ is derived from ‘sit’ by addition of a suffix—is an attractor and a favored target of selection. This selection occurs chiefly in sociolinguistic contexts of what is defined here as linguistic symbiosis, where languages mingle in speech, which in turn is favored by certain demographic, sociocultural, and environmental factors here termed frontier conditions. Evidence is surveyed from northern Eurasia, the Caucasus, North and Central America, and the Pacific and from both modern and ancient languages to raise the hypothesis that frontier conditions and symbiosis favor causativization.

  14. Types of Sentences in EFL Students' Paragraph Assignments: A Quantitative Study on Teaching and Learning Writing at Higher Education Level

    Directory of Open Access Journals (Sweden)

    Syayid Sandi Sukandi

    2017-08-01

    Full Text Available This research investigates Indonesian EFL students writing four types of English sentences in their paragraph writing assignments that were posted online in Writing 1 course of English Education at STKIP PGRI Sumatera Barat. The analysed types of sentences are Simple Sentence (code: S.S., Compound Sentence (code: C.S.1, Complex Sentence (code: C.S.2, and Compound-Complex Sentence (code: C.C.S. The percentage of each type of sentences that appears in the students’ writings within each five genres represents the students’ syntactical composition. Moreover, this research focuses on quantitatively analysing the above five types of sentences that appeared in students’ assignments in each type of following genres: argumentative, descriptive, process, cause-effect, and comparison-contrast. Data are taken from 10% samples of all population. The finding shows that writing Simple Sentence in paragraphs is a common type of sentence that is used by the students. It indicates that the guiding process to teaching students about writing paragraphs with varied sentence types is important for further development of teaching process of writing.

  15. Relations between Formal Linguistic Insecurity and the Perception of Linguistic Insecurity: A Quantitative Study in an Educational Environment at the Valencian Community (Spain)

    Science.gov (United States)

    Baldaqui Escandell, Josep M.

    2011-01-01

    What is the relationship between the awareness of linguistic prestige and the security or insecurity in the use of minoritised languages? Is formal linguistic insecurity (as initially described by Labov) the same as the speakers' perception of linguistic insecurity? Which are the variables related to the various types of linguistic insecurity in…

  16. Learning a generative probabilistic grammar of experience: a process-level model of language acquisition.

    Science.gov (United States)

    Kolodny, Oren; Lotem, Arnon; Edelman, Shimon

    2015-03-01

    We introduce a set of biologically and computationally motivated design choices for modeling the learning of language, or of other types of sequential, hierarchically structured experience and behavior, and describe an implemented system that conforms to these choices and is capable of unsupervised learning from raw natural-language corpora. Given a stream of linguistic input, our model incrementally learns a grammar that captures its statistical patterns, which can then be used to parse or generate new data. The grammar constructed in this manner takes the form of a directed weighted graph, whose nodes are recursively (hierarchically) defined patterns over the elements of the input stream. We evaluated the model in seventeen experiments, grouped into five studies, which examined, respectively, (a) the generative ability of grammar learned from a corpus of natural language, (b) the characteristics of the learned representation, (c) sequence segmentation and chunking, (d) artificial grammar learning, and (e) certain types of structure dependence. The model's performance largely vindicates our design choices, suggesting that progress in modeling language acquisition can be made on a broad front-ranging from issues of generativity to the replication of human experimental findings-by bringing biological and computational considerations, as well as lessons from prior efforts, to bear on the modeling approach. Copyright © 2014 Cognitive Science Society, Inc.

  17. A Risk Assessment System with Automatic Extraction of Event Types

    Science.gov (United States)

    Capet, Philippe; Delavallade, Thomas; Nakamura, Takuya; Sandor, Agnes; Tarsitano, Cedric; Voyatzi, Stavroula

    In this article we describe the joint effort of experts in linguistics, information extraction and risk assessment to integrate EventSpotter, an automatic event extraction engine, into ADAC, an automated early warning system. By detecting as early as possible weak signals of emerging risks ADAC provides a dynamic synthetic picture of situations involving risk. The ADAC system calculates risk on the basis of fuzzy logic rules operated on a template graph whose leaves are event types. EventSpotter is based on a general purpose natural language dependency parser, XIP, enhanced with domain-specific lexical resources (Lexicon-Grammar). Its role is to automatically feed the leaves with input data.

  18. A preparação de material terminológico em língua inglesa por meio de ferramentas linguístico-computacionais Preparation of terminological material in english by means of computational linguistic tools

    Directory of Open Access Journals (Sweden)

    Eduardo Batista da Silva

    2011-06-01

    Full Text Available O objetivo desse estudo é demonstrar, por meio de análise quantitativa e qualitativa, a eficácia de ferramentas linguístico-computacionais na seleção de terminologia para a produção de material terminológico. Serão apresentadas duas ferramentas linguístico-computacionais (WordSmith Tools e VocabProfile e, também, sugestões para que o ensino de termos ofereça resultados práticos. A fundamentação teórico-metodológica recorreu a Barros (2004; Berber Sardinha (2000; 2005; Biderman (2001; Cabré (2007; Cobb (2007; Nation, (2003 e Sinclair (2004. O corpus da pesquisa foi constituído exclusivamente de material escrito na língua inglesa em diversas áreas de especialidade. Os procedimentos de preparação de material terminológico são exemplificados a partir de uma das áreas de especialidades utilizadas nos corpora de pesquisa, as Redes Neurais Artificiais. Os resultados obtidos indicam que a utilização do Wordsmith Tools juntamente com o VocabProfile pode fornecer dados importantes para a pesquisa linguistica.This paper aims to demonstrate by means of quantitative and qualitative analyses the effectiveness of the linguistic computational tools in selecting terminology for the production of terminological material. Two linguistic computational tools will be introduced (WordSmith Tools e VocabProfile and also suggestions so as the teaching of terms may offer practical results. The theoretical-methodological approach relies on Barros (2004; Berber Sardinha (2000; 2005; Biderman (2001; Cabré (2007; Cobb (2007; Nation (2003 and Sinclair (2004. The research corpus was made solely of written material in English in several specialty languages. The procedures regarding terminological material preparation are exemplified with one of the specialty fields used in the research corpus, the Artificial Neural Networks. The obtained results indicate that the use of Wordsmith Tools in conjunction with VocabProfile might provide useful data for

  19. Laparoscopy in the treatment of cancer of the corpus uteri in patients with metabolic syndrome. A case report

    Directory of Open Access Journals (Sweden)

    A. I. Berishvili

    2015-01-01

    Full Text Available Today, the common term metabolic syndrome encompasses visceral (abdominal obesity, glucose intolerance, type 2 diabetes mellitus (DM, hypertension, and dyslipidemia. In Europe, the rate of obesity mong the women ranges from 6 to 36 %. In the USA, 65 % of the adult population is overweight and 30 % is obese. High estrogen production in adipose tissue in patients with obesity has been established to increase the risk of cancer of the corpus uteri (CCU by 4 times as compared to that in normal weight female patients. Furthermore, obese patients are at increased risk for DM that is also a risk factor for CCU. A rise in the number of obese patients leads to the population redistribution of gynecological cancer. The increasing number of patients with gynecological cancer is overweight. This patient group has an increased risk from surgery and anesthesia, a higher incidence of postoperative complications, and delayed recovery. Laparotomy increases the duration of hospital stay and the rate of wound complications. The frequency of urologic injuries in obese patients varies between 2 and 4 %. In this patient category, the rate of damage to the large vessels or bowels has also statistically significant differences. The literature highlights a number of complications specific to patients with obesity: trocar damage to anterior abdominal wall vessels, particularly to the inferior epigastric artery, urinary bladder, as well as trocar site hernia. The authors describe their experience in treating CCU patients with metabolic syndrome at the Department of Surgery for Female Reproductive System Tumors, N. N. Blokhin Russian Cancer Research Center. They demonstrate that laparoscopic extirpation of the uterus and its appendages can be performed in patients with third-degree obesity (weighing 174 kg in Stage I CCU. It should be noted that this weight was previously regarded as an absolute contraindication to surgical treatment. Today, the active

  20. A Danish open-set speech corpus for competing-speech studies

    DEFF Research Database (Denmark)

    Nielsen, Jens Bo; Dau, Torsten; Neher, Tobias

    2014-01-01

    Studies investigating speech-on-speech masking effects commonly use closed-set speech materials such as the coordinate response measure [Bolia et al. (2000). J. Acoust. Soc. Am. 107, 1065-1066]. However, these studies typically result in very low (i.e., negative) speech recognition thresholds (SRTs......) when the competing speech signals are spatially separated. To achieve higher SRTs that correspond more closely to natural communication situations, an open-set, low-context, multi-talker speech corpus was developed. Three sets of 268 unique Danish sentences were created, and each set was recorded...... with one of three professional female talkers. The intelligibility of each sentence in the presence of speech-shaped noise was measured. For each talker, 200 approximately equally intelligible sentences were then selected and systematically distributed into 10 test lists. Test list homogeneity was assessed...

  1. A semi-supervised learning framework for biomedical event extraction based on hidden topics.

    Science.gov (United States)

    Zhou, Deyu; Zhong, Dayou

    2015-05-01

    Scientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, due to the lack of explicit structure, literature in life science, one of the most important sources of this information, prevents computer-based systems from accessing. Therefore, biomedical event extraction, automatically acquiring knowledge of molecular events in research articles, has attracted community-wide efforts recently. Most approaches are based on statistical models, requiring large-scale annotated corpora to precisely estimate models' parameters. However, it is usually difficult to obtain in practice. Therefore, employing un-annotated data based on semi-supervised learning for biomedical event extraction is a feasible solution and attracts more interests. In this paper, a semi-supervised learning framework based on hidden topics for biomedical event extraction is presented. In this framework, sentences in the un-annotated corpus are elaborately and automatically assigned with event annotations based on their distances to these sentences in the annotated corpus. More specifically, not only the structures of the sentences, but also the hidden topics embedded in the sentences are used for describing the distance. The sentences and newly assigned event annotations, together with the annotated corpus, are employed for training. Experiments were conducted on the multi-level event extraction corpus, a golden standard corpus. Experimental results show that more than 2.2% improvement on F-score on biomedical event extraction is achieved by the proposed framework when compared to the state-of-the-art approach. The results suggest that by incorporating un-annotated data, the proposed framework indeed improves the performance of the state-of-the-art event extraction system and the similarity between sentences might be precisely

  2. Development of patient-centric linguistically tailored psychoeducational messages to support nutrition and medication self-management in type 2 diabetes: a feasibility study

    Directory of Open Access Journals (Sweden)

    Ellis RJB

    2014-10-01

    Full Text Available Rebecca J Bartlett Ellis,1 Ulla Connor,2 James Marshall21Indiana University School of Nursing, 2Indiana University School of Liberal Arts, International Center for Intercultural Communication, Indiana University, Indianapolis, IN, USA Purpose: This study evaluated the feasibility of developing linguistically tailored educational messages designed to match the linguistic styles of patients segmented into types with the Descriptor™, and to determine patient preferences for tailored or standard messages based on their segments. Patients and methods: Twenty patients with type 2 diabetes (T2DM were recruited from a diabetes health clinic. Participants were segmented using the Descriptor™, a language-based questionnaire, to identify patient types based on their control orientation (internal/external, agency (high/low, and affect (positive/negative, which are well studied constructs related to T2DM self-management. Two of the seven self-care behaviors described by the American Association of Diabetes Educators (healthy eating and taking medication were used to develop standard messages and then linguistically tailored using features of the six different construct segment types of the Descriptor™. A subset of seven participants each provided feedback on their preference for standard or linguistically tailored messages; 12 comparisons between standard and tailored messages were made. Results: Overall, the tailored messages were preferred to the standard messages. When the messages were matched to specific construct segment types, the tailored messages were preferred over the standard messages, although this was not statistically significant. Conclusion: Linguistically tailoring messages based on construct segments is feasible. Furthermore, tailored messages were more often preferred over standard messages. This study provides some preliminary evidence for tailoring messages based on the linguistic features of control orientation, agency, and affect

  3. Damage to Broca’s area OR the anterior temporal lobe is implicated in stroke-induced agrammatic comprehension: it depends on the task

    Directory of Open Access Journals (Sweden)

    Corianne Rogalsky

    2015-04-01

    Full Text Available The neurobiology of sentence comprehension remains unresolved. Previous large-scale studies of stroke patients have yielded conflicting results regarding sentence comprehension, implicating inferior frontal, anterior temporal and/or posterior temporal regions (Dronkers et al., 2004; Magnusdottir et al., 2013; Thothathiri et al. 2012. Furthermore, only one large-scale lesion study (Magnusdottir et al. 2013 has examined the neural underpinnings of agrammatic comprehension (i.e. substantially worse performance on sentences with noncanonical word orders compared to canonical word order sentences in English, a hallmark of Broca’s aphasia. This one previous study of noncanonical < canonical sentence performance on a sentence picture-matching task implicated damage to the left anterior temporal lobe (ATL and to a lesser degree Broca’s area damage (i.e. < 10% of significant voxels (Magnusdottir et al. 2013. The present study investigated the neurobiology of agrammatic comprehension with two sentence comprehension tasks in the MARC test battery: a sentence-picture matching task (the SOAP Test: a test of syntactic complexity; Love & Oster, 2002 and a sentence plausibility judgment task. Each task contained active, passive, subject-relative and object-relative sentences. Participants included 91 patients with chronic focal cerebral damage. First, we conducted voxel-based lesion symptom mapping (VLSM; Bates et al. 2003 for each sentence type in each task. Consistent with previous studies (Magnusdottir et al. 2013; Thothathiri et al. 2012, the VLSMs identified a significant association between sentence comprehension impairments and damage to a large left temporal-inferior parietal network for all sentences (peak t values were in posterior temporal and inferior parietal voxels; no areas of frontal lobe damage were significant for any sentence type/task. We then conducted VLSMs to identify areas of damage associated specifically with agrammatic

  4. ASD Is Not DLI: Individuals With Autism and Individuals With Syntactic DLI Show Similar Performance Level in Syntactic Tasks, but Different Error Patterns

    Directory of Open Access Journals (Sweden)

    Nufar Sukenik

    2018-04-01

    Full Text Available Do individuals with autism have a developmental syntactic impairment, DLI (formerly known as SLI? In this study we directly compared the performance of 18 individuals with Autism Spectrum Disorder (ASD aged 9;0–18;0 years with that of 93 individuals with Syntactic-Developmental Language Impairment (SyDLI aged 8;8–14;6 (and with 166 typically-developing children aged 5;2–18;1. We tested them using three syntactic tests assessing the comprehension and production of syntactic structures that are known to be sensitive to syntactic impairment: elicitation of subject and object relative clauses, reading and paraphrasing of object relatives, and repetition of complex syntactic structures including Wh questions, relative clauses, topicalized sentences, sentences with verb movement, sentences with A-movement, and embedded sentences. The results were consistent across the three tasks: the overall rate of correct performance on the syntactic tasks is similar for the children with ASD and those with SyDLI. However, once we look closer, they are very different. The types of errors of the ASD group differ from those of the SyDLI group—the children with ASD provide various types of pragmatically infelicitous responses that are not evinced in the SyDLI or in the age equivalent typically-developing groups. The two groups (ASD and SyDLI also differ in the pattern of performance—the children with SyDLI show a syntactically-principled pattern of impairment, with selective difficulty in specific sentence types (such as sentences derived by movement of the object across the subject, and normal performance on other structures (such as simple sentences. In contrast, the ASD participants showed generalized low performance on the various sentence structures. Syntactic performance was far from consistent within the ASD group. Whereas all ASD participants had errors that can originate in pragmatic/discourse difficulties, seven of them had completely normal syntax

  5. ASD Is Not DLI: Individuals With Autism and Individuals With Syntactic DLI Show Similar Performance Level in Syntactic Tasks, but Different Error Patterns.

    Science.gov (United States)

    Sukenik, Nufar; Friedmann, Naama

    2018-01-01

    Do individuals with autism have a developmental syntactic impairment, DLI (formerly known as SLI)? In this study we directly compared the performance of 18 individuals with Autism Spectrum Disorder (ASD) aged 9;0-18;0 years with that of 93 individuals with Syntactic-Developmental Language Impairment (SyDLI) aged 8;8-14;6 (and with 166 typically-developing children aged 5;2-18;1). We tested them using three syntactic tests assessing the comprehension and production of syntactic structures that are known to be sensitive to syntactic impairment: elicitation of subject and object relative clauses, reading and paraphrasing of object relatives, and repetition of complex syntactic structures including Wh questions, relative clauses, topicalized sentences, sentences with verb movement, sentences with A-movement, and embedded sentences. The results were consistent across the three tasks: the overall rate of correct performance on the syntactic tasks is similar for the children with ASD and those with SyDLI. However, once we look closer, they are very different. The types of errors of the ASD group differ from those of the SyDLI group-the children with ASD provide various types of pragmatically infelicitous responses that are not evinced in the SyDLI or in the age equivalent typically-developing groups. The two groups (ASD and SyDLI) also differ in the pattern of performance-the children with SyDLI show a syntactically-principled pattern of impairment, with selective difficulty in specific sentence types (such as sentences derived by movement of the object across the subject), and normal performance on other structures (such as simple sentences). In contrast, the ASD participants showed generalized low performance on the various sentence structures. Syntactic performance was far from consistent within the ASD group. Whereas all ASD participants had errors that can originate in pragmatic/discourse difficulties, seven of them had completely normal syntax in the structures we

  6. THE FUNCTION OF ALBANIAN AND ENGLISH SENTENCE

    OpenAIRE

    Shkelqim Millaku

    2017-01-01

    A simple sentence consists of a single independent clause. A multiple sentence contains one or more clauses as its immediate constituents. Multiple sentences are either compound or complex. In a compound sentence the immediate constituents are two or more coordinate clause. In a complex sentence one or more of its elements, such as direct object or adverbial, are realized by a subordinate.[1] Simple sentence may be divided into four major syntactic classes, whose use correlates with different...

  7. Assessing the performance under ionising radiation of lead tungstate scintillators for EM calorimetry in the CLAS12 Forward Tagger

    Science.gov (United States)

    Fegan, S.; Auffray, E.; Battaglieri, M.; Buchanan, E.; Caiffi, B.; Celentano, A.; Colaneri, L.; D`Angelo, A.; De Vita, R.; Dormenev, V.; Fanchini, E.; Lanza, L.; Novotny, R. W.; Parodi, F.; Rizzo, A.; Sokhan, D.; Tarasov, I.; Zonta, I.

    2015-07-01

    The well-established technology of electromagnetic calorimetry using Lead Tungstate crystals has recently seen an upheaval, with the closure of one of the most experienced large-scale suppliers of such crystals, the Bogoroditsk Technical Chemical Plant (BTCP), which was instrumental in the development of mass production procedures for PWO-II, the current benchmark for this scintillator. Obtaining alternative supplies of Lead Tungstate crystals matching the demanding specifications of contemporary calorimeter devices now presents a significant challenge to detector research and development programmes. In this paper we describe a programme of assessment carried out for the selection, based upon the performance under irradiation, of Lead Tungstate crystals for use in the Forward Tagger device, part of the CLAS12 detector in Hall B at Jefferson Lab. The crystals tested were acquired from SICCAS, the Shanghai Institute of Ceramics, Chinese Academy of Sciences. The tests performed are intended to maximise the performance of the detector within the practicalities of the crystal manufacturing process. Results of light transmission, before and after gamma ray irradiation, are presented and used to calculate dk, the induced radiation absorption coefficient, at 420 nm, the peak of the Lead Tungstate emission spectrum. Results for the SICCAS crystals are compared with identical measurements carried out on Bogoroditsk samples, which were acquired for the Forward Tagger development program before the closure of the facility. Also presented are a series of tests performed to determine the feasibility of recovering radiation damage to the crystals using illumination from an LED, with such illumination available in the Forward Tagger from a light monitoring system integral to the detector.

  8. Assessing the performance under ionising radiation of lead tungstate scintillators for EM calorimetry in the CLAS12 Forward Tagger

    Energy Technology Data Exchange (ETDEWEB)

    Fegan, S., E-mail: fegan@ge.infn.it [Istituto Nazionale di Fisica Nucleare, Sezione di Genova and Dipartimento di Fisica dell' Universitá, Via Dodecaneso 33, 16146 Genova (Italy); Auffray, E. [CERN, European Organisation for Nuclear Research, Geneva (Switzerland); Battaglieri, M. [Istituto Nazionale di Fisica Nucleare, Sezione di Genova and Dipartimento di Fisica dell' Universitá, Via Dodecaneso 33, 16146 Genova (Italy); Buchanan, E. [University of Glasgow, Glasgow G12 8QQ (United Kingdom); Caiffi, B.; Celentano, A. [Istituto Nazionale di Fisica Nucleare, Sezione di Genova and Dipartimento di Fisica dell' Universitá, Via Dodecaneso 33, 16146 Genova (Italy); Colaneri, L.; D' Angelo, A. [Istituto Nazionale di Fisica Nucleare, Sezione Roma2 Tor Vergata and Università degli studi di Roma Tor Vergata, Via Scientifica 1, 00133 Roma (Italy); De Vita, R. [Istituto Nazionale di Fisica Nucleare, Sezione di Genova and Dipartimento di Fisica dell' Universitá, Via Dodecaneso 33, 16146 Genova (Italy); Dormenev, V. [II. Physikalisches Institut, Universität Gießen, 35392 Gießen (Germany); Fanchini, E. [Istituto Nazionale di Fisica Nucleare, Sezione di Genova and Dipartimento di Fisica dell' Universitá, Via Dodecaneso 33, 16146 Genova (Italy); Lanza, L. [Istituto Nazionale di Fisica Nucleare, Sezione Roma2 Tor Vergata and Università degli studi di Roma Tor Vergata, Via Scientifica 1, 00133 Roma (Italy); Novotny, R.W. [II. Physikalisches Institut, Universität Gießen, 35392 Gießen (Germany); and others

    2015-07-21

    The well-established technology of electromagnetic calorimetry using Lead Tungstate crystals has recently seen an upheaval, with the closure of one of the most experienced large-scale suppliers of such crystals, the Bogoroditsk Technical Chemical Plant (BTCP), which was instrumental in the development of mass production procedures for PWO-II, the current benchmark for this scintillator. Obtaining alternative supplies of Lead Tungstate crystals matching the demanding specifications of contemporary calorimeter devices now presents a significant challenge to detector research and development programmes. In this paper we describe a programme of assessment carried out for the selection, based upon the performance under irradiation, of Lead Tungstate crystals for use in the Forward Tagger device, part of the CLAS12 detector in Hall B at Jefferson Lab. The crystals tested were acquired from SICCAS, the Shanghai Institute of Ceramics, Chinese Academy of Sciences. The tests performed are intended to maximise the performance of the detector within the practicalities of the crystal manufacturing process. Results of light transmission, before and after gamma ray irradiation, are presented and used to calculate dk, the induced radiation absorption coefficient, at 420 nm, the peak of the Lead Tungstate emission spectrum. Results for the SICCAS crystals are compared with identical measurements carried out on Bogoroditsk samples, which were acquired for the Forward Tagger development program before the closure of the facility. Also presented are a series of tests performed to determine the feasibility of recovering radiation damage to the crystals using illumination from an LED, with such illumination available in the Forward Tagger from a light monitoring system integral to the detector.

  9. nab-Paclitaxel in Combination with Carboplatin for a Previously Treated Thymic Carcinoma

    Directory of Open Access Journals (Sweden)

    Go Makimoto

    2014-01-01

    Full Text Available We present the case of a 40-year-old man with previously treated thymic carcinoma, complaining of gradually worsening back pain. Computed tomography scans of the chest showed multiple pleural disseminated nodules with a pleural effusion in the right thorax. The patient was treated with carboplatin on day 1 plus nab-paclitaxel on day 1 and 8 in cycles repeated every 4 weeks. Objective tumor shrinkage was observed after 4 cycles of this regimen. In addition, the elevated serum cytokeratin 19 fragment level decreased, and the patient's back pain was relieved without any analgesics. Although he experienced grade 4 neutropenia and granulocyte colony-stimulating factor (G-CSF injection, the severity of thrombocytopenia and nonhematological toxicities such as reversible neuropathy did not exceed grade 1 during the treatment. To our knowledge, this is the first report to demonstrate the efficacy of combination chemotherapy consisting of carboplatin and nab-paclitaxel against thymic carcinoma. This case report suggests that nab-paclitaxel in combination with carboplatin can be a favorable chemotherapy regimen for advanced thymic carcinoma.

  10. Unraveling the linguistic nature of specific autobiographical memories using a computerized classification algorithm.

    Science.gov (United States)

    Takano, Keisuke; Ueno, Mayumi; Moriya, Jun; Mori, Masaki; Nishiguchi, Yuki; Raes, Filip

    2017-06-01

    In the present study, we explored the linguistic nature of specific memories generated with the Autobiographical Memory Test (AMT) by developing a computerized classifier that distinguishes between specific and nonspecific memories. The AMT is regarded as one of the most important assessment tools to study memory dysfunctions (e.g., difficulty recalling the specific details of memories) in psychopathology. In Study 1, we utilized the Japanese corpus data of 12,400 cue-recalled memories tagged with observer-rated specificity. We extracted linguistic features of particular relevance to memory specificity, such as past tense, negation, and adverbial words and phrases pertaining to time and location. On the basis of these features, a support vector machine (SVM) was trained to classify the memories into specific and nonspecific categories, which achieved an area under the curve (AUC) of .92 in a performance test. In Study 2, the trained SVM was tested in terms of its robustness in classifying novel memories (n = 8,478) that were retrieved in response to cue words that were different from those used in Study 1. The SVM showed an AUC of .89 in classifying the new memories. In Study 3, we extended the binary SVM to a five-class classification of the AMT, which achieved 64%-65% classification accuracy, against the chance level (20%) in the performance tests. Our data suggest that memory specificity can be identified with a relatively small number of words, capturing the universal linguistic features of memory specificity across memories in diverse contents.

  11. Sirenomelia with agenesis of corpus callosum.

    Science.gov (United States)

    Shirani, Shapour; Rekabi, Vahab; Kamalian, Naser

    2006-07-01

    Sirenomelia is a very rare anomaly presented with fusion of the lower limbs. Genitourinary, neural tube, and vertebral anomalies are found in most cases. We report a case of sirenomelia with agenesis of corpus callosum, which has not been reported previously.

  12. Applying Corpus-Based Findings to Form-Focused Instruction: The Case of Reported Speech

    Science.gov (United States)

    Barbieri, Federica; Eckhardt, Suzanne E. B.

    2007-01-01

    Arguing that the introduction of corpus linguistics in teaching materials and the language classroom should be informed by theories and principles of SLA, this paper presents a case study illustrating how corpus-based findings on reported speech can be integrated into a form-focused model of instruction. After overviewing previous work which…

  13. A human cytochrome P-450 is recognized by anti-liver/kidney microsome antibodies in autoimmune chronic hepatitis.

    Science.gov (United States)

    Kiffel, L; Loeper, J; Homberg, J C; Leroux, J P

    1989-02-28

    1- Anti-liver/kidney microsome autoantibodies type 1 (anti-LKM1), observed in some children with chronic active hepatitis, were used to isolate their antigen in human liver microsomes. A protein, called P-LKM1 was thus purified. This protein was recognized by a rabbit antiserum directed against the related human cytochromes P-450 bufI and P-450 bufII. 2- A human liver microsomal protein immunoprecipitated with anti-LKM1 sera was also recognized by anti cytochromes P-450 bufI/II antibodies. 3- Anti-LKM1 antibodies potently inhibited microsomal bufuralol 1'-hydroxylation. These results displayed the possible identity between cytochrome P-450 bufI/II and LKM1 antigen.

  14. Keeleliste elulugude uurimisvõimalusi: Dagmar Normeti mitmekeelne lapsepõlv Eestis. Possibilities of Research on Linguistic Biographies: Dagmar Normet, a Multilingual Childhood in Estonia

    Directory of Open Access Journals (Sweden)

    Anna Verschik

    2012-04-01

    Full Text Available Recently the investigation of linguistic biographies has become popular among linguists for several reasons. Instead of studying formally-oriented, traditional approaches to second language acquisition and language learning, such research focuses on an individual’s conceptualisation of languages, language acquisition and living with and among multiple languages. Linguistic biographies can be either oral or written narratives, elicited by a researcher or produced by individuals. This includes language-learning memoirs as well. As some studies have demonstrated, a closer look at a linguistic history of a particular individual helps to discover new aspects that generally remain unnoticed in formally-oriented studies, such as the speaker’s personal attitudes, emotions attached to his/ her languages, self-expression in different languages, and instances of multilingual speech (for example, cross-linguistic influence, code-switching, etc.. However, a multilingual person’s narratives, either in written or oral form, should be treated with caution. It has been demonstrated in recent studies that grounded theory approach (i.e., coding and establishing emergent categories and content analysis alone cannot present a full picture of a linguistic biography. As Pavlenko (2007 argues, at least three kinds of reality should be considered: subject reality (how the narrator sees his/her life with multiple languages, text reality (that is, how the text of narration is structured, in what order events are presented and life reality (biographical facts. As in fieldwork in general, a researcher should be prepared to face discrepancies between the picture presented by the informant and other types of reality. From a methodological point of view, an informant should be interviewed several times in his/her different languages or, at the very least; a researcher should be familiar with the languages. In this sense, the European tradition of linguistic biographies

  15. Displays of authority in the clinical consultation: a linguistic ethnographic study of the electronic patient record.

    Science.gov (United States)

    Swinglehurst, Deborah

    2014-10-01

    The introduction of computers into general practice settings has profoundly changed the dynamics of the clinical consultation. Previous research exploring the impact of the computer (in what has been termed the 'triadic' consultation) has shown that computer use and communication between doctor and patient are intricately coordinated and inseparable. Swinglehurst et al. have recently been critical of the ongoing tendency within health communication research to focus on 'the computer' as a relatively simple 'black box', or as a material presence in the consultation. By re-focussing on the electronic patient record (EPR) and conceptualising this as a complex collection of silent but consequential voices, they have opened up new and more nuanced possibilities for analysis. This orientation makes visible a tension between the immediate contingencies of the interaction as it unfolds moment-by-moment and the more standardised, institutional demands which are embedded in the EPR ('dilemma of attention'). In this paper I extend this work, presenting an in-depth examination of how participants in the consultation manage this tension. I used linguistic ethnographic methods to study 54 video recorded consultations from a dataset collected between 2007 and 2008 in two UK general practices, combining microanalysis of the consultation with ethnographic attention to the wider organisational and institutional context. My analysis draws on the theoretical work of Erving Goffman and Mikhail Bakhtin, incorporating attention to the 'here and now' of the interaction as well as an appreciation of the 'distributed' nature of the EPR, its role in hosting and circulating new voices, and in mediating participants' talk and social practices. It reveals - in apparently fleeting moments of negotiation and contestation - the extent to which the EPR shapes the dynamic construction, display and circulation of authority in the contemporary consultation. Copyright © 2014 The Author. Published by

  16. 25 CFR 115.422 - As a custodial parent, the legal guardian, the person who BIA has recognized as having control...

    Science.gov (United States)

    2010-04-01

    ... BIA has recognized as having control and custody of the minor, or an emancipated minor, what are your... BIA has recognized as having control and custody of the minor, or an emancipated minor, what are your... legal guardian, the person who BIA has recognized as having control and custody of the minor, or an...

  17. A Linguistic Study of Courtship as a Rule-Bound Social Institution in Pride and Prejudice

    Directory of Open Access Journals (Sweden)

    Akram Pouralifard

    2017-11-01

    Full Text Available The focus of this research is in the area of the relationship between linguistics and the Victorian literature. Such a study is important in order to demonstrate how the masterpieces of Victorian literature possess the potential to be studied according to the principles of linguistics and how the motives behind many characters’ activities can be determined by recourse to linguistics. The findings from this research provide evidence that all human activities follow the same rules which all the human languages are based on and according to which they all function due to their common root in the human mind. The main conclusion drawn from this study is that linguistic principles can constitute a great methodology for determining the real motives behind many activities done by the humans. This paper recommends that linguistic principles can be an excellent methodology by means of which the researchers can study the literary works of other eras in the English literature.

  18. Processing Polarity: How the Ungrammatical Intrudes on the Grammatical

    Science.gov (United States)

    Vasishth, Shravan; Brussow, Sven; Lewis, Richard L.; Drenhaus, Heiner

    2008-01-01

    A central question in online human sentence comprehension is, "How are linguistic relations established between different parts of a sentence?" Previous work has shown that this dependency resolution process can be computationally expensive, but the underlying reasons for this are still unclear. This article argues that dependency…

  19. Impact of SNR, masker type and noise reduction processing on sentence recognition performance and listening effort as indicated by the pupil dilation response

    DEFF Research Database (Denmark)

    Ohlenforst, Barbara; Wendt, Dorothea; Kramer, Sophia E

    2018-01-01

    Recent studies have shown that activating the noise reduction scheme in hearing aids results in a smaller peak pupil dilation (PPD), indicating reduced listening effort, at 50% and 95% correct sentence recognition with a 4-talker masker. The objective of this study was to measure the effect...... of the noise reduction scheme (on or off) on PPD and sentence recognition across a wide range of signal-to-noise ratios (SNRs) from +16 dB to -12 dB and two masker types (4-talker and stationary noise). Relatively low PPDs were observed at very low (-12 dB) and very high (+16 dB to +8 dB) SNRs presumably due...... to 'giving up' and 'easy listening', respectively. The maximum PPD was observed with SNRs at approximately 50% correct sentence recognition. Sentence recognition with both masker types was significantly improved by the noise reduction scheme, which corresponds to the shift in performance from SNR function...

  20. Combustion of a Pb(II)-loaded olive tree pruning used as biosorbent

    Energy Technology Data Exchange (ETDEWEB)

    Ronda, A., E-mail: alirg@ugr.es [Department of Chemical Engineering, University of Granada, 18071 Granada (Spain); Della Zassa, M. [Department of Industrial Engineering, University of Padua, 35131 Padova (Italy); Martín-Lara, M.A.; Calero, M. [Department of Chemical Engineering, University of Granada, 18071 Granada (Spain); Canu, P. [Department of Industrial Engineering, University of Padua, 35131 Padova (Italy)

    2016-05-05

    Highlights: • The fate of Pb during combustion at two scales of investigation was studied. • Results from combustion in a flow reactor and in the thermobalance were consistent. • The Pb contained in the solid remained in the ashes. • The Pb does not interfere in the use of OTP as fuel. • The combustion of Pb(II)-loaded OTP does not cause environmental hazards. - Abstract: The olive tree pruning is a specific agroindustrial waste that can be successfully used as adsorbent, to remove Pb(II) from contaminated wastewater. Its final incineration has been studied in a thermobalance and in a laboratory flow reactor. The study aims at evaluating the fate of Pb during combustion, at two different scales of investigation. The flow reactor can treat samples approximately 10{sup 2} larger than the conventional TGA. A detailed characterization of the raw and Pb(II)-loaded waste, before and after combustion is presented, including analysis of gas and solids products. The Pb(II)-loaded olive tree pruning has been prepared by a previous biosorption step in a lead solution, reaching a concentration of lead of 2.3 wt%. Several characterizations of the ashes and the mass balances proved that after the combustion, all the lead presents in the waste remained in ashes. Combustion in a flow reactor produced results consistent with those obtained in the thermobalance. It is thus confirmed that the combustion of Pb(II)-loaded olive tree pruning is a viable option to use it after the biosorption process. The Pb contained in the solid remained in the ashes, preventing possible environmental hazards.