Language understanding is essential for intelligent information processing. Processing of language itself involves configuration element analysis, syntactic analysis (parsing), and semantic analysis. They are not carried out in isolation. These are described for the Japanese language and their usage in understanding-systems is examined. 30 references.
This book discusses the following: Computational Linguistics, Artificial Intelligence, Linguistics, Philosophy, and Cognitive Science and the current state of natural language understanding. Three topics form the focus for discussion; these topics include aspects of grammars, aspects of semantics/pragmatics, and knowledge representation.
Waltz, D. L.; Maran, L. R.; Dorfman, M. H.; Dinitz, R.; Farwell, D.
During this contract period the authors have: (1) continued investigation of events and actions by means of representation schemes called 'event shape diagrams'; (2) written a parsing program which selects appropriate word and sentence meanings by a parallel process know as activation and inhibition; (3) begun investigation of the point of a story or event by modeling the motivations and emotional behaviors of story characters; (4) started work on combining and translating two machine-readable dictionaries into a lexicon and knowledge base which will form an integral part of our natural language understanding programs; (5) made substantial progress toward a general model for the representation of cognitive relations by comparing English scene and event descriptions with similar descriptions in other languages; (6) constructed a general model for the representation of tense and aspect of verbs; (7) made progress toward the design of an integrated robotics system which accepts English requests, and uses visual and tactile inputs in making decisions and learning new tasks.
Supervision Distant supervision is a recent trend in information extraction. Distantly-supervised extractors are trained using a corpus of unlabeled text...consists of fill-in-the-blank natural language questions such as “Incan emperor ” or “Cunningham directed Auchtre’s second music video .” These questions...with an 132 unknown knowledge base, simultaneously learning how to semantically parse language and pop - ulate the knowledge base. The weakly
how a Concept specializes its subsumer. |C|ANIMAL. |C|PLANT. |C(PERSON, and |C| UNICORN are natural kinds, and so will need a PrimitiveClass. As...build this proof, we must build a proof of p x (p X n) steps. The size of the proofs grows exponentially with the depth of nesting This :s clearly
fundamental to knowledge management problems. In [Wijaya13] presented a novel approach to this ontology alignment problem that employs a very large natural...to them. This report is the result of contracted fundamental research deemed exempt from public affairs security and policy review in accordance...S / ALEKSEY PANASYUK MICHAEL J. WESSING Work Unit Manager Deputy Chief, Information Intelligence Systems & Analysis Division Information
interpretation would not be too bad if one were to believe that a frame "is intended to represent a ’ stereotypical situation’" ( , p. 48). We...natural kind-like concepts - some form of definitional structuring is necessary. The internal structure of non atomic concepts (e.g., proximate genus ...types of beer, bottles of wine, etc.; <x> need not be any sort of Onatural genus .’ For example, in Dll the definite pronoun Othem" is not meant to I
facilities. BBN is developing a series of increasingly sophisticated natural language understanding systems which will serve as an integrated interface...Haas, A.R. A Syntactic Theory of Belief and Action. Artificial Intelligence. 1986. Forthcoming.  Hinrichs, E. Temporale Anaphora im Englischen
conversational agent with information exchange disabled until the end of the experiment run. The meaning of the indicator in the top- right of the agent... Human Computer Collaboration at the Edge: Enhancing Collective Situation Understanding with Controlled Natural Language Alun Preece∗, William...email: PreeceAD@cardiff.ac.uk †Emerging Technology Services, IBM United Kingdom Ltd, Hursley Park, Winchester, UK ‡US Army Research Laboratory, Human
Massaro, Dominic W
I review 2 seminal research reports published in this journal during its second decade more than a century ago. Given psychology's subdisciplines, they would not normally be reviewed together because one involves reading and the other speech perception. The small amount of interaction between these domains might have limited research and theoretical progress. In fact, the 2 early research reports revealed common processes involved in these 2 forms of language processing. Their illustration of the role of Wundt's apperceptive process in reading and speech perception anticipated descriptions of contemporary theories of pattern recognition, such as the fuzzy logical model of perception. Based on the commonalities between reading and listening, one can question why they have been viewed so differently. It is commonly believed that learning to read requires formal instruction and schooling, whereas spoken language is acquired from birth onward through natural interactions with people who talk. Most researchers and educators believe that spoken language is acquired naturally from birth onward and even prenatally. Learning to read, on the other hand, is not possible until the child has acquired spoken language, reaches school age, and receives formal instruction. If an appropriate form of written text is made available early in a child's life, however, the current hypothesis is that reading will also be learned inductively and emerge naturally, with no significant negative consequences. If this proposal is true, it should soon be possible to create an interactive system, Technology Assisted Reading Acquisition, to allow children to acquire literacy naturally.
Sodiya, Adesina Simon
Natural languages are the latest generation of programming languages, which require processing real human natural expressions. Over the years, several groups or researchers have trying to develop widely accepted natural language languages based on artificial intelligence (AI). But no true natural language has been developed. The goal of this work is to design a natural language preprocessing architecture that identifies and accepts programming instructions or sentences in their natural forms ...
Wächter, Mirko; Ovchinnikova, Ekaterina; Wittenbeck, Valerij
We propose an approach for instructing a robot using natural language to solve complex tasks in a dynamic environment. In this study, we elaborate on a framework that allows a humanoid robot to understand natural language, derive symbolic representations of its sensorimotor experience, generate....... The framework is implemented within the robot development environment ArmarX. We evaluate the framework on the humanoid robot ARMAR-III in the context of two experiments: a demonstration of the real execution of a complex task in the kitchen environment on ARMAR-III and an experiment with untrained users...
Sharp, J.K. [Sandia National Labs., Albuquerque, NM (United States)
This seminar describes a process and methodology that uses structured natural language to enable the construction of precise information requirements directly from users, experts, and managers. The main focus of this natural language approach is to create the precise information requirements and to do it in such a way that the business and technical experts are fully accountable for the results. These requirements can then be implemented using appropriate tools and technology. This requirement set is also a universal learning tool because it has all of the knowledge that is needed to understand a particular process (e.g., expense vouchers, project management, budget reviews, tax, laws, machine function).
Byers-Heinlein, Krista; Chen, Ke Heng; Xu, Fei
Languages function as independent and distinct conventional systems, and so each language uses different words to label the same objects. This study investigated whether 2-year-old children recognize that speakers of their native language and speakers of a foreign language do not share the same knowledge. Two groups of children unfamiliar with Mandarin were tested: monolingual English-learning children (n=24) and bilingual children learning English and another language (n=24). An English speaker taught children the novel label fep. On English mutual exclusivity trials, the speaker asked for the referent of a novel label (wug) in the presence of the fep and a novel object. Both monolingual and bilingual children disambiguated the reference of the novel word using a mutual exclusivity strategy, choosing the novel object rather than the fep. On similar trials with a Mandarin speaker, children were asked to find the referent of a novel Mandarin label kuò. Monolinguals again chose the novel object rather than the object with the English label fep, even though the Mandarin speaker had no access to conventional English words. Bilinguals did not respond systematically to the Mandarin speaker, suggesting that they had enhanced understanding of the Mandarin speaker's ignorance of English words. The results indicate that monolingual children initially expect words to be conventionally shared across all speakers-native and foreign. Early bilingual experience facilitates children's discovery of the nature of foreign language words. Copyright © 2013 Elsevier Inc. All rights reserved.
easily transformed into a regrettable mistake (don’t cry over spilt milk ) if G is not characterized as a fleeting goal and a recovery plan therefore...technical literature is characterized by very dry and literal language. If there is one place where metaphors might not intrude, it must be when people...from the point of view of both evidential support and falsification ? I ask it because you didn’t say anything about it. A: Well, I think there’s a lot
Mazlack, L.J.; Paz, N.M.
Newspaper cartoons can graphically display the result of ambiguity in human speech; the result can be unexpected and funny. Likewise, computer analysis of natural language statements also needs to successfully resolve ambiguous situations. Computer techniques already developed use restricted world knowledge in resolving ambiguous language use. This paper illustrates how these techniques can be used in resolving ambiguous situations arising in cartoons. 8 references.
Hirschberg, Julia; Manning, Christopher D
Natural language processing employs computational techniques for the purpose of learning, understanding, and producing human language content. Early computational approaches to language research focused on automating the analysis of the linguistic structure of language and developing basic technologies such as machine translation, speech recognition, and speech synthesis. Today's researchers refine and make use of such tools in real-world applications, creating spoken dialogue systems and speech-to-speech translation engines, mining social media for information about health or finance, and identifying sentiment and emotion toward products and services. We describe successes and challenges in this rapidly advancing area. Copyright © 2015, American Association for the Advancement of Science.
Willems, Roel M; Casasanto, Daniel
Do people use sensori-motor cortices to understand language? Here we review neurocognitive studies of language comprehension in healthy adults and evaluate their possible contributions to theories of language in the brain. We start by sketching the minimal predictions that an embodied theory of language understanding makes for empirical research, and then survey studies that have been offered as evidence for embodied semantic representations. We explore four debated issues: first, does activation of sensori-motor cortices during action language understanding imply that action semantics relies on mirror neurons? Second, what is the evidence that activity in sensori-motor cortices plays a functional role in understanding language? Third, to what extent do responses in perceptual and motor areas depend on the linguistic and extra-linguistic context? And finally, can embodied theories accommodate language about abstract concepts? Based on the available evidence, we conclude that sensori-motor cortices are activated during a variety of language comprehension tasks, for both concrete and abstract language. Yet, this activity depends on the context in which perception and action words are encountered. Although modality-specific cortical activity is not a sine qua non of language processing even for language about perception and action, sensori-motor regions of the brain appear to make functional contributions to the construction of meaning, and should therefore be incorporated into models of the neurocognitive architecture of language.
Roel M Willems
Full Text Available Do people use sensori-motor cortices to understand language? Here we review neurocognitive studies of language comprehension in healthy adults and evaluate their possible contributions to theories of language in the brain. We start by sketching the minimal predictions that an embodied theory of language understanding makes for empirical research, and then survey studies that have been offered as evidence for embodied semantic representations. We explore four debated issues: first, does activation of sensori-motor cortices during action language understanding imply that action semantics relies on mirror neurons? Second, what is the evidence that activity in sensori-motor cortices plays a functional role in understanding language? Third, to what extent do responses in perceptual and motor areas depend on the linguistic and extra-linguistic context? And finally, can embodied theories accommodate language about abstract concepts? Based on the available evidence, we conclude that sensori-motor cortices are activated during a variety of language comprehension tasks, for both concrete and abstract language. Yet, this activity depends on the context in which perception and action words are encountered. Although modality-specific cortical activity is not a sine qua non of language processing even for language about perception and action, sensori-motor regions of the brain appear to make functional contributions to the construction of meaning, and should therefore be incorporated into models of the neurocognitive architecture of language.
Laporte , Eric
The connection between language processing and combinatorics on words is natural. Historically, linguists actually played a part in the beginning of the construction of theoretical combinatorics on words. Some of the terms in current use originate from linguistics: word, prefix, suffix, grammar, syntactic monoid... However, interpenetration between the formal world of computer theory and the intuitive world of linguistics is still a love story with ups and downs. We will encounter in this cha...
Full Text Available Wittgenstein has often explored language games that have to do with musical objects of different sizes (phrases, themes, formal sections or entire works. These games can refer to a technical language or to common parlance and correspond to different targets. One of these coincides with the intention to suggest a way of conceiving musical understanding. His model takes the form of the invitation to "hear (something as (something": typically, to hear a musical passage as an introduction or as a conclusion or in a certain tonality. However one may ask to what extent or in what terms (literal or metaphorical these procedures, and usually the intervention of language games, is requested by our common ways of understanding music. This article shows through the use of some examples that aspectual perception inherent to musical understanding does not require language games as a necessary condition (although in many cases the link between them seems very strong, in contradiction with the thesis of an essential linguistic character of music. At a basic level, it seems more appropriate to insist on the notion of a game: to understand music means to enter into the orbit of "music games" which show an autonomous functioning. Language games have, however, an important function when we develop this comprehension in the light of the criteria of judgment that substantiate the manner in which music is incorporated in and operates within specific forms of life.
Teachers' understanding of the communicative language teaching approach: The case of English language teachers in Thohoyandou. ... with CLT theories and practice. Keywords: communicative competence, approach versus method, Grammar translation method, direct method, first additional language, second language ...
Shepherd, Debra Lynne
The regional and cultural closeness of Botswana and South Africa, as well as differences in their political histories and language policy stances, offers a unique opportunity to evaluate the role of language in reading outcomes. This study aims to empirically test the effect of exposure to mother tongue and English instruction on the reading…
Sergio Di Carlo
Full Text Available Over time, definitions and taxonomies of language learning strategies have been critically examined. This article defines and classifies cognitive language learning strategies on a more grounded basis. Language learning is a macro-process for which the general hypotheses of information processing are valid. Cognitive strategies are represented by the pillars underlying the encoding, storage and retrieval of information. In order to understand the processes taking place on these three dimensions, a functional model was elaborated from multiple theoretical contributions and previous models: the Smart Processing Model. This model operates with linguistic inputs as well as with any other kind of information. It helps to illustrate the stages, relations, modules and processes that occur during the flow of information. This theoretical advance is a core element to classify cognitive strategies. Contributions from cognitive neuroscience have also been considered to establish the proposed classification which consists of five categories. Each of these categories has a different predominant function: classification, preparation, association, elaboration and transfer-practice. This better founded taxonomy opens the doors to potential studies that would allow a better understanding of the interdisciplinary complexity of language learning. Pedagogical and methodological implications are also discussed.
Corneli, Joseph; Corneli, Miriam
"Natural Language," whether spoken and attended to by humans, or processed and generated by computers, requires networked structures that reflect creative processes in semantic, syntactic, phonetic, linguistic, social, emotional, and cultural modules. Being able to produce novel and useful behavior following repeated practice gets to the root of both artificial intelligence and human language. This paper investigates the modalities involved in language-like applications that computers -- and ...
Provides a comprehensive, modern reference of practical tools and techniques for implementing natural language processing in computer systems. This title covers classical methods, empirical and statistical techniques, and various applications. It describes how the techniques can be applied to European and Asian languages as well as English
Olesen, Henning Salling; Weber, Kirsten
resumida brevemente, enfatizando el rol de los argumentos del investigador en descubrir el significado socialmente inconsciente en la interacción social. Finalmente, una mirada a los problemas epistemológicos contemporáneos. El enfoque de LORENZER para teorizar e investigar al sujeto como una entidad......The article is a guided tour to Alfred LORENZER's proposal for an "in-depth hermeneutic" cultural analysis methodology which was launched in an environment with an almost complete split between social sciences and psychology/psychoanalysis. It presents the background in his materialist...... socialization theory, which combines a social reinterpretation of the core insights in classical psychoanalysis—the unconscious, the drives—with a theory of language acquisition. His methodology is based on a transformation of the "scenic understanding" from a clinical to a text interpretation, which seeks...
Waltz, David L
Natural language understanding is central to the goals of artificial intelligence. Any truly intelligent machine must be capable of carrying on a conversation: dialogue, particularly clarification dialogue, is essential if we are to avoid disasters caused by the misunderstanding of the intelligent interactive systems of the future. This book is an interim report on the grand enterprise of devising a machine that can use natural language as fluently as a human. What has really been achieved since this goal was first formulated in Turing's famous test? What obstacles still need to be overcome?
Reese, Richard M
If you are a Java programmer who wants to learn about the fundamental tasks underlying natural language processing, this book is for you. You will be able to identify and use NLP tasks for many common problems, and integrate them in your applications to solve more difficult problems. Readers should be familiar/experienced with Java software development.
Levey, Sandra; Polirstok, Susan
Language Development: Understanding Language Diversity in the Classroom offers comprehensive coverage of the language development process for pre- and in-service teachers while emphasizing the factors that further academic success in the classroom, including literacy skills, phonological awareness, and narrative. With chapters written by respected…
Modeling the entailment relation over sentences is one of the generic problems of natural language understanding. In order to account for this problem, we design a theorem prover for Natural Logic, a logic whose terms resemble natural language expressions. The prover is based on an analytic tableau
Krahmer, Emiel; Theune, Mariet
Natural language generation (NLG) is a subfield of natural language processing (NLP) that is often characterized as the study of automatically converting non-linguistic representations (e.g., from databases or other knowledge sources) into coherent natural language text. In recent years the field
Olesen, Henning Salling; Weber, Kirsten
is based on a transformation of the "scenic understanding" from a clinical to a text interpretation, which seeks to understand collective unconscious meaning in text, and is presented with an illustration of the interpretation procedure from social research. Then follows a brief systematic account of key...
To identify the neural components that make a brain ready for language, it is important to have well defined linguistic phenotypes, to know precisely what language is. There are two central features to language: the capacity to form signs (words), and the capacity to combine them into complex structures. We must determine how the human brain enables these capacities. A sign is a link between a perceptual form and a conceptual meaning. Acoustic elements and content elements, are already brain-internal in non-human animals, but as categorical systems linked with brain-external elements. Being indexically tied to objects of the world, they cannot freely link to form signs. A crucial property of a language-ready brain is the capacity to process perceptual forms and contents offline, detached from any brain-external phenomena, so their "representations" may be linked into signs. These brain systems appear to have pleiotropic effects on a variety of phenotypic traits and not to be specifically designed for language. Syntax combines signs, so the combination of two signs operates simultaneously on their meaning and form. The operation combining the meanings long antedates its function in language: the primitive mode of predication operative in representing some information about an object. The combination of the forms is enabled by the capacity of the brain to segment vocal and visual information into discrete elements. Discrete temporal units have order and juxtaposition, and vocal units have intonation, length, and stress. These are primitive combinatorial processes. So the prior properties of the physical and conceptual elements of the sign introduce combinatoriality into the linguistic system, and from these primitive combinatorial systems derive concatenation in phonology and combination in morphosyntax. Given the nature of language, a key feature to our understanding of the language-ready brain is to be found in the mechanisms in human brains that enable the unique
Full Text Available To identify the neural components that make a brain ready for language, it is important to have well defined linguistic phenotypes, to know precisely what language is. There are two central features to language: the capacity to form signs (words, and the capacity to combine them into complex structures. We must determine how the human brain enables these capacities.A sign is a link between a perceptual form and a conceptual meaning. Acoustic elements and content elements, are already brain-internal in non-human animals, but as categorical systems linked with brain-external elements. Being indexically tied to objects of the world, they cannot freely link to form signs. A crucial property of a language-ready brain is the capacity to process perceptual forms and contents offline, detached from any brain-external phenomena, so their representations may be linked into signs. These brain systems appear to have pleiotropic effects on a variety of phenotypic traits and not to be specifically designed for language.Syntax combines signs, so the combination of two signs operates simultaneously on their meaning and form. The operation combining the meanings long antedates its function in language: the primitive mode of predication operative in representing some information about an object. The combination of the forms is enabled by the capacity of the brain to segment vocal and visual information into discrete elements. Discrete temporal units have order and juxtaposition, and vocal units have intonation, length, and stress. These are primitive combinatorial processes. So the prior properties of the physical and conceptual elements of the sign introduce combinatoriality into the linguistic system, and from these primitive combinatorial systems derive concatenation in phonology and combination in morphosyntax.Given the nature of language, a key feature to our understanding of the language-ready brain is to be found in the mechanisms in human brains that
Nadkarni, Prakash M; Ohno-Machado, Lucila; Chapman, Wendy W
To provide an overview and tutorial of natural language processing (NLP) and modern NLP-system design. This tutorial targets the medical informatics generalist who has limited acquaintance with the principles behind NLP and/or limited knowledge of the current state of the art. We describe the historical evolution of NLP, and summarize common NLP sub-problems in this extensive field. We then provide a synopsis of selected highlights of medical NLP efforts. After providing a brief description of common machine-learning approaches that are being used for diverse NLP sub-problems, we discuss how modern NLP architectures are designed, with a summary of the Apache Foundation's Unstructured Information Management Architecture. We finally consider possible future directions for NLP, and reflect on the possible impact of IBM Watson on the medical field.
Hassani, Kaveh; Lee, Won-Sook
A natural language interface exploits the conceptual simplicity and naturalness of the language to create a high-level user-friendly communication channel between humans and machines. One of the promising applications of such interfaces is generating visual interpretations of semantic content of a given natural language that can be then visualized either as a static scene or a dynamic animation. This survey discusses requirements and challenges of developing such systems and reports 26 graphi...
Natural language processing techniques for automatic test questions generation using discourse connectives. ... PROMOTING ACCESS TO AFRICAN RESEARCH. AFRICAN JOURNALS ... Journal of Computer Science and Its Application.
In principle, natural language and knowledge representation are closely related. This paper investigates this by demonstrating how several natural language phenomena, such as definite reference, ambiguity, ellipsis, ill-formed input, figures of speech, and vagueness, require diverse knowledge sources and reasoning. The breadth of kinds of knowledge needed to represent morphology, syntax, semantics, and pragmatics is surveyed. Furthermore, several current issues in knowledge representation, such as logic versus semantic nets, general-purpose versus special-purpose reasoners, adequacy of first-order logic, wait-and-see strategies, and default reasoning, are illustrated in terms of their relation to natural language processing and how natural language impact the issues.
Full Text Available The performance of deep learning in natural language processing has been spectacular, but the reasons for this success remain unclear because of the inherent complexity of deep learning. This paper provides empirical evidence of its effectiveness and of a limitation of neural networks for language engineering. Precisely, we demonstrate that a neural language model based on long short-term memory (LSTM effectively reproduces Zipf's law and Heaps' law, two representative statistical properties underlying natural language. We discuss the quality of reproducibility and the emergence of Zipf's law and Heaps' law as training progresses. We also point out that the neural language model has a limitation in reproducing long-range correlation, another statistical property of natural language. This understanding could provide a direction for improving the architectures of neural networks.
Mobile Speech and Advanced Natural Language Solutions provides a comprehensive and forward-looking treatment of natural speech in the mobile environment. This fourteen-chapter anthology brings together lead scientists from Apple, Google, IBM, AT&T, Yahoo! Research and other companies, along with academicians, technology developers and market analysts. They analyze the growing markets for mobile speech, new methodological approaches to the study of natural language, empirical research findings on natural language and mobility, and future trends in mobile speech. Mobile Speech opens with a challenge to the industry to broaden the discussion about speech in mobile environments beyond the smartphone, to consider natural language applications across different domains. Among the new natural language methods introduced in this book are Sequence Package Analysis, which locates and extracts valuable opinion-related data buried in online postings; microintonation as a way to make TTS truly human-like; and se...
Hovy, Eduard H
Recognizing that the generation of natural language is a goal- driven process, where many of the goals are pragmatic (i.e., interpersonal and situational) in nature, this book provides an overview of the role of pragmatics in language generation. Each chapter states a problem that arises in generation, develops a pragmatics-based solution, and then describes how the solution is implemented in PAULINE, a language generator that can produce numerous versions of a single underlying message, depending on its setting.
Bouaziz, J; Mashiach, R; Cohen, S; Kedem, A; Baron, A; Zajicek, M; Feldman, I; Seidman, D; Soriano, D
Endometriosis is a disease characterized by the development of endometrial tissue outside the uterus, but its cause remains largely unknown. Numerous genes have been studied and proposed to help explain its pathogenesis. However, the large number of these candidate genes has made functional validation through experimental methodologies nearly impossible. Computational methods could provide a useful alternative for prioritizing those most likely to be susceptibility genes. Using artificial intelligence applied to text mining, this study analyzed the genes involved in the pathogenesis, development, and progression of endometriosis. The data extraction by text mining of the endometriosis-related genes in the PubMed database was based on natural language processing, and the data were filtered to remove false positives. Using data from the text mining and gene network information as input for the web-based tool, 15,207 endometriosis-related genes were ranked according to their score in the database. Characterization of the filtered gene set through gene ontology, pathway, and network analysis provided information about the numerous mechanisms hypothesized to be responsible for the establishment of ectopic endometrial tissue, as well as the migration, implantation, survival, and proliferation of ectopic endometrial cells. Finally, the human genome was scanned through various databases using filtered genes as a seed to determine novel genes that might also be involved in the pathogenesis of endometriosis but which have not yet been characterized. These genes could be promising candidates to serve as useful diagnostic biomarkers and therapeutic targets in the management of endometriosis.
Hoard, James E.
Integrating diverse information sources and application software in a principled and general manner will require a very capable advanced information management (AIM) system. In particular, such a system will need a comprehensive addressing scheme to locate the material in its docuverse. It will also need a natural language processing (NLP) system of great sophistication. It seems that the NLP system must serve three functions. First, it provides an natural language interface (NLI) for the users. Second, it serves as the core component that understands and makes use of the real-world interpretations (RWIs) contained in the docuverse. Third, it enables the reasoning specialists (RSs) to arrive at conclusions that can be transformed into procedures that will satisfy the users' requests. The best candidate for an intelligent agent that can satisfactorily make use of RSs and transform documents (TDs) appears to be an object oriented data base (OODB). OODBs have, apparently, an inherent capacity to use the large numbers of RSs and TDs that will be required by an AIM system and an inherent capacity to use them in an effective way.
Garfield, D A; Rapp, C; Evens, M
The potential benefit of artificial intelligence (AI) technology as a tool of psychiatry has not been well defined. In this essay, the technology of natural language processing and its position with regard to the two main schools of AI is clearly outlined. Past experiments utilizing AI techniques in understanding psychopathology are reviewed. Natural language processing can automate the analysis of transcripts and can be used in modeling theories of language comprehension. In these ways, it can serve as a tool in testing psychological theories of psychopathology and can be used as an effective tool in empirical research on verbal behavior in psychopathology.
Levison, Michael; Lessard, Gregory
Describes the natural language computer program, "Vinci." Explains that using an attribute grammar formalism, Vinci can simulate components of several current linguistic theories. Considers the design of the system and its applications in linguistic modelling and second language acquisition research. Notes Vinci's uses in linguistics…
Sevens, Leen; Vandeghinste, Vincent; Schuurman, Ineke; Van Eynde, Frank
We present a Pictograph-to-Text translation system for people with Intellectual or Developmental Disabilities (IDD). The system translates pictograph messages, consisting of one or more pictographs, into Dutch text using WordNet links and an n-gram language model. We also provide several pictograph input methods assisting the users in selecting the appropriate pictographs.
This dissertation studies how people describe emotions with language and how computers can simulate this descriptive behavior. Although many non-human animals can express their current emotions as social signals, only humans can communicate about emotions symbolically. This symbolic communication of emotion allows us to talk about emotions that we…
The contributions in this volume focus on the Bayesian interpretation of natural languages, which is widely used in areas of artificial intelligence, cognitive science, and computational linguistics. This is the first volume to take up topics in Bayesian Natural Language Interpretation and make proposals based on information theory, probability theory, and related fields. The methodologies offered here extend to the target semantic and pragmatic analyses of computational natural language interpretation. Bayesian approaches to natural language semantics and pragmatics are based on methods from signal processing and the causal Bayesian models pioneered by especially Pearl. In signal processing, the Bayesian method finds the most probable interpretation by finding the one that maximizes the product of the prior probability and the likelihood of the interpretation. It thus stresses the importance of a production model for interpretation as in Grice's contributions to pragmatics or in interpretation by abduction.
Adelphi, MD 20783-1197 This technical note provides a brief description of a Java library for Arabic natural language processing ( NLP ) containing code...for training and applying the Arabic NLP system described in the paper "A Cross-Task Flexible Transition Model for Arabic Tokenization, Affix...and also English) natural language processing ( NLP ), containing code for training and applying the Arabic NLP system described in Stephen Tratz’s
Berwick, Robert C; Friederici, Angela D; Chomsky, Noam; Bolhuis, Johan J
Language serves as a cornerstone for human cognition, yet much about its evolution remains puzzling. Recent research on this question parallels Darwin's attempt to explain both the unity of all species and their diversity. What has emerged from this research is that the unified nature of human language arises from a shared, species-specific computational ability. This ability has identifiable correlates in the brain and has remained fixed since the origin of language approximately 100 thousand years ago. Although songbirds share with humans a vocal imitation learning ability, with a similar underlying neural organization, language is uniquely human. Copyright © 2012 Elsevier Ltd. All rights reserved.
Monti, Martin M; Parsons, Lawrence M; Osherson, Daniel N
A central question in cognitive science is whether natural language provides combinatorial operations that are essential to diverse domains of thought. In the study reported here, we addressed this issue by examining the role of linguistic mechanisms in forging the hierarchical structures of algebra. In a 3-T functional MRI experiment, we showed that processing of the syntax-like operations of algebra does not rely on the neural mechanisms of natural language. Our findings indicate that processing the syntax of language elicits the known substrate of linguistic competence, whereas algebraic operations recruit bilateral parietal brain regions previously implicated in the representation of magnitude. This double dissociation argues against the view that language provides the structure of thought across all cognitive domains.
Andreasen, Troels; Styltsvig, Henrik Bulskov; Jensen, Per Anker
We describe a natural logic for computational reasoning with a regimented fragment of natural language. The natural logic comes with intuitive inference rules enabling deductions and with an internal graph representation facilitating conceptual path finding between pairs of terms as an approach t...
Andreasen, Troels; Bulskov, Henrik; Jensen, Per Anker
We describe a natural logic for computational reasoning with a regimented fragment of natural language. The natural logic comes with intuitive inference rules enabling deductions and with an internal graph representation facilitating conceptual path finding between pairs of terms as an approach t...
Dependency Distance, proposed by Hudson , calculated by Liu [2,3], is an important concept in Dependency Theory. It can be used as a measure of the syntactic difficulty, and lots of research [2,4] have testified the universal of Dependency Distance in various languages. Human languages seem to present a preference for short dependency distance, which may be explained in terms of general cognitive constraint of limited working memory . Psychological experiments in English, German, Russian and Chinese support the hypothesis that Dependency Distance minimization (DDM) make languages to evolve into some syntactic patterns to reduce memory burden [6-9]. The study of psychology focuses on the process and mechanism of syntactic structure selection in speech comprehension. In many speech comprehension experiments , ambiguous structure is an important experimental material.
Gevarter, W. B.
An overview of artificial intelligence (AI), its core ingredients, and its applications is presented. The knowledge representation, logic, problem solving approaches, languages, and computers pertaining to AI are examined, and the state of the art in AI is reviewed. The use of AI in expert systems, computer vision, natural language processing, speech recognition and understanding, speech synthesis, problem solving, and planning is examined. Basic AI topics, including automation, search-oriented problem solving, knowledge representation, and computational logic, are discussed.
Willems, Roel M; Frank, Stefan L; Nijhof, Annabel D; Hagoort, Peter; van den Bosch, Antal
The notion of prediction is studied in cognitive neuroscience with increasing intensity. We investigated the neural basis of 2 distinct aspects of word prediction, derived from information theory, during story comprehension. We assessed the effect of entropy of next-word probability distributions as well as surprisal A computational model determined entropy and surprisal for each word in 3 literary stories. Twenty-four healthy participants listened to the same 3 stories while their brain activation was measured using fMRI. Reversed speech fragments were presented as a control condition. Brain areas sensitive to entropy were left ventral premotor cortex, left middle frontal gyrus, right inferior frontal gyrus, left inferior parietal lobule, and left supplementary motor area. Areas sensitive to surprisal were left inferior temporal sulcus ("visual word form area"), bilateral superior temporal gyrus, right amygdala, bilateral anterior temporal poles, and right inferior frontal sulcus. We conclude that prediction during language comprehension can occur at several levels of processing, including at the level of word form. Our study exemplifies the power of combining computational linguistics with cognitive neuroscience, and additionally underlines the feasibility of studying continuous spoken language materials with fMRI. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: email@example.com.
Martin, James H
.... This approach asserts that the interpretation of conventional metaphoric language should proceed through the direct application of specific knowledge about the metaphors in the language. MIDAS...
Research into natural language understanding systems for computers has concentrated on implementing particular grammars and grammatical models of the language concerned. This paper presents a rationale for research into natural language understanding systems based on neurological and psychological principles. Important features of the approach are that it seeks to place the onus of learning the language on the computer, and that it seeks to make use of the vast wealth of relevant psycholinguistic and neurolinguistic theory. 22 references.
Wagner, J C; Rogers, J E; Baud, R H; Scherrer, J R
A number of compositional Medical Concept Representation systems are being developed. Although these provide for a detailed conceptual representation of the underlying information, they have to be translated back to natural language for used by end-users and applications. The GALEN programme has been developing one such representation and we report here on a tool developed to generate natural language phrases from the GALEN conceptual representations. This tool can be adapted to different source modelling schemes and to different destination languages or sublanguages of a domain. It is based on a multilingual approach to natural language generation, realised through a clean separation of the domain model from the linguistic model and their link by well defined structures. Specific knowledge structures and operations have been developed for bridging between the modelling 'style' of the conceptual representation and natural language. Using the example of the scheme developed for modelling surgical operative procedures within the GALEN-IN-USE project, we show how the generator is adapted to such a scheme. The basic characteristics of the surgical procedures scheme are presented together with the basic principles of the generation tool. Using worked examples, we discuss the transformation operations which change the initial source representation into a form which can more directly be translated to a given natural language. In particular, the linguistic knowledge which has to be introduced--such as definitions of concepts and relationships is described. We explain the overall generator strategy and how particular transformation operations are triggered by language-dependent and conceptual parameters. Results are shown for generated French phrases corresponding to surgical procedures from the urology domain.
R. Bruce Hull; David P. Robertson; Angelina Kendra
This study is intended to serve as an explicit and specific example of the social construction of nature. It is motivated by the need to develop a more sophisticated language for a critical public dialogue about society's relationship with nature. We conducted a case study of environmental discourse in one local population in hopes of better understanding how a...
Rodríguez, J. Tinguaro; Franco, Camilo; Montero, Javier
The evidence coming from cognitive psychology and linguistics shows that pairs of reference concepts (as e.g. good/bad, tall/short, nice/ugly, etc.) play a crucial role in the way we everyday use and understand natural languages in order to analyze reality and make decisions. Different situations...
Language change is a phenomenon that has fascinated scholars for centuries. As a science, linguistic theory has evolved considerably during the 20th century, but the overall puzzle of language change still remains unsolved...
This volume aims to bridge the gap between language arts teaching and linguistic theory. Part one discusses selected aspects of linguistics that are relevant to language arts teaching: the acquisition and development of language during childhood; the English sound system and its relation to spellings and meanings; traditional, structural, and…
Beckage, Nicole M.; Colunga, Eliana
Language is inherently cognitive and distinctly human. Separating the object of language from the human mind that processes and creates language fails to capture the full language system. Linguistics traditionally has focused on the study of language as a static representation, removed from the human mind. Network analysis has traditionally been focused on the properties and structure that emerge from network representations. Both disciplines could gain from looking at language as a cognitive process. In contrast, psycholinguistic research has focused on the process of language without committing to a representation. However, by considering language networks as approximations of the cognitive system we can take the strength of each of these approaches to study human performance and cognition as related to language. This paper reviews research showcasing the contributions of network science to the study of language. Specifically, we focus on the interplay of cognition and language as captured by a network representation. To this end, we review different types of language network representations before considering the influence of global level network features. We continue by considering human performance in relation to network structure and conclude with theoretical network models that offer potential and testable explanations of cognitive and linguistic phenomena.
The Policy-Based Management Natural Language Parser (PBEM) is a rules-based approach to enterprise management that can be used to automate certain management tasks. This parser simplifies the management of a given endeavor by establishing policies to deal with situations that are likely to occur. Policies are operating rules that can be referred to as a means of maintaining order, security, consistency, or other ways of successfully furthering a goal or mission. PBEM provides a way of managing configuration of network elements, applications, and processes via a set of high-level rules or business policies rather than managing individual elements, thus switching the control to a higher level. This software allows unique management rules (or commands) to be specified and applied to a cross-section of the Global Information Grid (GIG). This software embodies a parser that is capable of recognizing and understanding conversational English. Because all possible dialect variants cannot be anticipated, a unique capability was developed that parses passed on conversation intent rather than the exact way the words are used. This software can increase productivity by enabling a user to converse with the system in conversational English to define network policies. PBEM can be used in both manned and unmanned science-gathering programs. Because policy statements can be domain-independent, this software can be applied equally to a wide variety of applications.
The present dissertation reports on research into the nature of Pragmatic Language Impairment (PLI) in children aged 4 to 7 in the Netherlands. First, the possibility of screening for PLI in the general population is examined. Results show that this is indeed possible as well as feasible. Second, an
Many natural language dialogue systems make use of `canned text' for output generation. This approach may be su±cient for dialogues in restricted domains where system utterances are short and simple and use fixed expressions (e.g., slot filling dialogues in the ticket reservation or travel
van Luin, J.; Nijholt, Antinus; op den Akker, Hendrikus J.A.; Giagourta, V.; Strintzis, M.G.
We describe our work on designing a natural language accessible navigation agent for a virtual reality (VR) environment. The agent is part of an agent framework, which means that it can communicate with other agents. Its navigation task consists of guiding the visitors in the environment and to
Percy-Smith, L; Busch, GW; Sandahl, M
The aim of the study was to identify factors associated with the level of language understanding, the level of receptive and active vocabulary, and to estimate effect-related odds ratios for cochlear implanted children's language level.......The aim of the study was to identify factors associated with the level of language understanding, the level of receptive and active vocabulary, and to estimate effect-related odds ratios for cochlear implanted children's language level....
Ezen-Can, Aysu; Boyer, Kristy Elizabeth
Within the landscape of educational data, textual natural language is an increasingly vast source of learning-centered interactions. In natural language dialogue, student contributions hold important information about knowledge and goals. Automatically modeling the dialogue act of these student utterances is crucial for scaling natural language…
Heger, A.S.; Koen, B.V.
A natural language interface has been developed for access to information from a data base, simulating a nuclear plant reliability data system (NPRDS), one of the several existing data bases serving the nuclear industry. In the last decade, the importance of information has been demonstrated by the impressive diffusion of data base management systems. The present methods that are employed to access data bases fall into two main categories of menu-driven systems and use of data base manipulation languages. Both of these methods are currently used by NPRDS. These methods have proven to be tedious, however, and require extensive training by the user for effective utilization of the data base. Artificial intelligence techniques have been used in the development of several intelligent front ends for data bases in nonnuclear domains. Lunar is a natural language program for interface to a data base describing moon rock samples brought back by Apollo. Intellect is one of the first data base question-answering systems that was commercially available in the financial area. Ladder is an intelligent data base interface that was developed as a management aid to Navy decision makers. A natural language interface for nuclear data bases that can be used by nonprogrammers with little or no training provides a means for achieving this goal for this industry
Kambayashi, Shaw; Uenaka, Junji
In this report, a natural language analyzer and two different task planning systems are described. In 1988, we have introduced a Japanese language analyzer named CS-PARSER for the input interface of the task planning system in the Human Acts Simulation Program (HASP). For the purpose of a high speed analysis, we have modified a dictionary system of the CS-PARSER by using C language description. It is found that the new dictionary system is very useful for a high speed analysis and an efficient maintenance of the dictionary. For the study of the task planning problem, we have modified a story generating system named Micro TALE-SPIN to generate a story written in Japanese sentences. We have also constructed a planning system with natural language interface by using the CS-PARSER. Task planning processes and related knowledge bases of these systems are explained. A concept design for a new task planning system will be also discussed from evaluations of above mentioned systems. (author)
Core, Mark G; Moore, Johanna D
.... Because the subject matter is richer, the range of vocabulary and grammatical structures is larger meaning NLU tools are more likely to encounter out-of-vocabulary words or extra-grammatical utterances...
concepts in the taxonomy (e.g., "if you eat this, you will get sick" or "if you eat this it will satisfy your hunger") . The two most important aspects... eating meat, etc., then in almost all such cases one can imagine (or actually encounter) entities to which the term should apply, but which fail to...Philosophical Essays on Mind and Psychology, Bradford Books, 1978. Frege, G. 1892. " Uber Sinn und Bedeutung". Zeitschr. f. Philosophie und philosoph. Kritik
Vandeventer Faltin, Anne
Full Text Available This paper illustrates the usefulness of natural language processing (NLP tools for computer assisted language learning (CALL through the presentation of three NLP tools integrated within a CALL software for French. These tools are (i a sentence structure viewer; (ii an error diagnosis system; and (iii a conjugation tool. The sentence structure viewer helps language learners grasp the structure of a sentence, by providing lexical and grammatical information. This information is derived from a deep syntactic analysis. Two different outputs are presented. The error diagnosis system is composed of a spell checker, a grammar checker, and a coherence checker. The spell checker makes use of alpha-codes, phonological reinterpretation, and some ad hoc rules to provide correction proposals. The grammar checker employs constraint relaxation and phonological reinterpretation as diagnosis techniques. The coherence checker compares the underlying "semantic" structures of a stored answer and of the learners' input to detect semantic discrepancies. The conjugation tool is a resource with enhanced capabilities when put on an electronic format, enabling searches from inflected and ambiguous verb forms.
Natural language processing (NLP) is a subfield of artificial intelligence and computational linguistics. It studies the problems of automated generation and understanding of natural human languages. This paper outlines a framework to use computer and natural language techniques for various levels of learners to learn foreign languages in Computer-based Learning environment. We propose some ideas for using the computer as a practical tool for learning foreign language where the most of courseware is generated automatically. We then describe how to build Computer Based Learning tools, discuss its effectiveness, and conclude with some possibilities using on-line resources.
Kearsey, John; Turner, Sheila
Argues that, although some bilingual pupils may be at a disadvantage in understanding scientific language, there may be some circumstances where being bilingual is an advantage in understanding scientific language. Presents evidence of circumstances where being bilingual was an advantage and circumstances where it was a disadvantage in…
Different generations are constituted depending on social changes and they are designed sociologically as traditional, baby boomer, X, Y and Z. Many studies have been reported on understanding of foreign language learning generation Y. This study aims to realise the gap in and contribute to the research on language learning understanding of…
Cawsey, A J; Webber, B L; Jones, R B
Good communication is vital in health care, both among health care professionals, and between health care professionals and their patients. And well-written documents, describing and/or explaining the information in structured databases may be easier to comprehend, more edifying, and even more convincing than the structured data, even when presented in tabular or graphic form. Documents may be automatically generated from structured data, using techniques from the field of natural language generation. These techniques are concerned with how the content, organization and language used in a document can be dynamically selected, depending on the audience and context. They have been used to generate health education materials, explanations and critiques in decision support systems, and medical reports and progress notes.
The word ''radioactivity'' has something scary about it; it makes us think of something intangable, creeping dangers, the mysterious ticking of Geiger counters, reactor disasters, dirty bombs, nuclear contamination and destruction. True: Whole landscapes were made uninhabitable by accidents involving radioactive material such as Windscale, Sellafield and Chernobyl and others that were kept largely secret from the public. While to some they brought premature death, for the great majority of the world population their effects have so far been insignificant. By contrast, how little known is the fact that natural radioactivity has been around since human beginnings and that the cells of the human body have always been equipped to repair damage from radioactive radiation or other causes provided such damage does not occur too frequently. Elmar Traebert presents the physics underlying radioactivity without resorting to formulas and explains in an easily understandable manner the different types of radiation, their measurement and sources (in medicine, power plants, and weapons technology) and how they should be handled. He describes nuclear power plants and the safety problems they involve, sunburn, radiation therapy, uranium ammunition and uranium mining. Whoever knows about these things can more early cope with his own fears and maybe allay some of them. He can also see through statements made by different interest groups with regard to radioactive material and duly form his own opinion
In this article we argue that second language acquisition (SLA) research and theory have a significant role to play in teacher education, especially at the masters level. The danger of overly practical approaches is that they cannot challenge current practice in ways that are both critical and rigorous. However, to engage ...
Hovy, Dirk; Spruit, Shannon
Research in natural language processing (NLP) used to be mostly performed on anonymous corpora, with the goal of enriching linguistic analysis. Authors were either largely unknown or public figures. As we increasingly use more data from social media, this situation has changed: users are now...... individually identifiable, and the outcome of NLP experiments and applications can have a direct effect on their lives. This change should spawn a debate about the ethical implications of NLP, but until now, the internal discourse in the field has not followed the technological development. This position paper...
Philpot, Cindy J.
Recent reform efforts in science education focus on scientific literacy for all citizens. In order to be scientifically literate, an individual must have informed understandings of nature of science (NOS), scientific inquiry, and science content matter. This study specifically focused on Science Olympiad students' understanding of NOS as one piece of scientific literacy. Research consistently shows that science students do not have informed understandings of NOS (Abd-El-Khalick, 2002; Bell, Blair, Crawford, and Lederman, 2002; Kilcrease and Lucy, 2002; Schwartz, Lederman, and Thompson, 2001). However, McGhee-Brown, Martin, Monsaas and Stombler (2003) found that Science Olympiad students had in-depth understandings of science concepts, principles, processes, and techniques. Science Olympiad teams compete nationally and are found in rural, urban, and suburban schools. In an effort to learn from students who are generally considered high achieving students and who enjoy science, as opposed to the typical science student, the purpose of this study was to investigate Science Olympiad students' understandings of NOS and the experiences that formed their understandings. An interpretive, qualitative, case study method was used to address the research questions. The participants were purposefully and conveniently selected from the Science Olympiad team at a suburban high school. Data collection consisted of the Views of Nature of Science -- High School Questionnaire (VNOS-HS) (Schwartz, Lederman, & Thompson, 2001), semi-structured individual interviews, and a focus group. The main findings of this study were similar to much of the previous research in that the participants had informed understandings of the tentative nature of science and the role of inferences in science, but they did not have informed understandings of the role of human imagination and creativity, the empirical nature of science, or theories and laws. High level science classes and participation in
Gevarter, William B.
Computer-based Natural Language Processing (NLP) is the key to enabling humans and their computer-based creations to interact with machines using natural languages (English, Japanese, German, etc.) rather than formal computer languages. NLP is a major research area in the fields of artificial intelligence and computational linguistics. Commercial…
Andreasen, Troels; Bulskov, Henrik; Nilsson, Jørgen Fischer
This paper makes a case for adopting appropriate forms of natural logic as target language for computational reasoning with descriptive natural language. Natural logics are stylized fragments of natural language where reasoning can be conducted directly by natural reasoning rules reflecting intui...... intuitive reasoning in natural language. The approach taken in this paper is to extend natural logic stepwise with a view to covering successively larger parts of natural language. We envisage applications for computational querying and reasoning, in particular within the life-sciences....
Manrique Cordeje, M.E.
How does (mis)understanding works in conversation? Problems of understanding occur all the time in our everyday social life. How does miscommunication happen and how do we deal with it? This thesis reports on how sign language users manage to understand each other based on a large Conversational
Richard K. Payne
Full Text Available The question motivating this essay is how tantric Buddhist practitioners in Japan understood language such as to believe that mantra, dhāraṇī, and related forms are efficacious. “Extraordinary language” is introduced as a cover term for these several similar language uses found in tantric Buddhist practices in Japan. The essay proceeds to a critical examination of Anglo-American philosophy of language to determine whether the concepts, categories, and concerns of that field can contribute to the analysis and understanding of extraordinary language. However, that philosophy of language does not contribute to this analysis, as it is constrained by its continuing focus on its founding concepts, dating particularly from the work of Frege. Comparing it to Indic thought regarding language reveals a distinct mismatch, further indicating the limiting character of the philosophy of language. The analysis then turns to examine two other explanations of tantric language use found in religious studies literature: magical language and performative language. These also, however, prove to be unhelpful. While the essay is primarily critical, one candidate for future constructive study is historical pragmatics, as suggested by Ronald Davidson. The central place of extraordinary language indicates that Indic reflections on the nature of language informed tantric Buddhist practice in Japan and are not simply cultural baggage.
Full Text Available This paper presents how to search mathematical formulae written in MathML when given plain words as a query. Since the proposed method allows natural language queries like the traditional Information Retrieval for the mathematical formula search, users do not need to enter any complicated math symbols and to use any formula input tool. For this, formula data is converted into plain texts, and features are extracted from the converted texts. In our experiments, we achieve an outstanding performance, a MRR of 0.659. In addition, we introduce how to utilize formula classification for formula search. By using class information, we finally achieve an improved performance, a MRR of 0.690.
Hovy, Dirk; Spruit, Shannon
Research in natural language processing (NLP) used to be mostly performed on anonymous corpora, with the goal of enriching linguistic analysis. Authors were either largely unknown or public figures. As we increasingly use more data from social media, this situation has changed: users are now...... individually identifiable, and the outcome of NLP experiments and applications can have a direct effect on their lives. This change should spawn a debate about the ethical implications of NLP, but until now, the internal discourse in the field has not followed the technological development. This position paper...... identifies a number of social implications that NLP research may have, and discusses their ethical significance, as well as ways to address them....
Full Text Available We propose a new application of quantum computing to the field of natural language processing. Ongoing work in this field attempts to incorporate grammatical structure into algorithms that compute meaning. In (Coecke, Sadrzadeh and Clark, 2010, the authors introduce such a model (the CSC model based on tensor product composition. While this algorithm has many advantages, its implementation is hampered by the large classical computational resources that it requires. In this work we show how computational shortcomings of the CSC approach could be resolved using quantum computation (possibly in addition to existing techniques for dimension reduction. We address the value of quantum RAM (Giovannetti,2008 for this model and extend an algorithm from Wiebe, Braun and Lloyd (2012 into a quantum algorithm to categorize sentences in CSC. Our new algorithm demonstrates a quadratic speedup over classical methods under certain conditions.
This study explores language ideologies of English at a Korean university where English has been adopted as an official language. This study draws on ethnographic data in order to understand how speakers respond to and experience the institutional language policy. The findings show that language ideologies in this university represent the…
Full Text Available Learning idioms which is considered a very essential part of learning and using language (Sridhar and Karunakaran, 2013 has recently attracted a great attention of English learning researchers particularly the assessment of how well Asian language learners acquire and use idioms in communication (Tran, 2013. Understanding and using them fluently could be viewed as a sign towards language proficiency as they could be an effective way to give students better conditions to enhance their communication skills in the daily context (Beloussova, 2015. Investigating how idiomatic expressions are dealt with and processed in a second language or foreign language is an issue worth examining further since it may give language teachers a better idea of some of the strategies language learners use in order to interpret figurative language. Despite their importance, learning and using English idioms by Arab EFL learners have not been investigated extensively, and no research has been conducted on Jordanian students’ idiomatic competency. Thus, the researcher decided to work on these un-tackled issues in the Jordanian context. Most idioms-based investigations are the difficulties Jordanians learners of English face when translating them into Arabic (Hussein, Khanji, and Makhzoumi, 2000; Bataineh and Bataineh, 2002; Alrishan and Smadi, 2015. The analysis of the test showed students’ very poor idiomatic competence; particularly a very limited awareness of the most frequently used idioms despite their overwhelming desire to learn them. Data analysis of the questionnaire revealed the strategies students use and the problems they face in understanding and learning idioms.
.... Initiated in 2004 at Defense Research and Development Canada (DRDC), the SACOT knowledge engineering research project is currently investigating, developing and validating innovative natural language processing (NLP...
Schaub, Gayle; Cadena, Cara; Bravender, Patricia; Kierkus, Christopher
To effectively access and use the resources of the academic library and to become information-literate, students must understand the language of information literacy. This study analyzes undergraduate students' understanding of fourteen commonly used information-literacy terms. It was found that some of the terms least understood by students are…
Nataša Pirih Svetina
Full Text Available Intercomprehension is a communication practice where two persons speak their mother tongue and are able to understand each other without being taught the language of their adressee. It is a usual practice between languages that belong to the same linguistic family, for example Slavic, Romance or Germanic languages. In the article, the authors present the notion of intercomprehension as an alternative to communication in English as a lingua franca. That kind of communication was known among Scandinavians, whereas the first teaching method was developped for Romance languages (EuRomCom at the beginning of the 21st century. Today, more methods exist including German and Slavic languages. In the article, the authors are enumerating some of them and also give a short outline of existing practices.
Mistry, Pramod K; Belmatoug, Nadia; vom Dahl, Stephan; Giugliani, Roberto
Gaucher disease is a rare and extraordinarily heterogeneous inborn error of metabolism that exhibits diverse manifestations, a broad range of age of onset of symptoms, and a wide clinical spectrum of disease severity, from lethal disease during infancy to first age of onset of symptoms in octogenarians. Before the advent of the International Collaborative Gaucher Group (ICGG) Gaucher Registry, the understanding of the natural history and phenotypic range of Gaucher disease was based on isolated case reports and small case series. Limited data hindered understanding of the full spectrum of the disease leading to some early misconceptions about Gaucher disease, notably, that nonneuronopathic (type 1) disease was a disease of adults only. The global scope of the ICGG Gaucher Registry, with its vast body of longitudinal data, has enabled a real appreciation of both the phenotypic spectrum of Gaucher disease and its natural history. This body of evidence represents the foundation for accurate assessment of the response to specific therapies for Gaucher disease and to the development of standard-of-care to monitor disease activity. Here, we outline the key developments in delineating the natural history of this highly complex disease and role of the ICGG Gaucher Registry in this effort. © 2015 Wiley Periodicals, Inc.
Lenti Boero, Daniela
Building a theory on extant species, as Ackermann et al. do, is a useful contribution to the field of language evolution. Here, I add another living model that might be of interest: human language ontogeny in the first year of life. A better knowledge of this phase might help in understanding two more topics among the "several building blocks of a comprehensive theory of the evolution of spoken language" indicated in their conclusion by Ackermann et al., that is, the foundation of the co-evolution of linguistic motor skills with the auditory skills underlying speech perception, and the possible phylogenetic interactions of protospeech production with referential capabilities.
Full Text Available [First paragraph] Christopher Alexander's book, The Timeless Way of Building, is probably the most beautiful book on the notion of quality in observation and design that I have been reading since Robert Pirsig's (1974 Zen and the Art of Motorcycle Maintenance. It was published in 1979, when Alexander was a professor of architecture at the University of California, Berkeley, where I was at that time studying. Although I was aware of some of Alexander's famous articles such as "A city is not a tree" (Alexander, 1965, the book (Alexander, 1979 never quite made it to the top of my reading list. This remained so until recently, when I met a software developer who enthusiastically talked to me on a book he was currently reading, about the importance of understanding design patterns. He was talking about the very book I had failed to read during my Berkeley years and which, as I now discovered, has since become a cult book among computer programmers and information scientists, as well as in other fields of research. I decided it was time to read the book.
Langkopf, B.S.; Mallory, L.H.
A scientific data base, the Tuff Data Base, is being created at Sandia National Laboratories on the Cyber 170/855, using System 2000. It is being developed for use by scientists and engineers investigating the feasibility of locating a high-level radioactive waste repository in tuff (a type of volcanic rock) at Yucca Mountain on and adjacent to the Nevada Test Site. This project, the Nevada Nuclear Waste Storage Investigations (NNWSI) Project, is managed by the Nevada Operations Office of the US Department of Energy. A user-friendly interface, PRIMER, was developed that uses the Self-Contained Facility (SCF) command SUBMIT and System 2000 Natural Language functions and parametric strings that are schema resident. The interface was designed to: (1) allow users, with or without computer experience or keyboard skill, to sporadically access data in the Tuff Data Base; (2) produce retrieval capabilities for the user quickly; and (3) acquaint the users with the data in the Tuff Data Base. This paper gives a brief description of the Tuff Data Base Schema and the interface, PRIMER, which is written in Fortran V. 3 figures
Paul H Thibodeau
Full Text Available Metaphors pervade discussions of social issues like climate change, the economy, and crime. We ask how natural language metaphors shape the way people reason about such social issues. In previous work, we showed that describing crime metaphorically as a beast or a virus, led people to generate different solutions to a city's crime problem. In the current series of studies, instead of asking people to generate a solution on their own, we provided them with a selection of possible solutions and asked them to choose the best ones. We found that metaphors influenced people's reasoning even when they had a set of options available to compare and select among. These findings suggest that metaphors can influence not just what solution comes to mind first, but also which solution people think is best, even when given the opportunity to explicitly compare alternatives. Further, we tested whether participants were aware of the metaphor. We found that very few participants thought the metaphor played an important part in their decision. Further, participants who had no explicit memory of the metaphor were just as much affected by the metaphor as participants who were able to remember the metaphorical frame. These findings suggest that metaphors can act covertly in reasoning. Finally, we examined the role of political affiliation on reasoning about crime. The results confirm our previous findings that Republicans are more likely to generate enforcement and punishment solutions for dealing with crime, and are less swayed by metaphor than are Democrats or Independents.
When we think of everyday language use, the first things that come to mind include colloquial conversations, reading and writing e-mails, sending text messages or reading a book. But can we study the brain basis of language as we use it in our daily lives? As a topic of study, the cognitive
593], pages International Conference of the IEEE Engineer- 351-363. ing in Medicine and Biology Society, volume 3, pages 1347-1348, New Orleans, LA...Conference on Machine Translation of Languages and Applied  Ingrid Zukerman. Koalas are not bears: Gener- Language Analysis. pages 66-80. Her
Service oriented chatbot systems are used to inform users in a conversational manner about a particular service or product on a website. Our research shows that current systems are time consuming to build and not very accurate or satisfying to users. We find that natural language understanding and natural language generation methods are central to creating an e�fficient and useful system. In this thesis we investigate current and past methods in this research area and place particular emph...
Hamon, Thierry; Mougin, Fleur; Grabar, Natalia
With the recent and intensive research in the biomedical area, the knowledge accumulated is disseminated through various knowledge bases. Links between these knowledge bases are needed in order to use them jointly. Linked Data, SPARQL language, and interfaces in Natural Language question-answering provide interesting solutions for querying such knowledge bases. We propose a method for translating natural language questions in SPARQL queries. We use Natural Language Processing tools, semantic resources, and the RDF triples description. The method is designed on 50 questions over 3 biomedical knowledge bases, and evaluated on 27 questions. It achieves 0.78 F-measure on the test set. The method for translating natural language questions into SPARQL queries is implemented as Perl module available at http://search.cpan.org/ thhamon/RDF-NLP-SPARQLQuery.
Pandolfe, Jessica M.; Wittke, Kacie; Spaulding, Tammie J.
Purpose: This study examined if adolescents with specific language impairment (SLI) understand driving vocabulary as well as their typically developing (TD) peers. Method: A total of 16 adolescents with SLI and 16 TD comparison adolescents completed a receptive vocabulary task focused on driving terminology derived from statewide driver's manuals.…
Dougherty, Ray C
This book's main goal is to show readers how to use the linguistic theory of Noam Chomsky, called Universal Grammar, to represent English, French, and German on a computer using the Prolog computer language. In so doing, it presents a follow-the-dots approach to natural language processing, linguistic theory, artificial intelligence, and expert systems. The basic idea is to introduce meaningful answers to significant problems involved in representing human language data on a computer. The book offers a hands-on approach to anyone who wishes to gain a perspective on natural language
toward the monolingual English 25 msec value. Miyawaki et a]. (1975) investigated the /ra/ - /la/ continuum with English and Japanese speakers...Standard Dictionary In order to evaluate some of the claims of the learning theory of speech recognition, a computer model was developed. The NEXus...discrimination of synthetic vowels. Language and Speech, 1962, 5, 171-189. Funk and Wagnalls New Standard Dictionary of the English Language. New York: Funk and
Full Text Available Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient’s thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes.
Pestian, John; Nasrallah, Henry; Matykiewicz, Pawel; Bennett, Aurora; Leenaars, Antoon
Suicide is the second leading cause of death among 25-34 year olds and the third leading cause of death among 15-25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes.
Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication by leveraging technologies from signal processing, pattern recognition, machine learning and artificial intelligence. SLU systems are designed to extract the meaning from speech utterances and its applications are vast, from voice search in mobile devices to meeting summarization, attracting interest from both commercial and academic sectors. Both human/machine and human/human communications can benefit from the application of SLU, usin
Dominick, Wayne D. (Editor); Liu, I-Hsiung
The currently developed user language interfaces of information systems are generally intended for serious users. These interfaces commonly ignore potentially the largest user group, i.e., casual users. This project discusses the concepts and implementations of a natural query language system which satisfy the nature and information needs of casual users by allowing them to communicate with the system in the form of their native (natural) language. In addition, a framework for the development of such an interface is also introduced for the MADAM (Multics Approach to Data Access and Management) system at the University of Southwestern Louisiana.
Barker-Plummer, Dave; Dale, Robert; Cox, Richard; Romanczuk, Alex
We have assembled a large corpus of student submissions to an automatic grading system, where the subject matter involves the translation of natural language sentences into propositional logic. Of the 2.3 million translation instances in the corpus, 286,000 (approximately 12%) are categorized as being in error. We want to understand the nature of…
Wong, Wing-Kwong; Yin, Sheng-Kai; Yang, Chang-Zhe
This paper presents a tool for drawing dynamic geometric figures by understanding the texts of geometry problems. With the tool, teachers and students can construct dynamic geometric figures on a web page by inputting a geometry problem in natural language. First we need to build the knowledge base for understanding geometry problems. With the…
Bender, Emily M
Many NLP tasks have at their core a subtask of extracting the dependencies-who did what to whom-from natural language sentences. This task can be understood as the inverse of the problem solved in different ways by diverse human languages, namely, how to indicate the relationship between different parts of a sentence. Understanding how languages solve the problem can be extremely useful in both feature design and error analysis in the application of machine learning to NLP. Likewise, understanding cross-linguistic variation can be important for the design of MT systems and other multilingual a
Where Humans Meet Machines: Innovative Solutions for Knotty Natural-Language Problems brings humans and machines closer together by showing how linguistic complexities that confound the speech systems of today can be handled effectively by sophisticated natural-language technology. Some of the most vexing natural-language problems that are addressed in this book entail recognizing and processing idiomatic expressions, understanding metaphors, matching an anaphor correctly with its antecedent, performing word-sense disambiguation, and handling out-of-vocabulary words and phrases. This fourteen-chapter anthology consists of contributions from industry scientists and from academicians working at major universities in North America and Europe. They include researchers who have played a central role in DARPA-funded programs and developers who craft real-world solutions for corporations. These contributing authors analyze the role of natural language technology in the global marketplace; they explore the need f...
Full Text Available Development of information technologies is growing steadily. With the latest software technologies development and application of the methods of artificial intelligence and machine learning intelligence embededs in computers, the expectations are that in near future computers will be able to solve problems themselves like people do. Artificial intelligence emulates human behavior on computers. Rather than executing instructions one by one, as theyare programmed, machine learning employs prior experience/data that is used in the process of system’s training. In this state of the art paper, common methods in AI, such as machine learning, pattern recognition and the natural language processing (NLP are discussed. Also are given standard architecture of NLP processing system and the level thatisneeded for understanding NLP. Lastly the statistical NLP processing and multi-word expressions are described.
May 26, 2018 ... resent, and store information in a natural-language-inde- pendent format . UNL is .... account semantic information available in words of the problem ...... Sentiment Analysis (SA) plays a vital role in decision making process.
Full Text Available Recent mathematical and algorithmic results in the field of finite-state technology, as well the increase in computing power, have constructed the base for a new approach in natural language processing. However the task of creating an appropriate model that would describe the phenomena of the natural language is still to be achieved. ln this paper I'm presenting some notions related to the finite-state modelling of syntax and morphology.
Institute for the Study of Violent Groups NATO North Atlantic Treaty Organization NLP Natural Language Processing PCorpus Permanent Corpus PDF...approaches, we apply Natural Language Processing ( NLP ) tools to a unique database of text documents collected by Whiteside (2014). His collection...from Arabic to English. Compared to other terrorism databases, Whiteside’s collection methodology limits the scope of the database and avoids coding
Full Text Available Arabic is a Semitic language spoken by more than 330 million people as a native language, in an area extending from the Arabian/Persian Gulf in the East to the Atlantic Ocean in the West. Moreover, it is the language in which 1.4 billion Muslims around the world perform their daily prayers. Over the last few years, Arabic natural language processing (ANLP has gained increasing importance, and several state of the art systems have been developed for a wide range of applications.
Scott, Jessica C; Henderson, Annette M E
Object labels are valuable communicative tools because their meanings are shared among the members of a particular linguistic community. The current research was conducted to investigate whether 13-month-old infants appreciate that object labels should not be generalized across individuals who have been shown to speak different languages. Using a visual habituation paradigm, Experiment 1 tested whether infants would generalize a new object label that was taught to them by a speaker of a foreign language to a speaker from the infant's own linguistic group. The results suggest that infants do not expect 2 individuals who have been shown to speak different languages to use the same label to refer to the same object. The results of Experiment 2 reveal that infants do not generalize a new object label that was taught to them by a speaker of their native language to an individual who had been shown to speak a foreign language. These findings offer the first evidence that by the end of the 1st year of life, infants are sensitive to the fact that the conventional nature of language is constrained by the language that a person has been shown to speak.
Rogalsky, Corianne; Raphel, Kristin; Tomkovicz, Vivian; O'Grady, Lucinda; Damasio, Hanna; Bellugi, Ursula; Hickok, Gregory
The neural basis of action understanding is a hotly debated issue. The mirror neuron account holds that motor simulation in fronto-parietal circuits is critical to action understanding including speech comprehension, while others emphasize the ventral stream in the temporal lobe. Evidence from speech strongly supports the ventral stream account, but on the other hand, evidence from manual gesture comprehension (e.g., in limb apraxia) has led to contradictory findings. Here we present a lesion analysis of sign language comprehension. Sign language is an excellent model for studying mirror system function in that it bridges the gap between the visual-manual system in which mirror neurons are best characterized and language systems which have represented a theoretical target of mirror neuron research. Twenty-one life long deaf signers with focal cortical lesions performed two tasks: one involving the comprehension of individual signs and the other involving comprehension of signed sentences (commands). Participants' lesions, as indicated on MRI or CT scans, were mapped onto a template brain to explore the relationship between lesion location and sign comprehension measures. Single sign comprehension was not significantly affected by left hemisphere damage. Sentence sign comprehension impairments were associated with left temporal-parietal damage. We found that damage to mirror system related regions in the left frontal lobe were not associated with deficits on either of these comprehension tasks. We conclude that the mirror system is not critically involved in action understanding.
Using contemporary science, the paper builds on Wittgenstein’s views of human language. Rather than ascribing reality to inscription-like entities, it links embodiment with distributed cognition. The verbal or (quasi) technological aspect of language is traced to not action, but human specific...... interactivity. This species-specific form of sense-making sustains, among other things, using texts, making/construing phonetic gestures and thinking. Human action is thus grounded in appraisals or sense-saturated coordination. To illustrate interactivity at work, the paper focuses on a case study. Over 11 s......, a crime scene investigator infers that she is probably dealing with an inside job: she uses not words, but intelligent gaze. This connects professional expertise to circumstances and the feeling of thinking. It is suggested that, as for other species, human appraisal is based in synergies. However, since...
Holmer, Emil; Heimann, Mikael; Rudner, Mary
Imitation and language processing are closely connected. According to the Ease of Language Understanding (ELU) model (Rönnberg et al., 2013) pre-existing mental representation of lexical items facilitates language understanding. Thus, imitation of manual gestures is likely to be enhanced by experience of sign language. We tested this by eliciting imitation of manual gestures from deaf and hard-of-hearing (DHH) signing and hearing non-signing children at a similar level of language and cognitive development. We predicted that the DHH signing children would be better at imitating gestures lexicalized in their own sign language (Swedish Sign Language, SSL) than unfamiliar British Sign Language (BSL) signs, and that both groups would be better at imitating lexical signs (SSL and BSL) than non-signs. We also predicted that the hearing non-signing children would perform worse than DHH signing children with all types of gestures the first time (T1) we elicited imitation, but that the performance gap between groups would be reduced when imitation was elicited a second time (T2). Finally, we predicted that imitation performance on both occasions would be associated with linguistic skills, especially in the manual modality. A split-plot repeated measures ANOVA demonstrated that DHH signers imitated manual gestures with greater precision than non-signing children when imitation was elicited the second but not the first time. Manual gestures were easier to imitate for both groups when they were lexicalized than when they were not; but there was no difference in performance between familiar and unfamiliar gestures. For both groups, language skills at T1 predicted imitation at T2. Specifically, for DHH children, word reading skills, comprehension and phonological awareness of sign language predicted imitation at T2. For the hearing participants, language comprehension predicted imitation at T2, even after the effects of working memory capacity and motor skills were taken into
Full Text Available Imitation and language processing are closely connected. According to the Ease of Language Understanding (ELU model (Rönnberg et al., 2013 pre-existing mental representation of lexical items facilitates language understanding. Thus, imitation of manual gestures is likely to be enhanced by experience of sign language. We tested this by eliciting imitation of manual gestures from deaf and hard-of-hearing (DHH signing and hearing non-signing children at a similar level of language and cognitive development. We predicted that the DHH signing children would be better at imitating gestures lexicalized in their own sign language (Swedish Sign Language, SSL than unfamiliar British Sign Language (BSL signs, and that both groups would be better at imitating lexical signs (SSL and BSL than non-signs. We also predicted that the hearing non-signing children would perform worse than DHH signing children with all types of gestures the first time (T1 we elicited imitation, but that the performance gap between groups would be reduced when imitation was elicited a second time (T2. Finally, we predicted that imitation performance on both occasions would be associated with linguistic skills, especially in the manual modality. A split-plot repeated measures ANOVA demonstrated that DHH signers imitated manual gestures with greater precision than non-signing children when imitation was elicited the second but not the first time. Manual gestures were easier to imitate for both groups when they were lexicalized than when they were not; but there was no difference in performance between familiar and unfamiliar gestures. For both groups, language skills at the T1 predicted imitation at T2. Specifically, for DHH children, word reading skills, comprehension and phonological awareness of sign language predicted imitation at T2. For the hearing participants, language comprehension predicted imitation at T2, even after the effects of working memory capacity and motor skills
Poletiek, Fenna H; Fitz, Hartmut; Bocanegra, Bruno R
Rey et al. (2012) present data from a study with baboons that they interpret in support of the idea that center-embedded structures in human language have their origin in low level memory mechanisms and associative learning. Critically, the authors claim that the baboons showed a behavioral preference that is consistent with center-embedded sequences over other types of sequences. We argue that the baboons' response patterns suggest that two mechanisms are involved: first, they can be trained to associate a particular response with a particular stimulus, and, second, when faced with two conditioned stimuli in a row, they respond to the most recent one first, copying behavior they had been rewarded for during training. Although Rey et al. (2012) 'experiment shows that the baboons' behavior is driven by low level mechanisms, it is not clear how the animal behavior reported, bears on the phenomenon of Center Embedded structures in human syntax. Hence, (1) natural language syntax may indeed have been shaped by low level mechanisms, and (2) the baboons' behavior is driven by low level stimulus response learning, as Rey et al. propose. But is the second evidence for the first? We will discuss in what ways this study can and cannot give evidential value for explaining the origin of Center Embedded recursion in human grammar. More generally, their study provokes an interesting reflection on the use of animal studies in order to understand features of the human linguistic system. Copyright © 2015 Elsevier B.V. All rights reserved.
Gevarter, W. B.
Computer based Natural Language Processing (NLP) is the key to enabling humans and their computer based creations to interact with machines in natural language (like English, Japanese, German, etc., in contrast to formal computer languages). The doors that such an achievement can open have made this a major research area in Artificial Intelligence and Computational Linguistics. Commercial natural language interfaces to computers have recently entered the market and future looks bright for other applications as well. This report reviews the basic approaches to such systems, the techniques utilized, applications, the state of the art of the technology, issues and research requirements, the major participants and finally, future trends and expectations. It is anticipated that this report will prove useful to engineering and research managers, potential users, and others who will be affected by this field as it unfolds.
Olive, Joseph P; McCary, John
This comprehensive handbook, written by leading experts in the field, details the groundbreaking research conducted under the breakthrough GALE program - The Global Autonomous Language Exploitation within the Defense Advanced Research Projects Agency (DARPA), while placing it in the context of previous research in the fields of natural language and signal processing, artificial intelligence and machine translation. The most fundamental contrast between GALE and its predecessor programs was its holistic integration of previously separate or sequential processes. In earlier language research pro
N. M. Boychenko
Full Text Available Purpose. In order to consistently distinguish between violence, which is always primarily a destructive force, and the civilized use of force that involves constructive, creative goals, one should explore the main possible philosophical approaches to understand the nature of violence and try to give it a systematic outline. Methodology. This study uses a systematic approach to identify the internal relationship between different forms of violence and, accordingly, the counteraction against violence. Also, the author uses an axiology to identify the values that are the basis for distinguishing violence from its prototypes, as well as for the distinction between violence and coercion, as well as different types of coercion. Originality. This article presents significant clarifications on the classification of types of violence, in particular, it is clearly established that certain types of violence can not have ethical relevance, since they belong to the sphere of biology (expansion, aggression or social anthropology (cultural, institutional coercion. Actually violence or violence in the narrow sense implies the existence of will, consciousness and destructive purpose. Accordingly, counteraction against violence should include the formation of a certain non-violent type of will, non-violent culture and creative, constructive goals. This requires both personal effort and institutional support and the availability of appropriate moral traditions. Ethical theory is intended to clarify and systematize these efforts. In this sense, ethics is the core of practical philosophy. To the extent that the influence of ethics on changes in human culture and sociality in the counterfactual regime is increasing, one should also speak of the anthropological significance of ethics. Conclusions. From the socio-philosophical point of view, it is necessary to specify exactly which social institutions and in which constellation generate violence. The ethical aspect of
Hiemstra, Djoerd; de Jong, Franciska M.G.
Traditionally, natural language processing techniques for information retrieval have always been studied outside the framework of formal models of information retrieval. In this article, we introduce a new formal model of information retrieval based on the application of statistical language models.
Heinrich, Stefan; Wermter, Stefan
For the complex human brain that enables us to communicate in natural language, we gathered good understandings of principles underlying language acquisition and processing, knowledge about sociocultural conditions, and insights into activity patterns in the brain. However, we were not yet able to understand the behavioural and mechanistic characteristics for natural language and how mechanisms in the brain allow to acquire and process language. In bridging the insights from behavioural psychology and neuroscience, the goal of this paper is to contribute a computational understanding of appropriate characteristics that favour language acquisition. Accordingly, we provide concepts and refinements in cognitive modelling regarding principles and mechanisms in the brain and propose a neurocognitively plausible model for embodied language acquisition from real-world interaction of a humanoid robot with its environment. In particular, the architecture consists of a continuous time recurrent neural network, where parts have different leakage characteristics and thus operate on multiple timescales for every modality and the association of the higher level nodes of all modalities into cell assemblies. The model is capable of learning language production grounded in both, temporal dynamic somatosensation and vision, and features hierarchical concept abstraction, concept decomposition, multi-modal integration, and self-organisation of latent representations.
Widemann, David P. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Wang, Eric X. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Thiagarajan, Jayaraman J. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
We present a novel Recoverable Order-Preserving Embedding (ROPE) of natural language. ROPE maps natural language passages from sparse concatenated one-hot representations to distributed vector representations of predetermined fixed length. We use Euclidean distance to return search results that are both grammatically and semantically similar. ROPE is based on a series of random projections of distributed word embeddings. We show that our technique typically forms a dictionary with sufficient incoherence such that sparse recovery of the original text is possible. We then show how our embedding allows for efficient and meaningful natural search and retrieval on Microsoft’s COCO dataset and the IMDB Movie Review dataset.
Full Text Available How human language arose is a mystery in the evolution of Homo sapiens. Miyagawa, Berwick, & Okanoya (Frontiers 2013 put forward a proposal, which we will call the Integration Hypothesis of human language evolution, which holds that human language is composed of two components, E for expressive, and L for lexical. Each component has an antecedent in nature: E as found, for example, in birdsong, and L in, for example, the alarm calls of monkeys. E and L integrated uniquely in humans to give rise to language. A challenge to the Integration Hypothesis is that while these non-human systems are finite-state in nature, human language is known to require characterization by a non-finite state grammar. Our claim is that E and L, taken separately, are finite-state; when a grammatical process crosses the boundary between E and L, it gives rise to the non-finite state character of human language. We provide empirical evidence for the Integration Hypothesis by showing that certain processes found in contemporary languages that have been characterized as non-finite state in nature can in fact be shown to be finite-state. We also speculate on how human language actually arose in evolution through the lens of the Integration Hypothesis.
Névéol, Aurélie; Dalianis, Hercules; Velupillai, Sumithra; Savova, Guergana; Zweigenbaum, Pierre
Natural language processing applied to clinical text or aimed at a clinical outcome has been thriving in recent years. This paper offers the first broad overview of clinical Natural Language Processing (NLP) for languages other than English. Recent studies are summarized to offer insights and outline opportunities in this area. We envision three groups of intended readers: (1) NLP researchers leveraging experience gained in other languages, (2) NLP researchers faced with establishing clinical text processing in a language other than English, and (3) clinical informatics researchers and practitioners looking for resources in their languages in order to apply NLP techniques and tools to clinical practice and/or investigation. We review work in clinical NLP in languages other than English. We classify these studies into three groups: (i) studies describing the development of new NLP systems or components de novo, (ii) studies describing the adaptation of NLP architectures developed for English to another language, and (iii) studies focusing on a particular clinical application. We show the advantages and drawbacks of each method, and highlight the appropriate application context. Finally, we identify major challenges and opportunities that will affect the impact of NLP on clinical practice and public health studies in a context that encompasses English as well as other languages.
Antonio Gisolfi; Enrico Fischetti
The aim of this paper is to show that with a subset of a natural language, simple systems running on PCs can be developed that can nevertheless be an effective tool for interfacing purposes in the building of an Intelligent Tutoring System (ITS). After presenting the special characteristics of the Smalltalk/V language, which provides an appropriate environment for the development of an interface, the overall architecture of the interface module is discussed. We then show how sentences are par...
Researchers, motivated by the need to improve the efficiency of natural language processing tools to handle web-scale data, have recently arrived at models that remarkably match the expected features of human language processing under the Now-or-Never bottleneck framework. This provides additional support for said framework and highlights the research potential in the interaction between applied computational linguistics and cognitive science.
This paper introduces natural language expressions and expert's subjectivity to system reliability analysis. To this end, this paper defines a subjective measure of reliability and presents the method of the system reliability analysis using the measure. The subjective measure of reliability corresponds to natural language expressions of reliability estimation, which is represented by a fuzzy set defined on [0,1]. The presented method deals with the dependence among subsystems and employs parametrized operations of subjective measures of reliability which can reflect expert 's subjectivity towards the analyzed system. The analysis results are also expressed by linguistic terms. Finally this paper gives an example of the system reliability analysis by the presented method
Learning to rank refers to machine learning techniques for training a model in a ranking task. Learning to rank is useful for many applications in information retrieval, natural language processing, and data mining. Intensive studies have been conducted on its problems recently, and significant progress has been made. This lecture gives an introduction to the area including the fundamental problems, major approaches, theories, applications, and future work.The author begins by showing that various ranking problems in information retrieval and natural language processing can be formalized as tw
Full Text Available There are some factors regarding which aspect of second language acquisition is affected by individual learner factors, age, learning style. aptitude, motivation, and personality. This research is about English language acquisition of fourth-year child by nature and nurture. The child acquired her second language acquisition at home and also in one of the courses in Jakarta. She schooled by her parents in order to be able to speak English well as a target language for her future time. The purpose of this paper is to see and examine individual learner difference especially in using English as a second language. This study is a library research and retrieved data collected, recorded, transcribed, and analyzed descriptively. The results can be concluded: the child is able to communicate well and also able to construct simple sentences, complex sentences, sentence statement, phrase questions, and explain something when her teacher asks her at school. She is able to communicate by making a simple sentence or compound sentence in well-form (two clauses or three clauses, even though she still not focus to use the past tense form and sometimes she forgets to put bound morpheme -s in third person singular but she can use turn-taking in her utterances. It is a very long process since the child does the second language acquisition. The family and teacher should participate and assist the child, the proven child can learn the first and the second language at the same time.
Anne E. Thessen
A computer can handle the volume but cannot make sense of the language. This paper reviews and discusses the use of natural language processing (NLP and machine-learning algorithms to extract information from systematic literature. NLP algorithms have been used for decades, but require special development for application in the biological realm due to the special nature of the language. Many tools exist for biological information extraction (cellular processes, taxonomic names, and morphological characters, but none have been applied life wide and most still require testing and development. Progress has been made in developing algorithms for automated annotation of taxonomic text, identification of taxonomic names in text, and extraction of morphological character information from taxonomic descriptions. This manuscript will briefly discuss the key steps in applying information extraction tools to enhance biodiversity science.
de Boer, Bart; Gontier, N; VanBendegem, JP; Aerts, D
This paper describes the uses of computer models in studying the evolution of language. Language is a complex dynamic system that can be studied at the level of the individual and at the level of the population. Much of the dynamics of language evolution and language change occur because of the
Michael, Joel; Rovick, Allen; Glass, Michael; Zhou, Yujian; Evens, Martha
CIRCSIM-Tutor is a computer tutor designed to carry out a natural language dialogue with a medical student. Its domain is the baroreceptor reflex, the part of the cardiovascular system that is responsible for maintaining a constant blood pressure. CIRCSIM-Tutor's interaction with students is modeled after the tutoring behavior of two experienced…
Doszkocs, Tamas E.
The National Library of Medicine's Current Information Transfer in English public access online catalog offers unique subject search capabilities--natural-language query input, automatic medical subject headings display, closest match search strategy, ranked document output, dynamic end user feedback for search refinement. References, description…
Szymczak, Bartlomiej Antoni
tried to establish a domain independent “ontological semantics” for relevant fragments of natural language. The purpose of this research is to develop methods and systems for taking advantage of formal ontologies for the purpose of extracting the meaning contents of texts. This functionality...
Dadlez, Eva M.
Describes a natural language searching strategy for retrieving current material which has bearing on George Orwell's "1984," and identifies four main themes (technology, authoritarianism, press and psychological/linguistic implications of surveillance, political oppression) which have emerged from cross-database searches of the "Big…
It is argued that pessimistic assessments of the adequacy of artificial neural networks (ANNs) for natural language processing (NLP) on the grounds that they have a finite state architecture are unjustified, and that their adequacy in this regard is an empirical issue. First, arguments that counter standard objections to finite state NLP on the…
van der Sluis, Ielka; Hielkema, F.; Mellish, C.; Doherty, G.
In this paper we look at what may be learned from a comparative study examining non-technical users with a background in social science browsing and querying metadata. Four query tasks were carried out with a natural language interface and with an interface that uses a web paradigm with hyperlinks.
Pon-Barry, Heather Roberta
The ﬁeld of spoken language processing is concerned with creating computer programs that can understand human speech and produce human-like speech. Regarding the problem of understanding human speech, there is currently growing interest in moving beyond speech recognition (the task of transcribing the words in an audio stream) and towards machine listening—interpreting the full spectrum of information in an audio stream. One part of machine listening, the problem that this thesis focuses on, ...
Kilic, Kerem; Sungur, Semra; Cakiroglu, Jale; Tekkaya, Ceren
The purpose of this study was to investigate the 9th-grade students' understandings of the nature of scientific knowledge. The study also aimed to investigate the differences in students' understanding of the nature of scientific knowledge by gender, and school types. A total of 575 ninth grade students from four different school types (General…
Köseoglu, Pinar; Köksal, Mustafa Serdar
The purpose of this study was to investigate epistemological predictors of nature of science understandings of 281 prospective biology teachers surveyed using the Epistemological Beliefs Scale Regarding Science and the Nature of Science Scale. The findings on multiple linear regression showed that understandings about definition of science and…
Taumoepeau, Mele; Ruffman, Ted
This study assessed the relation between mother mental state language and child desire language and emotion understanding in 15--24-month-olds. At both times point, mothers described pictures to their infants and mother talk was coded for mental and nonmental state language. Children were administered 2 emotion understanding tasks and their mental…
Scott, Jessica; Hinton, Christina
The rise of globalisation makes language competencies more valuable, both at individual and societal levels. This book examines the links between globalisation and the way we teach and learn languages. It begins by asking why some individuals are more successful than others at learning non-native languages, and why some education systems, or countries, are more successful than others at teaching languages. The book comprises chapters by different authors on the subject of language learning. There are chapters on the role of motivation; the way that languages, cultures and identities are interc
Fang, Yuxing; Chen, Quanjing; Lingnau, Angelika; Han, Zaizhu; Bi, Yanchao
The observation of other people's actions recruits a network of areas including the inferior frontal gyrus (IFG), the inferior parietal lobule (IPL), and posterior middle temporal gyrus (pMTG). These regions have been shown to be activated through both visual and auditory inputs. Intriguingly, previous studies found no engagement of IFG and IPL for deaf participants during non-linguistic action observation, leading to the proposal that auditory experience or sign language usage might shape the functionality of these areas. To understand which variables induce plastic changes in areas recruited during the processing of other people's actions, we examined the effects of tasks (action understanding and passive viewing) and effectors (arm actions vs. leg actions), as well as sign language experience in a group of 12 congenitally deaf signers and 13 hearing participants. In Experiment 1, we found a stronger activation during an action recognition task in comparison to a low-level visual control task in IFG, IPL and pMTG in both deaf signers and hearing individuals, but no effect of auditory or sign language experience. In Experiment 2, we replicated the results of the first experiment using a passive viewing task. Together, our results provide robust evidence demonstrating that the response obtained in IFG, IPL, and pMTG during action recognition and passive viewing is not affected by auditory or sign language experience, adding further support for the supra-modal nature of these regions.
Nikora, Allen P.
This viewgraph presentation reviews the rationale of the program to transform natural language specifications into formal notation.Specifically, automate generation of Linear Temporal Logic (LTL)correctness properties from natural language temporal specifications. There are several reasons for this approach (1) Model-based techniques becoming more widely accepted, (2) Analytical verification techniques (e.g., model checking, theorem proving) significantly more effective at detecting types of specification design errors (e.g., race conditions, deadlock) than manual inspection, (3) Many requirements still written in natural language, which results in a high learning curve for specification languages, associated tools and increased schedule and budget pressure on projects reduce training opportunities for engineers, and (4) Formulation of correctness properties for system models can be a difficult problem. This has relevance to NASA in that it would simplify development of formal correctness properties, lead to more widespread use of model-based specification, design techniques, assist in earlier identification of defects and reduce residual defect content for space mission software systems. The presentation also discusses: potential applications, accomplishments and/or technological transfer potential and the next steps.
Komata, Masaoki; Oosawa, Yasuo; Ujita, Hiroshi
A natural language retrieval program NATLANG is developed to assist in the retrieval of information from event-and-cause descriptions in Licensee Event Reports (LER). The characteristics of NATLANG are (1) the use of base forms of words to retrieve related forms altered by the addition of prefixes or suffixes or changes in inflection, (2) direct access and short time retrieval with an alphabet pointer, (3) effective determination of the items and entries for a Hitachi event classification in a two step retrieval scheme, and (4) Japanese character output with the PL-1 language. NATLANG output reduces the effort needed to re-classify licensee events in the Hitachi event classification. (author)
Full Text Available Intelligent interface, to enhance efficient interactions between user and databases, is the need of the database applications. Databases must be intelligent enough to make the accessibility faster. However, not every user familiar with the Structured Query Language (SQL queries as they may not aware of structure of the database and they thus require to learn SQL. So, non-expert users need a system to interact with relational databases in their natural language such as English. For this, Database Management System (DBMS must have an ability to understand Natural Language (NL. In this research, an intelligent interface is developed using semantic matching technique which translates natural language query to SQL using set of production rules and data dictionary. The data dictionary consists of semantics sets for relations and attributes. A series of steps like lower case conversion, tokenization, speech tagging, database element and SQL element extraction is used to convert Natural Language Query (NLQ to SQL Query. The transformed query is executed and the results are obtained by the user. Intelligent Interface is the need of database applications to enhance efficient interaction between user and DBMS.
Full Text Available This paper shows how fieldwork data can be managed using the program Toolbox together with the Natural Language Toolkit (NLTK for the Python programming language. It provides background information about Toolbox and describes how it can be downloaded and installed. The basic functionality of the program for lexicons and texts is described, and its strengths and weaknesses are reviewed. Its underlying data format is briefly discussed, and Toolbox processing capabilities of NLTK are introduced, showing ways in which it can be used to extend the functionality of Toolbox. This is illustrated with a few simple scripts that demonstrate basic data management tasks relevant to language documentation, such as printing out the contents of a lexicon as HTML.
This dissertation focuses on developing and evaluating hybrid approaches for analyzing free-form text in the medical domain. This research draws on natural language processing (NLP) techniques that are used to parse and extract concepts based on a controlled vocabulary. Once important concepts are extracted, additional machine learning algorithms,…
Nastassja A. Lewinski
Full Text Available Literature in the field of nanotechnology is exponentially increasing with more and more engineered nanomaterials being created, characterized, and tested for performance and safety. With the deluge of published data, there is a need for natural language processing approaches to semi-automate the cataloguing of engineered nanomaterials and their associated physico-chemical properties, performance, exposure scenarios, and biological effects. In this paper, we review the different informatics methods that have been applied to patent mining, nanomaterial/device characterization, nanomedicine, and environmental risk assessment. Nine natural language processing (NLP-based tools were identified: NanoPort, NanoMapper, TechPerceptor, a Text Mining Framework, a Nanodevice Analyzer, a Clinical Trial Document Classifier, Nanotoxicity Searcher, NanoSifter, and NEIMiner. We conclude with recommendations for sharing NLP-related tools through online repositories to broaden participation in nanoinformatics.
Full Text Available It is estimated that each year many people, most of whom are teenagers and young adults die by suicide worldwide. Suicide receives special attention with many countries developing national strategies for prevention. Since, more medical information is available in text, Preventing the growing trend of suicide in communities requires analyzing various textual resources, such as patient records, information on the web or questionnaires. For this purpose, this study systematically reviews recent studies related to the use of natural language processing techniques in the area of people’s health who have completed suicide or are at risk. After electronically searching for the PubMed and ScienceDirect databases and studying articles by two reviewers, 21 articles matched the inclusion criteria. This study revealed that, if a suitable data set is available, natural language processing techniques are well suited for various types of suicide related research.
ELEMENT. PROJECT. TASKN Artificial Inteligence Laboratory A1A4WR NTumet 0) 545 Technology Square Cambridge, MA 02139 Ln *t- CONTROLLING OFFICE NAME AND...RO-RI95 922 EXPLOITING LEXICAL REGULARITIES IN DESIGNING NATURAL 1/1 LANGUAGE SYSTENS(U) MASSACHUSETTS INST OF TECH CAMBRIDGE ARTIFICIAL INTELLIGENCE...oes.ary and ftdou.Ip hr Nl wow" L,2This paper presents the lexical component of the START Question Answering system developed at the MIT Artificial
studies: the Time-Triggered Ethernet (TTEthernet) communication platform used in space, and FAA-Isolette infant incubators used in NICU . We...in space, and FAA-Isolette infant incubators used in Neonatal Intensive Care Units ( NICUs ). We systematically evalu- ated various aspects of ARSENAL...effect, we present the ARSENAL methodology. ARSENAL uses state-of-the-art advances in natural language processing (NLP) and formal methods (FM) to
Friedman, Carol; Hripcsak, George; Shagina, Lyuda; Liu, Hongfang
Objective: To design a document model that provides reliable and efficient access to clinical information in patient reports for a broad range of clinical applications, and to implement an automated method using natural language processing that maps textual reports to a form consistent with the model. Methods: A document model that encodes structured clinical information in patient reports while retaining the original contents was designed using the extensible markup language (XML), and a document type definition (DTD) was created. An existing natural language processor (NLP) was modified to generate output consistent with the model. Two hundred reports were processed using the modified NLP system, and the XML output that was generated was validated using an XML validating parser. Results: The modified NLP system successfully processed all 200 reports. The output of one report was invalid, and 199 reports were valid XML forms consistent with the DTD. Conclusions: Natural language processing can be used to automatically create an enriched document that contains a structured component whose elements are linked to portions of the original textual report. This integrated document model provides a representation where documents containing specific information can be accurately and efficiently retrieved by querying the structured components. If manual review of the documents is desired, the salient information in the original reports can also be identified and highlighted. Using an XML model of tagging provides an additional benefit in that software tools that manipulate XML documents are readily available. PMID:9925230
Hunter, James; Freer, Yvonne; Gatt, Albert; Reiter, Ehud; Sripada, Somayajulu; Sykes, Cindy; Westwater, Dave
The BT-Nurse system uses data-to-text technology to automatically generate a natural language nursing shift summary in a neonatal intensive care unit (NICU). The summary is solely based on data held in an electronic patient record system, no additional data-entry is required. BT-Nurse was tested for two months in the Royal Infirmary of Edinburgh NICU. Nurses were asked to rate the understandability, accuracy, and helpfulness of the computer-generated summaries; they were also asked for free-text comments about the summaries. The nurses found the majority of the summaries to be understandable, accurate, and helpful (pgenerated summaries. In conclusion, natural language NICU shift summaries can be automatically generated from an electronic patient record, but our proof-of-concept software needs considerable additional development work before it can be deployed.
Kiran, Swathi; Iakupova, Regina
The goal of this study was to address the relationship between language proficiency, language impairment and rehabilitation in bilingual Russian-English individuals with aphasia. As a first step, we examined two Russian-English patients' pre-stroke language proficiency using a detailed and comprehensive language use and history questionnaire and…
Mcquaid, Nancy; Bigelow, Ann E.; McLaughlin, Jessica; MacLean, Kim
Mothers' mental state language in conversation with their preschool children, and children's preschool attachment security were examined for their effects on children's mental state language and expressions of emotional understanding in their conversation. Children discussed an emotionally salient event with their mothers and then relayed the…
Stickland, Michael G.; Conrad, Gregory N.; Eaton, Shelley M.
Natural language processing-based knowledge management software, traditionally developed for security organizations, is now becoming commercially available. An informal survey was conducted to discover and examine current NLP and related technologies and potential applications for information retrieval, information extraction, summarization, categorization, terminology management, link analysis, and visualization for possible implementation at Sandia National Laboratories. This report documents our current understanding of the technologies, lists software vendors and their products, and identifies potential applications of these technologies.
Murphy, Kimberly A.; Justice, Laura M.; O'Connell, Ann A.; Pentimonti, Jill M.; Kaderavek, Joan N.
Purpose: The purpose of this study was to retrospectively examine the preschool language and early literacy skills of kindergarten good and poor readers, and to determine the extent to which these skills predict reading status. Method: Participants were 136 children with language impairment enrolled in early childhood special education classrooms.…
Cai, Tianrun; Giannopoulos, Andreas A.; Yu, Sheng; Kelil, Tatiana; Ripley, Beth; Kumamaru, Kanako K.; Rybicki, Frank J.
The migration of imaging reports to electronic medical record systems holds great potential in terms of advancing radiology research and practice by leveraging the large volume of data continuously being updated, integrated, and shared. However, there are significant challenges as well, largely due to the heterogeneity of how these data are formatted. Indeed, although there is movement toward structured reporting in radiology (ie, hierarchically itemized reporting with use of standardized terminology), the majority of radiology reports remain unstructured and use free-form language. To effectively “mine” these large datasets for hypothesis testing, a robust strategy for extracting the necessary information is needed. Manual extraction of information is a time-consuming and often unmanageable task. “Intelligent” search engines that instead rely on natural language processing (NLP), a computer-based approach to analyzing free-form text or speech, can be used to automate this data mining task. The overall goal of NLP is to translate natural human language into a structured format (ie, a fixed collection of elements), each with a standardized set of choices for its value, that is easily manipulated by computer programs to (among other things) order into subcategories or query for the presence or absence of a finding. The authors review the fundamentals of NLP and describe various techniques that constitute NLP in radiology, along with some key applications. ©RSNA, 2016 PMID:26761536
Cai, Tianrun; Giannopoulos, Andreas A; Yu, Sheng; Kelil, Tatiana; Ripley, Beth; Kumamaru, Kanako K; Rybicki, Frank J; Mitsouras, Dimitrios
The migration of imaging reports to electronic medical record systems holds great potential in terms of advancing radiology research and practice by leveraging the large volume of data continuously being updated, integrated, and shared. However, there are significant challenges as well, largely due to the heterogeneity of how these data are formatted. Indeed, although there is movement toward structured reporting in radiology (ie, hierarchically itemized reporting with use of standardized terminology), the majority of radiology reports remain unstructured and use free-form language. To effectively "mine" these large datasets for hypothesis testing, a robust strategy for extracting the necessary information is needed. Manual extraction of information is a time-consuming and often unmanageable task. "Intelligent" search engines that instead rely on natural language processing (NLP), a computer-based approach to analyzing free-form text or speech, can be used to automate this data mining task. The overall goal of NLP is to translate natural human language into a structured format (ie, a fixed collection of elements), each with a standardized set of choices for its value, that is easily manipulated by computer programs to (among other things) order into subcategories or query for the presence or absence of a finding. The authors review the fundamentals of NLP and describe various techniques that constitute NLP in radiology, along with some key applications. ©RSNA, 2016.
Maurice H. P. M. van Putten
Full Text Available We consider the rate R and variance σ 2 of Shannon information in snippets of text based on word frequencies in the natural language. We empirically identify Kolmogorov’s scaling law in σ 2 ∝ k - 1 . 66 ± 0 . 12 (95% c.l. as a function of k = 1 / N measured by word count N. This result highlights a potential association of information flow in snippets, analogous to energy cascade in turbulent eddies in fluids at high Reynolds numbers. We propose R and σ 2 as robust utility functions for objective ranking of concordances in efficient search for maximal information seamlessly across different languages and as a starting point for artificial attention.
Full Text Available The aim of this paper is to show that with a subset of a natural language, simple systems running on PCs can be developed that can nevertheless be an effective tool for interfacing purposes in the building of an Intelligent Tutoring System (ITS. After presenting the special characteristics of the Smalltalk/V language, which provides an appropriate environment for the development of an interface, the overall architecture of the interface module is discussed. We then show how sentences are parsed by the interface, and how interaction takes place with the user. The knowledge-acquisition phase is subsequently described. Finally, some excerpts from a tutoring session concerned with elementary geometry are discussed, and some of the problems and limitations of the approach are illustrated.
Full Text Available Autism spectrum disorders (ASD are pervasive neurodevelopmental disorders involving a number of deficits to linguistic cognition. The gap between genetics and the pathophysiology of ASD remains open, in particular regarding its distinctive linguistic profile. The goal of this paper is to attempt to bridge this gap, focusing on how the autistic brain processes language, particularly through the perspective of brain rhythms. Due to the phenomenon of pleiotropy, which may take some decades to overcome, we believe that studies of brain rhythms, which are not faced with problems of this scale, may constitute a more tractable route to interpreting language deficits in ASD and eventually other neurocognitive disorders. Building on recent attempts to link neural oscillations to certain computational primitives of language, we show that interpreting language deficits in ASD as oscillopathic traits is a potentially fruitful way to construct successful endophenotypes of this condition. Additionally, we will show that candidate genes for ASD are overrepresented among the genes that played a role in the evolution of language. These genes include (and are related to genes involved in brain rhythmicity. We hope that the type of steps taken here will additionally lead to a better understanding of the comorbidity, heterogeneity, and variability of ASD, and may help achieve a better treatment of the affected populations.
Yi Fei Wang; Stephen Petrina
the goal of this article is to explore how learning analytics can be used to predict and advise the design of an intelligent language tutor, chatbot Lucy. With its focus on using student-produced data to understand the design of Lucy to assist English language learning, this research can be a valuable component for language-learning designers to improve second language acquisition. In this article, we present students’ learning journey and data trails, the chatting log architecture and result...
Burk, Robin K.
Computational natural language understanding and generation have been a goal of artificial intelligence since McCarthy, Minsky, Rochester and Shannon first proposed to spend the summer of 1956 studying this and related problems. Although statistical approaches dominate current natural language applications, two current research trends bring…
Seah, Lay Hoon; Clarke, David John; Hart, Christina Eugene
This case study of a science lesson, on the topic thermal expansion, examines the language demands on students from an integrated science and language perspective. The data were generated during a sequence of 9 lessons on the topic of "States of Matter" in a Grade 7 classroom (12-13 years old students). We identify the language demands…
Shah, Nishal Pradeepkumar
A recent advance in computer technology has permitted scientists to implement and test algorithms that were known from quite some time (or not) but which were computationally expensive. Two such projects are IBM's Jeopardy as a part of its DeepQA project  and Wolfram's Wolframalpha. Both these methods implement natural language processing (another goal of AI scientists) and try to answer questions as asked by the user. Though the goal of the two projects is similar, both of them have a ...
Bochkarev, Vladimir V.; Lerner, Eduard Yu; Shevlyakova, Anna V.
This paper is devoted to verifying of the empirical Zipf and Hips laws in natural languages using Google Books Ngram corpus data. The connection between the Zipf and Heaps law which predicts the power dependence of the vocabulary size on the text size is discussed. In fact, the Heaps exponent in this dependence varies with the increasing of the text corpus. To explain it, the obtained results are compared with the probability model of text generation. Quasi-periodic variations with characteristic time periods of 60-100 years were also found.
Bochkarev, Vladimir V; Lerner, Eduard Yu; Shevlyakova, Anna V
This paper is devoted to verifying of the empirical Zipf and Hips laws in natural languages using Google Books Ngram corpus data. The connection between the Zipf and Heaps law which predicts the power dependence of the vocabulary size on the text size is discussed. In fact, the Heaps exponent in this dependence varies with the increasing of the text corpus. To explain it, the obtained results are compared with the probability model of text generation. Quasi-periodic variations with characteristic time periods of 60-100 years were also found
Full Text Available We present a publicly-available state-of-the-art research and development platform for Machine Translation and Natural Language Processing that runs on the Amazon Elastic Compute Cloud. This provides a standardized research environment for all users, and enables perfect reproducibility and compatibility. Box also enables users to use their hardware budget to avoid the management and logistical overhead of maintaining a research lab, yet still participate in global research community with the same state-of-the-art tools.
Gu, Zhen; Li, Siheng; Zhang, Feilong; Wang, Shutao
Nature often exhibits various interesting and unique adhesive surfaces. The attempt to understand the natural adhesion phenomena can continuously guide the design of artificial adhesive surfaces by proposing simplified models of surface adhesion. Among those models, a peeling model can often effectively reflect the adhesive property between two surfaces during their attachment and detachment processes. In the context, this review summarizes the recent advances about the peeling model in understanding unique adhesive properties on natural and artificial surfaces. It mainly includes four parts: a brief introduction to natural surface adhesion, the theoretical basis and progress of the peeling model, application of the peeling model, and finally, conclusions. It is believed that this review is helpful to various fields, such as surface engineering, biomedicine, microelectronics, and so on.
Pons, Francisco; Lawson, J.: Harris, P.; Rosnay, M. de
Over the last two decades, it has been established that children's emotion understanding changes as they develop. Recent studies have also begun to address individual differences in children's emotion understanding. The first goal of this study was to examine the development of these individual...... differences across a wide age range with a test assessing nine different components of emotion understanding. The second goal was to examine the relation between language ability and individual differences in emotion understanding. Eighty children ranging in age from 4 to 11 years were tested. Children...... displayed a clear improvement with age in both their emotion understanding and language ability. In each age group, there were clear individual differences in emotion understanding and language ability. Age and language ability together explained 72% of emotion understanding variance; 20% of this variance...
arrested, detained, incarcerated, jailed, locked up, taken into custody, and thrown into prison. However, not all the paraphrases are uniformly good...jailed, locked up, taken into custody, and thrown into prison, along with a set of incorrect/noisy paraphrases that have different syntactic types or...1 CharLogCR=-0.08004 ContainsX=0 Equivalence=0.427150 Exclusion=0.000101 GlueRule=0 GoogleNgramSim=0.04294 Identity =0 Independent=0.078898 Lex(e1
Full Text Available In this short inquiry I would like to defend the statement that exact science deals with the explanation of models, but not with the understanding (comprehending of nature. By the word ‘nature’ I mean nature as physis (as a self-moving and self-developing living organism to which humans also belong, not nature as natura naturata (as a nonevolving creature created by someone or something. The Estonian philosopher of science Rein Vihalemm (2008 has shown with his conception of phi-science (φ-science that exact science is itself an idealized model or theoretical object derived from Galilean mathematical physics.
Molina, Martin; Sanchez-Soriano, Javier; Corcho, Oscar
Providing descriptions of isolated sensors and sensor networks in natural language, understandable by the general public, is useful to help users find relevant sensors and analyze sensor data. In this paper, we discuss the feasibility of using geographic knowledge from public databases available on the Web (such as OpenStreetMap, Geonames, or DBpedia) to automatically construct such descriptions. We present a general method that uses such information to generate sensor descriptions in natural language. The results of the evaluation of our method in a hydrologic national sensor network showed that this approach is feasible and capable of generating adequate sensor descriptions with a lower development effort compared to other approaches. In the paper we also analyze certain problems that we found in public databases (e.g., heterogeneity, non-standard use of labels, or rigid search methods) and their impact in the generation of sensor descriptions.
Full Text Available Providing descriptions of isolated sensors and sensor networks in natural language, understandable by the general public, is useful to help users find relevant sensors and analyze sensor data. In this paper, we discuss the feasibility of using geographic knowledge from public databases available on the Web (such as OpenStreetMap, Geonames, or DBpedia to automatically construct such descriptions. We present a general method that uses such information to generate sensor descriptions in natural language. The results of the evaluation of our method in a hydrologic national sensor network showed that this approach is feasible and capable of generating adequate sensor descriptions with a lower development effort compared to other approaches. In the paper we also analyze certain problems that we found in public databases (e.g., heterogeneity, non-standard use of labels, or rigid search methods and their impact in the generation of sensor descriptions.
Moon, Katie; Blackman, Deborah
Natural scientists are increasingly interested in social research because they recognize that conservation problems are commonly social problems. Interpreting social research, however, requires at least a basic understanding of the philosophical principles and theoretical assumptions of the discipline, which are embedded in the design of social research. Natural scientists who engage in social science but are unfamiliar with these principles and assumptions can misinterpret their results. We developed a guide to assist natural scientists in understanding the philosophical basis of social science to support the meaningful interpretation of social research outcomes. The 3 fundamental elements of research are ontology, what exists in the human world that researchers can acquire knowledge about; epistemology, how knowledge is created; and philosophical perspective, the philosophical orientation of the researcher that guides her or his action. Many elements of the guide also apply to the natural sciences. Natural scientists can use the guide to assist them in interpreting social science research to determine how the ontological position of the researcher can influence the nature of the research; how the epistemological position can be used to support the legitimacy of different types of knowledge; and how philosophical perspective can shape the researcher's choice of methods and affect interpretation, communication, and application of results. The use of this guide can also support and promote the effective integration of the natural and social sciences to generate more insightful and relevant conservation research outcomes. © 2014 Society for Conservation Biology.
Nafari, Maryam; Weaver, Chris
Richly interactive visualization tools are increasingly popular for data exploration and analysis in a wide variety of domains. Existing systems and techniques for recording provenance of interaction focus either on comprehensive automated recording of low-level interaction events or on idiosyncratic manual transcription of high-level analysis activities. In this paper, we present the architecture and translation design of a query-to-question (Q2Q) system that automatically records user interactions and presents them semantically using natural language (written English). Q2Q takes advantage of domain knowledge and uses natural language generation (NLG) techniques to translate and transcribe a progression of interactive visualization states into a visual log of styled text that complements and effectively extends the functionality of visualization tools. We present Q2Q as a means to support a cross-examination process in which questions rather than interactions are the focus of analytic reasoning and action. We describe the architecture and implementation of the Q2Q system, discuss key design factors and variations that effect question generation, and present several visualizations that incorporate Q2Q for analysis in a variety of knowledge domains.
Pons, Ewoud; Braun, Loes M M; Hunink, M G Myriam; Kors, Jan A
Radiological reporting has generated large quantities of digital content within the electronic health record, which is potentially a valuable source of information for improving clinical care and supporting research. Although radiology reports are stored for communication and documentation of diagnostic imaging, harnessing their potential requires efficient and automated information extraction: they exist mainly as free-text clinical narrative, from which it is a major challenge to obtain structured data. Natural language processing (NLP) provides techniques that aid the conversion of text into a structured representation, and thus enables computers to derive meaning from human (ie, natural language) input. Used on radiology reports, NLP techniques enable automatic identification and extraction of information. By exploring the various purposes for their use, this review examines how radiology benefits from NLP. A systematic literature search identified 67 relevant publications describing NLP methods that support practical applications in radiology. This review takes a close look at the individual studies in terms of tasks (ie, the extracted information), the NLP methodology and tools used, and their application purpose and performance results. Additionally, limitations, future challenges, and requirements for advancing NLP in radiology will be discussed. (©) RSNA, 2016 Online supplemental material is available for this article.
This book explains how can be created information extraction (IE) applications that are able to tap the vast amount of relevant information available in natural language sources: Internet pages, official documents such as laws and regulations, books and newspapers, and social web. Readers are introduced to the problem of IE and its current challenges and limitations, supported with examples. The book discusses the need to fill the gap between documents, data, and people, and provides a broad overview of the technology supporting IE. The authors present a generic architecture for developing systems that are able to learn how to extract relevant information from natural language documents, and illustrate how to implement working systems using state-of-the-art and freely available software tools. The book also discusses concrete applications illustrating IE uses. · Provides an overview of state-of-the-art technology in information extraction (IE), discussing achievements and limitations for t...
Cochrane, Donald Brian
The goal of scientific literacy requires that students develop an understanding of the nature of science to assist them in the reasoned acquisition of science concepts and in their future role as citizens in a participatory democracy. The purpose of this study was to investigate and describe the range of positions that grade six students hold with respect to the nature of science and to investigate whether gender or prior science education was related to students' views of the nature of science. Two grade six classes participated in this study. One class was from a school involved in a long-term elementary science curriculum project. The science curriculum at this school involved constructivist epistemology and pedagogy and a realist ontology. The curriculum stressed hands-on, open-ended activities and the development of science process skills. Students were frequently involved in creating and testing explanations for physical phenomena. The second class was from a matched school that had a traditional science program. Results of the study indicated that students hold a wider range of views of the nature of science than previously documented. Student positions ranged from having almost no understanding of the nature of science to those expressing positions regarding the nature of science that were more developed than previous studies had documented. Despite the range of views documented, all subjects held realist views of scientific knowledge. Contrary to the literature, some students were able to evaluate a scientific theory in light of empirical evidence that they had generated. Results also indicated that students from the project school displayed more advanced views of the nature of science than their matched peers. However, not all students benefited equally from their experiences. No gender differences were found with respect to students' understanding of the nature of science.
Bilican, K.; Cakiroglu, J.; Oztekin, C.
Exploring different contexts to facilitate in-depth nature of science (NOS) views were seen as critical for better professional development of pre-service science teachers, which ultimately would assure better students' NOS understanding and achieve an ultimate goal of current science education reforms. This study aimed to reduce the lack of…
Measures aimed at procedural fairness address conduct during the bargaining process and generally aim at ensuring transparency. Transparency in relation to the terms of a contract relates to whether the terms of the contract terms accessible, in clear language, well-structured, and cross-referenced, with prominence being ...
Tremblay, Pascale; Small, Steven L
A controversial question in cognitive neuroscience is whether comprehension of words and sentences engages brain mechanisms specific for decoding linguistic meaning or whether language comprehension occurs through more domain-general sensorimotor processes. Accumulating behavioral and neuroimaging evidence suggests a role for cortical motor and premotor areas in passive action-related language tasks, regions that are known to be involved in action execution and observation. To examine the involvement of these brain regions in language and nonlanguage tasks, we used functional magnetic resonance imaging (fMRI) on a group of 21 healthy adults. During the fMRI session, all participants 1) watched short object-related action movies, 2) looked at pictures of man-made objects, and 3) listened to and produced short sentences describing object-related actions and man-made objects. Our results are among the first to reveal, in the human brain, a functional specialization within the ventral premotor cortex (PMv) for observing actions and for observing objects, and a different organization for processing sentences describing actions and objects. These findings argue against the strongest version of the simulation theory for the processing of action-related language.
Gogolin, Sarah; Krüger, Dirk
Students' understanding of models in science has been subject to a number of investigations. The instruments the researchers used are suitable for educational research but, due to their complexity, cannot be employed directly by teachers. This article presents forced choice (FC) tasks, which, assembled as a diagnostic instrument, are supposed to measure students' understanding of the nature of models efficiently, while being sensitive enough to detect differences between individuals. In order to evaluate if the diagnostic instrument is suitable for its intended use, we propose an approach that complies with the demand to integrate students' responses to the tasks into the validation process. Evidence for validity was gathered based on relations to other variables and on students' response processes. Students' understanding of the nature of models was assessed using three methods: FC tasks, open-ended tasks and interviews ( N = 448). Furthermore, concurrent think-aloud protocols ( N = 30) were performed. The results suggest that the method and the age of the students have an effect on their understanding of the nature of models. A good understanding of the FC tasks as well as a convergence in the findings across the three methods was documented for grades eleven and twelve. This indicates that teachers can use the diagnostic instrument for an efficient and, at the same time, valid diagnosis for this group. Finally, the findings of this article may provide a possible explanation for alternative findings from previous studies as a result of specific methods that were used.
Rapisardi, Elena; Di Franco, Sabina; Giardino, Marco
not a unique meaning: e.g. Mercury could stand for the Roman god, the metallic element, the planet, or Freddy the singer. Similarly the word «alert»: in the common language has a certain meaning, whilst in the civil protection framework includes regulations, responsibilities and procedures. The NHW is intended as a collaborative virtual source with validated information on geosciences to support a common understanding of natural hazards, risks and civil protection. The NHW aims to become a point of reference both for acknowledged practitioners, who will share their expertise and data, and for citizens, civil servants, media representatives, and students allowed to comment and contribute to the scientifically validated content. The NHW is a simple tool to support information and communication on natural hazards and civil protection at all levels and would set up a shared and common knowledge. Moreover, NHW could represent the first step of a further challenging programme: through the power of «linked data» NHW could develop and contribute first to a natural hazard semantic, then to a «semantic disaster resilience».
S. K. Kostiuchkov
Full Text Available This paper examines the position of the biopolitical nature of man as a biosocial being given supplies of both the two spheres of life – natural, biological and social. The necessity of understanding of human nature, which by definition are bio-social importance of the approach to the definition of man as an integral, binary-konnotovanoyi of the «social individual – a species» which is characterized by symmetrical opposition – upposition social and biological. It was found that the main task of modern political science, and in particular bio-political studies presented appeals to rethink the political picture of the world in order to predict the development of a new order or a new chaos. Understanding the formation of a new global civilization worldview is today one of the most important problems, which is connected with the main problem of the modern world – the task of preserving life on the planet. It is concluded that the contradictions of human nature – between the biological and the social, physical and spiritual, universal and the particular, natural and artificial, rational and emotional – in today’s conditions are extremely sharp. The said situation requires more in-depth scientific analysis of human nature, the study of the structural level as human biosocial system.
Miller, Amanda C; Keenan, Janice M
This study replicated and extended a phenomenon in the text memory literature referred to as the centrality deficit Miller & Keenan (Annals of Dyslexia 59:99-113, 2009). It examined how reading in a foreign language (L2) affects one's text representation and ability to recall the most important information. Readers recalled a greater proportion of central than of peripheral ideas, regardless of whether reading in their native language (L1) or a foreign language (L2). Nonetheless, the greatest deficit in participants' L2 recalls, as compared with L1 recalls, was on the central, rather than the peripheral, information. This centrality deficit appears to stem from resources being diverted from comprehension when readers have to devote more cognitive resources to lower level processes (e.g., L2 word identification and syntactic processing), because the deficit was most evident among readers who had lower L2 proficiency. Prior knowledge (PK) of the passage topic helped compensate for the centrality deficit. Readers with less L2 proficiency who did not have PK of the topic displayed a centrality deficit, relative to their L1 recall, but this deficit dissipated when they did possess PK.
This thesis puts forward the view that a purely signal- based approach to natural language processing is both plausible and desirable. By questioning the veracity of symbolic representations of meaning, it argues for a unified, non-symbolic model of knowledge representation that is both biologically plausible and, potentially, highly efficient. Processes to generate a grounded, neural form of this model-dubbed the semantic filter-are discussed. The combined effects of local neural organisation, coincident with perceptual maturation, are used to hypothesise its nature. This theoretical model is then validated in light of a number of fundamental neurological constraints and milestones. The mechanisms of semantic and episodic development that the model predicts are then used to explain linguistic properties, such as propositions and verbs, syntax and scripting. To mimic the growth of locally densely connected structures upon an unbounded neural substrate, a system is developed that can grow arbitrarily large, data- dependant structures composed of individual self- organising neural networks. The maturational nature of the data used results in a structure in which the perception of concepts is refined by the networks, but demarcated by subsequent structure. As a consequence, the overall structure shows significant memory and computational benefits, as predicted by the cognitive and neural models. Furthermore, the localised nature of the neural architecture also avoids the increasing error sensitivity and redundancy of traditional systems as the training domain grows. The semantic and episodic filters have been demonstrated to perform as well, or better, than more specialist networks, whilst using significantly larger vocabularies, more complex sentence forms and more natural corpora.
Wu, Joy T; Dernoncourt, Franck; Gehrmann, Sebastian; Tyler, Patrick D; Moseley, Edward T; Carlson, Eric T; Grant, David W; Li, Yeran; Welt, Jonathan; Celi, Leo Anthony
Advancement of Artificial Intelligence (AI) capabilities in medicine can help address many pressing problems in healthcare. However, AI research endeavors in healthcare may not be clinically relevant, may have unrealistic expectations, or may not be explicit enough about their limitations. A diverse and well-functioning multidisciplinary team (MDT) can help identify appropriate and achievable AI research agendas in healthcare, and advance medical AI technologies by developing AI algorithms as well as addressing the shortage of appropriately labeled datasets for machine learning. In this paper, our team of engineers, clinicians and machine learning experts share their experience and lessons learned from their two-year-long collaboration on a natural language processing (NLP) research project. We highlight specific challenges encountered in cross-disciplinary teamwork, dataset creation for NLP research, and expectation setting for current medical AI technologies. Copyright © 2017. Published by Elsevier B.V.
Kashyap, Vipul; Turchin, Alexander; Morin, Laura; Chang, Frank; Li, Qi; Hongsermeier, Tonya
Structured Clinical Documentation is a fundamental component of the healthcare enterprise, linking both clinical (e.g., electronic health record, clinical decision support) and administrative functions (e.g., evaluation and management coding, billing). One of the challenges in creating good quality documentation templates has been the inability to address specialized clinical disciplines and adapt to local clinical practices. A one-size-fits-all approach leads to poor adoption and inefficiencies in the documentation process. On the other hand, the cost associated with manual generation of documentation templates is significant. Consequently there is a need for at least partial automation of the template generation process. We propose an approach and methodology for the creation of structured documentation templates for diabetes using Natural Language Processing (NLP).
Deleger, Louise; Li, Qi; Lingren, Todd; Kaiser, Megan; Molnar, Katalin; Stoutenborough, Laura; Kouril, Michal; Marsolo, Keith; Solti, Imre
We present the construction of three annotated corpora to serve as gold standards for medical natural language processing (NLP) tasks. Clinical notes from the medical record, clinical trial announcements, and FDA drug labels are annotated. We report high inter-annotator agreements (overall F-measures between 0.8467 and 0.9176) for the annotation of Personal Health Information (PHI) elements for a de-identification task and of medications, diseases/disorders, and signs/symptoms for information extraction (IE) task. The annotated corpora of clinical trials and FDA labels will be publicly released and to facilitate translational NLP tasks that require cross-corpora interoperability (e.g. clinical trial eligibility screening) their annotation schemas are aligned with a large scale, NIH-funded clinical text annotation project.
Graham, Matthew; Zhang, M.; Djorgovski, S. G.; Donalek, C.; Drake, A. J.; Mahabal, A.
The rapidly emerging field of time domain astronomy is one of the most exciting and vibrant new research frontiers, ranging in scientific scope from studies of the Solar System to extreme relativistic astrophysics and cosmology. It is being enabled by a new generation of large synoptic digital sky surveys - LSST, PanStarrs, CRTS - that cover large areas of sky repeatedly, looking for transient objects and phenomena. One of the biggest challenges facing these is the automated classification of transient events, a process that needs machine-processible astronomical knowledge. Semantic technologies enable the formal representation of concepts and relations within a particular domain. ATELs (http://www.astronomerstelegram.org) are a commonly-used means for reporting and commenting upon new astronomical observations of transient sources (supernovae, stellar outbursts, blazar flares, etc). However, they are loose and unstructured and employ scientific natural language for description: this makes automated processing of them - a necessity within the next decade with petascale data rates - a challenge. Nevertheless they represent a potentially rich corpus of information that could lead to new and valuable insights into transient phenomena. This project lies in the cutting-edge field of astrosemantics, a branch of astroinformatics, which applies semantic technologies to astronomy. The ATELs have been used to develop an appropriate concept scheme - a representation of the information they contain - for transient astronomy using hierarchical clustering of processed natural language. This allows us to automatically organize ATELs based on the vocabulary used. We conclude that we can use simple algorithms to process and extract meaning from astronomical textual data.
Huerta, Margarita; Tong, Fuhui; Irby, Beverly J.; Lara-Alecio, Rafael
The authors of this quantitative study measured and compared the academic language development and conceptual understanding of fifth-grade economically disadvantaged English language learners (ELL), former ELLs, and native English-speaking (ES) students as reflected in their science notebook scores. Using an instrument they developed, the authors…
Elson, Raymond J.; O'Callaghan, Susanne; Walker, John P.; Williams, Robert
Students rely on rote knowledge to learn accounting concepts. However, this approach does not allow them to understanding the meta language of accounting. Meta language is simply the concepts and terms that are used in a profession and are easily understood by its users. Terms such as equity, assets, and balance sheet are part of the accounting…
The starting-point of this thesis is the hypothesis that, from at least 22 months old, children who watch movies (i.e. any moving-image media) may be learning how to make sense of them. Rather than looking for evidence of precursors to further learning (such as language, literacy or technological skills) or for the risks or benefits that movie-watching may entail, the thesis argues that viewing behaviour provides enough evidence about the practices and processes through which children of this...
Deane, Paul; Sheehan, Kathleen
This paper is an exploration of the conceptual issues that have arisen in the course of building a natural language generation (NLG) system for automatic test item generation. While natural language processing techniques are applicable to general verbal items, mathematics word problems are particularly tractable targets for natural language…
Full Text Available Teachers’ practical knowledge is considered as teachers’ general knowledge, beliefsand thinking (Borg, 2003 which can be traced in teachers’ practices (Connelly & Clandinin,1988 and shaped by various background sources (Borg, 2003; Grossman, 1990; Meijer,Verloop, and Beijard, 1999. This paper initially discusses how language teachers areinfluenced by three background sources: teachers’ prior language learning experiences, priorteaching experience, and professional coursework in pre- and in-service education. Bydrawing its data from the author’s longitidunal study, it also presents the findings of a crosscasetheme emerged from the investigation of three English as a foreign language (EFLteachers’ prior language learning experiences. The paper also discusses how the participationin studies on teachers’ knowledge raises teachers’ own awareness while it informs theresearch.
Full Text Available This paper presents the results of the research of peculiarities of syntactic development, as an element of language structure on the grammatical level of children suffering from developmental dysphasia, after the completed speech pathology treatment of many years. Syntactic level at younger school age was studied by assessing language competence in the accomplishment of communicative sentence with subordinate clause. The research was performed on the samples of children at school age in regular primary schools in Belgrade. The sample comprised 160 respondents who were divided in two groups: target and comparative. The target group consisted of 60 respondents (children suffering from developmental dysphasia after the completed speech pathology treatment of many years, and the comparative group consisted of 100 respondents from regular primary school "Gavrilo Princip" in Zemun. Research results show that grammatical development of children suffering from developmental dysphasia takes place at a considerably slower rate and entails substantially more difficulties in accomplishing predication in subordinate clauses. This paper discusses the consequences which the difficulties in grammatical development can have on school achievement.
Miller, William R; Johnson, Wendy R
Client motivation for change, a topic of high interest to addiction clinicians, is multidimensional and complex, and many different approaches to measurement have been tried. The current effort drew on psycholinguistic research on natural language that is used by clients to describe their own motivation. Seven addiction treatment sites participated in the development of a simple scale to measure client motivation. Twelve items were drafted to represent six potential dimensions of motivation for change that occur in natural discourse. The maximum self-rating of motivation (10 on a 0-10 scale) was the median score on all items, and 43% of respondents rated 10 on all 12 items - a substantial ceiling effect. From 1035 responses, three factors emerged representing importance, ability, and commitment - constructs that are also reflected in several theoretical models of motivation. A 3-item version of the scale, with one marker item for each of these constructs, accounted for 81% of variance in the full scale. The three items are: 1. It is important for me to . . . 2. I could . . . and 3. I am trying to . . . This offers a quick (1-minute) assessment of clients' self-reported motivation for change.
Kumar, Rajesh; Yunus, Reva
This article looks at the contribution of insights from theoretical linguistics to an understanding of language acquisition and the nature of language in terms of their potential benefit to language education. We examine the ideas of innateness and universal language faculty, as well as multilingualism and the language-society relationship. Modern…
Pearson, Barbara Zurer; Conner, Tracy; Jackson, Janice E
Language difference among speakers of African American English (AAE) has often been considered language deficit, based on a lack of understanding about the AAE variety. Following Labov (1972), Wolfram (1969), Green (2002, 2011), and others, we define AAE as a complex rule-governed linguistic system and briefly discuss language structures that it shares with general American English (GAE) and others that are unique to AAE. We suggest ways in which mistaken ideas about the language variety add to children's difficulties in learning the mainstream dialect and, in effect, deny them the benefits of their educational programs. We propose that a linguistically informed approach that highlights correspondences between AAE and the mainstream dialect and trains students and teachers to understand language varieties at a metalinguistic level creates environments that support the academic achievement of AAE-speaking students. Finally, we present 3 program types that are recommended for helping students achieve the skills they need to be successful in multiple linguistic environments.
Milliken, Aimee; Grace, Pamela
Much attention has been paid to the role of the nurse in recognizing and addressing ethical dilemmas. There has been less emphasis, however, on the issue of whether or not nurses understand the ethical nature of everyday practice. Awareness of the inherently ethical nature of practice is a component of nurse ethical sensitivity, which has been identified as a component of ethical decision-making. Ethical sensitivity is generally accepted as a necessary precursor to moral agency, in that recognition of the ethical content of practice is necessary before consistent action on behalf of patient interests can take place. This awareness is also compulsory in ensuring patient good by recognizing the unique interests and wishes of individuals, in line with an ethic of care. Scholarly and research literature are used to argue that bolstering ethical awareness and ensuring that nurses understand the ethical nature of the role are an obligation of the profession. Based on this line of reasoning, recommendations for education and practice, along with directions for future research, are suggested.
Wittrock, Merlin C.
Concepts in cognitive psychology are applied to the language used in military situations, and a sentence classification system for use in analyzing military language is outlined. The system is designed to be used, in part, in conjunction with a natural language query system that allows a user to access a database. The discussion of military…
Nature conservation relies largely on peoples' rule adherence. However, noncompliance in the conservation context is common: it is one of the largest illegal activities in the world, degrading societies, economies and the environment. Understanding and managing compliance is key for ensuring effective conservation, nevertheless crucial concepts and tools are scattered in a wide array of literature. Here I review and integrate these concepts and tools in an effort to guide compliance management in the conservation context. First, I address the understanding of compliance by breaking it down into five key questions: who?, what?, when?, where? and why?. A special focus is given to 'why?' because the answer to this question explains the reasons for compliance and noncompliance, providing critical information for management interventions. Second, I review compliance management strategies, from voluntary compliance to coerced compliance. Finally, I suggest a system, initially proposed for tax compliance, to balance these multiple compliance management strategies. This paper differs from others by providing a broad yet practical scope on theory and tools for understanding and managing compliance in the nature conservation context. Copyright © 2015 Elsevier Ltd. All rights reserved.
Woo, Chong Woo; Evens, Martha W; Freedman, Reva; Glass, Michael; Shim, Leem Seop; Zhang, Yuemei; Zhou, Yujian; Michael, Joel
The objective of this research was to build an intelligent tutoring system capable of carrying on a natural language dialogue with a student who is solving a problem in physiology. Previous experiments have shown that students need practice in qualitative causal reasoning to internalize new knowledge and to apply it effectively and that they learn by putting their ideas into words. Analysis of a corpus of 75 hour-long tutoring sessions carried on in keyboard-to-keyboard style by two professors of physiology at Rush Medical College tutoring first-year medical students provided the rules used in tutoring strategies and tactics, parsing, and text generation. The system presents the student with a perturbation to the blood pressure, asks for qualitative predictions of the changes produced in seven important cardiovascular variables, and then launches a dialogue to correct any errors and to probe for possible misconceptions. The natural language understanding component uses a cascade of finite-state machines. The generation is based on lexical functional grammar. Results of experiments with pretests and posttests have shown that using the system for an hour produces significant learning gains and also that even this brief use improves the student's ability to solve problems more then reading textual material on the topic. Student surveys tell us that students like the system and feel that they learn from it. The system is now in regular use in the first-year physiology course at Rush Medical College. We conclude that the CIRCSIM-Tutor system demonstrates that intelligent tutoring systems can implement effective natural language dialogue with current language technology.
Liu et al.  provide a comprehensive account of research on dependency distance in human languages. While the article is a very rich and useful report on this complex subject, here I will expand on a few specific issues where research in computational linguistics (specifically natural language processing) can inform DDM research, and vice versa. These aspects have not been explored much in  or elsewhere, probably due to the little overlap between both research communities, but they may provide interesting insights for improving our understanding of the evolution of human languages, the mechanisms by which the brain processes and understands language, and the construction of effective computer systems to achieve this goal.
Weimer, Amy A; Gasquoine, Philip G
Belief reasoning and emotion understanding were measured among 102 Mexican American bilingual children ranging from 4 to 7 years old. All children were tested in English and Spanish after ensuring minimum comprehension in each language. Belief reasoning was assessed using 2 false and 1 true belief tasks. Emotion understanding was measured using subtests from the Test for Emotion Comprehension. The influence of family background variables of yearly income, parental education level, and number of siblings on combined Spanish and English vocabulary, belief reasoning, and emotion understanding was assessed by regression analyses. Age and emotion understanding predicted belief reasoning. Vocabulary and belief reasoning predicted emotion understanding. When the sample was divided into language-dominant and balanced bilingual groups on the basis of language proficiency difference scores, there were no significant differences on belief reasoning or emotion understanding. Language groups were demographically similar with regard to child age, parental educational level, and family income. Results suggest Mexican American language-dominant and balanced bilinguals develop belief reasoning and emotion understanding similarly.
Philip N Stoop
Full Text Available The Consumer Protection Act 68 of 2008 came into effect on 1 April 2011. The purpose of this Act is, among other things, to promote fairness, openness and respectable business practice between the suppliers of goods or services and the consumers of such good and services. In consumer protection legislation fairness is usually approached from two directions, namely substantive and procedural fairness. Measures aimed at procedural fairness address conduct during the bargaining process and generally aim at ensuring transparency. Transparency in relation to the terms of a contract relates to whether the terms of the contract terms accessible, in clear language, well-structured, and cross-referenced, with prominence being given to terms that are detrimental to the consumer or because they grant important rights. One measure in the Act aimed at addressing procedural fairness is the right to plain and understandable language. The consumer’s right to being given information in plain and understandable language, as it is expressed in section 22, is embedded under the umbrella right of information and disclosure in the Act. Section 22 requires that notices, documents or visual representations that are required in terms of the Act or other law are to be provided in plain and understandable language as well as in the prescribed form, where such a prescription exists. In the analysis of the concept “plain and understandable language” the following aspects are considered in this article: the development of plain language measures in Australia and the United Kingdom; the structure and purpose of section 22; the documents that must be in plain language; the definition of plain language; the use of official languages in consumer contracts; and plain language guidelines (based on the law of the states of Pennsylvania and Connecticut in the United States of America.
Hirschman, Lynette; Fort, Karën; Boué, Stéphanie; Kyrpides, Nikos; Islamaj Doğan, Rezarta; Cohen, Kevin Bretonnel
Crowdsourcing is increasingly utilized for performing tasks in both natural language processing and biocuration. Although there have been many applications of crowdsourcing in these fields, there have been fewer high-level discussions of the methodology and its applicability to biocuration. This paper explores crowdsourcing for biocuration through several case studies that highlight different ways of leveraging 'the crowd'; these raise issues about the kind(s) of expertise needed, the motivations of participants, and questions related to feasibility, cost and quality. The paper is an outgrowth of a panel session held at BioCreative V (Seville, September 9-11, 2015). The session consisted of four short talks, followed by a discussion. In their talks, the panelists explored the role of expertise and the potential to improve crowd performance by training; the challenge of decomposing tasks to make them amenable to crowdsourcing; and the capture of biological data and metadata through community editing.Database URL: http://www.mitre.org/publications/technical-papers/crowdsourcing-and-curation-perspectives. © The Author(s) 2016. Published by Oxford University Press.
A new approach for processing vowelized and unvowelized Arabic texts in order to prepare them for Natural Language Processing (NLP) purposes is described. The developed approach is rule-based and made up of four phases: text tokenization, word light stemming, word's morphological analysis and text annotation. The first phase preprocesses the input text in order to isolate the words and represent them in a formal way. The second phase applies a light stemmer in order to extract the stem of each word by eliminating the prefixes and suffixes. The third phase is a rule-based morphological analyzer that determines the root and the morphological pattern for each extracted stem. The last phase produces an annotated text where each word is tagged with its morphological attributes. The preprocessor presented in this paper is capable of dealing with vowelized and unvowelized words, and provides the input words along with relevant linguistics information needed by different applications. It is designed to be used with different NLP applications such as machine translation text summarization, text correction, information retrieval and automatic vowelization of Arabic Text. (author)
Juuso, Esko K.
Performance improvement is taken as the primary goal in the asset management. Advanced data analysis is needed to efficiently integrate condition monitoring data into the operation and maintenance. Intelligent stress and condition indices have been developed for control and condition monitoring by combining generalized norms with efficient nonlinear scaling. These nonlinear scaling methodologies can also be used to handle performance measures used for management since management oriented indicators can be presented in the same scale as intelligent condition and stress indices. Performance indicators are responses of the process, machine or system to the stress contributions analyzed from process and condition monitoring data. Scaled values are directly used in intelligent temporal analysis to calculate fluctuations and trends. All these methodologies can be used in prognostics and fatigue prediction. The meanings of the variables are beneficial in extracting expert knowledge and representing information in natural language. The idea of dividing the problems into the variable specific meanings and the directions of interactions provides various improvements for performance monitoring and decision making. The integrated temporal analysis and uncertainty processing facilitates the efficient use of domain expertise. Measurements can be monitored with generalized statistical process control (GSPC) based on the same scaling functions.
Wu Stephen T
Full Text Available Abstract Background One challenge in reusing clinical data stored in electronic medical records is that these data are heterogenous. Clinical Natural Language Processing (NLP plays an important role in transforming information in clinical text to a standard representation that is comparable and interoperable. Information may be processed and shared when a type system specifies the allowable data structures. Therefore, we aim to define a common type system for clinical NLP that enables interoperability between structured and unstructured data generated in different clinical settings. Results We describe a common type system for clinical NLP that has an end target of deep semantics based on Clinical Element Models (CEMs, thus interoperating with structured data and accommodating diverse NLP approaches. The type system has been implemented in UIMA (Unstructured Information Management Architecture and is fully functional in a popular open-source clinical NLP system, cTAKES (clinical Text Analysis and Knowledge Extraction System versions 2.0 and later. Conclusions We have created a type system that targets deep semantics, thereby allowing for NLP systems to encapsulate knowledge from text and share it alongside heterogenous clinical data sources. Rather than surface semantics that are typically the end product of NLP algorithms, CEM-based semantics explicitly build in deep clinical semantics as the point of interoperability with more structured data types.
Wu, Stephen T; Kaggal, Vinod C; Dligach, Dmitriy; Masanz, James J; Chen, Pei; Becker, Lee; Chapman, Wendy W; Savova, Guergana K; Liu, Hongfang; Chute, Christopher G
One challenge in reusing clinical data stored in electronic medical records is that these data are heterogenous. Clinical Natural Language Processing (NLP) plays an important role in transforming information in clinical text to a standard representation that is comparable and interoperable. Information may be processed and shared when a type system specifies the allowable data structures. Therefore, we aim to define a common type system for clinical NLP that enables interoperability between structured and unstructured data generated in different clinical settings. We describe a common type system for clinical NLP that has an end target of deep semantics based on Clinical Element Models (CEMs), thus interoperating with structured data and accommodating diverse NLP approaches. The type system has been implemented in UIMA (Unstructured Information Management Architecture) and is fully functional in a popular open-source clinical NLP system, cTAKES (clinical Text Analysis and Knowledge Extraction System) versions 2.0 and later. We have created a type system that targets deep semantics, thereby allowing for NLP systems to encapsulate knowledge from text and share it alongside heterogenous clinical data sources. Rather than surface semantics that are typically the end product of NLP algorithms, CEM-based semantics explicitly build in deep clinical semantics as the point of interoperability with more structured data types.
Polišenská, Kamila; Kapalková, Svetlana; Novotková, Monika
The study aims to describe receptive language skills in children with intellectual disability (ID) and to contribute to the debate on deviant versus delayed language development. This is the 1st study of receptive skills in children with ID who speak a Slavic language, providing insight into how language development is affected by disability and also language typology. Twenty-eight Slovak-speaking children participated in the study (14 children with ID and 14 typically developing [TD] children matched on nonverbal reasoning abilities). The children were assessed by receptive language tasks targeting words, sentences, and stories, and the groups were compared quantitatively and qualitatively. The groups showed similar language profiles, with a better understanding of words, followed by sentences, with the poorest comprehension for stories. Nouns were comprehended better than verbs; sentence constructions also showed a qualitatively similar picture, although some dissimilarities emerged. Verb comprehension was strongly related to sentence comprehension in both groups and related to story comprehension in the TD group only. The findings appear to support the view that receptive language skills follow the same developmental route in children with ID as seen in younger TD children, suggesting that language development is a robust process and does not seem to be differentially affected by ID even when delayed.
Charollais, A; Marret, S; Stumpf, M-H; Lemarchand, M; Delaporte, B; Philip, E; Monom-Diverre; Guillois, B; Datin-Dorriere, V; Debillon, T; Simon, M-J; De Barace, C; Pasquet, F; Saliba, E; Zebhib, R
Clinical and radiological knowledge of language development in the former premature infant compared to the newborn allows us to argue for exploration of the sensorimotor co-factors required for proper language development. There are early representations of the maternal language in the infant's visual, auditory, and sensorimotor areas, activated or stabilized by orofacial and articulatory movements. The functional architecture of language is different for vulnerable children such as premature infants. We have already mentioned the impact of early dysfunction of the facial praxis fine motor skills in this population presenting comprehension disorders. A recent meta-analysis confirms the increasing difficulty of understanding between 3 and 12 years, questioning the quality of the initial linguistic processes. A precise analysis of language, referenced from 3 years of age, should be completed by sensorimotor tests to assess possible constraints in automating neurolinguistic foundations. The usual assessment at this age can exclude sensory disturbances and communication and offers guidance and socialization. However, a recent study shows the ineffectiveness of "language-reinforced immersion" at 2 and 3 years in a population of vulnerable children. The LAMOPRESCO study of language and motor skills in the premature infant (National PHRC 2010) has assessed language and sensorimotor skills of preterm-born (theory of speech perception." Early and accurate assessment of language and the patient's constraints should differentiate and specify management strategies for all children, whatever their background and pathologies. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Appelo, Lisette; Leermakers, M.C.J.; Rous, J.H.G.
A method is described for the generation of related natural-language expressions. The method is based on a formal grammar of the natural language in question, specified in the Controlled M-Grammar (CMG) formalism. In the CMG framework the generation of an utterance is controlled by a derivation
Full Text Available Two studies explored young children’s understanding of the role of shared language in communication by investigating how monolingual English-speaking children interact with an English speaker, a Spanish speaker, and a bilingual experimenter who spoke both English and Spanish. When the bilingual experimenter spoke in Spanish or English to request objects, four-year-old children, but not three-year-olds, used her language choice to determine whom she addressed (e.g. requests in Spanish were directed to the Spanish speaker. Importantly, children used this cue – language choice – only in a communicative context. The findings suggest that by four years, monolingual children recognize that speaking the same language enables successful communication, even when that language is unfamiliar to them. Three-year-old children’s failure to make this distinction suggests that this capacity likely undergoes significant development in early childhood, although other capacities might also be at play.
Popova, Yanna B
This paper proposes an understanding of literary narrative as a form of social cognition and situates the study of such narratives in relation to the new comprehensive approach to human cognition, enaction. The particular form of enactive cognition that narrative understanding is proposed to depend on is that of participatory sense-making, as developed in the work of Di Paolo and De Jaegher. Currently there is no consensus as to what makes a good literary narrative, how it is understood, and why it plays such an irreplaceable role in human experience. The proposal thus identifies a gap in the existing research on narrative by describing narrative as a form of intersubjective process of sense-making between two agents, a teller and a reader. It argues that making sense of narrative literature is an interactional process of co-constructing a story-world with a narrator. Such an understanding of narrative makes a decisive break with both text-centered approaches that have dominated both structuralist and early cognitivist study of narrative, as well as pragmatic communicative ones that view narrative as a form of linguistic implicature. The interactive experience that narrative affords and necessitates at the same time, I argue, serves to highlight the active yet cooperative and communal nature of human sociality, expressed in the many forms than human beings interact in, including literary ones.
Yanna B. Popova
Full Text Available This paper proposes an understanding of literary narrative as a form of social cognition and situates the study of such narratives in relation to the new comprehensive approach to human cognition, enaction. The particular form of enactive cognition that narrative understanding is proposed to depend on is that of participatory sense-making, as developed in the work of Di Paolo and De Jaegher. Currently there is no consensus as to what makes a good literary narrative, how it is understood, and why it plays such an irreplaceable role in human experience. The proposal thus identifies a gap in the existing research on narrative by describing narrative as a form of intersubjective process of sense-making between two agents, a teller and a reader. It argues that making sense of narrative literature is an interactional process of co-constructing a story-world with a narrator. Such an understanding of narrative makes a decisive break with both text-centered approaches that have dominated both structuralist and early cognitivist study of narrative, as well as pragmatic communicative ones that view narrative as a form of linguistic implicature. The interactive experience that narrative affords and necessitates at the same time, I argue, serves to highlight the active yet cooperative and communal nature of human sociality, expressed in the many forms than human beings interact in, including literary ones.
Gullberg, M.; Robert, L.; Dimroth, C.; Veroude, K.; Indefrey, P.
Despite the literature on the role of input in adult second-language (L2) acquisition and on artificial and statistical language learning, surprisingly little is known about how adults break into a new language in the wild. This article reports on a series of behavioral and neuroimaging studies that
Full Text Available Mental image directed semantic theory (MIDST has proposed an omnisensory mental image model and its description language Lmd. This language is designed to represent and compute human intuitive knowledge of space and can provide multimedia expressions with intermediate semantic descriptions in predicate logic. It is hypothesized that such knowledge and semantic descriptions are controlled by human attention toward the world and therefore subjective to each human individual. This paper describes Lmd expression of human subjective knowledge of space and its application to aware computing in cross-media operation between linguistic and pictorial expressions as spatial language understanding.
Lee, Ming Che; Chang, Jia Wei; Hsieh, Tung Cheng
This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to "artificial language", such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.
McNamara, Danielle S; Crossley, Scott A; Roscoe, Rod
The Writing Pal is an intelligent tutoring system that provides writing strategy training. A large part of its artificial intelligence resides in the natural language processing algorithms to assess essay quality and guide feedback to students. Because writing is often highly nuanced and subjective, the development of these algorithms must consider a broad array of linguistic, rhetorical, and contextual features. This study assesses the potential for computational indices to predict human ratings of essay quality. Past studies have demonstrated that linguistic indices related to lexical diversity, word frequency, and syntactic complexity are significant predictors of human judgments of essay quality but that indices of cohesion are not. The present study extends prior work by including a larger data sample and an expanded set of indices to assess new lexical, syntactic, cohesion, rhetorical, and reading ease indices. Three models were assessed. The model reported by McNamara, Crossley, and McCarthy (Written Communication 27:57-86, 2010) including three indices of lexical diversity, word frequency, and syntactic complexity accounted for only 6% of the variance in the larger data set. A regression model including the full set of indices examined in prior studies of writing predicted 38% of the variance in human scores of essay quality with 91% adjacent accuracy (i.e., within 1 point). A regression model that also included new indices related to rhetoric and cohesion predicted 44% of the variance with 94% adjacent accuracy. The new indices increased accuracy but, more importantly, afford the means to provide more meaningful feedback in the context of a writing tutoring system.
Haug Peter J
Full Text Available Abstract Background The medical problem list is an important part of the electronic medical record in development in our institution. To serve the functions it is designed for, the problem list has to be as accurate and timely as possible. However, the current problem list is usually incomplete and inaccurate, and is often totally unused. To alleviate this issue, we are building an environment where the problem list can be easily and effectively maintained. Methods For this project, 80 medical problems were selected for their frequency of use in our future clinical field of evaluation (cardiovascular. We have developed an Automated Problem List system composed of two main components: a background and a foreground application. The background application uses Natural Language Processing (NLP to harvest potential problem list entries from the list of 80 targeted problems detected in the multiple free-text electronic documents available in our electronic medical record. These proposed medical problems drive the foreground application designed for management of the problem list. Within this application, the extracted problems are proposed to the physicians for addition to the official problem list. Results The set of 80 targeted medical problems selected for this project covered about 5% of all possible diagnoses coded in ICD-9-CM in our study population (cardiovascular adult inpatients, but about 64% of all instances of these coded diagnoses. The system contains algorithms to detect first document sections, then sentences within these sections, and finally potential problems within the sentences. The initial evaluation of the section and sentence detection algorithms demonstrated a sensitivity and positive predictive value of 100% when detecting sections, and a sensitivity of 89% and a positive predictive value of 94% when detecting sentences. Conclusion The global aim of our project is to automate the process of creating and maintaining a problem
Redd, Andrew; Pickard, Steve; Meystre, Stephane; Scehnet, Jeffrey; Bolton, Dan; Heavirland, Julia; Weaver, Allison Lynn; Hope, Carol; Garvin, Jennifer Hornung
We introduce and evaluate a new, easily accessible tool using a common statistical analysis and business analytics software suite, SAS, which can be programmed to remove specific protected health information (PHI) from a text document. Removal of PHI is important because the quantity of text documents used for research with natural language processing (NLP) is increasing. When using existing data for research, an investigator must remove all PHI not needed for the research to comply with human subjects' right to privacy. This process is similar, but not identical, to de-identification of a given set of documents. PHI Hunter removes PHI from free-form text. It is a set of rules to identify and remove patterns in text. PHI Hunter was applied to 473 Department of Veterans Affairs (VA) text documents randomly drawn from a research corpus stored as unstructured text in VA files. PHI Hunter performed well with PHI in the form of identification numbers such as Social Security numbers, phone numbers, and medical record numbers. The most commonly missed PHI items were names and locations. Incorrect removal of information occurred with text that looked like identification numbers. PHI Hunter fills a niche role that is related to but not equal to the role of de-identification tools. It gives research staff a tool to reasonably increase patient privacy. It performs well for highly sensitive PHI categories that are rarely used in research, but still shows possible areas for improvement. More development for patterns of text and linked demographic tables from electronic health records (EHRs) would improve the program so that more precise identifiable information can be removed. PHI Hunter is an accessible tool that can flexibly remove PHI not needed for research. If it can be tailored to the specific data set via linked demographic tables, its performance will improve in each new document set.
Jednoróg, Katarzyna; Bola, Łukasz; Mostowski, Piotr; Szwed, Marcin; Boguszewski, Paweł M; Marchewka, Artur; Rutkowski, Paweł
In several countries natural sign languages were considered inadequate for education. Instead, new sign-supported systems were created, based on the belief that spoken/written language is grammatically superior. One such system called SJM (system językowo-migowy) preserves the grammatical and lexical structure of spoken Polish and since 1960s has been extensively employed in schools and on TV. Nevertheless, the Deaf community avoids using SJM for everyday communication, its preferred language being PJM (polski język migowy), a natural sign language, structurally and grammatically independent of spoken Polish and featuring classifier constructions (CCs). Here, for the first time, we compare, with fMRI method, the neural bases of natural vs. devised communication systems. Deaf signers were presented with three types of signed sentences (SJM and PJM with/without CCs). Consistent with previous findings, PJM with CCs compared to either SJM or PJM without CCs recruited the parietal lobes. The reverse comparison revealed activation in the anterior temporal lobes, suggesting increased semantic combinatory processes in lexical sign comprehension. Finally, PJM compared with SJM engaged left posterior superior temporal gyrus and anterior temporal lobe, areas crucial for sentence-level speech comprehension. We suggest that activity in these two areas reflects greater processing efficiency for naturally evolved sign language. Copyright © 2015 Elsevier Ltd. All rights reserved.
Rojas Barahona , Lina Maria; Quaglini , Silvana; Stefanelli , Mario
International audience; The prospective home-care management will probably of- fer intelligent conversational assistants for supporting patients at home through natural language interfaces. Homecare assistance in natural lan- guage, HomeNL, is a proof-of-concept dialogue system for the manage- ment of patients with hypertension. It follows up a conversation with a patient in which the patient is able to take the initiative. HomeNL pro- cesses natural language, makes an internal representation...
Malamud, B. D.; Rhodes, F. H. T.
This paper explores natural hazards teaching and communications through the use of a literary anthology of writings about the earth aimed at non-experts. Teaching natural hazards in high-school and university introductory Earth Science and Geography courses revolves mostly around lectures, examinations, and laboratory demonstrations/activities. Often the results of such a course are that a student 'memorizes' the answers, and is penalized when they miss a given fact [e.g., "You lost one point because you were off by 50 km/hr on the wind speed of an F5 tornado."] Although facts and general methodologies are certainly important when teaching natural hazards, it is a strong motivation to a student's assimilation of, and enthusiasm for, this knowledge, if supplemented by writings about the Earth. In this paper, we discuss a literary anthology which we developed [Language of the Earth, Rhodes, Stone, Malamud, Wiley-Blackwell, 2008] which includes many descriptions about natural hazards. Using first- and second-hand accounts of landslides, earthquakes, tsunamis, floods and volcanic eruptions, through the writings of McPhee, Gaskill, Voltaire, Austin, Cloos, and many others, hazards become 'alive', and more than 'just' a compilation of facts and processes. Using short excerpts such as these, or other similar anthologies, of remarkably written accounts and discussions about natural hazards results in 'dry' facts becoming more than just facts. These often highly personal viewpoints of our catostrophic world, provide a useful supplement to a student's understanding of the turbulent world in which we live.
The Cross-Lingual Information Retrieval system (CLIR) or Multilingual Information Retrieval (MIR) has become the key issue in electronic documents management systems in a multinational environment. We propose here a multilingual information retrieval system consisting of a morpho-syntactic analyser, a transfer system from source language to target language and an information retrieval system. A thorough investigation into the system architecture and the transfer mechanisms is proposed in that report, using two different performance evaluation methods. (author) [fr
Li, Peggy; Dunham, Yarrow; Carey, Susan
Shown an entity (e.g., a plastic whisk) labeled by a novel noun in neutral syntax, speakers of Japanese, a classifier language, are more likely to assume the noun refers to the substance (plastic) than are speakers of English, a count/mass language, who are instead more likely to assume it refers to the object kind [whisk; Imai, M., & Gentner, D.…
Lane, H. Chad; Vanlehn, Kurt
For beginning programmers, inadequate problem solving and planning skills are among the most salient of their weaknesses. In this paper, we test the efficacy of natural language tutoring to teach and scaffold acquisition of these skills. We describe ProPL (Pro-PELL), a dialogue-based intelligent tutoring system that elicits goal decompositions and program plans from students in natural language. The system uses a variety of tutoring tactics that leverage students' intuitive understandings of the problem, how it might be solved, and the underlying concepts of programming. We report the results of a small-scale evaluation comparing students who used ProPL with a control group who read the same content. Our primary findings are that students who received tutoring from ProPL seem to have developed an improved ability to solve the composition problem and displayed behaviors that suggest they were able to think at greater levels of abstraction than students in the read-only group.
Hunter, James; Freer, Yvonne; Gatt, Albert; Reiter, Ehud; Sripada, Somayajulu; Sykes, Cindy
Our objective was to determine whether and how a computer system could automatically generate helpful natural language nursing shift summaries solely from an electronic patient record system, in a neonatal intensive care unit (NICU). A system was developed which automatically generates partial NICU shift summaries (for the respiratory and cardiovascular systems), using data-to-text technology. It was evaluated for 2 months in the NICU at the Royal Infirmary of Edinburgh, under supervision. In an on-ward evaluation, a substantial majority of the summaries was found by outgoing and incoming nurses to be understandable (90%), and a majority was found to be accurate (70%), and helpful (59%). The evaluation also served to identify some outstanding issues, especially with regard to extra content the nurses wanted to see in the computer-generated summaries. It is technically possible automatically to generate limited natural language NICU shift summaries from an electronic patient record. However, it proved difficult to handle electronic data that was intended primarily for display to the medical staff, and considerable engineering effort would be required to create a deployable system from our proof-of-concept software. Copyright © 2012 Elsevier B.V. All rights reserved.
Paredes-Valverde, Mario Andrés
The semantic Web aims to provide to Web information with a well-defined meaning and make it understandable not only by humans but also by computers, thus allowing the automation, integration and reuse of high-quality information across different applications. However, current information retrieval mechanisms for semantic knowledge bases are intended to be only used by expert users. In this work, we propose a natural language interface that allows non-expert users the access to this kind of information through formulating queries in natural language. The present approach uses a domain-independent ontology model to represent the question\\'s structure and context. Also, this model allows determination of the answer type expected by the user based on a proposed question classification. To prove the effectiveness of our approach, we have conducted an evaluation in the music domain using LinkedBrainz, an effort to provide the MusicBrainz information as structured data on the Web by means of Semantic Web technologies. Our proposal obtained encouraging results based on the F-measure metric, ranging from 0.74 to 0.82 for a corpus of questions generated by a group of real-world end users. © The Author(s) 2015.
Paredes-Valverde, Mario André s; Valencia-Garcí a, Rafael; Rodriguez-Garcia, Miguel Angel; Colomo-Palacios, Ricardo; Alor-Herná ndez, Giner
The semantic Web aims to provide to Web information with a well-defined meaning and make it understandable not only by humans but also by computers, thus allowing the automation, integration and reuse of high-quality information across different applications. However, current information retrieval mechanisms for semantic knowledge bases are intended to be only used by expert users. In this work, we propose a natural language interface that allows non-expert users the access to this kind of information through formulating queries in natural language. The present approach uses a domain-independent ontology model to represent the question's structure and context. Also, this model allows determination of the answer type expected by the user based on a proposed question classification. To prove the effectiveness of our approach, we have conducted an evaluation in the music domain using LinkedBrainz, an effort to provide the MusicBrainz information as structured data on the Web by means of Semantic Web technologies. Our proposal obtained encouraging results based on the F-measure metric, ranging from 0.74 to 0.82 for a corpus of questions generated by a group of real-world end users. © The Author(s) 2015.
Pelucchi, Bruna; Hay, Jessica F; Saffran, Jenny R
Numerous studies over the past decade support the claim that infants are equipped with powerful statistical language learning mechanisms. The primary evidence for statistical language learning in word segmentation comes from studies using artificial languages, continuous streams of synthesized syllables that are highly simplified relative to real speech. To what extent can these conclusions be scaled up to natural language learning? In the current experiments, English-learning 8-month-old infants' ability to track transitional probabilities in fluent infant-directed Italian speech was tested (N = 72). The results suggest that infants are sensitive to transitional probability cues in unfamiliar natural language stimuli, and support the claim that statistical learning is sufficiently robust to support aspects of real-world language acquisition.
Full Text Available Natural Language Processing is one of the most developing fields in research area. In most of the applications related to the Natural Language Processing findings of the Morphological Analysis and Morphological Generation can be considered very important. As morphological study is the technique to recognise a word and its output can be used on later on stages .Keeping in view this importance this paper describes how Morphological Analysis and Morphological Generation can be proved as an important part of various Natural Language Processing fields such as Spell checker Machine Translation etc.
Vlas, Radu Eduard
Open source projects do have requirements; they are, however, mostly informal, text descriptions found in requests, forums, and other correspondence. Understanding such requirements provides insight into the nature of open source projects. Unfortunately, manual analysis of natural language requirements is time-consuming, and for large projects,…
Full Text Available The aim of this article is to argue that the use of language in liturgy during worship services should be meaningful to contribute to persuasion in the lives of the participants in liturgy. Language is a prominent medium to convey meaning. In fact, the essence of liturgy that has to lead to the liturgy of life is in itself a meaningful act. The question regarding the meaning of worship services that people often raise is another reason why research on the influence of liturgy is crucial. This investigation is anchored in research on the importance of cognition in persuasive language use to promote attitude change. The research gathers insights from the fields of language philosophy and cognitive psychology. It is clear that the meaning of words in language can never be separated from people’s understanding of the meaning of language. Communication and communion are not opposites. In the normative phase of this investigation, perspectives from Romans 12 are offered. The renewal of the mind that leads to discernment of God’s will must also lead to a new cognition (understanding or phronesis of each believer’s place within the Body of Christ. The insights gained from language philosophy, cognitive psychology and the normative grounding make it evident that people always try to make sense of what they are experiencing and of what they are observing. The attempt to understand necessitates further reflection on the importance of cognition. Finally, practical theological perspectives are offered to indicate that cognition is important to create a meaningful liturgy. This cognition is anchored in God’s presence during worship services and, therefore, it requires meaningful words from liturgists.
Ming Che Lee
Full Text Available This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure.
Chang, Jia Wei; Hsieh, Tung Cheng
This paper presents a grammar and semantic corpus based similarity algorithm for natural language sentences. Natural language, in opposition to “artificial language”, such as computer programming languages, is the language used by the general public for daily communication. Traditional information retrieval approaches, such as vector models, LSA, HAL, or even the ontology-based approaches that extend to include concept similarity comparison instead of cooccurrence terms/words, may not always determine the perfect matching while there is no obvious relation or concept overlap between two natural language sentences. This paper proposes a sentence similarity algorithm that takes advantage of corpus-based ontology and grammatical rules to overcome the addressed problems. Experiments on two famous benchmarks demonstrate that the proposed algorithm has a significant performance improvement in sentences/short-texts with arbitrary syntax and structure. PMID:24982952
Kiraz, George Anton
This book presents a tractable computational model that can cope with complex morphological operations, especially in Semitic languages, and less complex morphological systems present in Western languages. It outlines a new generalized regular rewrite rule system that uses multiple finite-state automata to cater to root-and-pattern morphology,…
Wagner, J C; Solomon, W D; Michel, P A; Juge, C; Baud, R H; Rector, A L; Scherrer, J R
Re-usable and sharable, and therefore language-independent concept models are of increasing importance in the medical domain. The GALEN project (Generalized Architecture for Languages Encyclopedias and Nomenclatures in Medicine) aims at developing language-independent concept representation systems as the foundations for the next generation of multilingual coding systems. For use within clinical applications, the content of the model has to be mapped to natural language. A so-called Multilingual Information Module (MM) establishes the link between the language-independent concept model and different natural languages. This text generation software must be versatile enough to cope at the same time with different languages and with different parts of a compositional model. It has to meet, on the one hand, the properties of the language as used in the medical domain and, on the other hand, the specific characteristics of the underlying model and its representation formalism. We propose a semantic-oriented approach to natural language generation that is based on linguistic annotations to a concept model. This approach is realized as an integral part of a Terminology Server, built around the concept model and offering different terminological services for clinical applications.
Nyachwaya, James M.
The objective of this study was to examine college general chemistry students' conceptual understanding and language fluency in the context of the topic of acids and bases. 115 students worked in groups of 2-4 to complete an activity on conductometry, where they were given a scenario in which a titration of sodium hydroxide solution and dilute…
Jamil, Hasan M
One of the many unique features of biological databases is that the mere existence of a ground data item is not always a precondition for a query response. It may be argued that from a biologist's standpoint, queries are not always best posed using a structured language. By this we mean that approximate and flexible responses to natural language like queries are well suited for this domain. This is partly due to biologists' tendency to seek simpler interfaces and partly due to the fact that questions in biology involve high level concepts that are open to interpretations computed using sophisticated tools. In such highly interpretive environments, rigidly structured databases do not always perform well. In this paper, our goal is to propose a semantic correspondence plug-in to aid natural language query processing over arbitrary biological database schema with an aim to providing cooperative responses to queries tailored to users' interpretations. Natural language interfaces for databases are generally effective when they are tuned to the underlying database schema and its semantics. Therefore, changes in database schema become impossible to support, or a substantial reorganization cost must be absorbed to reflect any change. We leverage developments in natural language parsing, rule languages and ontologies, and data integration technologies to assemble a prototype query processor that is able to transform a natural language query into a semantically equivalent structured query over the database. We allow knowledge rules and their frequent modifications as part of the underlying database schema. The approach we adopt in our plug-in overcomes some of the serious limitations of many contemporary natural language interfaces, including support for schema modifications and independence from underlying database schema. The plug-in introduced in this paper is generic and facilitates connecting user selected natural language interfaces to arbitrary databases using a
Barrera, Rosalinda B.; Aleman, Magdalena
Described is a newspaper project in which elementary students report life as it was in the Middle Ages. Students are involved in a variety of language-centered activities. For example, they gather and evaluate information about medieval times and write, edit, and proofread articles for the newspaper. (RM)
Theune, Mariet; Freedman, R.; Callaway, C.
This paper describes how a language generation system that was originally designed for monologue generation, has been adapted for use in the OVIS spoken dialogue system. To meet the requirement that in a dialogue, the system’s utterances should make up a single, coherent dialogue turn, several
Fedurek, Pawel; Slocombe, Katie E
Language is a uniquely human trait, and questions of how and why it evolved have been intriguing scientists for years. Nonhuman primates (primates) are our closest living relatives, and their behavior can be used to estimate the capacities of our extinct ancestors. As humans and many primate species rely on vocalizations as their primary mode of communication, the vocal behavior of primates has been an obvious target for studies investigating the evolutionary roots of human speech and language. By studying the similarities and differences between human and primate vocalizations, comparative research has the potential to clarify the evolutionary processes that shaped human speech and language. This review examines some of the seminal and recent studies that contribute to our knowledge regarding the link between primate calls and human language and speech. We focus on three main aspects of primate vocal behavior: functional reference, call combinations, and vocal learning. Studies in these areas indicate that despite important differences, primate vocal communication exhibits some key features characterizing human language. They also indicate, however, that some critical aspects of speech, such as vocal plasticity, are not shared with our primate cousins. We conclude that comparative research on primate vocal behavior is a very promising tool for deepening our understanding of the evolution of human speech and language, but much is still to be done as many aspects of monkey and ape vocalizations remain largely unexplored.
Moreno, Iván; de Vega, Manuel; León, Inmaculada
The mu rhythms (8-13 Hz) and the beta rhythms (15 up to 30 Hz) of the EEG are observed in the central electrodes (C3, Cz and C4) in resting states, and become suppressed when participants perform a manual action or when they observe another's action. This has led researchers to consider that these rhythms are electrophysiological markers of the motor neuron activity in humans. This study tested whether the comprehension of action language, unlike abstract language, modulates mu and low beta rhythms (15-20 Hz) in a similar way as the observation of real actions. The log-ratios were calculated for each oscillatory band between each condition and baseline resting periods. The results indicated that both action language and action videos caused mu and beta suppression (negative log-ratios), whereas abstract language did not, confirming the hypothesis that understanding action language activates motor networks in the brain. In other words, the resonance of motor areas associated with action language is compatible with the embodiment approach to linguistic meaning. Copyright © 2013 Elsevier Inc. All rights reserved.
We describe basic concepts and software architectures for the integration of shallow and deep (linguistics-based, semantics-oriented) natural language processing (NLP) components. The main goal of this novel, hybrid integration paradigm is improving robustness of deep processing. After an introduction to constraint-based natural language parsing, we give an overview of typical shallow processing tasks. We introduce XML standoff markup as an additional abstraction layer that eases integration ...
Lutz, Martha Victoria Rosett
Scientific literacy is a central goal of science education. One purpose of this investigation was to reevaluate the definition of 'scientific literacy.' Another purpose was to develop and implement new curriculum involving natural history experiments with insects, with the goal of allowing students opportunities to construct an understanding of the nature of science, a crucial aspect of scientific literacy. This investigation was a qualitative case study. Methods of data collection included direct observations, analysis of sketches and written products created by students and class-room teachers, and analysis of audio tapes. Major findings include: (1) Scientific literacy is generally defined by lists of factual information which students are expected to master. When asked to evaluate their knowledge of selected items on a list published in a science education reform curriculum guide, 15 practicing scientists reported lack of familiarity or comprehension with many items, with the exception of items within their areas of specialization. (2) Genuine natural history experiments using insects can be incorporated into the existing school schedule and need not require any increase in the budget for science materials. (3) Students as young as first through third grade can learn the manual techniques and conceptual skills necessary for designing and conducting original natural history experiments, including manipulating the insects, making accurate sketches, developing test able hypotheses, recording data, and drawing conclusions from their data. Students were generally enthusiastic both about working with live insects and also conducting genuine science experiments. (4) Girls appear both positive and engaged with natural history activities and may be more likely than boys to follow through on designing, conducting, and reporting on independent experiments. The results imply that a valid definition of scientific literacy should be based on the ability to acquire scientific
Snefjella, Bryor; Kuperman, Victor
Existing evidence shows that more abstract mental representations are formed and more abstract language is used to characterize phenomena that are more distant from the self. Yet the precise form of the functional relationship between distance and linguistic abstractness is unknown. In four studies, we tested whether more abstract language is used in textual references to more geographically distant cities (Study 1), time points further into the past or future (Study 2), references to more socially distant people (Study 3), and references to a specific topic (Study 4). Using millions of linguistic productions from thousands of social-media users, we determined that linguistic concreteness is a curvilinear function of the logarithm of distance, and we discuss psychological underpinnings of the mathematical properties of this relationship. We also demonstrated that gradient curvilinear effects of geographic and temporal distance on concreteness are nearly identical, which suggests uniformity in representation of abstractness along multiple dimensions. © The Author(s) 2015.
Botha, Jan A.; Pitler, Emily; Ma, Ji; Bakalov, Anton; Salcianu, Alex; Weiss, David; McDonald, Ryan; Petrov, Slav
We show that small and shallow feed-forward neural networks can achieve near state-of-the-art results on a range of unstructured and structured language processing tasks while being considerably cheaper in memory and computational requirements than deep recurrent models. Motivated by resource-constrained environments like mobile phones, we showcase simple techniques for obtaining such small neural network models, and investigate different tradeoffs when deciding how to allocate a small memory...
Theune, Mariet; Freedman, R.; Callaway, C.
This paper describes how a language generation system that was originally designed for monologue generation, has been adapted for use in the OVIS spoken dialogue system. To meet the requirement that in a dialogue, the system’s utterances should make up a single, coherent dialogue turn, several modifications had to be made to the system. The paper also discusses the influence of dialogue context on information status, and its consequences for the generation of referring expressions and accentu...
Language modeling plays a critical role in natural language processing and understanding. Starting from a general structure, language models are able to learn natural language patterns from rich input data. However, the state-of-the-art language models only take advantage of words themselves, which
Komac, B.; Zorn, M.; Ciglič, R.; Steinführer, A.
The importance of natural-disaster education for social preparedness is presented. Increasing damage caused by natural disasters around the globe draws attention to the fact that even developed societies must adapt to natural processes. Natural-disaster education is a component part of any education strategy for a sustainably oriented society. The purpose of this article is to present the role of formal education in natural disasters in Europe. To ensure a uniform overview, the study used secondary-school geography textbooks from the collection at the Georg Eckert Institute for International Textbook Research in Braunschweig, Germany. Altogether, nearly 190 textbooks from 35 European countries were examined. The greatest focus on natural disasters can be found in textbooks published in western Europe (3.8% of pages describing natural disasters), and the smallest in those published in eastern Europe (0.7%). A share of textbook pages exceeding three percent describing natural disasters can also be found in northern Europe (3.6%) and southeast Europe, including Turkey (3.4%). The shares in central and southern Europe exceed two percent (i.e., 2.8% and 2.3%, respectively). The types and specific examples of natural disasters most commonly covered in textbooks as well as the type of natural disasters presented in textbooks according to the number of casualties and the damage caused were analyzed. The results show that the majority of European (secondary-school) education systems are poorly developed in terms of natural-disaster education. If education is perceived as part of natural-disaster management and governance, greater attention should clearly be dedicated to this activity. In addition to formal education, informal education also raises a series of questions connected with the importance of this type of education. Special attention was drawn to the importance of knowledge that locals have about their region because this aspect of education is important in both
Emmeche, Claus; Hoffmeyer, Jesper Normann
be of considerable value, not only heuristically, but in order to comprehend the irreducible nature of living organisms. In arguing for a semiotic perspective on living nature, it makes a marked difference whether the departure is made from the tradition of F. de Saussure´s structural linguistics or from...
Topac, V; Stoicu-Tivadar, V
Patient empowerment is important in order to increase the quality of medical care and the life quality of the patients. An important obstacle for empowering patients is the language barrier the lay patient encounter when accessing medical information. To design and develop a service that will help increase the understanding of medical language for lay persons. The service identifies and explains medical terminology from a given text by annotating the terms in the original text with the definition. It is based on an original terminology interpretation engine that uses a fuzzy matching dictionary. The service was implemented in two projects: a) into the server of a tele-care system (TELEASIS) with the purpose of adapting medical text assigned by medical personnel for the assisted patients. b) Into a dedicated web site that can adapt the medical language from raw text or from existing web pages. The output of the service was evaluated by a group of persons, and the results indicate that such a system can increase the understanding of medical texts. Several design decisions were driven from the evaluation, and are being considered for future development. Other tests measuring accuracy and time performance for the fuzzy terminology recognition have been performed. Test results revealed good performance for accuracy and excellent results regarding time performance. The current version of the service increases the accessibility of medical language by explaining terminology with a good accuracy, while allowing the user to easily identify errors, in order to reduce the risk of incorrect terminology recognition.
Harispe, Sébastien; Janaqi, Stefan
Artificial Intelligence federates numerous scientific fields in the aim of developing machines able to assist human operators performing complex treatments---most of which demand high cognitive skills (e.g. learning or decision processes). Central to this quest is to give machines the ability to estimate the likeness or similarity between things in the way human beings estimate the similarity between stimuli.In this context, this book focuses on semantic measures: approaches designed for comparing semantic entities such as units of language, e.g. words, sentences, or concepts and instances def
Canfield, K.; Bray, B.; Huff, S.; Warner, H.
We describe a prototype system for semi-automatic database capture of free-text echocardiography reports. The system is very simple and uses a Unified Medical Language System compatible architecture. We use this system and a large body of texts to create a patient database and develop a comprehensive hierarchical dictionary for echocardiography.
Bucks, Gregory Warren
Computers have become an integral part of how engineers complete their work, allowing them to collect and analyze data, model potential solutions and aiding in production through automation and robotics. In addition, computers are essential elements of the products themselves, from tennis shoes to construction materials. An understanding of how computers function, both at the hardware and software level, is essential for the next generation of engineers. Despite the need for engineers to develop a strong background in computing, little opportunity is given for engineering students to develop these skills. Learning to program is widely seen as a difficult task, requiring students to develop not only an understanding of specific concepts, but also a way of thinking. In addition, students are forced to learn a new tool, in the form of the programming environment employed, along with these concepts and thought processes. Because of this, many students will not develop a sufficient proficiency in programming, even after progressing through the traditional introductory programming sequence. This is a significant problem, especially in the engineering disciplines, where very few students receive more than one or two semesters' worth of instruction in an already crowded engineering curriculum. To address these issues, new pedagogical techniques must be investigated in an effort to enhance the ability of engineering students to develop strong computing skills. However, these efforts are hindered by the lack of published assessment instruments available for probing an individual's understanding of programming concepts across programming languages. Traditionally, programming knowledge has been assessed by producing written code in a specific language. This can be an effective method, but does not lend itself well to comparing the pedagogical impact of different programming environments, languages or paradigms. This dissertation presents a phenomenographic research study
Vukotic , Vedran; Raymond , Christian; Gravier , Guillaume
International audience; Architectures of Recurrent Neural Networks (RNN) recently become a very popular choice for Spoken Language Understanding (SLU) problems; however, they represent a big family of different architectures that can furthermore be combined to form more complex neural networks. In this work, we compare different recurrent networks, such as simple Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM) networks, Gated Memory Units (GRU) and their bidirectional versions,...
Jutla, Antarpreet; Khan, Rakibul; Colwell, Rita
Diarrheal diseases remain a serious global public health threat, especially for those populations lacking access to safe water and sanitation infrastructure. Although association of several diarrheal diseases, e.g., cholera, shigellosis, etc., with climatic processes has been documented, the global human population remains at heightened risk of outbreak of diseases after natural disasters, such as earthquakes, floods, or droughts. In this review, cholera was selected as a signature diarrheal disease and the role of natural disasters in triggering and transmitting cholera was analyzed. Key observations include identification of an inherent feedback loop that includes societal structure, prevailing climatic processes, and spatio-temporal seasonal variability of natural disasters. Data obtained from satellite-based remote sensing are concluded to have application, although limited, in predicting risks of a cholera outbreak(s). We argue that with the advent of new high spectral and spatial resolution data, earth observation systems should be seamlessly integrated in a decision support mechanism to be mobilize resources when a region suffers a natural disaster. A framework is proposed that can be used to assess the impact of natural disasters with response to outbreak of cholera, providing assessment of short- and long-term influence of climatic processes on disease outbreaks.
Moore, Robert C.; Cohen, Michael H.
Under this effort, SRI has developed spoken-language technology for interactive problem solving, featuring real-time performance for up to several thousand word vocabularies, high semantic accuracy, habitability within the domain, and robustness to many sources of variability. Although the technology is suitable for many applications, efforts to date have focused on developing an Air Travel Information System (ATIS) prototype application. SRI's ATIS system has been evaluated in four ARPA benchmark evaluations, and has consistently been at or near the top in performance. These achievements are the result of SRI's technical progress in speech recognition, natural-language processing, and speech and natural-language integration.
Daltrozzo, Jerome; Emerson, Samantha N; Deocampo, Joanne; Singh, Sonia; Freggens, Marjorie; Branum-Martin, Lee; Conway, Christopher M
Statistical learning (SL) is believed to enable language acquisition by allowing individuals to learn regularities within linguistic input. However, neural evidence supporting a direct relationship between SL and language ability is scarce. We investigated whether there are associations between event-related potential (ERP) correlates of SL and language abilities while controlling for the general level of selective attention. Seventeen adults completed tests of visual SL, receptive vocabulary, grammatical ability, and sentence completion. Response times and ERPs showed that SL is related to receptive vocabulary and grammatical ability. ERPs indicated that the relationship between SL and grammatical ability was independent of attention while the association between SL and receptive vocabulary depended on attention. The implications of these dissociative relationships in terms of underlying mechanisms of SL and language are discussed. These results further elucidate the cognitive nature of the links between SL mechanisms and language abilities. Copyright © 2017 Elsevier Inc. All rights reserved.
Bisikalo Oleg V.
Full Text Available The task of evaluating uncertainty in the measurement of sense in natural language constructions (NLCs was researched through formalization of the notions of the language image, formalization of artificial cognitive systems (ACSs and the formalization of units of meaning. The method for measuring the sense of natural language constructions incorporated fuzzy relations of meaning, which ensures that information about the links between lemmas of the text is taken into account, permitting the evaluation of two types of measurement uncertainty of sense characteristics. Using developed applications programs, experiments were conducted to investigate the proposed method to tackle the identification of informative characteristics of text. The experiments resulted in dependencies of parameters being obtained in order to utilise the Pareto distribution law to define relations between lemmas, analysis of which permits the identification of exponents of an average number of connections of the language image as the most informative characteristics of text.
Clody, Michael C
The essay argues that Francis Bacon's considerations of parables and cryptography reflect larger interpretative concerns of his natural philosophic project. Bacon describes nature as having a language distinct from those of God and man, and, in so doing, establishes a central problem of his natural philosophy—namely, how can the language of nature be accessed through scientific representation? Ultimately, Bacon's solution relies on a theory of differential and duplicitous signs that conceal within them the hidden voice of nature, which is best recognized in the natural forms of efficient causality. The "alphabet of nature"—those tables of natural occurrences—consequently plays a central role in his program, as it renders nature's language susceptible to a process and decryption that mirrors the model of the bilateral cipher. It is argued that while the writing of Bacon's natural philosophy strives for literality, its investigative process preserves a space for alterity within scientific representation, that is made accessible to those with the interpretative key.
Vanderhoeven, Sonia; Piqueray, Julien; Halford, Mathieu; Nulens, Greet; Vincke, Jan; Mahy, Grégory
We conducted a survey to determine how two professional sectors in Belgium, horticulture professionals and nature reserve managers (those directly involved in conservation), view the issues associated with invasive plant species. We developed and utilized a questionnaire that addressed the themes of awareness, concept and use of language, availability of information, impacts and, finally, control and available solutions. Using co-inertia analyses, we tested to what extent the perception of invasive alien species (IAS) was dependent upon the perception of Nature in general. Only forty-two percent of respondent horticulture professionals and eighty-two percent of nature reserve managers had a general knowledge of IAS. Many individuals in both target groups nonetheless had an accurate understanding of the scientific issues. Our results therefore suggest that the manner in which individuals within the two groups view, or perceive, the IAS issue was more the result of lack of information than simply biased perceptions of target groups. Though IAS perceptions by the two groups diverged, they were on par with how they viewed Nature in general. The descriptions of IAS by participants converged with the ideas and concepts frequently found in the scientific literature. Both managers and horticulture professionals expressed a strong willingness to participate in programs designed to prevent the spread of, and damage caused by, IAS. Despite this, the continued commercial availability of many invasive species highlighted the necessity to use both mandatory and voluntary approaches to reduce their re-introduction and spread. The results of this study provide stakeholders and conservation managers with practical information on which communication and management strategies can be based.
Large, David R; Clark, Leigh; Quandt, Annie; Burnett, Gary; Skrypchuk, Lee
Given the proliferation of 'intelligent' and 'socially-aware' digital assistants embodying everyday mobile technology - and the undeniable logic that utilising voice-activated controls and interfaces in cars reduces the visual and manual distraction of interacting with in-vehicle devices - it appears inevitable that next generation vehicles will be embodied by digital assistants and utilise spoken language as a method of interaction. From a design perspective, defining the language and interaction style that a digital driving assistant should adopt is contingent on the role that they play within the social fabric and context in which they are situated. We therefore conducted a qualitative, Wizard-of-Oz study to explore how drivers might interact linguistically with a natural language digital driving assistant. Twenty-five participants drove for 10 min in a medium-fidelity driving simulator while interacting with a state-of-the-art, high-functioning, conversational digital driving assistant. All exchanges were transcribed and analysed using recognised linguistic techniques, such as discourse and conversation analysis, normally reserved for interpersonal investigation. Language usage patterns demonstrate that interactions with the digital assistant were fundamentally social in nature, with participants affording the assistant equal social status and high-level cognitive processing capability. For example, participants were polite, actively controlled turn-taking during the conversation, and used back-channelling, fillers and hesitation, as they might in human communication. Furthermore, participants expected the digital assistant to understand and process complex requests mitigated with hedging words and expressions, and peppered with vague language and deictic references requiring shared contextual information and mutual understanding. Findings are presented in six themes which emerged during the analysis - formulating responses; turn-taking; back
The two authors, Thi Phuong Thao-Do and Chokchai Yuenyong, explored the Nature of Science as it is understood in Vietnam, a fast-developing "ancient" and modern country which continues to be shaped by uniquely Asian social norms and values. Upon reviewing their paper, I observed strong parallels to the country, the United Arab Emirates,…
The competition for land has become an issue of major concern and cause of conflict, especially between pastoralists and crop farmers, but also between pastoralists and nature conservation institutions. The Biosphere Reserve of W in Benin Republic (WBR) and its surrounding lands are located in
Boran, Gül Hanim; Bag, Hüseyin
The aim in conducting this study is to explore the effects of argumentation on pre-service science teachers' views of the nature of science. This study used a qualitative case study and conducted with 20 pre-service science teachers. Data sources include an open-ended questionnaire and audio-taped interviews. According to pretest and posttest…
Abstract. The themes of human rights and human rights education in South Africa's multi-cultural society are central to the work of Cornelia Roux. This article discusses the human reality and ethics underlying those themes, using an approach based on a view of human nature. It has six sections, starting with an introduction ...
Liu, Haitao; Xu, Chunshan; Liang, Junying
Dependency distance, measured by the linear distance between two syntactically related words in a sentence, is generally held as an important index of memory burden and an indicator of syntactic difficulty. Since this constraint of memory is common for all human beings, there may well be a universal preference for dependency distance minimization (DDM) for the sake of reducing memory burden. This human-driven language universal is supported by big data analyses of various corpora that consistently report shorter overall dependency distance in natural languages than in artificial random languages and long-tailed distributions featuring a majority of short dependencies and a minority of long ones. Human languages, as complex systems, seem to have evolved to come up with diverse syntactic patterns under the universal pressure for dependency distance minimization. However, there always exist a small number of long-distance dependencies in natural languages, which may reflect some other biological or functional constraints. Language system may adapt itself to these sporadic long-distance dependencies. It is these universal constraints that have shaped such a rich diversity of syntactic patterns in human languages.
Liu, Haitao; Xu, Chunshan; Liang, Junying
Dependency distance, measured by the linear distance between two syntactically related words in a sentence, is generally held as an important index of memory burden and an indicator of syntactic difficulty. Since this constraint of memory is common for all human beings, there may well be a universal preference for dependency distance minimization (DDM) for the sake of reducing memory burden. This human-driven language universal is supported by big data analyses of various corpora that consistently report shorter overall dependency distance in natural languages than in artificial random languages and long-tailed distributions featuring a majority of short dependencies and a minority of long ones. Human languages, as complex systems, seem to have evolved to come up with diverse syntactic patterns under the universal pressure for dependency distance minimization. However, there always exist a small number of long-distance dependencies in natural languages, which may reflect some other biological or functional constraints. Language system may adapt itself to these sporadic long-distance dependencies. It is these universal constraints that have shaped such a rich diversity of syntactic patterns in human languages. Copyright © 2017. Published by Elsevier B.V.
Groth, P.T.; Gil, Y
Scientists increasingly use workflows to represent and share their computational experiments. Because of their declarative nature, focus on pre-existing component composition and the availability of visual editors, workflows provide a valuable start for creating user-friendly environments for end
Swartz, Jordan; Koziatek, Christian; Theobald, Jason; Smith, Silas; Iturrate, Eduardo
Testing for venous thromboembolism (VTE) is associated with cost and risk to patients (e.g. radiation). To assess the appropriateness of imaging utilization at the provider level, it is important to know that provider's diagnostic yield (percentage of tests positive for the diagnostic entity of interest). However, determining diagnostic yield typically requires either time-consuming, manual review of radiology reports or the use of complex and/or proprietary natural language processing software. The objectives of this study were twofold: 1) to develop and implement a simple, user-configurable, and open-source natural language processing tool to classify radiology reports with high accuracy and 2) to use the results of the tool to design a provider-specific VTE imaging dashboard, consisting of both utilization rate and diagnostic yield. Two physicians reviewed a training set of 400 lower extremity ultrasound (UTZ) and computed tomography pulmonary angiogram (CTPA) reports to understand the language used in VTE-positive and VTE-negative reports. The insights from this review informed the arguments to the five modifiable parameters of the NLP tool. A validation set of 2,000 studies was then independently classified by the reviewers and by the tool; the classifications were compared and the performance of the tool was calculated. The tool was highly accurate in classifying the presence and absence of VTE for both the UTZ (sensitivity 95.7%; 95% CI 91.5-99.8, specificity 100%; 95% CI 100-100) and CTPA reports (sensitivity 97.1%; 95% CI 94.3-99.9, specificity 98.6%; 95% CI 97.8-99.4). The diagnostic yield was then calculated at the individual provider level and the imaging dashboard was created. We have created a novel NLP tool designed for users without a background in computer programming, which has been used to classify venous thromboembolism reports with a high degree of accuracy. The tool is open-source and available for download at http
Idrissou, L.; Aarts, M.N.C.; Leeuwis, C.; Paassen, van A.
This paper investigated conflicts in participatory protected areas management in Benin to better understand their dynamics. This review paper is based on four articles written from three case-studies of conflicts that emerged and evolved in participatory protected areas management in Benin and a
Hoyle, Fred; Wickramasinghe, Chandra
We discuss a possible biological explanation of the phenomenon of colour prejudice that hinges on the relative advantages and disadvantages in the expression of the strongly dominant gene(s) for melanin under ice-age conditions at different locations on the Earth. An understanding of the genesis of this prejudice could hopefully eradicate or ameliorate its worst manifestations in modern society.
Artificial intellignece implications for knowledge retrivedO. Accession For NTIGRA&I DTIC TAB Unannounced 0 Justifioation *5**.By I Distribution...through understanding and generalizing plans", "An approach to learning from observation", and " Artificial intelligence implications for knowledge
Full Text Available We propose a stochastic model for the number of different words in a given database which incorporates the dependence on the database size and historical changes. The main feature of our model is the existence of two different classes of words: (i a finite number of core words, which have higher frequency and do not affect the probability of a new word to be used, and (ii the remaining virtually infinite number of noncore words, which have lower frequency and, once used, reduce the probability of a new word to be used in the future. Our model relies on a careful analysis of the Google Ngram database of books published in the last centuries, and its main consequence is the generalization of Zipf’s and Heaps’ law to two-scaling regimes. We confirm that these generalizations yield the best simple description of the data among generic descriptive models and that the two free parameters depend only on the language but not on the database. From the point of view of our model, the main change on historical time scales is the composition of the specific words included in the finite list of core words, which we observe to decay exponentially in time with a rate of approximately 30 words per year for English.
This volume examines mathematics as a product of the human mind and analyzes the language of "pure mathematics" from various advanced-level sources. Through analysis of the foundational texts of mathematics, it is demonstrated that math is a complex literary creation, containing objects, actors, actions, projection, prediction, planning, explanation, evaluation, roles, image schemas, metonymy, conceptual blending, and, of course, (natural) language. The book follows the narrative of mathematics in a typical order of presentation for a standard university-level algebra course, beginning with analysis of set theory and mappings and continuing along a path of increasing complexity. At each stage, primary concepts, axioms, definitions, and proofs will be examined in an effort to unfold the tell-tale traces of the basic human cognitive patterns of story and conceptual blending. This book will be of interest to mathematicians, teachers of mathematics, cognitive scientists, cognitive linguists, and anyone interested...
Batchelor, D B; Berry, L A; Bonoli, P T; Carter, M D; Choi, M; D'Azevedo, E; D'Ippolito, D A; Gorelenkov, N; Harvey, R W; Jaeger, E F; Myra, J R; Okuda, H; Phillips, C K; Smithe, D N; Wright, J C
In a magnetized plasma, such as in fusion devices or the Earth's magnetosphere, several different kinds of waves can simultaneously exist, having very different physical properties. Under the right conditions one wave can quite suddenly convert to another type. Depending on the case, this can be either a great benefit or a problem for the use of waves to heat and control fusion plasmas. Understanding and accurately modeling such behavior is a major computational challenge
Feng, Qiangze; Qi, Hongwei; Fukushima, Toshikazu
Information services accessed via mobile phones provide information directly relevant to subscribers’ daily lives and are an area of dynamic market growth worldwide. Although many information services are currently offered by mobile operators, many of the existing solutions require a unique gateway for each service, and it is inconvenient for users to have to remember a large number of such gateways. Furthermore, the Short Message Service (SMS) is very popular in China and Chinese users would prefer to access these services in natural language via SMS. This chapter describes a Natural Language Based Service Selection System (NL3S) for use with a large number of mobile information services. The system can accept user queries in natural language and navigate it to the required service. Since it is difficult for existing methods to achieve high accuracy and high coverage and anticipate which other services a user might want to query, the NL3S is developed based on a Multi-service Ontology (MO) and Multi-service Query Language (MQL). The MO and MQL provide semantic and linguistic knowledge, respectively, to facilitate service selection for a user query and to provide adaptive service recommendations. Experiments show that the NL3S can achieve 75-95% accuracies and 85-95% satisfactions for processing various styles of natural language queries. A trial involving navigation of 30 different mobile services shows that the NL3S can provide a viable commercial solution for mobile operators.
Pelger, Susanne; Sigrell, Anders
Background: Feedback is one of the most significant factors for students' development of writing skills. For feedback to be successful, however, students and teachers need a common language - a meta-language - for discussing texts. Not least because in science education such a meta-language might contribute to improve writing training and feedback-giving. Purpose: The aim of this study was to explore students' perception of teachers' feedback given on their texts in two genres, and to suggest how writing training and feedback-giving could become more efficient. Sample: In this study were included 44 degree project students in biology and molecular biology, and 21 supervising teachers at a Swedish university. Design and methods: The study concerned students' writing about their degree projects in two genres: scientific writing and popular science writing. The data consisted of documented teacher feedback on the students' popular science texts. It also included students' and teachers' answers to questionnaires about writing and feedback. All data were collected during the spring of 2012. Teachers' feedback, actual and recalled - by students and teachers, respectively - was analysed and compared using the so-called Canons of rhetoric. Results: While the teachers recalled the given feedback as mainly positive, most students recalled only negative feedback. According to the teachers, suggested improvements concerned firstly the content, and secondly the structure of the text. In contrast, the students mentioned language style first, followed by content. Conclusions: The disagreement between students and teachers regarding how and what feedback was given on the students texts confirm the need of improved strategies for writing training and feedback-giving in science education. We suggest that the rhetorical meta-language might play a crucial role in overcoming the difficulties observed in this study. We also discuss how training of writing skills may contribute to
Laasonen, Marja; Smolander, Sini; Lahti-Nuuttila, Pekka; Leminen, Miika; Lajunen, Hanna-Reetta; Heinonen, Kati; Pesonen, Anu-Katriina; Bailey, Todd M; Pothos, Emmanuel M; Kujala, Teija; Leppänen, Paavo H T; Bartlett, Christopher W; Geneid, Ahmed; Lauronen, Leena; Service, Elisabet; Kunnari, Sari; Arkkila, Eva
Developmental language disorder (DLD, also called specific language impairment, SLI) is a common developmental disorder comprising the largest disability group in pre-school-aged children. Approximately 7% of the population is expected to have developmental language difficulties. However, the specific etiological factors leading to DLD are not yet known and even the typical linguistic features appear to vary by language. We present here a project that investigates DLD at multiple levels of analysis and aims to make the reliable prediction and early identification of the difficulties possible. Following the multiple deficit model of developmental disorders, we investigate the DLD phenomenon at the etiological, neural, cognitive, behavioral, and psychosocial levels, in a longitudinal study of preschool children. In January 2013, we launched the Helsinki Longitudinal SLI study (HelSLI) at the Helsinki University Hospital ( http://tiny.cc/HelSLI ). We will study 227 children aged 3-6 years with suspected DLD and their 160 typically developing peers. Five subprojects will determine how the child's psychological characteristics and environment correlate with DLD and how the child's well-being relates to DLD, the characteristics of DLD in monolingual versus bilingual children, nonlinguistic cognitive correlates of DLD, electrophysiological underpinnings of DLD, and the role of genetic risk factors. Methods include saliva samples, EEG, computerized cognitive tasks, neuropsychological and speech and language assessments, video-observations, and questionnaires. The project aims to increase our understanding of the multiple interactive risk and protective factors that affect the developing heterogeneous cognitive and behavioral profile of DLD, including factors affecting literacy development. This accumulated knowledge will form a heuristic basis for the development of new interventions targeting linguistic and non-linguistic aspects of DLD.
MILROY, Lesley. Observing and Analysing Natural Language: A Critical Account of Sociolinguistic Method. Oxford: Basil Blackwell, 1987. 230pp. MILROY, Lesley. Observing and Analysing Natural Language: A Critical Account of Sociolinguistic Method. Oxford: Basil Blackwell, 1987. 230pp.
Iria Werlang Garcia
Full Text Available Lesley Milroy's Observing and Analysing Natural Language is a recent addition to an ever growing number of publications in the field of Sociolinguistics. It carries the weight of one of the experienced authors in the current days in the specified field and should offer basic information to both newcomers and established investigators in natural language. Lesley Milroy's Observing and Analysing Natural Language is a recent addition to an ever growing number of publications in the field of Sociolinguistics. It carries the weight of one of the experienced authors in the current days in the specified field and should offer basic information to both newcomers and established investigators in natural language.
The two authors, Thi Phuong Thao-Do and Chokchai Yuenyong, explored the Nature of Science as it is understood in Vietnam, a fast-developing `ancient' and modern country which continues to be shaped by uniquely Asian social norms and values. Upon reviewing their paper, I observed strong parallels to the country, the United Arab Emirates, where I have lived and worked for 20 years. In this forum piece, I described several areas of similarity and one striking area of difference between the two societies.
Lopez, Carlos F.; Nielsen, Steve O.; Moore, Preston B.; Klein, Michael L.
Synthetic and natural peptide assemblies can possess transport or conductance activity across biomembranes through the formation of nanopores. The fundamental mechanisms of membrane insertion necessary for antimicrobial or synthetic pore formation are poorly understood. We observe a lipid-assisted mechanism for passive insertion into a model membrane from molecular dynamics simulations. The assembly used in the study, a generic nanotube functionalized with hydrophilic termini, is assisted in crossing the membrane core by transleaflet lipid flips. Lipid tails occlude a purely hydrophobic nanotube. The observed insertion mechanism requirements for hydrophobic-hydrophilic matching have implications for the design of synthetic channels and antibiotics.
Srisawat, Akkarawat; Aiemsum-ang, Napapan; Yuenyong, Chokchai
This study was conducted on the effect of understanding and instruction of the nature of science of Ms. Wanida, a pre-service student under science education program in biology, Faculty of Education, Khon Kaen University. Wanida was a teaching practicum student majoring in biology at Khon Kaen University Demonstration School (Modindaeng). She was teaching biology for 38 Grade 10 students. Methodology regarded interpretive paradigm. The study aimed to examine 1) Wanida's understanding of the nature of science, 2) Wanida's instruction of the nature of science, 3 students' understanding of the nature of science from Wanida's instruction, and 4) the effects of Wanida's understanding and instruction of the nature of science on students' understanding of the nature of science from Wanida's instruction. Tools of interpretation included teaching observation, a semi-structured interview, open-ended questionnaire, and an observation record form for the instruction of the nature of science. The data obtained was interpreted, encoded, and classified, using the descriptive statistics. The findings indicated that Wanida held good understanding of the nature of science. She could apply the deficient nature of science approach mostly, followed by the implicit nature of science approach. Unfortunately, she could not show her teaching as explicit nature of science. However, her students' the understanding of the nature of science was good.
The purpose of this qualitative study was to discover the influence of instructional games on middle school learners' use of scientific language, concept understanding, and attitude toward learning science. The rationale for this study stemmed from the lack of research concerning the value of play as an instructional strategy for older learners. Specifically, the study focused on the ways in which 6 average ability 7th grade students demonstrated scientific language and concept use during gameplay. The data were collected for this 6-week study in a southern New Jersey suburban middle school and included audio recordings of the 5 games observed in class, written documents (e.g., student created game questions, self-evaluation forms, pre- and post-assessments, and the final quiz) interviews, and researcher field notes. Data were coded and interpreted borrowing from the framework for scientific literacy developed by Bybee (1997). Based on the findings, the framework was modified to reflect the level of scientific understanding demonstrated by the participants and categorized as: Unacquainted, Nominal, Functional, and Conceptual. Major findings suggested that the participants predominantly achieved the Functional level of scientific literacy (i.e., the ability to adequately and appropriately use scientific language in both written and oral discourse) during games. Further, it was discovered that the participants achieved the Conceptual level of scientific literacy during gameplay. Through games participants were afforded the opportunity to use common, everyday language to explore concepts, promoted through peer collaboration. In games the participants used common language to build understandings that exceeded Nominal or token use of the technical vocabulary and concepts. Additionally, the participants reported through interviews and self-evaluation forms that their attitude (patterns included: Motivation, Interest, Fun, Relief from Boredom, and an Alternate Learning
Mott, David H.; Shemanski, Donald R.; Giammanco, Cheryl; Braines, Dave
A key aspect of an analyst's task in providing relevant information from data is the reasoning about the implications of that data, in order to build a picture of the real world situation. This requires human cognition, based upon domain knowledge about individuals, events and environmental conditions. For a computer system to collaborate with an analyst, it must be capable of following a similar reasoning process to that of the analyst. We describe ITA Controlled English (CE), a subset of English to represent analyst's domain knowledge and reasoning, in a form that it is understandable by both analyst and machine. CE can be used to express domain rules, background data, assumptions and inferred conclusions, thus supporting human-machine interaction. A CE reasoning and modeling system can perform inferences from the data and provide the user with conclusions together with their rationale. We present a logical problem called the "Analysis Game", used for training analysts, which presents "analytic pitfalls" inherent in many problems. We explore an iterative approach to its representation in CE, where a person can develop an understanding of the problem solution by incremental construction of relevant concepts and rules. We discuss how such interactions might occur, and propose that such techniques could lead to better collaborative tools to assist the analyst and avoid the "pitfalls".
Chandar, Prem; Nole, Greg; Johnson, Anthony W
Dry skin and moisturization are important topics because they impact the lives of many individuals. For most individuals, dry skin is not a notable concern and can be adequately managed with current moisturizing products. However, dry skin can affect the quality of life of some individuals because of the challenges of either harsh environmental conditions or impaired stratum corneum (SC) dry skin protection processes resulting from various common skin diseases. Dry skin protection processes of the SC, such as the development of natural moisturizing factor (NMF), are complex, carefully balanced, and easily perturbed. We discuss the importance of the filaggrin-NMF system and the composition of NMF in both healthy and dry skin, and also reveal new insights that suggest the properties required for a new generation of moisturizing technologies.
This study is part of a research programme conducted by IRSN on the safety of deep geological disposal of high level and intermediate long-lived radioactive wastes. It more especially concerns the geological medium considered as a full component of the multi-barrier concept proposed by Andra for a deep repository. Indeed, the Callovo-Oxfordian argillite of the Paris Basin, in the east of France, is being investigated by Andra as a potential host rock for this repository. Performance assessment of this natural barrier is based on the knowledge of its confinement properties and therefore on phenomena possibly involved in the mass transport of radionuclides. In this context, this work aimed at studying the distribution of tracers naturally present in pore waters obtained from boreholes having crossed Mesozoic sedimentary series involving impervious and compacted clay rocks in the East (Andra borehole, EST433) and south of France (IRSN boreholes). Radial diffusion and vapour exchange methods were used to calculate the concentrations and diffusion parameters of the studied tracers. In Tournemire formations, the different profiles describe a curved shapes attributed to a diffusive exchange between the argillite pore water and the surrounding aquifers. Concerning the Mesozoic formations crossed by EST433, the study of the different profiles confirms the diffusion as the dominant transport mechanism in the Callovo-Oxfordian formation, and permits identifying the transport processes in the whole studied column from the Oxfordian formations down to the Liassic one. This study also helps to identify the Liassic formations as a major source of salinity of the Dogger aquifer
Borges, Olga; Lebre, Filipa; Bento, Dulce; Borchard, Gerrit; Junginger, Hans E
It has long been known that protection against pathogens invading the organism via mucosal surfaces correlates better with the presence of specific antibodies in local secretions than with serum antibodies. The most effective way to induce mucosal immunity is to administer antigens directly to the mucosal surface. The development of vaccines for mucosal application requires antigen delivery systems and immunopotentiators that efficiently facilitate the presentation of the antigen to the mucosal immune system. This review provides an overview of the events within mucosal tissues that lead to protective mucosal immune responses. The understanding of those biological mechanisms, together with knowledge of the technology of vaccines and adjuvants, provides guidance on important technical aspects of mucosal vaccine design. Not being exhaustive, this review also provides information related to modern adjuvants, including polymeric delivery systems and immunopotentiators.
Thessen,Anne; Preciado,Jenette; Jain,Payoj; Martin,James; Palmer,Martha; Bhat,Riyaz
The cTAKES package (using the ClearTK Natural Language Processing toolkit Bethard et al. 2014, http://cleartk.github.io/cleartk/) has been successfully used to automatically read clinical notes in the medical field (Albright et al. 2013, Styler et al. 2014). It is used on a daily basis to automatically process clinical notes and extract relevant information by dozens of medical institutions. ClearEarth is a collaborative project that brings together computational linguistics and domain scient...
Topac, Vasile; Jurcau, Daniel-Alexandru; Stoicu-Tivadar, Vasile
Medical terminology appears in the natural language in multiple forms: canonical, derived or inflected form. This research presents an analysis of the form in which medical terminology appears in Romanian and English language. The sources of medical language used for the study are web pages presenting medical information for patients and other lay users. The results show that, in English, medical terminology tends to appear more in canonical form while, in the case of Romanian, it is the opposite. This paper also presents the service that was created to perform this analysis. This tool is available for the general public, and it is designed to be easily extensible, allowing the addition of other languages.
Some of the legal issues relating to exploring for and operating oil and gas properties in Saskatchewan were discussed. An overview of key legislation was provided. The purpose of the Oil and Gas Conservation Act (OGCA) was explained, i.e., (1) to prevent waste, (2) to regulate all oil and gas operations to maximize ultimate recovery through prudent operations, (3) to allow each owner the opportunity to recover its share of oil or gas from a pool, (4) to develop, protect and conserve Saskatchewan's oil and gas resources, and (5) to protect the environment from the harmful effects of oil and gas operations. Legislation regarding vertical wells, horizontal wells, and horizontal well spacing was reviewed. Similar explanations were provided for the key features of the Petroleum and Natural Gas Regulations, the Freehold Oil and Gas Production Tax Act, the Mineral Taxation Act, the Land Titles Act, and the Builder's Lien Act. Registration issues for Crown and freehold lands, and non-contractual operator's duties were also reviewed. A brief reference was also made to a recent report entitled the 'Saskatchewan External Cost Review' which indicated that Saskatchewan had certain advantages for producing oil and gas compared to Alberta, Manitoba, British Columbia and North Dakota. Unfortunately, the report also indicated that the external costs ( crown royalties, freehold production taxes, income taxes, sales taxes, etc.), were the highest in Saskatchewan of the four jurisdictions reviewed
Thao-Do, Thi Phuong; Yuenyong, Chokchai
Scholars proved nature of science (NOS) has made certain contributions to science teaching and learning. Nonetheless, what, how and how much NOS should be integrated in the science curriculum of each country cannot be a benchmark, due to the influence of culture and society. Before employing NOS in a new context, it should be carefully studied. In assessing views of NOS in Vietnam, a developing country with Eastern culture where the NOS is not consider a compulsory learning outcome, there are several issues that researchers and educators should notice to develop an appropriate instrument that can clearly exhibit a NOS view of Vietnamese. They may include: time for the survey; length, content, type, and terms of the questionnaire; Vietnamese epistemology and philosophy; and some other Vietnamese social and cultural aspects. The most important reason for these considerations is that a Vietnamese view of NOS and NOS assessment possibly differs from the Western ideas due to the social and cultural impact. As a result, a Western assessment tool may become less effective in an Eastern context. The suggestions and implications in this study were derived from a prolonged investigation on Vietnamese science teacher educators and student teachers of School of Education, at Can Tho University, a State University in Mekong Delta region, Vietnam.
Long, D M; Bloomfield, D S; Chen, P F; Downs, C; Gallagher, P T; Kwon, R-Y; Vanninathan, K; Veronig, A M; Vourlidas, A; Vršnak, B; Warmuth, A; Žic, T
For almost 20 years the physical nature of globally propagating waves in the solar corona (commonly called "EIT waves") has been controversial and subject to debate. Additional theories have been proposed over the years to explain observations that did not agree with the originally proposed fast-mode wave interpretation. However, the incompatibility of observations made using the Extreme-ultraviolet Imaging Telescope (EIT) onboard the Solar and Heliospheric Observatory with the fast-mode wave interpretation was challenged by differing viewpoints from the twin Solar Terrestrial Relations Observatory spacecraft and data with higher spatial and temporal resolution from the Solar Dynamics Observatory . In this article, we reexamine the theories proposed to explain EIT waves to identify measurable properties and behaviours that can be compared to current and future observations. Most of us conclude that the so-called EIT waves are best described as fast-mode large-amplitude waves or shocks that are initially driven by the impulsive expansion of an erupting coronal mass ejection in the low corona.
Full Text Available Working memory is important for online language processing during conversation. We use it to maintain relevant information, to inhibit or ignore irrelevant information, and to attend to conversation selectively. Working memory helps us to keep track of and actively participate in conversation, including taking turns and following the gist. This paper examines the Ease of Language Understanding model (i.e., the ELU model, Rönnberg, 2003; Rönnberg et al., 2008 in light of new behavioral and neural findings concerning the role of working memory capacity (WMC in uni-modal and bimodal language processing. The new ELU model is a meaning prediction system that depends on phonological and semantic interactions in rapid implicit and slower explicit processing mechanisms that both depend on WMC albeit in different ways. A revised ELU model is proposed based on findings that address the relationship between WMC and (a early attention processes in listening to speech, (b signal processing in hearing aids and its effects on short-term memory, (c inhibition of speech maskers and its effect on episodic long-term memory, (d the effects of hearing impairment on episodic and semantic long-term memory, and finally, (e listening effort. New predictions and clinical implications are outlined. Comparisons with other WMC and speech perception models are made.
Full Text Available The goal of the paper is to show that language can support social and intercultural competence of both students and teachers: one of the ways to do it is teaching cultural taboos and taboo language for intercultural awareness and understanding. The current state of the art in the field points to an increasing interest in the teaching of taboos. The material we analysed consisted in 238 offensive, vulgar and obscene English words that both students and teachers should know to attain social and intercultural competence. The method used is the descriptive one. The degree of novelty is rather high in our cultural area. Results show that there are 134 offensive (slang words and expressions (referring to the country of origin or to an ethnic group, to sex and sex-related issues (sexual orientation, to race, etc., 75 vulgar words and expressions (referring to sex and sex-related issues, to body parts, to people, etc., and 29 obscene words and expressions (referring to body secretions, to sex and sex-related issues, to people, etc.. There seems to be no research limitations given the lexicographic sources that we used. The implications of teaching cultural taboos and taboo language at tertiary level concern both the students and teachers and the organisation they belong to. The paper is original and relevant given the process of globalisation.
Olson, Andrea M; Swabey, Laurie
Despite federal laws that mandate equal access and communication in all healthcare settings for deaf people, consistent provision of quality interpreting in healthcare settings is still not a reality, as recognized by deaf people and American Sign Language (ASL)-English interpreters. The purpose of this study was to better understand the work of ASL interpreters employed in healthcare settings, which can then inform on training and credentialing of interpreters, with the ultimate aim of improving the quality of healthcare and communication access for deaf people. Based on job analysis, researchers designed an online survey with 167 task statements representing 44 categories. American Sign Language interpreters (N = 339) rated the importance of, and frequency with which they performed, each of the 167 tasks. Categories with the highest average importance ratings included language and interpreting, situation assessment, ethical and professional decision making, manage the discourse, monitor, manage and/or coordinate appointments. Categories with the highest average frequency ratings included the following: dress appropriately, adapt to a variety of physical settings and locations, adapt to working with variety of providers in variety of roles, deal with uncertain and unpredictable work situations, and demonstrate cultural adaptability. To achieve health equity for the deaf community, the training and credentialing of interpreters needs to be systematically addressed.
Bedore, Lisa M; Peña, Elizabeth D; Anaya, Jissel B; Nieto, Ricardo; Lugo-Neris, Mirza J; Baron, Alisa
This study examines English performance on a set of 11 grammatical forms in Spanish-English bilingual, school-age children in order to understand how item difficulty of grammatical constructions helps correctly classify language impairment (LI) from expected variability in second language acquisition when taking into account linguistic experience and exposure. Three hundred seventy-eight children's scores on the Bilingual English-Spanish Assessment-Middle Extension (Peña, Bedore, Gutiérrez-Clellen, Iglesias, & Goldstein, 2008) morphosyntax cloze task were analyzed by bilingual experience groups (high Spanish experience, balanced English-Spanish experience, high English experience, ability (typically developing [TD] vs. LI), and grammatical form. Classification accuracy was calculated for the forms that best differentiated TD and LI groups. Children with LI scored lower than TD children across all bilingual experience groups. There were differences by grammatical form across bilingual experience and ability groups. Children from high English experience and balanced English-Spanish experience groups could be accurately classified on the basis of all the English grammatical forms tested except for prepositions. For bilinguals with high Spanish experience, it was possible to rule out LI on the basis of grammatical production but not rule in LI. It is possible to accurately identify LI in English language learners once they use English 40% of the time or more. However, for children with high Spanish experience, more information about development and patterns of impairment is needed to positively identify LI.
Fitzpatrick, A.Liam; /Boston U.; Kaplan, Jared; /SLAC; Penedones, Joao; /Perimeter Inst. Theor. Phys.; Raju, Suvrat; /Harish-Chandra Res. Inst.; van Rees, Balt C.; /YITP, Stony Brook
We provide dramatic evidence that 'Mellin space' is the natural home for correlation functions in CFTs with weakly coupled bulk duals. In Mellin space, CFT correlators have poles corresponding to an OPE decomposition into 'left' and 'right' sub-correlators, in direct analogy with the factorization channels of scattering amplitudes. In the regime where these correlators can be computed by tree level Witten diagrams in AdS, we derive an explicit formula for the residues of Mellin amplitudes at the corresponding factorization poles, and we use the conformal Casimir to show that these amplitudes obey algebraic finite difference equations. By analyzing the recursive structure of our factorization formula we obtain simple diagrammatic rules for the construction of Mellin amplitudes corresponding to tree-level Witten diagrams in any bulk scalar theory. We prove the diagrammatic rules using our finite difference equations. Finally, we show that our factorization formula and our diagrammatic rules morph into the flat space S-Matrix of the bulk theory, reproducing the usual Feynman rules, when we take the flat space limit of AdS/CFT. Throughout we emphasize a deep analogy with the properties of flat space scattering amplitudes in momentum space, which suggests that the Mellin amplitude may provide a holographic definition of the flat space S-Matrix.
Denning, R. S.
This paper describes the evolution of understanding of severe accident consequences from the non-mechanistic assumptions of WASH-740 to WASH-1400, NUREG-1150, SOARCA and today in the interpretation of the consequences of the accident at Fukushima. As opposed to the general perception, the radiological human health consequences to members of the Japanese public from the Fukushima accident will be small despite meltdowns at three reactors and loss of containment integrity. In contrast, the radiation-related societal impacts present a substantial additional economic burden on top of the monumental task of economic recovery from the nonnuclear aspects of the earthquake and tsunami damage. The Fukushima accident provides additional evidence that we have mis-characterized the risk of nuclear power plant accidents to ourselves and to the public. The human health risks are extremely small even to people living next door to a nuclear power plant. The principal risk associated with a nuclear power plant accident involves societal impacts: relocation of people, loss of land use, loss of contaminated products, decontamination costs and the need for replacement power. Although two of the three probabilistic safety goals of the NRC address societal risk, the associated quantitative health objectives in reality only address individual human health risk. This paper describes the types of analysis that would address compliance with the societal goals. (authors)
Çetinkaya-Aydın, Gamze; Çakıroğlu, Jale
The purpose of this study was to investigate the possible associations between preservice science teachers' understanding of nature of science and their learner characteristics; understanding of nature of scientific inquiry, science teaching self-efficacy beliefs, metacognitive awareness level, and faith/worldview schemas. The sample of the current study was 60 3rd-year preservice science teachers enrolled in the Nature of Science and History of Science course. Using a descriptive and associational case study design, data were collected by means of different qualitative and quantitative questionnaires. Analysis of the data revealed that preservice science teachers' understanding of nature of science and nature of scientific inquiry were highly associated. Similarly, science teaching self-efficacy beliefs, metacognitive awareness levels, and faith/worldviews of the preservice science teachers were found to be significantly associated with their understanding of nature of science. Thus, it can be concluded that there might be other factors interfering with the learning processes of nature of science.
Leung, Constant; Scarino, Angela
Transformations associated with the increasing speed, scale, and complexity of mobilities, together with the information technology revolution, have changed the demography of most countries of the world and brought about accompanying social, cultural, and economic shifts (Heugh, 2013). This complex diversity has changed the very nature of…
Hassanpour, Saeed; Bay, Graham; Langlotz, Curtis P
We built a natural language processing (NLP) method to automatically extract clinical findings in radiology reports and characterize their level of change and significance according to a radiology-specific information model. We utilized a combination of machine learning and rule-based approaches for this purpose. Our method is unique in capturing different features and levels of abstractions at surface, entity, and discourse levels in text analysis. This combination has enabled us to recognize the underlying semantics of radiology report narratives for this task. We evaluated our method on radiology reports from four major healthcare organizations. Our evaluation showed the efficacy of our method in highlighting important changes (accuracy 99.2%, precision 96.3%, recall 93.5%, and F1 score 94.7%) and identifying significant observations (accuracy 75.8%, precision 75.2%, recall 75.7%, and F1 score 75.3%) to characterize radiology reports. This method can help clinicians quickly understand the key observations in radiology reports and facilitate clinical decision support, review prioritization, and disease surveillance.
Full Text Available Controlling robots by natural language (NL is increasingly attracting attention for its versatility, convenience and no need of extensive training for users. Grounding is a crucial challenge of this problem to enable robots to understand NL instructions from humans. This paper mainly explores the object grounding problem and concretely studies how to detect target objects by the NL instructions using an RGB-D camera in robotic manipulation applications. In particular, a simple yet robust vision algorithm is applied to segment objects of interest. With the metric information of all segmented objects, the object attributes and relations between objects are further extracted. The NL instructions that incorporate multiple cues for object specifications are parsed into domain-specific annotations. The annotations from NL and extracted information from the RGB-D camera are matched in a computational state estimation framework to search all possible object grounding states. The final grounding is accomplished by selecting the states which have the maximum probabilities. An RGB-D scene dataset associated with different groups of NL instructions based on different cognition levels of the robot are collected. Quantitative evaluations on the dataset illustrate the advantages of the proposed method. The experiments of NL controlled object manipulation and NL-based task programming using a mobile manipulator show its effectiveness and practicability in robotic applications.
Kerlyl, Alice; Hall, Phil; Bull, Susan
There is an extensive body of work on Intelligent Tutoring Systems: computer environments for education, teaching and training that adapt to the needs of the individual learner. Work on personalisation and adaptivity has included research into allowing the student user to enhance the system's adaptivity by improving the accuracy of the underlying learner model. Open Learner Modelling, where the system's model of the user's knowledge is revealed to the user, has been proposed to support student reflection on their learning. Increased accuracy of the learner model can be obtained by the student and system jointly negotiating the learner model. We present the initial investigations into a system to allow people to negotiate the model of their understanding of a topic in natural language. This paper discusses the development and capabilities of both conversational agents (or chatbots) and Intelligent Tutoring Systems, in particular Open Learner Modelling. We describe a Wizard-of-Oz experiment to investigate the feasibility of using a chatbot to support negotiation, and conclude that a fusion of the two fields can lead to developing negotiation techniques for chatbots and the enhancement of the Open Learner Model. This technology, if successful, could have widespread application in schools, universities and other training scenarios.
Nye, Benjamin D.; Graesser, Arthur C.; Hu, Xiangen
AutoTutor is a natural language tutoring system that has produced learning gains across multiple domains (e.g., computer literacy, physics, critical thinking). In this paper, we review the development, key research findings, and systems that have evolved from AutoTutor. First, the rationale for developing AutoTutor is outlined and the advantages…
Higginbotham, D Jeffery; Lesher, Gregory W; Moulton, Bryan J; Roark, Brian
Significant progress has been made in the application of natural language processing (NLP) to augmentative and alternative communication (AAC), particularly in the areas of interface design and word prediction. This article will survey the current state-of-the-science of NLP in AAC and discuss its future applications for the development of next generation of AAC technology.
Krahmer, E.; Krahmer, E.; Theune, Mariet
We are pleased to present the Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009). ENLG 2009 was held in Athens, Greece, as a workshop at the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2009). Following our call, we
A theoretical discussion is offered on whether the subjunctive in the Romance languages is by nature thematic, as suggested in previous studies. English and Spanish samples are used to test the hypothesis; one conclusion is that the subjunctive seems to offer speaker-related information and may express the intensity of the speaker's involvement.…
Laski, Karen E.; And Others
Parents of four nonverbal and four echolalic autistic children, aged five-nine, were trained to increase their children's speech by using the Natural Language Paradigm. Following training, parents increased the frequency with which they required their children to speak, and children increased the frequency of their verbalizations in three…
Stoianov, [No Value; Nerbonne, J; Bouma, H; Coppen, PA; vanHalteren, H; Teunissen, L
Simple Recurrent Networks (SRN) are Neural Network (connectionist) models able to process natural language. Phonotactics concerns the order of symbols in words. We continued an earlier unsuccessful trial to model the phonotactics of Dutch words with SRNs. In order to overcome the previously reported
Ingram, D. E.
The nature and development of the recently released International English Language Testing System (IELTS) instrument are described. The test is the result of a joint Australian-British project to develop a new test for use with foreign students planning to study in English-speaking countries. It is expected that the modular instrument will become…
Tierney, Patrick J.
This paper introduces a method of extending natural language-based processing of qualitative data analysis with the use of a very quantitative tool--graph theory. It is not an attempt to convert qualitative research to a positivist approach with a mathematical black box, nor is it a "graphical solution". Rather, it is a method to help qualitative…
Balyan, Renu; McCarthy, Kathryn S.; McNamara, Danielle S.
This study examined how machine learning and natural language processing (NLP) techniques can be leveraged to assess the interpretive behavior that is required for successful literary text comprehension. We compared the accuracy of seven different machine learning classification algorithms in predicting human ratings of student essays about…
Kyle, Kristopher; Crossley, Scott A.; McNamara, Danielle S.
This study explores the construct validity of speaking tasks included in the TOEFL iBT (e.g., integrated and independent speaking tasks). Specifically, advanced natural language processing (NLP) tools, MANOVA difference statistics, and discriminant function analyses (DFA) are used to assess the degree to which and in what ways responses to these…
Full Text Available Pragmatics is the study of the relation of signs to interpreters. For English foreign language (EFL learners, the knowledge and comprehensible input of pragmatics is much needed. This paper is based on research project. The writer did the research survey by giving some respondents questionnaire. The respondent is some students from UAD, which is taken randomly. Besides using open questionnaire, the writer also got the data from in depth interview with some EFL learners, the native speaker who teaches English, and also did literature review from some books. The result of the research then gives some evidences that EFL learners difficulties in understanding the English pragmatics occurs in 1 greeting, 2 apologizing, 3 complimenting, and 4 thanking. The factors that promotes EFL learners’ difficulties in understanding because 1 the different culture and values between native speaker and learners; 2 habit that the usually use in their daily life.
Manjet Kaur Mehar Singh
Full Text Available Malaysian intercultural society is typified by three major ethnic groups mainly Malays, Chinese and Indians. Although education system is the best tool for these three major ethnic groups to work together, contemporary research reveals that there is still lack of intercultural embedding education context and national schools are seen as breeding grounds of racial polarisation. In Malaysian context, there is a gap in research that focuses on the design of a proper intercultural reading framework for national integration and such initiatives are viable through schools. The main objective of this conceptual paper is to introduce the English Language Intercultural Reading Programme (ELIRP in secondary schools to promote intercultural understanding among secondary school students. The proposed framework will facilitate the acquisition of intercultural inputs without being constrained by ideological, political, or psychological demands. This article will focus on elucidating how ELIRP could affect cognitive (knowledge and behavioural transformations to intercultural perceptions harboured by selected Form 4 students of 20 national schools in Malaysia. Keywords: behavior, knowledge, intercultural reading framework, intercultural understanding, English Language Intercultural Reading Programme, secondary school students
Çetinkaya-Aydin, Gamze; Çakiroglu, Jale
The purpose of this study was to investigate the possible associations between preservice science teachers' understanding of nature of science and their learner characteristics; understanding of nature of scientific inquiry, science teaching self-efficacy beliefs, metacognitive awareness level, and faith/worldview schemas. The sample of the…
Park, Mihwa; Liu, Xiufeng; Smith, Erica; Waight, Noemi
This study reports the effect of computer models as formative assessment on high school students' understanding of the nature of models. Nine high school teachers integrated computer models and associated formative assessments into their yearlong high school chemistry course. A pre-test and post-test of students' understanding of the nature of…
Students may use the technical engineering terms without knowing what these words mean. This creates a language barrier in engineering that influences student learning. Previous research has been conducted to characterize the difference between colloquial and scientific language. Since this research had not yet been applied explicitly to…
Farrant, Brad M.; Maybery, Murray T.; Fletcher, Janet
The hypothesis that language plays a role in theory-of-mind (ToM) development is supported by a number of lines of evidence (e.g., H. Lohmann & M. Tomasello, 2003). The current study sought to further investigate the relations between maternal language input, memory for false sentential complements, cognitive flexibility, and the development of…
Lott, Jason P; Boudreau, Denise M; Barnhill, Ray L; Weinstock, Martin A; Knopp, Eleanor; Piepkorn, Michael W; Elder, David E; Knezevich, Steven R; Baer, Andrew; Tosteson, Anna N A; Elmore, Joann G
Population-based information on the distribution of histologic diagnoses associated with skin biopsies is unknown. Electronic medical records (EMRs) enable automated extraction of pathology report data to improve our epidemiologic understanding of skin biopsy outcomes, specifically those of melanocytic origin. To determine population-based frequencies and distribution of histologically confirmed melanocytic lesions. A natural language processing (NLP)-based analysis of EMR pathology reports of adult patients who underwent skin biopsies at a large integrated health care delivery system in the US Pacific Northwest from January 1, 2007, through December 31, 2012. Skin biopsy procedure. The primary outcome was histopathologic diagnosis, obtained using an NLP-based system to process EMR pathology reports. We determined the percentage of diagnoses classified as melanocytic vs nonmelanocytic lesions. Diagnoses classified as melanocytic were further subclassified using the Melanocytic Pathology Assessment Tool and Hierarchy for Diagnosis (MPATH-Dx) reporting schema into the following categories: class I (nevi and other benign proliferations such as mildly dysplastic lesions typically requiring no further treatment), class II (moderately dysplastic and other low-risk lesions that may merit narrow reexcision with skin biopsies, performed on 47 529 patients, were examined. Nearly 1 in 4 skin biopsies were of melanocytic lesions (23%; n = 18 715), which were distributed according to MPATH-Dx categories as follows: class I, 83.1% (n = 15 558); class II, 8.3% (n = 1548); class III, 4.5% (n = 842); class IV, 2.2% (n = 405); and class V, 1.9% (n = 362). Approximately one-quarter of skin biopsies resulted in diagnoses of melanocytic proliferations. These data provide the first population-based estimates across the spectrum of melanocytic lesions ranging from benign through dysplastic to malignant. These results may serve as a foundation for future
Arnold, V I
This collection of 39 short stories gives the reader a unique opportunity to take a look at the scientific philosophy of Vladimir Arnold, one of the most original contemporary researchers. Topics of the stories included range from astronomy, to mirages, to motion of glaciers, to geometry of mirrors and beyond. In each case Arnold's explanation is both deep and simple, which makes the book interesting and accessible to an extremely broad readership. Original illustrations hand drawn by the author help the reader to further understand and appreciate Arnold's view on the relationship between math
Christina Siu-Dschu Fan
Full Text Available In tonal languages, such as Mandarin Chinese, the pitch contour of vowels discriminates lexical meaning, which is not the case in non-tonal languages such as German. Recent data provide evidence that pitch processing is influenced by language experience. However, there are still many open questions concerning the representation of such phonological and language-related differences at the level of the auditory cortex (AC. Using magnetoencephalography (MEG, we recorded transient and sustained auditory evoked fields (AEF in native Chinese and German speakers to investigate language related phonological and semantic aspects in the processing of acoustic stimuli. AEF were elicited by spoken meaningful and meaningless syllables, by vowels, and by a French horn tone. Speech sounds were recorded from a native speaker and showed frequency-modulations according to the pitch-contours of Mandarin. The sustained field (SF evoked by natural speech signals was significantly larger for Chinese than for German listeners. In contrast, the SF elicited by a horn tone was not significantly different between groups. Furthermore, the SF of Chinese subjects was larger when evoked by meaningful syllables compared to meaningless ones, but there was no significant difference regarding whether vowels were part of the Chinese phonological system or not. Moreover, the N100m gave subtle but clear evidence that for Chinese listeners other factors than purely physical properties play a role in processing meaningful signals. These findings show that the N100 and the SF generated in Heschl's gyrus are influenced by language experience, which suggests that AC activity related to specific pitch contours of vowels is influenced in a top-down fashion by higher, language related areas. Such interactions are in line with anatomical findings and neuroimaging data, as well as with the dual-stream model of language of Hickok and Poeppel that highlights the close and reciprocal interaction
Fan, Christina Siu-Dschu; Zhu, Xingyu; Dosch, Hans Günter; von Stutterheim, Christiane; Rupp, André
In tonal languages, such as Mandarin Chinese, the pitch contour of vowels discriminates lexical meaning, which is not the case in non-tonal languages such as German. Recent data provide evidence that pitch processing is influenced by language experience. However, there are still many open questions concerning the representation of such phonological and language-related differences at the level of the auditory cortex (AC). Using magnetoencephalography (MEG), we recorded transient and sustained auditory evoked fields (AEF) in native Chinese and German speakers to investigate language related phonological and semantic aspects in the processing of acoustic stimuli. AEF were elicited by spoken meaningful and meaningless syllables, by vowels, and by a French horn tone. Speech sounds were recorded from a native speaker and showed frequency-modulations according to the pitch-contours of Mandarin. The sustained field (SF) evoked by natural speech signals was significantly larger for Chinese than for German listeners. In contrast, the SF elicited by a horn tone was not significantly different between groups. Furthermore, the SF of Chinese subjects was larger when evoked by meaningful syllables compared to meaningless ones, but there was no significant difference regarding whether vowels were part of the Chinese phonological system or not. Moreover, the N100m gave subtle but clear evidence that for Chinese listeners other factors than purely physical properties play a role in processing meaningful signals. These findings show that the N100 and the SF generated in Heschl's gyrus are influenced by language experience, which suggests that AC activity related to specific pitch contours of vowels is influenced in a top-down fashion by higher, language related areas. Such interactions are in line with anatomical findings and neuroimaging data, as well as with the dual-stream model of language of Hickok and Poeppel that highlights the close and reciprocal interaction between
Mathematics and the Laws of Nature, Revised Edition describes the evolution of the idea that nature can be described in the language of mathematics. Colorful chapters explore the earliest attempts to apply deductive methods to the study of the natural world. This revised resource goes on to examine the development of classical conservation laws, including the conservation of momentum, the conservation of mass, and the conservation of energy. Chapters have been updated and revised to reflect recent information, including the mathematical pioneers who introduced new ideas about what it meant to
Parker, Catherine Frieda
A possible contributing factor to students' difficulty in learning advanced mathematics is the conflict between students' "natural" learning styles and the formal structure of mathematics, which is based on definitions, theorems, and proofs. Students' natural learning styles may be a function of their intuition and language skills. The purpose of…
Sanden, Guro Refsum
Purpose: – The purpose of this paper is to analyse the consequences of globalisation in the area of corporate communication, and investigate how language may be managed as a strategic resource. Design/methodology/approach: – A review of previous studies on the effects of globalisation on corporate...... communication and the implications of language management initiatives in international business. Findings: – Efficient language management can turn language into a strategic resource. Language needs analyses, i.e. linguistic auditing/language check-ups, can be used to determine the language situation...... of a company. Language policies and/or strategies can be used to regulate a company’s internal modes of communication. Language management tools can be deployed to address existing and expected language needs. Continuous feedback from the front line ensures strategic learning and reduces the risk of suboptimal...
Full Text Available Emotion comprehension is known to be a key correlate and predictor of prosociality from early childhood. The present study look at their relation within the wide theoretical construct of social understanding which includes a number of socio-emotional skills, as well as cognitive and linguistic abilities. Theory of mind, especially false-belief understanding, has been found to have positive correlations with both emotion comprehension and prosocial orientation. Similarly, language ability is known to play a key role in children’s socio-emotional development. The combined contribution of both false-belief understanding and language in explaining the relation between emotion comprehension and prosociality has yet to be investigated. Thus, in the current study, we conducted an in-depth exploration of how preschoolers’ false-belief understanding and language ability each contribute to modeling the relationship between their comprehension of emotion and their disposition to act prosocially towards others, after controlling for age and gender. Participants were 101 4-to-6 year old children (54% boys, who were administered measures of language ability, false-belief understanding, emotion comprehension and prosocial orientation. Multiple mediation analysis of the data suggested that false-belief understanding and language ability jointly and fully mediated the effect of preschoolers’ emotion comprehension on their prosocial orientation. Analysis of covariates revealed that gender exerted no statistically significant effect, while age had a trivial positive effect. Theoretical and practical implications of the findings are discussed.
Ornaghi, Veronica; Pepe, Alessandro; Grazzani, Ilaria
Emotion comprehension (EC) is known to be a key correlate and predictor of prosociality from early childhood. In the present study, we examined this relationship within the broad theoretical construct of social understanding which includes a number of socio-emotional skills, as well as cognitive and linguistic abilities. Theory of mind, especially false-belief understanding, has been found to be positively correlated with both EC and prosocial orientation. Similarly, language ability is known to play a key role in children's socio-emotional development. The combined contribution of false-belief understanding and language to explaining the relationship between EC and prosociality has yet to be investigated. Thus, in the current study, we conducted an in-depth exploration of how preschoolers' false-belief understanding and language ability each contribute to modeling the relationship between children's comprehension of emotion and their disposition to act prosocially toward others, after controlling for age and gender. Participants were 101 4- to 6-year-old children (54% boys), who were administered measures of language ability, false-belief understanding, EC and prosocial orientation. Multiple mediation analysis of the data suggested that false-belief understanding and language ability jointly and fully mediated the effect of preschoolers' EC on their prosocial orientation. Analysis of covariates revealed that gender exerted no statistically significant effect, while age had a trivial positive effect. Theoretical and practical implications of the findings are discussed.
Jackendoff, Ray; Pinker, Steven
In a continuation of the conversation with Fitch, Chomsky, and Hauser on the evolution of language, we examine their defense of the claim that the uniquely human, language-specific part of the language faculty (the ''narrow language faculty'') consists only of recursion, and that this part cannot be considered an adaptation to communication. We…
Full Text Available Communicative interactions involve a kind of procedural knowledge that is used by the human brain for processing verbal and nonverbal inputs and for language production. Although considerable work has been done on modeling human language abilities, it has been difficult to bring them together to a comprehensive tabula rasa system compatible with current knowledge of how verbal information is processed in the brain. This work presents a cognitive system, entirely based on a large-scale neural architecture, which was developed to shed light on the procedural knowledge involved in language elaboration. The main component of this system is the central executive, which is a supervising system that coordinates the other components of the working memory. In our model, the central executive is a neural network that takes as input the neural activation states of the short-term memory and yields as output mental actions, which control the flow of information among the working memory components through neural gating mechanisms. The proposed system is capable of learning to communicate through natural language starting from tabula rasa, without any a priori knowledge of the structure of phrases, meaning of words, role of the different classes of words, only by interacting with a human through a text-based interface, using an open-ended incremental learning process. It is able to learn nouns, verbs, adjectives, pronouns and other word classes, and to use them in expressive language. The model was validated on a corpus of 1587 input sentences, based on literature on early language assessment, at the level of about 4-years old child, and produced 521 output sentences, expressing a broad range of language processing functionalities.
Rassinoux, Anne-Marie; Baud, Robert H; Rodrigues, Jean-Marie; Lovis, Christian; Geissbühler, Antoine
The importance of clinical communication between providers, consumers and others, as well as the requisite for computer interoperability, strengthens the need for sharing common accepted terminologies. Under the directives of the World Health Organization (WHO), an approach is currently being conducted in Australia to adopt a standardized terminology for medical procedures that is intended to become an international reference. In order to achieve such a standard, a collaborative approach is adopted, in line with the successful experiment conducted for the development of the new French coding system CCAM. Different coding centres are involved in setting up a semantic representation of each term using a formal ontological structure expressed through a logic-based representation language. From this language-independent representation, multilingual natural language generation (NLG) is performed to produce noun phrases in various languages that are further compared for consistency with the original terms. Outcomes are presented for the assessment of the International Classification of Health Interventions (ICHI) and its translation into Portuguese. The initial results clearly emphasize the feasibility and cost-effectiveness of the proposed method for handling both a different classification and an additional language. NLG tools, based on ontology driven semantic representation, facilitate the discovery of ambiguous and inconsistent terms, and, as such, should be promoted for establishing coherent international terminologies.
Sharma, Vivekanand; Law, Wayne; Balick, Michael J; Sarkar, Indra Neil
The growing amount of data describing historical medicinal uses of plants from digitization efforts provides the opportunity to develop systematic approaches for identifying potential plant-based therapies. However, the task of cataloguing plant use information from natural language text is a challenging task for ethnobotanists. To date, there have been only limited adoption of informatics approaches used for supporting the identification of ethnobotanical information associated with medicinal uses. This study explored the feasibility of using biomedical terminologies and natural language processing approaches for extracting relevant plant-associated therapeutic use information from historical biodiversity literature collection available from the Biodiversity Heritage Library. The results from this preliminary study suggest that there is potential utility of informatics methods to identify medicinal plant knowledge from digitized resources as well as highlight opportunities for improvement.
Min, Yul Ha; Park, Hyeoun-Ae; Jeon, Eunjoo; Lee, Joo Yun; Jo, Soo Jung
The purpose of this study was to develop an ontology model to generate nursing narratives as natural as human language from the entity-attribute-value triplets of a detailed clinical model using natural language generation technology. The model was based on the types of information and documentation time of the information along the nursing process. The typesof information are data characterizing the patient status, inferences made by the nurse from the patient data, and nursing actions selected by the nurse to change the patient status. This information was linked to the nursing process based on the time of documentation. We describe a case study illustrating the application of this model in an acute-care setting. The proposed model provides a strategy for designing an electronic nursing record system.
Weng, Wei-Hung; Wagholikar, Kavishwar B.; McCray, Alexa T.; Szolovits, Peter; Chueh, Henry C.
Background The medical subdomain of a clinical note, such as cardiology or neurology, is useful content-derived metadata for developing machine learning downstream applications. To classify the medical subdomain of a note accurately, we have constructed a machine learning-based natural language processing (NLP) pipeline and developed medical subdomain classifiers based on the content of the note. Methods We constructed the pipeline using the clinical ...
Will, Herbert A.; Mackin, Michael A.
PC software is described which provides flexible natural language process control capability with an IBM PC or compatible machine. Hardware requirements include the PC, and suitable hardware interfaces to all controlled devices. Software required includes the Microsoft Disk Operating System (MS-DOS) operating system, a PC-based FORTRAN-77 compiler, and user-written device drivers. Instructions for use of the software are given as well as a description of an application of the system.
Compact Closed categories and Frobenius and Bi algebras have been applied to model and reason about Quantum protocols. The same constructions have also been applied to reason about natural language semantics under the name: ``categorical distributional compositional'' semantics, or in short, the ``DisCoCat'' model. This model combines the statistical vector models of word meaning with the compositional models of grammatical structure. It has been applied to natural language tasks such as disambiguation, paraphrasing and entailment of phrases and sentences. The passage from the grammatical structure to vectors is provided by a functor, similar to the Quantization functor of Quantum Field Theory. The original DisCoCat model only used compact closed categories. Later, Frobenius algebras were added to it to model long distance dependancies such as relative pronouns. Recently, bialgebras have been added to the pack to reason about quantifiers. This paper reviews these constructions and their application to natural language semantics. We go over the theory and present some of the core experimental results.
Full Text Available Compact Closed categories and Frobenius and Bi algebras have been applied to model and reason about Quantum protocols. The same constructions have also been applied to reason about natural language semantics under the name: “categorical distributional compositional” semantics, or in short, the “DisCoCat” model. This model combines the statistical vector models of word meaning with the compositional models of grammatical structure. It has been applied to natural language tasks such as disambiguation, paraphrasing and entailment of phrases and sentences. The passage from the grammatical structure to vectors is provided by a functor, similar to the Quantization functor of Quantum Field Theory. The original DisCoCat model only used compact closed categories. Later, Frobenius algebras were added to it to model long distance dependancies such as relative pronouns. Recently, bialgebras have been added to the pack to reason about quantifiers. This paper reviews these constructions and their application to natural language semantics. We go over the theory and present some of the core experimental results.
Machine translation systems often incorporate modeling assumptions motivated by properties of the language pairs they initially target. When such systems are applied to language families with considerably different properties, translation quality can deteriorate. Phrase-based machine translation
re~arded as -a fairly complete dictionary contains about 18,000 itemsw at soluition to the domain-restricted task at tzanlating present, and will be... dictionary access and so on, with an article. Unfortunately, the Weidner system did but as time goes on, one might imagine functionality not know that...superfast type. looped tht it A31l be built with taste by peo. writer ought to be possible in the monolingual case pie who understand languages and
With the accelerated globalization, domestic and international communications become more frequent than ever before. As the major media of international communication, languages contact with each other more actively by day. And in the active contact any language would gradually develop and change. Pidgin language is a unique linguistic phenomenon…
Full Text Available Abstract Background Accurate information is needed to direct healthcare systems’ efforts to control methicillin-resistant Staphylococcus aureus (MRSA. Assembling complete and correct microbiology data is vital to understanding and addressing the multiple drug-resistant organisms in our hospitals. Methods Herein, we describe a system that securely gathers microbiology data from the Department of Veterans Affairs (VA network of databases. Using natural language processing methods, we applied an information extraction process to extract organisms and susceptibilities from the free-text data. We then validated the extraction against independently derived electronic data and expert annotation. Results We estimate that the collected microbiology data are 98.5% complete and that methicillin-resistant Staphylococcus aureus was extracted accurately 99.7% of the time. Conclusions Applying natural language processing methods to microbiology records appears to be a promising way to extract accurate and useful nosocomial pathogen surveillance data. Both scientific inquiry and the data’s reliability will be dependent on the surveillance system’s capability to compare from multiple sources and circumvent systematic error. The dataset constructed and methods used for this investigation could contribute to a comprehensive infectious disease surveillance system or other pressing needs.
Noguera-Arnaldos, José Ángel
The Internet of Things (IoT) offers opportunities for new applications and services that enable users to access and control their working and home environment from local and remote locations, aiming to perform daily life activities in an easy way. However, the IoT also introduces new challenges, some of which arise from the large range of devices currently available and the heterogeneous interfaces provided for their control. The control and management of this variety of devices and interfaces represent a new challenge for non-expert users, instead of making their life easier. Based on this understanding, in this work we present a natural language interface for the IoT, which takes advantage of Semantic Web technologies to allow non-expert users to control their home environment through an instant messaging application in an easy and intuitive way. We conducted several experiments with a group of end users aiming to evaluate the effectiveness of our approach to control home appliances by means of natural language instructions. The evaluation results proved that without the need for technicalities, the user was able to control the home appliances in an efficient way.
Surveys developments in language revitalization and language death. Focusing on indigenous languages, discusses the role and nature of appropriate linguistic documentation, possibilities for bilingual education, and methods of promoting oral fluency and intergenerational transmission in affected languages. (Author/VWL)
Martin-Dunlop, Catherine S.
This study investigated prospective elementary teachers' understandings of the nature of science and explored associations with their guided-inquiry science learning environment. Over 500 female students completed the Nature of Scientific Knowledge Survey (NSKS), although only four scales were analyzed-Creative, Testable, Amoral, and Unified. The…
One dimension of early Canadian education is the attempt of the government to use the education system as an assimilative tool to integrate the First Nations and Me´tis people into Euro-Canadian society. Despite these attempts, many First Nations and Me´tis people retained their culture and their indigenous language. Few science educators have examined First Nations and Western scientific worldviews and the impact they may have on science learning. This study explored the views some First Nations (Cree) and Euro-Canadian Grade-7-level students in Manitoba had about the nature of science. Both qualitative (open-ended questions and interviews) and quantitative (a Likert-scale questionnaire) instruments were used to explore student views. A central hypothesis to this research programme is the possibility that the different world-views of two student populations, Cree and Euro-Canadian, are likely to influence their perceptions of science. This preliminary study explored a range of methodologies to probe the perceptions of the nature of science in these two student populations. It was found that the two cultural groups differed significantly between some of the tenets in a Nature of Scientific Knowledge Scale (NSKS). Cree students significantly differed from Euro-Canadian students on the developmental, testable and unified tenets of the nature of scientific knowledge scale. No significant differences were found in NSKS scores between language groups (Cree students who speak English in the home and those who speak English and Cree or Cree only). The differences found between language groups were primarily in the open-ended questions where preformulated responses were absent. Interviews about critical incidents provided more detailed accounts of the Cree students' perception of the nature of science. The implications of the findings of this study are discussed in relation to the challenges related to research methodology, further areas for investigation, science
Pahisa-Solé, Joan; Herrera-Joancomartí, Jordi
In this article, we describe a compansion system that transforms the telegraphic language that comes from the use of pictogram-based augmentative and alternative communication (AAC) into natural language. The system was tested with four participants with severe cerebral palsy and ranging degrees of linguistic competence and intellectual disabilities. Participants had used pictogram-based AAC at least for the past 30 years each and presented a stable linguistic profile. During tests, which consisted of a total of 40 sessions, participants were able to learn new linguistic skills, such as the use of basic verb tenses, while using the compansion system, which proved a source of motivation. The system can be adapted to the linguistic competence of each person and required no learning curve during tests when none of its special features, like gender, number, verb tense, or sentence type modifiers, were used. Furthermore, qualitative and quantitative results showed a mean communication rate increase of 41.59%, compared to the same communication device without the compansion system, and an overall improvement in the communication experience when the output is in natural language. Tests were conducted in Catalan and Spanish.
Elio Jesús Cruz Rondón
Full Text Available Learning a foreign language may be a challenge for most people due to differences in the form and structure between one’s mother tongue and a new one. However, there are some tools that facilitate the teaching and learning of a foreign language, for instance, new applications for digital devices, video blogs, educational platforms, and teaching materials. Therefore, this case study aims at understanding the role of teaching materials among beginners’ level students learning English as a foreign language. After conducting five non-participant classroom observations and nine semi-structured interviews, we found that the way the teacher implemented a pedagogical intervention by integrating the four language skills, promoting interactive learning through the use of online resources, and using the course book led to a global English teaching and learning process.
Full Text Available The semantic web extends the current World Wide Web by adding facilities for the machine understood description of meaning. The ontology based search model is used to enhance efficiency and accuracy of information retrieval. Ontology is the core technology for the semantic web and this mechanism for representing formal and shared domain descriptions. In this paper, we proposed ontology based meaningful search using semantic web and Natural Language Processing (NLP techniques in the educational domain. First we build the educational ontology then we present the semantic search system. The search model consisting three parts which are embedding spell-check, finding synonyms using WordNet API and querying ontology using SPARQL language. The results are both sensitive to spell check and synonymous context. This paper provides more accurate results and the complete details for the selected field in a single page.
Jones, William I.
This study examined the understanding of nature of science among participants in their final year of a 4-year undergraduate teacher education program at a Midwest liberal arts university. The Logic Model Process was used as an integrative framework to focus the collection, organization, analysis, and interpretation of the data for the purpose of (1) describing participant understanding of NOS and (2) to identify participant characteristics and teacher education program features related to those understandings. The Views of Nature of Science Questionnaire form C (VNOS-C) was used to survey participant understanding of 7 target aspects of Nature of Science (NOS). A rubric was developed from a review of the literature to categorize and score participant understanding of the target aspects of NOS. Participants' high school and college transcripts, planning guides for their respective teacher education program majors, and science content and science teaching methods course syllabi were examined to identify and categorize participant characteristics and teacher education program features. The R software (R Project for Statistical Computing, 2010) was used to conduct an exploratory analysis to determine correlations of the antecedent and transaction predictor variables with participants' scores on the 7 target aspects of NOS. Fourteen participant characteristics and teacher education program features were moderately and significantly ( p Middle Childhood with a science concentration program major or in the Adolescent/Young Adult Science Education program major were more likely to have an informed understanding on each of the 7 target aspects of NOS. Analyses of the planning guides and the course syllabi in each teacher education program major revealed differences between the program majors that may account for the results.
.... After examining some situations in which United States and British forces carried out counterinsurgency operations, the author reveals that ground troops with foreign-language skills and cultural...
Geide Rosa Coelho
Full Text Available We report an inquiry on the development of students' understanding about the nature of light. The study happened in a learning environment with a recursive and spiral Physics syllabus. We investigated the change in students' understanding about the nature of light during their 3rd year in High School, and the level of understanding about this subject achieved by students at the end of this year. To assess the students' understanding, we developed an open questionnaire form and a set of hierarchical categories, consisting of five different models about the nature of light. The questionnaire was used to access the students´ understanding at the beginning and at the end of the third level of the recursive curriculum. The results showed that students have a high level of prior knowledge, and also that the Physics learning they experienced had enhanced their understanding, despite the effects are not verified in all the Physics classes. By the end of the third year, most of the students explain the nature of light using or a corpuscular electromagnetic model or a dual electromagnetic model, but some students use these models with inconsistencies in their explanations.
Famed for his collection of drawings of naturalia and his thoughts on the relationship between painting and natural knowledge, it now appears that the Bolognese naturalist Ulisse Aldrovandi (1522-1605) also pondered specifically color and pigments, compiling not only lists and diagrams of color terms but also a full-length unpublished manuscript entitled De coloribus or Trattato dei colori. Introducing these writings for the first time, this article portrays a scholar not so much interested in the materiality of pigment production, as in the cultural history of hues. It argues that these writings constituted an effort to build a language of color, in the sense both of a standard nomenclature of hues and of a lexicon, a dictionary of their denotations and connotations as documented in the literature of ancients and moderns. This language would serve the naturalist in his artistic patronage and his natural historical studies, where color was considered one of the most reliable signs for the correct identification of specimens, and a guarantee of accuracy in their illustration. Far from being an exception, Aldrovandi's 'color sensibility'spoke of that of his university-educated nature-loving peers.
Benjamin O. Ladd
Full Text Available Introduction: Change talk (CT and sustain talk (ST are thought to reflect underlying motivation and be important mechanisms of behavior change (MOBCs. However, greater specificity and experimental rigor is needed to establish CT and ST as MOBCs. Testing the effects of self-directed language under laboratory conditions is one promising avenue. The current study presents a replication and extension of research examining the feasibility for using simulation tasks to elicit self-directed language. Methods: First-year college students (N=92 responded to the Collegiate Simulated Intoxication Digital Elicitation, a validated task for assessing decision-making in college drinking. Verbal responses elicited via free-response and structured interview formats were coded based on established definitions of CT and ST, with minor modifications to reflect the non-treatment context. Associations between self-directed language and alcohol use at baseline and eight months were examined. Additionally, this study examined whether a contextually-based measure of decision-making, behavioral willingness, mediated relationships between self-directed language and alcohol outcome. Results: Healthy talk and unhealthy talk independently were associated with baseline alcohol use across both elicitation formats. Only healthy talk during the free-response elicitation was associated with alcohol use at follow up; both healthy talk and unhealthy talk during the interview elicitation were associated with 8-month alcohol use. Behavioral willingness significantly mediated the relationship between percent healthy talk and alcohol outcome. Conclusions: Findings support the utility of studying self-directed language under laboratory conditions and suggest that such methods may provide a fruitful strategy to further understand the role of self-directed language as a MOBC. Keywords: Change talk, College students, Alcohol, Simulation task
This volume deals with the computational application of systemic functional grammar (SFG) for natural language generation. In particular, it describes the implementation of a fragment of the grammar of German in the computational framework of KOMET-PENMAN for multilingual generation. The text also presents a specification of explicit well-formedness constraints on syntagmatic structure which are defined in the form of typed feature structures. It thus achieves a model of systemic functional grammar that unites both the strengths of systemics, such as stratification, functional diversification
Vilic, Adnan; Petersen, John Asger; Hoppe, Karsten
This paper presents a data-driven approach to graphically presenting text-based patient journals while still maintaining all textual information. The system first creates a timeline representation of a patients’ physiological condition during an admission, which is assessed by electronically...... monitoring vital signs and then combining these into Early Warning Scores (EWS). Hereafter, techniques from Natural Language Processing (NLP) are applied on the existing patient journal to extract all entries. Finally, the two methods are combined into an interactive timeline featuring the ability to see...... drastic changes in the patients’ health, and thereby enabling staff to see where in the journal critical events have taken place....
Full Text Available The Quran is a scripture that acts as the main reference to people which their religion is Islam. It covers information from politics to science, with vast amount of information that requires effort to uncover the knowledge behind it. Today, the emergence of smartphones has led to the development of a wide-range application for enhancing knowledge-seeking activities. This project proposes a mobile application that is taking a natural language approach to searching topics in the Quran based on keyword searching. The benefit of the application is two-fold; it is intuitive and it saves time.
The aim of this bachelor thesis is to explore this image label database coming from the ESP game from the natural language processing (NLP) point of view. ESP game is an online game, in which human players do useful work - they label images. The output of the ESP game is then a database of images and their labels. What interests us is whether the data collected in the process of labeling images will be of any use in NLP tasks. Specifically, we are interested in the tasks of automatic corefere...
It is shown how certain kinds of domain independent expert systems based on classification problem-solving methods can be constructed directly from natural language descriptions by a human expert. The expert knowledge is not translated into production rules. Rather, it is mapped into conceptual structures which are integrated into long-term memory (LTM). The resulting system is one in which problem-solving, retrieval and memory organization are integrated processes. In other words, the same algorithm and knowledge representation structures are shared by these processes. As a result of this, the system can answer questions, solve problems or reorganize LTM.
Full Text Available Abstract Background Incident reporting is the most common method for detecting adverse events in a hospital. However, under-reporting or non-reporting and delay in submission of reports are problems that prevent early detection of serious adverse events. The aim of this study was to determine whether it is possible to promptly detect serious injuries after inpatient falls by using a natural language processing method and to determine which data source is the most suitable for this purpose. Methods We tried to detect adverse events from narrative text data of electronic medical records by using a natural language processing method. We made syntactic category decision rules to detect inpatient falls from text data in electronic medical records. We compared how often the true fall events were recorded in various sources of data including progress notes, discharge summaries, image order entries and incident reports. We applied the rules to these data sources and compared F-measures to detect falls between these data sources with reference to the results of a manual chart review. The lag time between event occurrence and data submission and the degree of injury were compared. Results We made 170 syntactic rules to detect inpatient falls by using a natural language processing method. Information on true fall events was most frequently recorded in progress notes (100%, incident reports (65.0% and image order entries (12.5%. However, F-measure to detect falls using the rules was poor when using progress notes (0.12 and discharge summaries (0.24 compared with that when using incident reports (1.00 and image order entries (0.91. Since the results suggested that incident reports and image order entries were possible data sources for prompt detection of serious falls, we focused on a comparison of falls found by incident reports and image order entries. Injury caused by falls found by image order entries was significantly more severe than falls detected by
This book introduces basic supervised learning algorithms applicable to natural language processing (NLP) and shows how the performance of these algorithms can often be improved by exploiting the marginal distribution of large amounts of unlabeled data. One reason for that is data sparsity, i.e., the limited amounts of data we have available in NLP. However, in most real-world NLP applications our labeled data is also heavily biased. This book introduces extensions of supervised learning algorithms to cope with data sparsity and different kinds of sampling bias.This book is intended to be both
D'Souza, Dean; Filippi, Roberto
The ability to acquire language is a critical part of human development. Yet there is no consensus on how the skill emerges in early development. Does it constitute an innately-specified, language-processing module or is it acquired progressively? One of Annette Karmiloff-Smith's (1938-2016) key contributions to developmental science addresses…
Tomblin, J. Bruce; Mueller, Kathyrn L.
This article provides a background for the topic of comorbidity of attention-deficit/hyperactivity disorder and spoken and written language and speech disorders that extends through this issue of "Topics in Language Disorders." Comorbidity is common within developmental disorders and may be explained by many possible reasons. Some of these can be…
Drawing on institutional theory, this study describes how cognitive, normative, and regulative mechanisms shape bilingual teachers' language policy implementation in both English-only and bilingual contexts. Aligned with prior educational language policy research, findings indicate the important role that teachers' beliefs play in the policy…
Sauerland, Uli; Grohmann, Kleanthes K.; Guasti, Maria Teresa; Andelkovic, Darinka; Argus, Reili; Armon-Lotem, Sharon; Arosio, Fabrizio; Avram, Larisa; Costa, João; Dabašinskiene, Ineta; de López, Kristine; Gatt, Daniela; Grech, Helen; Haman, Ewa; van Hout, Angeliek; Hrzica, Gordana; Kainhofer, Judith; Kamandulyte-Merfeldiene, Laura; Kunnari, Sari; Kovacevic, Melita; Kuvac Kraljevic, Jelena; Lipowska, Katarzyna; Mejias, Sandrine; Popovic, Maša; Ruzaite, Jurate; Savic, Maja; Sevcenco, Anca; Varlokosta, Spyridoula; Varnava, Marina; Yatsushiro, Kazuko
The comprehension of constituent questions is an important topic for language acquisition research and for applications in the diagnosis of language impairment. This article presents the results of a study investigating the comprehension of different types of questions by 5-year-old, typically developing children across 19 European countries, 18…
Full Text Available This paper investigates the interplay of constructed action and the clause in Finnish Sign Language (FinSL. Constructed action is a form of gestural enactment in which the signers use their hands, face and other parts of the body to represent the actions, thoughts or feelings of someone they are referring to in the discourse. With the help of frequencies calculated from corpus data, this article shows firstly that when FinSL signers are narrating a story, there are differences in how they use constructed action. Then the paper argues that there are differences also in the prototypical structure, linkage type and non-manual activity of clauses, depending on the presence or non-presence of constructed action. Finally, taking the view that gesturality is an integral part of language, the paper discusses the nature of syntax in sign languages and proposes a conceptualization in which syntax is seen as a set of norms distributed on a continuum between a categorial-conventional end and a gradient-unconventional end.
The frequency distribution of words has been a key object of study in statistical linguistics for the past 70 years. This distribution approximately follows a simple mathematical form known as Zipf ’ s law. This article first shows that human language has a highly complex, reliable structure in the frequency distribution over and above this classic law, although prior data visualization methods have obscured this fact. A number of empirical phenomena related to word frequencies are then reviewed. These facts are chosen to be informative about the mechanisms giving rise to Zipf’s law and are then used to evaluate many of the theoretical explanations of Zipf’s law in language. No prior account straightforwardly explains all the basic facts or is supported with independent evaluation of its underlying assumptions. To make progress at understanding why language obeys Zipf’s law, studies must seek evidence beyond the law itself, testing assumptions and evaluating novel predictions with new, independent data. PMID:24664880
Rice, Mabel L
Future perspectives on children with language impairments are framed from what is known about children with specific language impairment (SLI). A summary of the current state of services is followed by discussion of how these children can be overlooked and misunderstood and consideration of why it is so hard for some children to acquire language when it is effortless for most children. Genetic influences are highlighted, with the suggestion that nature plus nurture should be considered in present as well as future intervention approaches. A nurture perspective highlights the family context of the likelihood of SLI for some of the children. Future models of the causal pathways may provide more specific information to guide gene-treatment decisions, in ways parallel to current personalized medicine approaches. Future treatment options can build on the potential of electronic technologies and social media to provide personalized treatment methods available at a time and place convenient for the person to use as often as desired. The speech-language pathologist could oversee a wide range of treatment options and monitor evidence provided electronically to evaluate progress and plan future treatment steps. Most importantly, future methods can provide lifelong language acquisition activities that maintain the privacy and dignity of persons with language impairment, and in so doing will in turn enhance the effectiveness of speech-language pathologists. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
DiBenedetto, Christina M.
This study is the first of its kind to explore the thoughts, beliefs, attitudes and values of secondary educators as they experience conceptual change in their understanding of the nature of science learning vis a vis the Framework for K-12 Science Education published by the National Research Council. The study takes aim at the existing gap between the vision for science learning as an active process of inquiry and current pedagogical practices in K-12 science classrooms. For students to understand and explain everyday science ideas and succeed in science studies and careers, the means by which they learn science must change. Focusing on this change, the study explores the significance of educator attitudes, beliefs and values to science learning through interpretive phenomenological analysis around the central question, "In what ways do educators understand and articulate attitudes and beliefs toward the nature of science learning?" The study further explores the questions, "How do educators experience changes in their understanding of the nature of science learning?" and "How do educators believe these changes influence their pedagogical practice?" Study findings converge on four conceptions that science learning: is the action of inquiry; is a visible process initiated by both teacher and learner; values student voice and changing conceptions is science learning. These findings have implications for the primacy of educator beliefs, attitudes and values in reform efforts, science teacher leadership and the explicit instruction of both Nature of Science and conceptual change in educator preparation programs. This study supports the understanding that the nature of science learning is cognitive and affective conceptual change. Keywords: conceptual change, educator attitudes and beliefs, framework for K-12 science education, interpretive phenomenological analysis, nature of science learning, next generation science standards, science professional development
Pazos R, Rodolfo A; Aguirre L, Marco A; González B, Juan J; Martínez F, José A; Pérez O, Joaquín; Verástegui O, Andrés A
In the last decades the popularity of natural language interfaces to databases (NLIDBs) has increased, because in many cases information obtained from them is used for making important business decisions. Unfortunately, the complexity of their customization by database administrators make them difficult to use. In order for a NLIDB to obtain a high percentage of correctly translated queries, it is necessary that it is correctly customized for the database to be queried. In most cases the performance reported in NLIDB literature is the highest possible; i.e., the performance obtained when the interfaces were customized by the implementers. However, for end users it is more important the performance that the interface can yield when the NLIDB is customized by someone different from the implementers. Unfortunately, there exist very few articles that report NLIDB performance when the NLIDBs are not customized by the implementers. This article presents a semantically-enriched data dictionary (which permits solving many of the problems that occur when translating from natural language to SQL) and an experiment in which two groups of undergraduate students customized our NLIDB and English language frontend (ELF), considered one of the best available commercial NLIDBs. The experimental results show that, when customized by the first group, our NLIDB obtained a 44.69 % of correctly answered queries and ELF 11.83 % for the ATIS database, and when customized by the second group, our NLIDB attained 77.05 % and ELF 13.48 %. The performance attained by our NLIDB, when customized by ourselves was 90 %.
Amaechi Uneke Enyi
Full Text Available The study entitled. “Language and Interactional Discourse: Deconstructing the Talk - Generating Machinery in Natural Conversation,” is an analysis of spontaneous and informal conversation. The study, carried out in the theoretical and methodological tradition of Ethnomethodology, was aimed at explicating how ordinary talk is organized and produced, how people coordinate their talk –in- interaction, how meanings are determined, and the role of talk in the wider social processes. The study followed the basic assumption of conversation analysis which is, that talk is not just a product of two ‘speakers - hearers’ who attempt to exchange information or convey messages to each other. Rather, participants in conversation are seen to be mutually orienting to, and collaborating in order to achieve orderly and meaningful communication. The analytic objective is therefore to make clear these procedures on which speakers rely to produce utterances and by which they make sense of other speakers’ talk. The datum used for this study was a recorded informal conversation between two (and later three middle- class civil servants who are friends. The recording was done in such a way that the participants were not aware that they were being recorded. The recording was later transcribed in a way that we believe is faithful to the spontaneity and informality of the talk. Our finding showed that conversation has its own features and is an ordered and structured social day by- day event. Specifically, utterances are designed and informed by organized procedures, methods and resources which are tied to the contexts in which they are produced, and which participants are privy to by virtue of their membership of a culture or a natural language community. Keywords: Language, Discourse and Conversation
Kreimeyer, Kory; Foster, Matthew; Pandey, Abhishek; Arya, Nina; Halford, Gwendolyn; Jones, Sandra F; Forshee, Richard; Walderhaug, Mark; Botsis, Taxiarchis
We followed a systematic approach based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses to identify existing clinical natural language processing (NLP) systems that generate structured information from unstructured free text. Seven literature databases were searched with a query combining the concepts of natural language processing and structured data capture. Two reviewers screened all records for relevance during two screening phases, and information about clinical NLP systems was collected from the final set of papers. A total of 7149 records (after removing duplicates) were retrieved and screened, and 86 were determined to fit the review criteria. These papers contained information about 71 different clinical NLP systems, which were then analyzed. The NLP systems address a wide variety of important clinical and research tasks. Certain tasks are well addressed by the existing systems, while others remain as open challenges that only a small number of systems attempt, such as extraction of temporal information or normalization of concepts to standard terminologies. This review has identified many NLP systems capable of processing clinical free text and generating structured output, and the information collected and evaluated here will be important for prioritizing development of new approaches for clinical NLP. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available Bahasa adalah sebuah cara berkomunikasi secara sistematis dengan menggunakan suara atau simbol-simbol yang memiliki arti, yang diucapkan melalui mulut. Bahasa juga ditulis dengan mengikuti kaidah yang berlaku. Salah satu bahasa yang banyak digunakan di belahan dunia adalah Bahasa Inggris. Namun ada beberapa kendala apabila kita belajar kepada seorang guru atau instruktur. Waktu yang diberikan seorang guru, terbatas pada jam sekolah atau les saja. Bila siswa pulang sekolah atau les, maka yang bersangkutan harus belajar bahasa Inggris secara mandiri. Dari permasalahan di atas, muncul sebuah ide tentang bagaimana membuat sebuah penelitian yang berkaitan dengan pembuatan aplikasi yang mampu memberikan pengetahuan kepada siswa tentang bagaimana belajar bahasa Inggris secara mandiri baik dari perubahan kalimat postif menjadi kalimat negatif dan kalimat tanya. Disamping itu, aplikasi ini juga mampu memberikan pengetahuan tentang bagaimana mengucapkan kalimat dalam bahasa Inggris. Pada intinya kontribusi yang dapat diperoleh dari hasil penelitian ini adalah pihak terkait dari tingkat SMP sampai dengan SMU/SMK, dapat menggunakan aplikasi text to speech berbasis natural language processing untuk mempelajari tenses pada bahasa Inggris. Aplikasi ini dapat memperdengarkan kalimat-kalimat pada bahasa inggris dan dapat menyusun kalimat tanya dan kalimat negatif berdasarkan kalimat positifnya dalam beberapa tenses bahasa Inggris. Kata Kunci : Natural language processing, Text to speech
Alexandr I Krupnov
Full Text Available The article discusses the results of empirical study of the association between variables of persistence and academic achievement in foreign languages. The sample includes students of the Faculty of Physics, Mathematics and Natural Science at the RUDN University ( n = 115, divided into 5 subsamples, two of which are featured in the present study (the most and the least successful students subsamples. Persistence as a personality trait is studied within A.I. Krupnov’s system-functional approach. A.I. Krupnov’s paper-and-pencil test was used to measure persistence variables. Academic achievement was measured according to the four parameters: Phonetics, Grammar, Speaking and Political vocabulary based on the grades students received during the academic year. The analysis revealed that persistence displays different associations with academic achievement variables in more and less successful students subsamples, the general prominence of this trait is more important for unsuccessful students. Phonetics is the academic achievement variable most associated with persistence due to its nature, a skill one can acquire through hard work and practice which is the definition of persistence. Grammar as an academic achievement variable is not associated with persistence and probably relates to other factors. Unsuccessful students may have difficulties in separating various aspects of language acquisition from each other which should be taken into consideration by the teachers.
Genuardi, Michael T.
One strategy for machine-aided indexing (MAI) is to provide a concept-level analysis of the textual elements of documents or document abstracts. In such systems, natural-language phrases are analyzed in order to identify and classify concepts related to a particular subject domain. The overall performance of these MAI systems is largely dependent on the quality and comprehensiveness of their knowledge bases. These knowledge bases function to (1) define the relations between a controlled indexing vocabulary and natural language expressions; (2) provide a simple mechanism for disambiguation and the determination of relevancy; and (3) allow the extension of concept-hierarchical structure to all elements of the knowledge file. After a brief description of the NASA Machine-Aided Indexing system, concerns related to the development and maintenance of MAI knowledge bases are discussed. Particular emphasis is given to statistically-based text analysis tools designed to aid the knowledge base developer. One such tool, the Knowledge Base Building (KBB) program, presents the domain expert with a well-filtered list of synonyms and conceptually-related phrases for each thesaurus concept. Another tool, the Knowledge Base Maintenance (KBM) program, functions to identify areas of the knowledge base affected by changes in the conceptual domain (for example, the addition of a new thesaurus term). An alternate use of the KBM as an aid in thesaurus construction is also discussed.
Varelas, Maria; Pappas, Christine; Barry, Anne; O'Neill, Amy
Presents units that address states of matter and changes of states of matter linked with the water cycle and integrates literacy and science. Discusses the language in science books. Lists characteristics of good science inquiry units. (Contains 11 references.) (ASK)
Bell, Randy L.; Matkins, Juanita Jo; Gansneder, Bruce M.
This mixed-methods investigation compared the relative impacts of instructional approach and context of nature of science instruction on preservice elementary teachers' understandings. The sample consisted of 75 preservice teachers enrolled in four sections of an elementary science methods course. Independent variables included instructional…
do Nascimento Rocha, Maristela; Gurgel, Ivã
This paper performs a critical analysis of the consensual and family resemblance approaches to the nature of science. Despite the debate that surrounds them, between a pragmatic consensus and a more comprehensive understanding, both approaches have in common the goal of helping students to "internalize" knowledge about science in a…
Sengdala, Phoxay; Yuenyong, Chokchai
This paper aimed to study of Grade 12 students' understanding of nature of science in learning about atom for peace through science technology and society (STS) approach. Participants were 51 Grade 12 who study in Thongphong high school Vientiane Capital City Lao PDR, 1st semester of 2012 academic year. This research regarded interpretive…
Chuy, Maria; Scardamalia, Marlene; Bereiter, Carl; Prinsen, Fleur; Resendes, Monica; Messina, Richard; Hunsburger, Winifred; Teplovs, Chris; Chow, Angela
In 1993 Carey and Smith conjectured that the most promising way to boost students' understanding of the nature of science is a "theory-building approach to teaching about inquiry." The research reported here tested this conjecture by comparing results from two Grade 4 classrooms that differed in their emphasis on and technological…
McComas, William F.
The nature of science (NOS) is a phrase used to represent the rules of the game of science. Arguably, NOS is the most important content issue in science instruction because it helps students understand the way in which knowledge is generated and validated within the scientific enterprise. This article offers a proposal for the elements of NOS that…
This dissertation may be located in the wide debate on the effectiveness of policy interventions in developing countries, in the field of natural resource management (NRM). It is especially concerned with contributing to the understanding of the limited effectiveness of fishery management
Gotch, Chad; Hall, Troy
The Theory of Reasoned Action has proven to be a valuable tool for predicting and understanding behavior and, as such, provides a potentially important basis for environmental education program design. This study used a Theory of Reasoned Action approach to examine a unique type of behavior (nature-related activities) and a unique population…
Full Text Available The seventh issue of Complex Systems Informatics and Modeling Quarterly presents five papers devoted to two distinct research topics: systems modeling and natural language processing (NLP. Both of these subjects are very important in computer science. Through modeling we can simplify the studied problem by concentrating on only one aspect at a time. Moreover, a properly constructed model allows the modeler to work on higher levels of abstraction and not having to concentrate on details. Since the size and complexity of information systems grows rapidly, creating good models of such systems is crucial. The analysis of natural language is slowly becoming a widely used tool in commerce and day to day life. Opinion mining allows recommender systems to provide accurate recommendations based on user-generated reviews. Speech recognition and NLP are the basis for such widely used personal assistants as Apple’s Siri, Microsoft’s Cortana, and Google Now. While a lot of work has already been done on natural language processing, the research usually concerns widely used languages, such as English. Consequently, natural language processing in languages other than English is very relevant subject and is addressed in this issue.
Ma, Cuixia; Dai, Guozhong
Natural User Interface is one of the important next generation interactions. Computers are not just the tools of many special people or areas but for most people. Ubiquitous computing makes the world magic and more comfortable. In the design domain, current systems, which need the detail information, cannot conveniently support the conceptual design of the early phrase. Pen and paper are the natural and simple tools to use in our daily life, especially in design domain. Gestures are the useful and natural mode in the interaction of pen-based. In natural UI, gestures can be introduced and used through the similar mode to the existing resources in interaction. But the gestures always are defined beforehand without the users' intention and recognized to represent something in certain applications without being transplanted to others. We provide the gesture description language (GDL) to try to cite the useful gestures to the applications conveniently. It can be used in terms of the independent control resource such as menus or icons in applications. So we give the idea from two perspectives: one from the application-dependent point of view and the other from the application-independent point of view.
Individual classroom experiences: a sociocultural comparison for understanding efl classroom language learning Individual classroom experiences: a sociocultural comparison for understanding efl classroom language learning
Full Text Available Este trabalho compara as experiências de sala de aula (ESA de duas universitárias na aprendizagem de língua inglesa. As ESA emergiram de entrevistas individuais, onde vídeos das aulas promoveram a reflexão. A análise revelou que experiências de natureza cognitiva, social ou afetiva influem diretamente no processo de aprendizagem e as que se referem ao contexto, à história, crenças e metas dos alunos influem indiretamente no mesmo. A singularidade de algumas experiências levou à sua categorização como ESA individuais (ESAI. Ao comparar as ESAI de duas informantes, a importância da análise sociocultural do processo de aprendizagem de sala de aula fica evidente. Concluiremos com uma defesa do valor da teoria sociocultural no estudo da aprendizagem de língua estrangeira em sala de aula e com a apresentação das implicações deste estudo para pesquisadores e professores. This paper compares the classroom experiences (CEs of two university students in their process of learning English as a foreign language (EFL. The CEs emerged from individual interviews, where classroom videos promoted reflection. The analysis revealed that cognitive, social and affective experiences directly influence the learning process and that those which refer to setting, learner’s personal background, beliefs and goal influence the learning process indirectly. The analysis also revealed the singularity of some of these CEs that led to their categorization as individual CEs (ICEs. When comparing the ICEs of the two participants, the importance of a sociocultural analysis of the classroom learning process becomes evident. We conclude with an analysis of the value of sociocultural theory in the study of classroom EFL learning and with the implications of this study for teachers and researchers.
Full Text Available Although a significant body of research has investigated the relationships among children’s emotion understanding (EU, theory of mind (ToM, and language abilities. As far as we know, no study to date has been conducted with a sizeable sample of both preschool and school-age children exploring the direct effect of EU on ToM when the role of language was evaluated as a potential exogenous factor in a single comprehensive model. Participants in the current study were 389 children (age range: 37–97 months, M = 60.79 months; SD = 12.66, to whom a False-Belief understanding battery, the Test of Emotion Comprehension, and the Peabody Test were administered. Children’s EU, ToM, and language ability (receptive vocabulary were positively correlated. Furthermore, EU scores explained variability in ToM scores independently of participants’ age and gender. Finally, language was found to play a crucial role in both explaining variance in ToM scores and in mediating the relationship between EU and ToM. We discuss the theoretical and educational implications of these outcomes, particularly in relation to offering social and emotional learning programs through schools.
Grazzani, Ilaria; Ornaghi, Veronica; Conte, Elisabetta; Pepe, Alessandro; Caprin, Claudia
Although a significant body of research has investigated the relationships among children's emotion understanding (EU), theory of mind (ToM), and language abilities. As far as we know, no study to date has been conducted with a sizeable sample of both preschool and school-age children exploring the direct effect of EU on ToM when the role of language was evaluated as a potential exogenous factor in a single comprehensive model. Participants in the current study were 389 children (age range: 37-97 months, M = 60.79 months; SD = 12.66), to whom a False-Belief understanding battery, the Test of Emotion Comprehension, and the Peabody Test were administered. Children's EU, ToM, and language ability (receptive vocabulary) were positively correlated. Furthermore, EU scores explained variability in ToM scores independently of participants' age and gender. Finally, language was found to play a crucial role in both explaining variance in ToM scores and in mediating the relationship between EU and ToM. We discuss the theoretical and educational implications of these outcomes, particularly in relation to offering social and emotional learning programs through schools.
Zhang, Xingyu; Kim, Joyce; Patzer, Rachel E; Pitts, Stephen R; Patzer, Aaron; Schrager, Justin D
To describe and compare logistic regression and neural network modeling strategies to predict hospital admission or transfer following initial presentation to Emergency Department (ED) triage with and without the addition of natural language processing elements. Using data from the National Hospital Ambulatory Medical Care Survey (NHAMCS), a cross-sectional probability sample of United States EDs from 2012 and 2013 survey years, we developed several predictive models with the outcome being admission to the hospital or transfer vs. discharge home. We included patient characteristics immediately available after the patient has presented to the ED and undergone a triage process. We used this information to construct logistic regression (LR) and multilayer neural network models (MLNN) which included natural language processing (NLP) and principal component analysis from the patient's reason for visit. Ten-fold cross validation was used to test the predictive capacity of each model and receiver operating curves (AUC) were then calculated for each model. Of the 47,200 ED visits from 642 hospitals, 6,335 (13.42%) resulted in hospital admission (or transfer). A total of 48 principal components were extracted by NLP from the reason for visit fields, which explained 75% of the overall variance for hospitalization. In the model including only structured variables, the AUC was 0.824 (95% CI 0.818-0.830) for logistic regression and 0.823 (95% CI 0.817-0.829) for MLNN. Models including only free-text information generated AUC of 0.742 (95% CI 0.731- 0.753) for logistic regression and 0.753 (95% CI 0.742-0.764) for MLNN. When both structured variables and free text variables were included, the AUC reached 0.846 (95% CI 0.839-0.853) for logistic regression and 0.844 (95% CI 0.836-0.852) for MLNN. The predictive accuracy of hospital admission or transfer for patients who presented to ED triage overall was good, and was improved with the inclusion of free text data from a patient
Full Text Available When using assistive systems, the consideration of individual and cultural meaning is crucial for the utility and acceptance of technology. Orientation, communication and interaction are rooted in perception and therefore always happen in material space. We understand that a major problem lies in the difference between human and technical perception of space. Cultural policies are based on meanings including their spatial situation and their rich relationships. Therefore, we have developed an approach where the different perception systems share a hybrid spatial model that is generated by artificial intelligence—a joint effort by humans and assistive systems. The aim of our project is to create a spatial model of cultural meaning based on interaction between humans and robots. We define the role of humanoid robots as becoming our companions. This calls for technical systems to include still inconceivable human and cultural agendas for the perception of space. In two experiments, we tested a first prototype of the communication module that allows a humanoid to learn cultural meanings through a machine learning system. Interaction is achieved by non-verbal and natural-language communication between humanoids and test persons. This helps us to better understand how a spatial model of cultural meaning can be developed.
Aiemsum-ang, Napapan; Yuenyong, Chokchai
This paper aimed to investigate the existing ideas of nature of science (NOS) teaching in Thailand biology classroom. The study reported the existing ideas of nature of science (NOS) teaching of one biology teacher Mrs. Mali who had been teaching for 6 years at in a school in Khon Kaen city. Methodology regarded interpretive paradigm. Tools of interpretation included 2 months of classroom observation, interviewing, and questionnaire of NOS. The findings revealed Mali held good understanding of the nature of science in the aspect of the use of evidence, the aspect of knowledge inquiry through different observation and deduction, the aspect of creativity and imagination influencing science knowledge inquiry, and the aspect of changeable scientific knowledge. Her biology teaching indicated that she used both the deficient nature of science approach and the implicit nature of science approach. The implicit nature of science approach was applied mostly in 7 periods and only 2 periods were arranged using the deficient nature of science approach. The paper has implication for professional development and pre-service program on NOS teaching in Thailand.
The word ''radioactivity'' has something scary about it; it makes us think of something intangable, creeping dangers, the mysterious ticking of Geiger counters, reactor disasters, dirty bombs, nuclear contamination and destruction. True: Whole landscapes were made uninhabitable by accidents involving radioactive material such as Windscale, Sellafield and Chernobyl and others that were kept largely secret from the public. While to some they brought premature death, for the great majority of the world population their effects have so far been insignificant. By contrast, how little known is the fact that natural radioactivity has been around since human beginnings and that the cells of the human body have always been equipped to repair damage from radioactive radiation or other causes provided such damage does not occur too frequently. Elmar Traebert presents the physics underlying radioactivity without resorting to formulas and explains in an easily understandable manner the different types of radiation, their measurement and sources (in medicine, power plants, and weapons technology) and how they should be handled. He describes nuclear power plants and the safety problems they involve, sunburn, radiation therapy, uranium ammunition and uranium mining. Whoever knows about these things can more early cope with his own fears and maybe allay some of them. He can also see through statements made by different interest groups with regard to radioactive material and duly form his own opinion.
Lassiter, Daniel; Goodman, Noah D
The "new paradigm" unifying deductive and inductive reasoning in a Bayesian framework (Oaksford & Chater, 2007; Over, 2009) has been claimed to be falsified by results which show sharp differences between reasoning about necessity vs. plausibility (Heit & Rotello, 2010; Rips, 2001; Rotello & Heit, 2009). We provide a probabilistic model of reasoning with modal expressions such as "necessary" and "plausible" informed by recent work in formal semantics of natural language, and show that it predicts the possibility of non-linear response patterns which have been claimed to be problematic. Our model also makes a strong monotonicity prediction, while two-dimensional theories predict the possibility of reversals in argument strength depending on the modal word chosen. Predictions were tested using a novel experimental paradigm that replicates the previously-reported response patterns with a minimal manipulation, changing only one word of the stimulus between conditions. We found a spectrum of reasoning "modes" corresponding to different modal words, and strong support for our model's monotonicity prediction. This indicates that probabilistic approaches to reasoning can account in a clear and parsimonious way for data previously argued to falsify them, as well as new, more fine-grained, data. It also illustrates the importance of careful attention to the semantics of language employed in reasoning experiments. Copyright © 2014 Elsevier B.V. All rights reserved.
Soysal, Ergin; Wang, Jingqi; Jiang, Min; Wu, Yonghui; Pakhomov, Serguei; Liu, Hongfang; Xu, Hua
Existing general clinical natural language processing (NLP) systems such as MetaMap and Clinical Text Analysis and Knowledge Extraction System have been successfully applied to information extraction from clinical text. However, end users often have to customize existing systems for their individual tasks, which can require substantial NLP skills. Here we present CLAMP (Clinical Language Annotation, Modeling, and Processing), a newly developed clinical NLP toolkit that provides not only state-of-the-art NLP components, but also a user-friendly graphic user interface that can help users quickly build customized NLP pipelines for their individual applications. Our evaluation shows that the CLAMP default pipeline achieved good performance on named entity recognition and concept encoding. We also demonstrate the efficiency of the CLAMP graphic user interface in building customized, high-performance NLP pipelines with 2 use cases, extracting smoking status and lab test values. CLAMP is publicly available for research use, and we believe it is a unique asset for the clinical NLP community. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Levitt, Ash; Schlauch, Robert C; Bartholow, Bruce D; Sher, Kenneth J
Examining the natural language college students use to describe various levels of intoxication can provide important insight into subjective perceptions of college alcohol use. Previous research (Levitt et al., Alcohol Clin Exp Res 2009; 33: 448) has shown that intoxication terms reflect moderate and heavy levels of intoxication and that self-use of these terms differs by gender among college students. However, it is still unknown whether these terms similarly apply to other individuals and, if so, whether similar gender differences exist. To address these issues, the current study examined the application of intoxication terms to characters in experimentally manipulated vignettes of naturalistic drinking situations within a sample of university undergraduates (n = 145). Findings supported and extended previous research by showing that other-directed applications of intoxication terms are similar to self-directed applications and depend on the gender of both the target and the user. Specifically, moderate intoxication terms were applied to and from women more than men, even when the character was heavily intoxicated, whereas heavy intoxication terms were applied to and from men more than women. The findings suggest that gender differences in the application of intoxication terms are other-directed as well as self-directed and that intoxication language can inform gender-specific prevention and intervention efforts targeting problematic alcohol use among college students. Copyright © 2013 by the Research Society on Alcoholism.
Payne, Philip R O; Kwok, Alan; Dhaval, Rakesh; Borlawsky, Tara B
The conduct of large-scale translational studies presents significant challenges related to the storage, management and analysis of integrative data sets. Ideally, the application of methodologies such as conceptual knowledge discovery in databases (CKDD) provides a means for moving beyond intuitive hypothesis discovery and testing in such data sets, and towards the high-throughput generation and evaluation of knowledge-anchored relationships between complex bio-molecular and phenotypic variables. However, the induction of such high-throughput hypotheses is non-trivial, and requires correspondingly high-throughput validation methodologies. In this manuscript, we describe an evaluation of the efficacy of a natural language processing-based approach to validating such hypotheses. As part of this evaluation, we will examine a phenomenon that we have labeled as "Conceptual Dissonance" in which conceptual knowledge derived from two or more sources of comparable scope and granularity cannot be readily integrated or compared using conventional methods and automated tools.
Juan Andres Laura
Full Text Available In recent studies Recurrent Neural Networks were used for generative processes and their surprising performance can be explained by their ability to create good predictions. In addition, Data Compression is also based on prediction. What the problem comes down to is whether a data compressor could be used to perform as well as recurrent neural networks in the natural language processing tasks of sentiment analysis and automatic text generation. If this is possible, then the problem comes down to determining if a compression algorithm is even more intelligent than a neural network in such tasks. In our journey, a fundamental difference between a Data Compression Algorithm and Recurrent Neural Networks has been discovered.
In this paper, I investigate a problem of finding most similar music tracks using, popular in Natural Language Processing, techniques like: TF-IDF and LDA. I de ned document as music track. Each music track is transformed to spectrogram, thanks that, I can use well known techniques to get words from images. I used SURF operation to detect characteristic points and novel approach for their description. The standard kmeans was used for clusterization. Clusterization is here identical with dictionary making, so after that I can transform spectrograms to text documents and perform TF-IDF and LDA. At the final, I can make a query in an obtained vector space. The research was done on 16 music tracks for training and 336 for testing, that are splitted in four categories: Hiphop, Jazz, Metal and Pop. Although used technique is completely unsupervised, results are satisfactory and encouraging to further research.
Full Text Available In the last decades, Natural Language Processing (NLP has obtained a high level of success. Interactions between NLP and Serious Games have started and some of them already include NLP techniques. The objectives of this paper are twofold: on the one hand, providing a simple framework to enable analysis of potential uses of NLP in Serious Games and, on the other hand, applying the NLP framework to existing Serious Games and giving an overview of the use of NLP in pedagogical Serious Games. In this paper we present 11 serious games exploiting NLP techniques. We present them systematically, according to the following structure: first, we highlight possible uses of NLP techniques in Serious Games, second, we describe the type of NLP implemented in the each specific Serious Game and, third, we provide a link to possible purposes of use for the different actors interacting in the Serious Game.
Bosco, Cristina; Delmonte, Rodolfo; Moschitti, Alessandro; Simi, Maria
The papers collected in this volume are selected as a sample of the progress in Natural Language Processing (NLP) performed within the Italian NLP community and especially attested by the PARLI project. PARLI (Portale per l’Accesso alle Risorse in Lingua Italiana) is a project partially funded by the Ministero Italiano per l’Università e la Ricerca (PRIN 2008) from 2008 to 2012 for monitoring and fostering the harmonic growth and coordination of the activities of Italian NLP. It was proposed by various teams of researchers working in Italian universities and research institutions. According to the spirit of the PARLI project, most of the resources and tools created within the project and here described are freely distributed and they did not terminate their life at the end of the project itself, hoping they could be a key factor in future development of computational linguistics.
Pai, Vinay M; Rodgers, Mary; Conroy, Richard; Luo, James; Zhou, Ruixia; Seto, Belinda
In April 2012, the National Institutes of Health organized a two-day workshop entitled 'Natural Language Processing: State of the Art, Future Directions and Applications for Enhancing Clinical Decision-Making' (NLP-CDS). This report is a summary of the discussions during the second day of the workshop. Collectively, the workshop presenters and participants emphasized the need for unstructured clinical notes to be included in the decision making workflow and the need for individualized longitudinal data tracking. The workshop also discussed the need to: (1) combine evidence-based literature and patient records with machine-learning and prediction models; (2) provide trusted and reproducible clinical advice; (3) prioritize evidence and test results; and (4) engage healthcare professionals, caregivers, and patients. The overall consensus of the NLP-CDS workshop was that there are promising opportunities for NLP and CDS to deliver cognitive support for healthcare professionals, caregivers, and patients.
Redman, Joseph S; Natarajan, Yamini; Hou, Jason K; Wang, Jingqi; Hanif, Muzammil; Feng, Hua; Kramer, Jennifer R; Desiderio, Roxanne; Xu, Hua; El-Serag, Hashem B; Kanwal, Fasiha
Natural language processing is a powerful technique of machine learning capable of maximizing data extraction from complex electronic medical records. We utilized this technique to develop algorithms capable of "reading" full-text radiology reports to accurately identify the presence of fatty liver disease. Abdominal ultrasound, computerized tomography, and magnetic resonance imaging reports were retrieved from the Veterans Affairs Corporate Data Warehouse from a random national sample of 652 patients. Radiographic fatty liver disease was determined by manual review by two physicians and verified with an expert radiologist. A split validation method was utilized for algorithm development. For all three imaging modalities, the algorithms could identify fatty liver disease with >90% recall and precision, with F-measures >90%. These algorithms could be used to rapidly screen patient records to establish a large cohort to facilitate epidemiological and clinical studies and examine the clinic course and outcomes of patients with radiographic hepatic steatosis.
Li, Muqun; Carrell, David; Aberdeen, John; Hirschman, Lynette; Kirby, Jacqueline; Li, Bo; Vorobeychik, Yevgeniy; Malin, Bradley A
Electronic medical records (EMRs) are increasingly repurposed for activities beyond clinical care, such as to support translational research and public policy analysis. To mitigate privacy risks, healthcare organizations (HCOs) aim to remove potentially identifying patient information. A substantial quantity of EMR data is in natural language form and there are concerns that automated tools for detecting identifiers are imperfect and leak information that can be exploited by ill-intentioned data recipients. Thus, HCOs have been encouraged to invest as much effort as possible to find and detect potential identifiers, but such a strategy assumes the recipients are sufficiently incentivized and capable of exploiting leaked identifiers. In practice, such an assumption may not hold true and HCOs may overinvest in de-identification technology. The goal of this study is to design a natural language de-identification framework, rooted in game theory, which enables an HCO to optimize their investments given the expected capabilities of an adversarial recipient. We introduce a Stackelberg game to balance risk and utility in natural language de-identification. This game represents a cost-benefit model that enables an HCO with a fixed budget to minimize their investment in the de-identification process. We evaluate this model by assessing the overall payoff to the HCO and the adversary using 2100 clinical notes from Vanderbilt University Medical Center. We simulate several policy alternatives using a range of parameters, including the cost of training a de-identification model and the loss in data utility due to the removal of terms that are not identifiers. In addition, we compare policy options where, when an attacker is fined for misuse, a monetary penalty is paid to the publishing HCO as opposed to a third party (e.g., a federal regulator). Our results show that when an HCO is forced to exhaust a limited budget (set to $2000 in the study), the precision and recall of the
Goldstein, Ayelet; Shahar, Yuval
Physicians are required to interpret, abstract and present in free-text large amounts of clinical data in their daily tasks. This is especially true for chronic-disease domains, but holds also in other clinical domains. We have recently developed a prototype system, CliniText, which, given a time-oriented clinical database, and appropriate formal abstraction and summarization knowledge, combines the computational mechanisms of knowledge-based temporal data abstraction, textual summarization, abduction, and natural-language generation techniques, to generate an intelligent textual summary of longitudinal clinical data. We demonstrate our methodology, and the feasibility of providing a free-text summary of longitudinal electronic patient records, by generating summaries in two very different domains - Diabetes Management and Cardiothoracic surgery. In particular, we explain the process of generating a discharge summary of a patient who had undergone a Coronary Artery Bypass Graft operation, and a brief summary of the treatment of a diabetes patient for five years.
El Saadawi, Gilan M.; Tseytlin, Eugene; Legowski, Elizabeth; Jukic, Drazen; Castine, Melissa; Fine, Jeffrey; Gormley, Robert; Crowley, Rebecca S.
Introduction We developed and evaluated a Natural Language Interface (NLI) for an Intelligent Tutoring System (ITS) in Diagnostic Pathology. The system teaches residents to examine pathologic slides and write accurate pathology reports while providing immediate feedback on errors they make in their slide review and diagnostic reports. Residents can ask for help at any point in the case, and will receive context-specific feedback. Research Questions We evaluated (1) the performance of our natural language system, (2) the effect of the system on learning (3) the effect of feedback timing on learning gains and (4) the effect of ReportTutor on performance to self-assessment correlations. Methods The study uses a crossover 2×2 factorial design. We recruited 20 subjects from 4 academic programs. Subjects were randomly assigned to one of the four conditions - two conditions for the immediate interface, and two for the delayed interface. An expert dermatopathologist created a reference standard and 2 board certified AP/CP pathology fellows manually coded the residents' assessment reports. Subjects were given the opportunity to self grade their performance and we used a survey to determine student response to both interfaces. Results Our results show a highly significant improvement in report writing after one tutoring session with 4-fold increase in the learning gains with both interfaces but no effect of feedback timing on performance gains. Residents who used the immediate feedback interface first experienced a feature learning gain that is correlated with the number of cases they viewed. There was no correlation between performance and self-assessment in either condition. PMID:17934789
The complex environment of the typical research laboratory requires flexible process control. This program provides natural language process control from an IBM PC or compatible machine. Sometimes process control schedules require changes frequently, even several times per day. These changes may include adding, deleting, and rearranging steps in a process. This program sets up a process control system that can either run without an operator, or be run by workers with limited programming skills. The software system includes three programs. Two of the programs, written in FORTRAN77, record data and control research processes. The third program, written in Pascal, generates the FORTRAN subroutines used by the other two programs to identify the user commands with the user-written device drivers. The software system also includes an input data set which allows the user to define the user commands which are to be executed by the computer. To set the system up the operator writes device driver routines for all of the controlled devices. Once set up, this system requires only an input file containing natural language command lines which tell the system what to do and when to do it. The operator can make up custom commands for operating and taking data from external research equipment at any time of the day or night without the operator in attendance. This process control system requires a personal computer operating under MS-DOS with suitable hardware interfaces to all controlled devices. The program requires a FORTRAN77 compiler and user-written device drivers. This program was developed in 1989 and has a memory requirement of about 62 Kbytes.
Baneyx, Audrey; Charlet, Jean; Jaulent, Marie-Christine
Pathologies and acts are classified in thesauri to help physicians to code their activity. In practice, the use of thesauri is not sufficient to reduce variability in coding and thesauri are not suitable for computer processing. We think the automation of the coding task requires a conceptual modeling of medical items: an ontology. Our task is to help lung specialists code acts and diagnoses with software that represents medical knowledge of this concerned specialty by an ontology. The objective of the reported work was to build an ontology of pulmonary diseases dedicated to the coding process. To carry out this objective, we develop a precise methodological process for the knowledge engineer in order to build various types of medical ontologies. This process is based on the need to express precisely in natural language the meaning of each concept using differential semantics principles. A differential ontology is a hierarchy of concepts and relationships organized according to their similarities and differences. Our main research hypothesis is to apply natural language processing tools to corpora to develop the resources needed to build the ontology. We consider two corpora, one composed of patient discharge summaries and the other being a teaching book. We propose to combine two approaches to enrich the ontology building: (i) a method which consists of building terminological resources through distributional analysis and (ii) a method based on the observation of corpus sequences in order to reveal semantic relationships. Our ontology currently includes 1550 concepts and the software implementing the coding process is still under development. Results show that the proposed approach is operational and indicates that the combination of these methods and the comparison of the resulting terminological structures give interesting clues to a knowledge engineer for the building of an ontology.
Charity Hudley, Anne H.; Mallinson, Christine
In today's culturally diverse classrooms, students possess and use many culturally, ethnically, and regionally diverse English language varieties that may differ from standardized English. This book helps classroom teachers become attuned to these differences and offers practical strategies to support student achievement while fostering positive…
Prizant, Barry M.
The paper examines theoretical issues regarding the symptomatology of echolalia in the language of visually impaired children. Literature on echolalia is reviewed from a variety of perspectives and clinical work and research with visual impairment and with autism is discussed. Problems of definition are cited, and explanations for occurrence of…
DeKeyser, Robert M.
The effect of age of acquisition on ultimate attainment in second language learning has been a controversial topic for years. After providing a very brief overview of the ideas that are at the core of the controversy, I discuss the two main reasons why these issues are so controversial: conceptual misunderstandings and methodological difficulties.…
Mainela-Arnold, Elina; Evans, Julia L.; Alibali, Martha W.
Purpose: The authors investigated mental representations of Piagetian conservation tasks in children with specific language impairment (SLI) and typically developing peers. Children with SLI have normal nonverbal intelligence; however, they exhibit difficulties in Piagetian conservation tasks. The authors tested the hypothesis that conservation…
The prevalence of academic procrastination has long been the subject of attention among researchers. However, there is still a paucity of studies examining language learners since most of the studies focus on similar participants such as psychology students. The present study was conducted among students trying to learn English in the first year…
Espin, Christine A; Cevasco, Jazmin; van den Broek, Paul; Baker, Scott; Gersten, Russell
In this study, we examine the nature and quality of students' comprehension of history. Specifically, we explore whether cognitive-psychological theories developed to capture the comprehension of narrative text can be used to capture the comprehension of history. Participants were 36 students with learning disabilities who had taken part in an earlier study designed to investigate the effects of an interactive instructional intervention in history. The results of the original study supported the effectiveness of the intervention in terms of amount recalled. The results of the present study reveal that historical understanding can be characterized as the construction of meaning through the creation of a causal network of events. The study of history within a causal network framework has implications for understanding the nature and quality of students' learning of history, and for potentially identifying sources of failure in learning.
Full Text Available Nature of science (NOS is considered to be a controversial topic by historians, philosophers of science and science educators. It is paradoxical that we all teach science and still have difficulties in understanding what science is and how it develops and progresses. A major obstacle in understanding NOS is that science is primarily ‘unnatural’, that is it cannot be learned by a simple observation of phenomena. In most parts of the world history and philosophy of science are ‘inside’ science content and as such can guide our understanding of NOS. However, some science educators consider the ‘historical turn’ as dated and hence neglect the historical approach and instead emphasize the model based naturalist view of science. The objective of this presentation is to show that the historical approach is very much a part of teaching science and actually complements naturalism. Understanding NOS generally requires two aspects of science: Domain general and domain specific. In the classroom this can be illustrated by discussing the atomic models developed in the early 20th century which constitute the domain specific aspect of NOS. This can then lead to an understanding of the tentative nature of science that is a domain general aspect of NOS. A review of the literature in science education reveals three views (among others of understanding NOS: a Consensus view: It attempts to include only those domain-general NOS aspects that are the least controversial (Lederman, Abd-El-Khalick; b Family resemblance view: Based on the ideas of Wittgenstein, this view promotes science as a cognitive system (Irzik, Nola; c Integrated view: this view postulates that both domain general and domain specific aspects of NOS are not dichotomous but rather need to be integrated and are essential if we want students to understand ‘science in the making’ (Niaz. The following framework helps to facilitate integration: i Elaboration of a theoretical framework
Full Text Available Abstract Background The Institute of Medicine has identified patient safety as a key goal for health care in the United States. Detecting vaccine adverse events is an important public health activity that contributes to patient safety. Reports about adverse events following immunization (AEFI from surveillance systems contain free-text components that can be analyzed using natural language processing. To extract Unified Medical Language System (UMLS concepts from free text and classify AEFI reports based on concepts they contain, we first needed to clean the text by expanding abbreviations and shortcuts and correcting spelling errors. Our objective in this paper was to create a UMLS-based spelling error correction tool as a first step in the natural language processing (NLP pipeline for AEFI reports. Methods We developed spell checking algorithms using open source tools. We used de-identified AEFI surveillance reports to create free-text data sets for analysis. After expansion of abbreviated clinical terms and shortcuts, we performed spelling correction in four steps: (1 error detection, (2 word list generation, (3 word list disambiguation and (4 error correction. We then measured the performance of the resulting spell checker by comparing it to manual correction. Results We used 12,056 words to train the spell checker and tested its performance on 8,131 words. During testing, sensitivity, specificity, and positive predictive value (PPV for the spell checker were 74% (95% CI: 74–75, 100% (95% CI: 100–100, and 47% (95% CI: 46%–48%, respectively. Conclusion We created a prototype spell checker that can be used to process AEFI reports. We used the UMLS Specialist Lexicon as the primary source of dictionary terms and the WordNet lexicon as a secondary source. We used the UMLS as a domain-specific source of dictionary terms to compare potentially misspelled words in the corpus. The prototype sensitivity was comparable to currently available
It is known that radiation is detected at random and the radiation counts fluctuate statistically. In the present study, a radiation measurement experiment was performed to understand the randomness and statistical fluctuation of radiation counts. In the measurement, three natural radiation sources were used. The sources were fabricated from potassium chloride chemicals, chemical fertilizers and kelps. These materials contain naturally occurring potassium-40 that is a radionuclide. From high schools, junior high schools and elementary schools, nine teachers participated to the radiation measurement experiment. Each participant measured the 1-min integration counts of radiation five times using GM survey meters, and 45 sets of data were obtained for the respective natural radiation sources. It was found that the frequency of occurrence of radiation counts was distributed according to a Gaussian distribution curve, although the obtained 45 data sets of radiation counts superficially looked to be fluctuating meaninglessly. (author)
Gong, Tao; Shuai, Lan
Memory is essential to many cognitive tasks including language. Apart from empirical studies of memory effects on language acquisition and use, there lack sufficient evolutionary explorations on whether a high level of memory capacity is prerequisite for language and whether language origin could influence memory capacity. In line with evolutionary theories that natural selection refined language-related cognitive abilities, we advocated a coevolution scenario between language and memory capacity, which incorporated the genetic transmission of individual memory capacity, cultural transmission of idiolects, and natural and cultural selections on individual reproduction and language teaching. To illustrate the coevolution dynamics, we adopted a multi-agent computational model simulating the emergence of lexical items and simple syntax through iterated communications. Simulations showed that: along with the origin of a communal language, an initially-low memory capacity for acquired linguistic knowledge was boosted; and such coherent increase in linguistic understandability and memory capacities reflected a language-memory coevolution; and such coevolution stopped till memory capacities became sufficient for language communications. Statistical analyses revealed that the coevolution was realized mainly by natural selection based on individual communicative success in cultural transmissions. This work elaborated the biology-culture parallelism of language evolution, demonstrated the driving force of culturally-constituted factors for natural selection of individual cognitive abilities, and suggested that the degree difference in language-related cognitive abilities between humans and nonhuman animals could result from a coevolution with language. PMID:26544876
copular sentences in Arabic and Russian, and struc- The effect of an elided subject on subse- tures similar to predicate can be found in Cantonese (our...thanks to K. Fu for the Cantonese data). This being the case, quent focusing is the same as that of an overt it is not surprising that analogous...Location was construed very abstractly, and included. adapted froni thos- used in studying social inter- e.g., measures of whether the antecedent and
Memo No. 43, Paoli Reserach Center, System Development Corporation, 1986. L. Hiuuchman ad K. Puder, Restriction Grammar in Prolog. In Pr... of as...causes and results of SAC failures. 3. METHODOLOGY The essential feature of our parser which facilitates the collecting of syntactic patterns is the
Ragonis, Noa; Shilo, Gila
The paper presents a theoretical investigational study of the potential advantages that secondary school learners may gain from learning two different subjects, namely, logic programming within computer science studies and argumentation texts within linguistics studies. The study suggests drawing an analogy between the two subjects since they both…
McColl, Derek; Jiang, Chuan; Nejat, Goldie
For social robots to be successfully integrated and accepted within society, they need to be able to interpret human social cues that are displayed through natural modes of communication. In particular, a key challenge in the design of social robots is developing the robot's ability to recognize a person's affective states (emotions, moods, and attitudes) in order to respond appropriately during social human-robot interactions (HRIs). In this paper, we present and discuss social HRI experiments we have conducted to investigate the development of an accessibility-aware social robot able to autonomously determine a person's degree of accessibility (rapport, openness) toward the robot based on the person's natural static body language. In particular, we present two one-on-one HRI experiments to: 1) determine the performance of our automated system in being able to recognize and classify a person's accessibility levels and 2) investigate how people interact with an accessibility-aware robot which determines its own behaviors based on a person's speech and accessibility levels.
Musolino, Julien; Landau, Barbara
In this article, we discuss two experiments of nature and their implications for the sciences of the mind. The first, Williams syndrome, bears on one of cognitive science's holy grails: the possibility of unravelling the causal chain between genes and cognition. We sketch the outline of a general framework to study the relationship between genes and cognition, focusing as our case study on the development of language in individuals with Williams syndrome. Our approach emphasizes the role of three key ingredients: the need to specify a clear level of analysis, the need to provide a theoretical account of the relevant cognitive structure at that level, and the importance of the (typical) developmental process itself. The promise offered by the case of Williams syndrome has also given rise to two strongly conflicting theoretical approaches-modularity and neuroconstructivism-themselves offshoots of a perennial debate between nativism and empiricism. We apply our framework to explore the tension created by these two conflicting perspectives. To this end, we discuss a second experiment of nature, which allows us to compare the two competing perspectives in what comes close to a controlled experimental setting. From this comparison, we conclude that the "meaningful debate assumption", a widespread assumption suggesting that neuroconstructivism and modularity address the same questions and represent genuine theoretical alternatives, rests on a fallacy.
Full Text Available Conceptual knowledge accessed by language may involve the re-activation of the associated primary sensory-motor processes. Whether these embodied representations are indeed constitutive to conceptual knowledge is hotly debated, particularly since direct evidence that sensory-motor expertise can improve conceptual processing is scarce.In this study, we sought for this crucial piece of evidence, by training naive healthy subjects to perform complex manual actions and by measuring, before and after training, their performance in a semantic language task. 19 participants engaged in 3 weeks of motor training. Each participant was trained in 3 complex manual actions (e.g. origami. Before and after the training period, each subject underwent a series of manual dexterity tests and a semantic language task. The latter consisted of a sentence-picture semantic congruency judgment task, with 6 target congruent sentence-picture pairs (semantically related to the trained manual actions, 6 non-target congruent pairs (semantically unrelated, and 12 filler incongruent pairs.Manual action training induced a significant improvement in all manual dexterity tests, demonstrating the successful acquisition of sensory-motor expertise. In the semantic language task, the reaction times to both target and non-target congruent sentence-image pairs decreased after action training, indicating a more efficient conceptual-semantic processing. Noteworthy, the reaction times for target pairs decreased more than those for non-target pairs, as indicated by the 2x2 interaction. These results were confirmed when controlling for the potential bias of increased frequency of use of target lexical items during manual training.The results of the present study suggest that sensory-motor expertise gained by training of specific manual actions can lead to an improvement of cognitive-linguistic skills related to the specific conceptual-semantic domain associated to the trained actions.
Mayberry, Marshall R.; Crocker, Matthew W.
The Adaptive Mechanisms in Human Language Processing (ALPHA) project features both experimental and computational tracks designed to complement each other in the investigation of the cognitive mechanisms that underlie situated human utterance processing. The models developed in the computational track replicate results obtained in the experimental track and, in turn, suggest further experiments by virtue of behavior that arises as a by-product of their operation.
Beyer, Sebastian E; McKee, Brady J; Regis, Shawn M; McKee, Andrea B; Flacke, Sebastian; El Saadawi, Gilan; Wald, Christoph
Our aim was to train a natural language processing (NLP) algorithm to capture imaging characteristics of lung nodules reported in a structured CT report and suggest the applicable Lung-RADS™ (LR) category. Our study included structured, clinical reports of consecutive CT lung screening (CTLS) exams performed from 08/2014 to 08/2015 at an ACR accredited Lung Cancer Screening Center. All patients screened were at high-risk for lung cancer according to the NCCN Guidelines ® . All exams were interpreted by one of three radiologists credentialed to read CTLS exams using LR using a standard reporting template. Training and test sets consisted of consecutive exams. Lung screening exams were divided into two groups: three training sets (500, 120, and 383 reports each) and one final evaluation set (498 reports). NLP algorithm results were compared with the gold standard of LR category assigned by the radiologist. The sensitivity/specificity of the NLP algorithm to correctly assign LR categories for suspicious nodules (LR 4) and positive nodules (LR 3/4) were 74.1%/98.6% and 75.0%/98.8% respectively. The majority of mismatches occurred in cases where pulmonary findings were present not currently addressed by LR. Misclassifications also resulted from the failure to identify exams as follow-up and the failure to completely characterize part-solid nodules. In a sub-group analysis among structured reports with standardized language, the sensitivity and specificity to detect LR 4 nodules were 87.0% and 99.5%, respectively. An NLP system can accurately suggest the appropriate LR category from CTLS exam findings when standardized reporting is used.
Full Text Available In 1993 Carey and Smith conjectured that the most promising way to boost students’ understanding of the nature of science is a “theory-building approach to teaching about inquiry.” The research reported here tested this conjecture by comparing results from two Grade 4 classrooms that differed in their emphasis on and technological support for creating and improving theories. One class followed a Knowledge Building approach and used Knowledge Forum®, which together emphasize theory improvement and sustained creative work with ideas. The other class followed an inquiry approach mediated through collaborative project-based activities. Apart from this, the two classes were demographically similar and both fell within the broad category of constructivist, inquiry-based approaches and employed a range of modes and media for investigative research and reports. An augmented version of Carey and Smith’s Nature of Science Interview showed that the Knowledge Building approach resulted in deeper understanding of the nature of theoretical progress, the connections between theories and facts, and the role of ideas in scientific inquiry.
Ward, Gillian; Haigh, Mavis
Teachers need an understanding of the nature of science (NOS) to enable them to incorporate NOS into their teaching of science. The current study examines the usefulness of a strategy for challenging or changing teachers' understandings of NOS. The teachers who participated in this study were 10 initial teacher education chemistry students and six experienced teachers from secondary and primary schools who were introduced to an explicit and reflective activity, a dramatic reading about a historical scientific development. Concept maps were used before and after the activity to assess teachers' knowledge of NOS. The participants also took part in a focus group interview to establish whether they perceived the activity as useful in developing their own understanding of NOS. Initial analysis led us to ask another group, comprising seven initial teacher education chemistry students, to take part in a modified study. These participants not only completed the same tasks as the previous participants but also completed a written reflection commenting on whether the activity and focus group discussion enhanced their understanding of NOS. Both Lederman et al.'s (Journal of Research in Science Teaching, 39(6), 497-521, 2002) concepts of NOS and notions of "naive" and "informed" understandings of NOS and Hay's (Studies in Higher Education, 32(1), 39-57, 2007) notions of "surface" and "deep" learning were used as frameworks to examine the participants' specific understandings of NOS and the depth of their learning. The ways in which participants' understandings of NOS were broadened or changed by taking part in the dramatic reading are presented. The impact of the data-gathering tools on the participants' professional learning is also discussed.
Fong, Allan; Harriott, Nicole; Walters, Donna M; Foley, Hanan; Morrissey, Richard; Ratwani, Raj R
Many healthcare providers have implemented patient safety event reporting systems to better understand and improve patient safety. Reviewing and analyzing these reports is often time consuming and resource intensive because of both the quantity of reports and length of free-text descriptions in the reports. Natural language processing (NLP) experts collaborated with clinical experts on a patient safety committee to assist in the identification and analysis of medication related patient safety events. Different NLP algorithmic approaches were developed to identify four types of medication related patient safety events and the models were compared. Well performing NLP models were generated to categorize medication related events into pharmacy delivery delays, dispensing errors, Pyxis discrepancies, and prescriber errors with receiver operating characteristic areas under the curve of 0.96, 0.87, 0.96, and 0.81 respectively. We also found that modeling the brief without the resolution text generally improved model performance. These models were integrated into a dashboard visualization to support the patient safety committee review process. We demonstrate the capabilities of various NLP models and the use of two text inclusion strategies at categorizing medication related patient safety events. The NLP models and visualization could be used to improve the efficiency of patient safety event data review and analysis. Copyright © 2017 Elsevier B.V. All rights reserved.
Khanna, Anirudh; Das, Bhagwan; Pandey, Bishwajeet
With the advent of AI and IoT, the idea of incorporating smart things/appliances in our day to day life is converting into a reality. The paper discusses the possibilities and potential of designing IoT systems which can be controlled via natural language, with help of Quick Script as a development...
He, Qiwei; Veldkamp, Bernard P.; Glas, Cornelis A.W.; de Vries, Theo
Patients’ narratives about traumatic experiences and symptoms are useful in clinical screening and diagnostic procedures. In this study, we presented an automated assessment system to screen patients for posttraumatic stress disorder via a natural language processing and text-mining approach. Four
Dessus, Philippe; Trausan-Matu, Stefan; Van Rosmalen, Peter; Wild, Fridolin
Dessus, P., Trausan-Matu, S., Van Rosmalen, P., & Wild, F. (Eds.) (2009). AIED 2009 Workshops Proceedings Volume 10 Natural Language Processing in Support of Learning: Metrics, Feedback and Connectivity. In S. D. Craig & D. Dicheva (Eds.), AIED 2009: 14th International Conference in Artificial
Sermet, M. Y.; Demir, I.; Krajewski, W. F.
The Iowa Flood Information System (IFIS) is a web-based platform developed by the Iowa Flood Center (IFC) to provide access to flood inundation maps, real-time flood conditions, flood forecasts, flood-related data, information and interactive visualizations for communities in Iowa. The IFIS is designed for use by general public, often people with no domain knowledge and limited general science background. To improve effective communication with such audience, we have introduced a voice-enabled knowledge engine on flood related issues in IFIS. Instead of navigating within many features and interfaces of the information system and web-based sources, the system provides dynamic computations based on a collection of built-in data, analysis, and methods. The IFIS Knowledge Engine connects to real-time stream gauges, in-house data sources, analysis and visualization tools to answer natural language questions. Our goal is the systematization of data and modeling results on flood related issues in Iowa, and to provide an interface for definitive answers to factual queries. The goal of the knowledge engine is to make all flood related knowledge in Iowa easily accessible to everyone, and support voice-enabled natural language input. We aim to integrate and curate all flood related data, implement analytical and visualization tools, and make it possible to compute answers from questions. The IFIS explicitly implements analytical methods and models, as algorithms, and curates all flood related data and resources so that all these resources are computable. The IFIS Knowledge Engine computes the answer by deriving it from its computational knowledge base. The knowledge engine processes the statement, access data warehouse, run complex database queries on the server-side and return outputs in various formats. This presentation provides an overview of IFIS Knowledge Engine, its unique information interface and functionality as an educational tool, and discusses the future plans
Student views on the nature of science are shaped by a variety of out-of-school forces and television-mediated science is a significant force. To attempt to achieve a science for all, we need to recognize and understand the diverse messages about science that students access and think about on a regular basis. In this work I examine how high school students think about science that is mediated by four different program genres on television: documentary, magazine-format programming, network news, and dramatic or fictional programming. The following categories of findings are discussed: the ethics and validity of science, final form science, science as portrayed by its practitioners, and school science and television science. Student perceptions of the nature of science depicted on the program sample used in this study ranged from seeing science as comprising tentative knowledge claims to seeing science as a fixed body of facts.
Rebecca S. Toupal
Full Text Available Multicultural demands on public lands in the United States continue to challenge federal land managers to address social and cultural concerns in their planning efforts. Specifically, they lack adequate knowledge of cultural concerns, as well as a consistent strategy for acquiring that knowledge for use in decision-making. Current federal approaches to understanding such issues as access, use, and control of resources include public participation, conservation partnerships, government-to-government consultations with American Indian tribes, cultural resource inventories, and landscape analysis. Given that cultural knowledge arises from human-nature relationships and shared perceptions of natural environments, and that landscapes are the ultimate expression of such knowledge, an exploratory methodology was developed to provide a different approach to understanding cultural concerns through landscape perceptions. Using cultural landscape theories and applications from the natural and social sciences, this study examines the landscape perceptions of four groups concerned with management planning of the Baboquivari Wilderness Area in southern Arizona: the Bureau of Land Management, the landowners of the Altar Valley, recreationists, and members of the Tohono O'odham Nation. The methodology is based on a human-nature relationship rather than cultural aspects or features. It takes a holistic approach that differs from other perception studies in that it includes: emic aspects of data collection and analysis; a spatial component (triangulation of data collection through narrative and graphic descriptions; ethnographic, on-site interviews; and cultural consensus analysis and small-sample theory. The results include: verification of four cultural groups; two levels of consensus (in the population of concern, and in each group that overlap in some aspects of landscape perception; descriptions of four cultural landscapes that illustrate similarities and
Koerber, Susanne; Osterhaus, Christopher; Sodian, Beate
Understanding the nature of science (NOS) is a critical aspect of scientific reasoning, yet few studies have investigated its developmental beginnings and initial structure. One contributing reason is the lack of an adequate instrument. Two studies assessed NOS understanding among third graders using a multiple-select (MS) paper-and-pencil test. Study 1 investigated the validity of the MS test by presenting the items to 68 third graders (9-year-olds) and subsequently interviewing them on their underlying NOS conception of the items. All items were significantly related between formats, indicating that the test was valid. Study 2 applied the same instrument to a larger sample of 243 third graders, and their performance was compared to a multiple-choice (MC) version of the test. Although the MC format inflated the guessing probability, there was a significant relation between the two formats. In summary, the MS format was a valid method revealing third graders' NOS understanding, thereby representing an economical test instrument. A latent class analysis identified three groups of children with expertise in qualitatively different aspects of NOS, suggesting that there is not a single common starting point for the development of NOS understanding; instead, multiple developmental pathways may exist. © 2014 The British Psychological Society.
Corballis, Michael C.
The mirror system provided a natural platform for the subsequent evolution of language. In nonhuman primates, the system provides for the understanding of biological action, and possibly for imitation, both prerequisites for language. I argue that language evolved from manual gestures, initially as a system of pantomime, but with gestures…
Falomir, Zoe; Kluth, Thomas
The challenge of describing 3D real scenes is tackled in this paper using qualitative spatial descriptors. A key point to study is which qualitative descriptors to use and how these qualitative descriptors must be organized to produce a suitable cognitive explanation. In order to find answers, a survey test was carried out with human participants which openly described a scene containing some pieces of furniture. The data obtained in this survey are analysed, and taking this into account, the QSn3D computational approach was developed which uses a XBox 360 Kinect to obtain 3D data from a real indoor scene. Object features are computed on these 3D data to identify objects in indoor scenes. The object orientation is computed, and qualitative spatial relations between the objects are extracted. These qualitative spatial relations are the input to a grammar which applies saliency rules obtained from the survey study and generates cognitive natural language descriptions of scenes. Moreover, these qualitative descriptors can be expressed as first-order logical facts in Prolog for further reasoning. Finally, a validation study is carried out to test whether the descriptions provided by QSn3D approach are human readable. The obtained results show that their acceptability is higher than 82%.
Full Text Available Recent advances in Natural Language Processing and Machine Learning provide us with the tools to build predictive models that can be used to unveil patterns driving judicial decisions. This can be useful, for both lawyers and judges, as an assisting tool to rapidly identify cases and extract patterns which lead to certain decisions. This paper presents the first systematic study on predicting the outcome of cases tried by the European Court of Human Rights based solely on textual content. We formulate a binary classification task where the input of our classifiers is the textual content extracted from a case and the target output is the actual judgment as to whether there has been a violation of an article of the convention of human rights. Textual information is represented using contiguous word sequences, i.e., N-grams, and topics. Our models can predict the court’s decisions with a strong accuracy (79% on average. Our empirical analysis indicates that the formal facts of a case are the most important predictive factor. This is consistent with the theory of legal realism suggesting that judicial decision-making is significantly affected by the stimulus of the facts. We also observe that the topical content of a case is another important feature in this classification task and explore this relationship further by conducting a qualitative analysis.
Full Text Available As we discuss, a stationary stochastic process is nonergodic when a random persistent topic can be detected in the infinite random text sampled from the process, whereas we call the process strongly nonergodic when an infinite sequence of independent random bits, called probabilistic facts, is needed to describe this topic completely. Replacing probabilistic facts with an algorithmically random sequence of bits, called algorithmic facts, we adapt this property back to ergodic processes. Subsequently, we call a process perigraphic if the number of algorithmic facts which can be inferred from a finite text sampled from the process grows like a power of the text length. We present a simple example of such a process. Moreover, we demonstrate an assertion which we call the theorem about facts and words. This proposition states that the number of probabilistic or algorithmic facts which can be inferred from a text drawn from a process must be roughly smaller than the number of distinct word-like strings detected in this text by means of the Prediction by Partial Matching (PPM compression algorithm. We also observe that the number of the word-like strings for a sample of plays by Shakespeare follows an empirical stepwise power law, in a stark contrast to Markov processes. Hence, we suppose that natural language considered as a process is not only non-Markov but also perigraphic.
Badr, Hoda; Milbury, Kathrin; Majeed, Nadia; Carmack, Cindy L.; Ahmad, Zeba; Gritz, Ellen R.
Objective This multimethod prospective study examined whether emotional disclosure and coping focus as conveyed through natural language use is associated with the psychological and marital adjustment of head and neck cancer patients and their spouses. Methods One-hundred twenty-three patients (85% men; age X‒=56.8 years, SD=10.4) and their spouses completed surveys prior to, following, and 4-months after engaging in a videotaped discussion about cancer in the laboratory. Linguistic Inquiry and Word Count (LIWC) software assessed counts of positive/negative emotion words and first-person singular (I-talk), second person (you-talk), and first-person plural (we-talk) pronouns. Using a Grounded Theory approach, discussions were also analyzed to describe how emotion words and pronouns were used and what was being discussed. Results Emotion words were most often used to disclose thoughts/feelings or worry/uncertainty about the future, and to express gratitude or acknowledgment to one’s partner. Although patients who disclosed more negative emotion during the discussion reported more positive mood following the discussion (ppsychological and marital adjustment were found. Patients used significantly more I-talk than spouses and spouses used significantly more you-talk than patients (p’sdistress at the 4-month follow-up assessment when their partners used more we-talk (p disclosure may be less important to one’s cancer adjustment than having a partner who one sees as instrumental to the coping process. PMID:27441867
Sevenster, Merlijn; Bozeman, Jeffrey; Cowhy, Andrea; Trost, William
To standardize and objectivize treatment response assessment in oncology, guidelines have been proposed that are driven by radiological measurements, which are typically communicated in free-text reports defying automated processing. We study through inter-annotator agreement and natural language processing (NLP) algorithm development the task of pairing measurements that quantify the same finding across consecutive radiology reports, such that each measurement is paired with at most one other ("partial uniqueness"). Ground truth is created based on 283 abdomen and 311 chest CT reports of 50 patients each. A pre-processing engine segments reports and extracts measurements. Thirteen features are developed based on volumetric similarity between measurements, semantic similarity between their respective narrative contexts and structural properties of their report positions. A Random Forest classifier (RF) integrates all features. A "mutual best match" (MBM) post-processor ensures partial uniqueness. In an end-to-end evaluation, RF has precision 0.841, recall 0.807, F-measure 0.824 and AUC 0.971; with MBM, which performs above chance level (P0.960) indicates that the task is well defined. Domain properties and inter-section differences are discussed to explain superior performance in abdomen. Enforcing partial uniqueness has mixed but minor effects on performance. A combined machine learning-filtering approach is proposed for pairing measurements, which can support prospective (supporting treatment response assessment) and retrospective purposes (data mining). Copyright © 2014 Elsevier Inc. All rights reserved.
A. E. Pismak
Full Text Available Subject of Research. The paper is focused on Wiktionary articles structural organization in the aspect of its usage as the base for semantic network. Wiktionary community references, article templates and articles markup features are analyzed. The problem of numerical estimation for semantic similarity of structural elements in Wiktionary articles is considered. Analysis of existing software for semantic similarity estimation of such elements is carried out; algorithms of their functioning are studied; their advantages and disadvantages are shown. Methods. Mathematical statistics methods were used to analyze Wiktionary articles markup features. The method of semantic similarity computing based on statistics data for compared structural elements was proposed.Main Results. We have concluded that there is no possibility for direct use of Wiktionary articles as the source for semantic network. We have proposed to find hidden similarity between article elements, and for that purpose we have developed the algorithm for calculation of confidence coefficients proving that each pair of sentences is semantically near. The research of quantitative and qualitative characteristics for the developed algorithm has shown its major performance advantage over the other existing solutions in the presence of insignificantly higher error rate. Practical Relevance. The resulting algorithm may be useful in developing tools for automatic Wiktionary articles parsing. The developed method could be used in computing of semantic similarity for short text fragments in natural language in case of algorithm performance requirements are higher than its accuracy specifications.
Dang, Pragya A; Kalra, Mannudeep K; Blake, Michael A; Schultz, Thomas J; Stout, Markus; Lemay, Paul R; Freshman, David J; Halpern, Elkan F; Dreyer, Keith J
The study purpose was to describe the use of natural language processing (NLP) and online analytic processing (OLAP) for assessing patterns in recommendations in unstructured radiology reports on the basis of patient and imaging characteristics, such as age, gender, referring physicians, radiology subspecialty, modality, indications, diseases, and patient status (inpatient vs outpatient). A database of 4,279,179 radiology reports from a single tertiary health care center during a 10-year period (1995-2004) was created. The database includes reports of computed tomography, magnetic resonance imaging, fluoroscopy, nuclear medicine, ultrasound, radiography, mammography, angiography, special procedures, and unclassified imaging tests with patient demographics. A clinical data mining and analysis NLP program (Leximer, Nuance Inc, Burlington, Massachusetts) in conjunction with OLAP was used for classifying reports into those with recommendations (I(REC)) and without recommendations (N(REC)) for imaging and determining I(REC) rates for different patient age groups, gender, imaging modalities, indications, diseases, subspecialties, and referring physicians. In addition, temporal trends for I(REC) were also determined. There was a significant difference in the I(REC) rates in different age groups, varying between 4.8% (10-19 years) and 9.5% (>70 years) (P OLAP revealed considerable differences between recommendation trends for different imaging modalities and other patient and imaging characteristics.
Rivas, Michael Gerald
This action research project studies preservice elementary teachers in a science methods course. The purpose of this research project was to enhance preservice teachers' understanding of specific nature of science (NOS) tenets so as to promote equity and access within the elementary science classroom. In particular, I chose five NOS tenets that were listed in the first chapter of the AAAS (1989) document titled, "The Nature of Science," and connected them to equitable educational goals and practices. The theoretical framework guiding this study came from bodies of scholarship relating to the NOS, social constructivism, and action research. This study addressed the following three questions: (1) What opportunities were provided the preservice teachers so that they could enhance their understandings of the NOS? (2) What were the changes in preservice teachers' understanding of the NOS as a result? (3) How did the prospective teachers' understandings of the NOS translate into their classroom practice? The analysis revealed that the science methods course's operational curriculum consisted of implicit and explicit teaching of the NOS, as well as intended and untended NOS tenets. The prospective teachers initially held a limited view of the NOS, but by the end of the course their view had been enhanced. In addition, the participants made direct connections between their new understandings of the NOS and equity and access in the science classroom. In their teaching, the preservice teachers as a group implicitly taught all five of the NOS tenets. In fact, a majority taught three of the five intended tenets. Explicitly, only one tenet was taught, but it was taught with a direct connection to making the science classroom more inclusive. The findings of this study indicate that preservice teachers can have their views of the NOS enhanced even though they may have experienced years of deficient science instruction. They pointed out that this enhanced view of the NOS can be
Cole, Merryn L.
This dissertation employed a mixed-methods approach to examine the relationship between spatial reasoning ability and understanding of chemistry content for both middle school students and their science teachers. Spatial reasoning has been linked to success in learning STEM subjects (Wai, Lubinski, & Benbow, 2009). Previous studies have shown a correlation between understanding of chemistry content and spatial reasoning ability (e.g., Pribyl & Bodner, 1987; Wu & Shah, 2003: Stieff, 2013), raising the importance of developing the spatial reasoning ability of both teachers and students. Few studies examine middle school students' or in-service middle school teachers' understanding of chemistry concepts or its relation to spatial reasoning ability. The first paper in this dissertation addresses the quantitative relationship between mental rotation, a type of spatial reasoning ability, and understanding a fundamental concept in chemistry, the particulate nature of matter. The data showed a significant, positive correlation between scores on the Purdue Spatial Visualization Test of Rotations (PSVT; Bodner & Guay, 1997) and the Particulate Nature of Matter Assessment (ParNoMA; Yezierski, 2003) for middle school students prior to and after chemistry instruction. A significant difference in spatial ability among students choosing different answer choices on ParNoMA questions was also found. The second paper examined the ways in which students of different spatial abilities talked about matter and chemicals differently. Students with higher spatial ability tended to provide more of an explanation, though not necessarily in an articulate matter. In contrast, lower spatial ability students tended to use any keywords that seemed relevant, but provided little or no explanation. The third paper examined the relationship between mental reasoning and understanding chemistry for middle school science teachers. Similar to their students, a significant, positive correlation between
Lindhe, Christina; Hartelius, Lena
The aim of the study was to describe the subjective ratings of the course 'Training of the student's own voice and speech', from a student-centred perspective. A questionnaire was completed after each of the six individual sessions. Six speech and language pathology (SLP) students rated how they perceived the practical exercises in terms of doing and understanding. The results showed that five of the six participants rated the exercises as significantly easier to understand than to do. The exercises were also rated as easier to do over time. Results are interpreted within in a theoretical framework of approaches to learning. The findings support the importance of both the physical and reflective aspects of the voice training process.
Patton, Desmond Upton; MacBeth, Jamie; Schoenebeck, Sarita; Shear, Katherine; McKeown, Kathleen
There is a dearth of research investigating youths’ experience of grief and mourning after the death of close friends or family. Even less research has explored the question of how youth use social media sites to engage in the grieving process. This study employs qualitative analysis and natural language processing to examine tweets that follow 2 deaths. First, we conducted a close textual read on a sample of tweets by Gakirah Barnes, a gang-involved teenaged girl in Chicago, and members of her Twitter network, over a 19-day period in 2014 during which 2 significant deaths occurred: that of Raason “Lil B” Shaw and Gakirah’s own death. We leverage the grief literature to understand the way Gakirah and her peers express thoughts, feelings, and behaviors at the time of these deaths. We also present and explain the rich and complex style of online communication among gang-involved youth, one that has been overlooked in prior research. Next, we overview the natural language processing output for expressions of loss and grief in our data set based on qualitative findings and present an error analysis on its output for grief. We conclude with a call for interdisciplinary research that analyzes online and offline behaviors to help understand physical and emotional violence and other problematic behaviors prevalent among marginalized communities. PMID:29636619
Yu. S. Hetsevich
Full Text Available The article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model and part of speech tagging model were built. The natural language processing of Belarusian corpus with the help of developed algorithm using machine learning was carried out. The precision of developed models of machine learning has been 80–90 %. The dictionary was enriched with new words for the further using it in the systems of Belarusian speech synthesis.
Weng, Wei-Hung; Wagholikar, Kavishwar B; McCray, Alexa T; Szolovits, Peter; Chueh, Henry C
The medical subdomain of a clinical note, such as cardiology or neurology, is useful content-derived metadata for developing machine learning downstream applications. To classify the medical subdomain of a note accurately, we have constructed a machine learning-based natural language processing (NLP) pipeline and developed medical subdomain classifiers based on the content of the note. We constructed the pipeline using the clinical NLP system, clinical Text Analysis and Knowledge Extraction System (cTAKES), the Unified Medical Language System (UMLS) Metathesaurus, Semantic Network, and learning algorithms to extract features from two datasets - clinical notes from Integrating Data for Analysis, Anonymization, and Sharing (iDASH) data repository (n = 431) and Massachusetts General Hospital (MGH) (n = 91,237), and built medical subdomain classifiers with different combinations of data representation methods and supervised learning algorithms. We evaluated the performance of classifiers and their portability across the two datasets. The convolutional recurrent neural network with neural word embeddings trained-medical subdomain classifier yielded the best performance measurement on iDASH and MGH datasets with area under receiver operating characteristic curve (AUC) of 0.975 and 0.991, and F1 scores of 0.845 and 0.870, respectively. Considering better clinical interpretability, linear support vector machine-trained medical subdomain classifier using hybrid bag-of-words and clinically relevant UMLS concepts as the feature representation, with term frequency-inverse document frequency (tf-idf)-weighting, outperformed other shallow learning classifiers on iDASH and MGH datasets with AUC of 0.957 and 0.964, and F1 scores of 0.932 and 0.934 respectively. We trained classifiers on one dataset, applied to the other dataset and yielded the threshold of F1 score of 0.7 in classifiers for half of the medical subdomains we studied. Our study shows that a supervised
Cofré, Hernán; Cuevas, Emilia; Becerra, Beatriz
Despite the importance of the theory of evolution (TE) to scientific knowledge, a number of misconceptions continue to be found among biology teachers. In this context, the first objective of this study was to identify the impact of professional development programme (PDP) on teachers' understanding of nature of science (NOS) and evolution and on the acceptance of this theory. Its second objective was to study the relationship among these variables. Three instruments were used to quantify these variables: the Views of the Nature of Science Version D (VNOS D+), the Assessing Contextual Reasoning about Natural Selection (ACORN), and the Measure of Acceptance of Theory of Evolution (MATE). The results indicate that the PDP had a positive impact on teachers, significantly improving their understanding of the NOS and natural selection, as well as their acceptance of the TE. Furthermore, a positive correlation between the understanding of the NOS obtained by teachers in the first part of the PDP and the understanding and acceptance of evolution that these teachers showed at the end of the programme was determined. However, no relationship between an understanding of the NOS and gains in the understanding and acceptance of evolution was found.
Ruohotie-Lyhty, Maria; Korppi, Aino; Moate, Josephine; Nyman, Tarja
Teaching is recognised as an emotional practice. Studies have highlighted the importance of teachers' emotional literacy in the development of pupils' emotional skills, the central position of emotions in teachers' ways of knowing, and in their professional development. This longitudinal study draws on a dialogic understanding of emotion to…
van der Kroon, Linda; Jauregi Ondarra, M.K.; ten Thije, J.D.
The development of intercultural communicative competence is increasingly important in this globalised and highly digitalised world. This implies the adequate understanding of otherness, which entails a myriad of complex cognitive competences, skills and behaviour. The TILA project aims to study how
Wekesa, Duncan Wasike
Mathematical knowledge and understanding is important not only for scientific progress and development but also for its day-to-day application in social sciences and arts, government, business and management studies and household chores. But the general performance in school mathematics in Kenya has been poor over the years. There is evidence that…
Rappleye, Jeremy; Imoto, Yuki; Horiguchi, Sachiko
Globalisation and convergence in educational policy worldwide has reinvigorated, while rendering more complex, the classic theme of educational transfer. Framed by this wider pursuit of new understandings of a changing transfer/context puzzle, this paper explores how an ethnographic "thick description" might complement and extend recent…
Bucks, Gregory Warren
Computers have become an integral part of how engineers complete their work, allowing them to collect and analyze data, model potential solutions and aiding in production through automation and robotics. In addition, computers are essential elements of the products themselves, from tennis shoes to construction materials. An understanding of how…
analogy from Wittgenstein’s term "language game" ( Wittgenstein , 1958). However, Dialogue-games represent knowledge people have about language as used to...and memory of narrative discourse. CoRtiiiive PsycholoRy, 1977, 9, 77-110. Wittgenstein , L. Philosophical inve-ÜRalions (3rd ed.). New York
Giovana Fracari Hautrive
Full Text Available Taking the theme literacy of deaf children is currently directing the eye to the practice teaching course that demands beyond the school. Questions moving to daily practice, became a challenge, requiring an investigative attitude. The article aims to problematize the process of literacy of deaf children. Reflection proposal emerges from daily practice. This structure is from yarns that include theoretical studies of Vigotskii (1989, 1994, 1996, 1998; Stumpf (2005, Quadros (1997; Bolzan (1998, 2002; Skliar (1997a, 1997b, 1998 . From which, problematizes the processes involved in the construction of written language. It is as a result, the importance of the instrumentalization of sign language as first language in education of deaf and learning of sign language writing. Important aspects for the deaf student is observed in the condition to be literate in their mother tongue. It points out the need for a redirect in the literacy of deaf children, so that important aspects of language and its role in the structuring of thought and its communicative aspect, are respected and considered in this process. Thus, it emphasizes the learning of the writing of sign language as fundamental, it should occupy a central role in the proposed teaching the class, encouraging the contradictions that put the student in a situation of cognitive conflict, while respecting the diversity inherent to each humans. It is considered that the production of sign language writing is an appropriate tool for the deaf students record their visual language.
Ben Abacha, Asma; Dos Reis, Julio Cesar; Mrabet, Yassine; Pruski, Cédric; Da Silveira, Marcos
The increasing number of open-access ontologies and their key role in several applications such as decision-support systems highlight the importance of their validation. Human expertise is crucial for the validation of ontologies from a domain point-of-view. However, the growing number of ontologies and their fast evolution over time make manual validation challenging. We propose a novel semi-automatic approach based on the generation of natural language (NL) questions to support the validation of ontologies and their evolution. The proposed approach includes the automatic generation, factorization and ordering of NL questions from medical ontologies. The final validation and correction is performed by submitting these questions to domain experts and automatically analyzing their feedback. We also propose a second approach for the validation of mappings impacted by ontology changes. The method exploits the context of the changes to propose correction alternatives presented as Multiple Choice Questions. This research provides a question optimization strategy to maximize the validation of ontology entities with a reduced number of questions. We evaluate our approach for the validation of three medical ontologies. We also evaluate the feasibility and efficiency of our mappings validation approach in the context of ontology evolution. These experiments are performed with different versions of SNOMED-CT and ICD9. The obtained experimental results suggest the feasibility and adequacy of our approach to support the validation of interconnected and evolving ontologies. Results also suggest that taking into account RDFS and OWL entailment helps reducing the number of questions and validation time. The application of our approach to validate mapping evolution also shows the difficulty of adapting mapping evolution over time and highlights the importance of semi-automatic validation.
Jay, Caroline; Harper, Simon; Dunlop, Ian; Smith, Sam; Sufi, Shoaib; Goble, Carole; Buchan, Iain
Data discovery, particularly the discovery of key variables and their inter-relationships, is key to secondary data analysis, and in-turn, the evolving field of data science. Interface designers have presumed that their users are domain experts, and so they have provided complex interfaces to support these "experts." Such interfaces hark back to a time when searches needed to be accurate first time as there was a high computational cost associated with each search. Our work is part of a governmental research initiative between the medical and social research funding bodies to improve the use of social data in medical research. The cross-disciplinary nature of data science can make no assumptions regarding the domain expertise of a particular scientist, whose interests may intersect multiple domains. Here we consider the common requirement for scientists to seek archived data for secondary analysis. This has more in common with search needs of the "Google generation" than with their single-domain, single-tool forebears. Our study compares a Google-like interface with traditional ways of searching for noncomplex health data in a data archive. Two user interfaces are evaluated for the same set of tasks in extracting data from surveys stored in the UK Data Archive (UKDA). One interface, Web search, is "Google-like," enabling users to browse, search for, and view metadata about study variables, whereas the other, traditional search, has standard multioption user interface. Using a comprehensive set of tasks with 20 volunteers, we found that the Web search interface met data discovery needs and expectations better than the traditional search. A task × interface repeated measures analysis showed a main effect indicating that answers found through the Web search interface were more likely to be correct (F1,19=37.3, Pnatural language search interfaces for variable search supporting in particular: query reformulation; data browsing; faceted search; surrogates; relevance
Tseytlin, Eugene; Mitchell, Kevin; Legowski, Elizabeth; Corrigan, Julia; Chavan, Girish; Jacobson, Rebecca S
Natural language processing (NLP) applications are increasingly important in biomedical data analysis, knowledge engineering, and decision support. Concept recognition is an important component task for NLP pipelines, and can be either general-purpose or domain-specific. We describe a novel, flexible, and general-purpose concept recognition component for NLP pipelines, and compare its speed and accuracy against five commonly used alternatives on both a biological and clinical corpus. NOBLE Coder implements a general algorithm for matching terms to concepts from an arbitrary vocabulary set. The system's matching options can be configured individually or in combination to yield specific system behavior for a variety of NLP tasks. The software is open source, freely available, and easily integrated into UIMA or GATE. We benchmarked speed and accuracy of the system against the CRAFT and ShARe corpora as reference standards and compared it to MMTx, MGrep, Concept Mapper, cTAKES Dictionary Lookup Annotator, and cTAKES Fast Dictionary Lookup Annotator. We describe key advantages of the NOBLE Coder system and associated tools, including its greedy algorithm, configurable matching strategies, and multiple terminology input formats. These features provide unique functionality when compared with existing alternatives, including state-of-the-art systems. On two benchmarking tasks, NOBLE's performance exceeded commonly used alternatives, performing almost as well as the most advanced systems. Error analysis revealed differences in error profiles among systems. NOBLE Coder is comparable to other widely used concept recognition systems in terms of accuracy and speed. Advantages of NOBLE Coder include its interactive terminology builder tool, ease of configuration, and adaptability to various domains and tasks. NOBLE provides a term-to-concept matching system suitable for general concept recognition in biomedical NLP pipelines.
Kim, Brian J; Merchant, Madhur; Zheng, Chengyi; Thomas, Anil A; Contreras, Richard; Jacobsen, Steven J; Chien, Gary W
Natural language processing (NLP) software programs have been widely developed to transform complex free text into simplified organized data. Potential applications in the field of medicine include automated report summaries, physician alerts, patient repositories, electronic medical record (EMR) billing, and quality metric reports. Despite these prospects and the recent widespread adoption of EMR, NLP has been relatively underutilized. The objective of this study was to evaluate the performance of an internally developed NLP program in extracting select pathologic findings from radical prostatectomy specimen reports in the EMR. An NLP program was generated by a software engineer to extract key variables from prostatectomy reports in the EMR within our healthcare system, which included the TNM stage, Gleason grade, presence of a tertiary Gleason pattern, histologic subtype, size of dominant tumor nodule, seminal vesicle invasion (SVI), perineural invasion (PNI), angiolymphatic invasion (ALI), extracapsular extension (ECE), and surgical margin status (SMS). The program was validated by comparing NLP results to a gold standard compiled by two blinded manual reviewers for 100 random pathology reports. NLP demonstrated 100% accuracy for identifying the Gleason grade, presence of a tertiary Gleason pattern, SVI, ALI, and ECE. It also demonstrated near-perfect accuracy for extracting histologic subtype (99.0%), PNI (98.9%), TNM stage (98.0%), SMS (97.0%), and dominant tumor size (95.7%). The overall accuracy of NLP was 98.7%. NLP generated a result in report. This novel program demonstrated high accuracy and efficiency identifying key pathologic details from the prostatectomy report within an EMR system. NLP has the potential to assist urologists by summarizing and highlighting relevant information from verbose pathology reports. It may also facilitate future urologic research through the rapid and automated creation of large databases.
Zheng, Chengyi; Rashid, Nazia; Wu, Yi-Lin; Koblick, River; Lin, Antony T; Levy, Gerald D; Cheetham, T Craig
Gout flares are not well documented by diagnosis codes, making it difficult to conduct accurate database studies. We implemented a computer-based method to automatically identify gout flares using natural language processing (NLP) and machine learning (ML) from electronic clinical notes. Of 16,519 patients, 1,264 and 1,192 clinical notes from 2 separate sets of 100 patients were selected as the training and evaluation data sets, respectively, which were reviewed by rheumatologists. We created separate NLP searches to capture different aspects of gout flares. For each note, the NLP search outputs became the ML system inputs, which provided the final classification decisions. The note-level classifications were grouped into patient-level gout flares. Our NLP+ML results were validated using a gold standard data set and compared with the claims-based method used by prior literatures. For 16,519 patients with a diagnosis of gout and a prescription for a urate-lowering therapy, we identified 18,869 clinical notes as gout flare positive (sensitivity 82.1%, specificity 91.5%): 1,402 patients with ≥3 flares (sensitivity 93.5%, specificity 84.6%), 5,954 with 1 or 2 flares, and 9,163 with no flare (sensitivity 98.5%, specificity 96.4%). Our method identified more flare cases (18,869 versus 7,861) and patients with ≥3 flares (1,402 versus 516) when compared to the claims-based method. We developed a computer-based method (NLP and ML) to identify gout flares from the clinical notes. Our method was validated as an accurate tool for identifying gout flares with higher sensitivity and specificity compared to previous studies. Copyright © 2014 by the American College of Rheumatology.
Arika E Wieneke
Full Text Available Background: Pathology reports typically require manual review to abstract research data. We developed a natural language processing (NLP system to automatically interpret free-text breast pathology reports with limited assistance from manual abstraction. Methods: We used an iterative approach of machine learning algorithms and constructed groups of related findings to identify breast-related procedures and results from free-text pathology reports. We evaluated the NLP system using an all-or-nothing approach to determine which reports could be processed entirely using NLP and which reports needed manual review beyond NLP. We divided 3234 reports for development (2910, 90%, and evaluation (324, 10% purposes using manually reviewed pathology data as our gold standard. Results: NLP correctly coded 12.7% of the evaluation set, flagged 49.1% of reports for manual review, incorrectly coded 30.8%, and correctly omitted 7.4% from the evaluation set due to irrelevancy (i.e. not breast-related. Common procedures and results were identified correctly (e.g. invasive ductal with 95.5% precision and 94.0% sensitivity, but entire reports were flagged for manual review because of rare findings and substantial variation in pathology report text. Conclusions: The NLP system we developed did not perform sufficiently for abstracting entire breast pathology reports. The all-or-nothing approach resulted in too broad of a scope of work and limited our flexibility to identify breast pathology procedures and results. Our NLP system was also limited by the lack of the gold standard data on rare findings and wide variation in pathology text. Focusing on individual, common elements and improving pathology text report standardization may improve performance.
The following sections are included: * Definition of Dynamical Languages * Distinct Excluded Blocks * Definition and Properties * L and L″ in Chomsky Hierarchy * A Natural Equivalence Relation * Symbolic Flows * Symbolic Flows and Dynamical Languages * Subshifts of Finite Type * Sofic Systems * Graphs and Dynamical Languages * Graphs and Shannon-Graphs * Transitive Languages * Topological Entropy
Wanjari Ghate Sonalika
Full Text Available Giant cell fibroma (GCF is a rare case with unique histopathology. It belongs to the broad category of fibrous hyperplastic lesions of the oral cavity. It is often mistaken with fibroma and papilloma due to its clinical resemblance. Only its peculiar histopathological features help us to distinguish it from them. The origin of the giant cell is still controversial. Data available is very sparse to predict the exact behavior. Hence, we report a case of GCF of tongue in a 19-year-old male. Special emphasis is given to understand the basic process of development of the lesion, nature of giant cells, and also the need for formation of these peculiar cells. Briefly, the differential diagnosis for GCF is tabulated.
Callahan, Brendan E.
There is a distinct divide between theory and practice in American science education. Research indicates that a constructivist philosophy, in which students construct their own knowledge, is conductive to learning, while in many cases teachers continue to present science in a more traditional manner. This study sought to explore possible relationships between a socioscientific issues based curriculum and three outcome variables: nature of science understanding, reflective judgment, and argumentation skill. Both quantitative and qualitative methods were used to examine both whole class differences as well as individual differences between the beginning and end of a semester of high school Biology I. Results indicated that the socioscientific issues based curriculum did not produce statistically significant changes over the course of one semester. However, the treatment group scored better on all three instruments than the comparison group. The small sample size may have contributed to the inability to find statistical significance in this study. The qualitative interviews did indicate that some students provided more sophisticated views on nature of science and reflective judgment, and were able to provide slightly more complex argumentation structures. Theoretical implications regarding the use of explicit use of socioscientific issues in the classroom are presented.
Markon, Kristian E
The literature suggests that internalizing psychopathology relates to impairment incrementally and gradually. However, the form of this relationship has not been characterized. This form is critical to understanding internalizing psychopathology, as it is possible that internalizing may accelerate in effect at some level of severity, defining a natural boundary of abnormality. Here, a novel method-semiparametric structural equation modeling-was used to model the relationship between internalizing and impairment in a sample of 8,580 individuals from the 2000 British Office for National Statistics Survey of Psychiatric Morbidity, a large, population-representative study of psychopathology. This method allows one to model relationships between latent internalizing and impairment without assuming any particular form a priori and to compare models in which the relationship is constant and linear. Results suggest that the relationship between internalizing and impairment is in fact linear and constant across the entire range of internalizing variation and that it is impossible to nonarbitrarily define a specific level of internalizing beyond which consequences suddenly become catastrophic in nature. Results demonstrate the phenomenological continuity of internalizing psychopathology, highlight the importance of impairment as well as symptoms, and have clear implications for defining mental disorder. Copyright 2010 APA, all rights reserved
Full Text Available Scientific publications written in natural language still play a central role as our knowledge source. However, due to the flood of publications, the literature survey process has become a highly time-consuming and tangled process, especially for novices of the discipline. Therefore, tools supporting the literature-survey process may help the individual scientist to explore new useful domains. Natural language processing (NLP is expected as one of the promising techniques to retrieve, abstract, and extract knowledge. In this contribution, NLP is firstly applied to the literature of chemical vapor deposition (CVD, which is a sub-discipline of materials science and is a complex and interdisciplinary field of research involving chemists, physicists, engineers, and materials scientists. Causal knowledge extraction from the literature is demonstrated using NLP.
Merker, Bjorn; Okanoya, Kazuo
Human languages are quintessentially historical phenomena. Every known aspect of linguistic form and content is subject to change in historical time (Lehmann, 1995; Bybee, 2004). Many facts of language, syntactic no less than semantic, find their explanation in the historical processes that generated them. If adpositions were once verbs, then the fact that they tend to occur on the same side of their arguments as do verbs ("cross-category harmony": Hawkins, 1983) is a matter of historical contingency rather than a reflection of inherent structural constraints on human language (Delancey, 1993).
Language change can be understood as an evolutionary process. Language change occurs at two different timescales, corresponding to the two steps of the evolutionary process. The first timescale is very short, namely, the production of an utterance: this is where linguistic structures are replicated and language variation is generated. The second timescale is (or can be) very long, namely, the propagation of linguistic variants in the speech community: this is where certain variants are selected over others. At both timescales, the evolutionary process is driven by social interaction and the role language plays in it. An understanding of social interaction at the micro-level—face-to-face interactions—and at the macro-level—the structure of speech communities—gives us the basis for understanding the generation and propagation of language structures, and understanding the nature of language itself.
Kirrie J. Ballard
Full Text Available Researchers have interpreted the behaviours of individuals with acquired apraxia of speech (AOS as impairment of linguistic phonological processing, motor control, or both. Acoustic, kinematic, and perceptual studies of speech in more recent years have led to significant advances in our understanding of the disorder and wide acceptance that it affects phonetic - motoric planning of speech. However, newly developed methods for studying nonspeech motor control are providing new insights, indicating that the motor control impairment of AOS extends beyond speech and is manifest in nonspeech movements of the oral structures. We present the most recent developments in theory and methods to examine and define the nature of AOS. Theories of the disorder are then related to existing treatment approaches and the efficacy of these approaches is examined. Directions for development of new treatments are posited. It is proposed that treatment programmes driven by a principled account of how the motor system learns to produce skilled actions will provide the most efficient and effective framework for treating motorbased speech disorders. In turn, well controlled and theoretically motivated studies of treatment efficacy promise to stimulate further development of theoretical accounts and contribute to our understanding of AOS.
Kar, B.; Robinson, C.; Koch, D. B.; Omitaomu, O.
The Sendai Framework for Disaster Risk Reduction 2015-2030 identified the following four priorities to prevent and reduce disaster risks: i) understanding disaster risk; ii) strengthening governance to manage disaster risk; iii) investing in disaster risk reduction for resilience and; iv) enhancing disaster preparedness for effective response, and to "Build Back Better" in recovery, rehabilitation and reconstruction. While forecasting and decision making tools are in place to predict and understand future impacts of natural hazards, the knowledge to action approach that currently exists fails to provide updated information needed by decision makers to undertake response and recovery efforts following a hazard event. For instance, during a tropical storm event advisories are released every two to three hours, but manual analysis of geospatial data to determine potential impacts of the event tends to be time-consuming and a post-event process. Researchers at Oak Ridge National Laboratory have developed a Spatial Decision Support System that enables real-time analysis of storm impact based on updated advisory. A prototype of the tool that focuses on determining projected power outage areas and projected duration of outages demonstrates the feasibility of integrating science with decision making for emergency management personnel to act in real time to protect communities and reduce risk.
Hajicova, E; Sgall, P
The authors briefly mention one experiment of natural language interface with databases of a common type. The main part of this paper is devoted to the prepared system of natural language understanding with an automatic construction of the collection of data. 12 references.
Saunders, Daniel R; Bex, Peter J; Woods, Russell L
Crowdsourcing has become a valuable method for collecting medical research data. This approach, recruiting through open calls on the Web, is particularly useful for assembling large normative datasets. However, it is not known how natural language datasets collected over the Web differ from those collected under controlled laboratory conditions. To compare the natural language responses obtained from a crowdsourced sample of participants with responses collected in a conventional laboratory setting from participants recruited according to specific age and gender criteria. We collected natural language descriptions of 200 half-minute movie clips, from Amazon Mechanical Turk workers (crowdsourced) and 60 participants recruited from the community (lab-sourced). Crowdsourced participants responded to as many clips as they wanted and typed their responses, whereas lab-sourced participants gave spoken responses to 40 clips, and their responses were transcribed. The content of the responses was evaluated using a take-one-out procedure, which compared responses to other responses to the same clip and to other clips, with a comparison of the average number of shared words. In contrast to the 13 months of recruiting that was required to collect normative data from 60 lab-sourced participants (with specific demographic characteristics), only 34 days were needed to collect normative data from 99 crowdsourced participants (contributing a median of 22 responses). The majority of crowdsourced workers were female, and the median age was 35 years, lower than the lab-sourced median of 62 years but similar to the median age of the US population. The responses contributed by the crowdsourced participants were longer on average, that is, 33 words compared to 28 words (Pcrowdsourced participants had more shared words (P=.004 and .01 respectively), whereas younger participants had higher numbers of shared words in the lab-sourced population (P=.01). Crowdsourcing is an effective approach
and contained technological trajectories on a national level using a combination of methods from statistical natural language processing, vector space modelling and network analysis. The proposed approach does not aim at replacing the researcher or expert but rather offers the possibility to algorithmically...... in Denmark. Results show that in the explored case it is not mainly new technologies and applications that are driving change but innovative re-combinations of old and new technologies....
Jones, David; Johnson, Gareth; Hicks, Nigel; Bond, Clare; Gilfillan, Stuart; Kremer, Yannick; Lister, Bob; Nkwane, Mzikayise; Maupa, Thulani; Munyangane, Portia; Robey, Kate; Saunders, Ian; Shipton, Zoe; Pearce, Jonathan; Haszeldine, Stuart
Natural CO2 leakage along the Bongwana Fault in South Africa is being studied to help understand processes of CO2 leakage and develop monitoring protocols. The Bongwana Fault crops out over approximately 80 km in KwaZulu-Natal province, South Africa. In outcrop the fault is expressed as a broad fracture corridor in Dwyka Tillite, with fractures oriented approximately N-S. Natural emissions of CO2 occur at various points along the fault, manifest as travertine cones and terraces, bubbling in the rivers and as gas fluxes through soil. Exposed rock outcrop shows evidence for Fe-staining around fractures and is locally extensively kaolinitised. The gas has also been released through a shallow water well, and was exploited commercially in the past. Preliminary studies have been carried out to better document the surface emissions using near surface gas monitoring, understand the origin of the gas through major gas composition and stable and noble gas isotopes and improve understanding of the structural controls on gas leakage through mapping. In addition the impact of the leaking CO2 on local water sources (surface and ground) is being investigated, along with the seismic activity of the fault. The investigation will help to build technical capacity in South Africa and to develop monitoring techniques and plans for a future CO2 storage pilot there. Early results suggest that CO2 leakage is confined to a relatively small number of spatially-restricted locations along the weakly seismically active fault. Fracture permeability appears to be the main method by which the CO2 migrates to the surface. The bulk of the CO2 is of deep origin with a minor contribution from near surface biogenic processes as determined by major gas composition. Water chemistry, including pH, DO and TDS is notably different between CO2-rich and CO2-poor sites. Soil gas content and flux effectively delineates the fault trace in active leakage sites. The fault provides an effective testing ground for
Toma, Irina; Brighiu, Stefan Mihai; Dascalu, Mihai; Trausan-Matu, Stefan
Learning a new language includes multiple aspects, from vocabulary acquisition to exercising words in sentences, and developing discourse building capabilities. In most learning scenarios, students learn individually and interact only during classes; therefore, it is difficult to enhance their
Dependency distance: A new perspective on the syntactic development in second language acquisition. Comment on "Dependency distance: A new perspective on syntactic patterns in natural language" by Haitao Liu et al.
Jiang, Jingyang; Ouyang, Jinghui
Liu et al.  offers a clear and informative account of the use of dependency distance in studying natural languages, with a focus on the viewpoint that dependency distance minimization (DDM) can be regarded as a linguistic universal. We would like to add the perspective of employing dependency distance in the studies of second languages acquisition (SLA), particularly the studies of syntactic development.
Cofré, Hernán; Cuevas, Emilia; Becerra, Beatriz
Despite the importance of the theory of evolution (TE) to scientific knowledge, a number of misconceptions continue to be found among biology teachers. In this context, the first objective of this study was to identify the impact of professional development programme (PDP) on teachers' understanding of nature of science (NOS) and evolution and on…
Smith, Sam; Sufi, Shoaib; Goble, Carole; Buchan, Iain
Background Data discovery, particularly the discovery of key variables and their inter-relationships, is key to secondary data analysis, and in-turn, the evolving field of data science. Interface designers have presumed that their users are domain experts, and so they have provided complex interfaces to support these “experts.” Such interfaces hark back to a time when searches needed to be accurate first time as there was a high computational cost associated with each search. Our work is part of a governmental research initiative between the medical and social research funding bodies to improve the use of social data in medical research. Objective The cross-disciplinary nature of data science can make no assumptions regarding the domain expertise of a particular scientist, whose interests may intersect multiple domains. Here we consider the common requirement for scientists to seek archived data for secondary analysis. This has more in common with search needs of the “Google generation” than with their single-domain, single-tool forebears. Our study compares a Google-like interface with traditional ways of searching for noncomplex health data in a data archive. Methods Two user interfaces are evaluated for the same set of tasks in extracting data from surveys stored in the UK Data Archive (UKDA). One interface, Web search, is “Google-like,” enabling users to browse, search for, and view metadata about study variables, whereas the other, traditional search, has standard multioption user interface. Results Using a comprehensive set of tasks with 20 volunteers, we found that the Web search interface met data discovery needs and expectations better than the traditional search. A task × interface repeated measures analysis showed a main effect indicating that answers found through the Web search interface were more likely to be correct (F 1,19=37.3, Peffect of task (F 3,57=6.3, Pinterface (F 1,19=18.0, Peffect of task (F 2,38=4.1, P=.025, Greenhouse
Scarpaci, J L
This essay examines methodological problems concerning the conceptualization and operationalization of phenomena central to medical geography. Its main argument is that qualitative research can be strengthened if the differences between instrumental and apparent validity are better understood than the current research in medical geography suggests. Its premise is that our definitions of key terms and concepts must be reinforced throughout the design of research should our knowledge and understanding be enhanced. In doing so, the paper aims to move the methodological debate beyond the simple dichotomies of quantitative vs qualitative approaches and logical positivism vs phenomenology. Instead, the argument is couched in a postmodernist hermeneutic sense which questions the validity of one discourse of investigation over another. The paper begins by discussing methods used in conceptualizing and operationalizing variables in quantitative and qualitative research design. Examples derive from concepts central to a geography of health-care behavior and well-being. The latter half of the essay shows the uses and misuses of validity studies in selected health services research and the current debate on national health insurance.
An attempt is made to specify the structure of the hominin bands that began steps to language. Storytelling could evolve without need for language yet be strongly subject to natural selection and could provide a major feedback process in evolving language. A storytelling model is examined, including its effects on the evolution of consciousness and the possible timing of language evolution. Behavior planning is presented as a model of language evolution from storytelling. The behavior programming mechanism in both directions provide a model of creating and understanding behavior and language. Culture began with societies, then family evolution, family life in troops, but storytelling created a culture of experiences, a final step in the long process of achieving experienced adults by natural selection. Most language evolution occurred in conversations where evolving non-verbal feedback ensured mutual agreements on understanding. Natural language evolved in conversations with feedback providing understanding of changes.